A search interface for the Performing Patents Otherwise publication as part of the Politics of Patents case study (part of Copim WP6): this parses data from the archive of RTF files and provides additional data from the European Patent Office OPS API. https://patents.copim.ac.uk
No puede seleccionar más de 25 temas Los temas deben comenzar con una letra o número, pueden incluir guiones ('-') y pueden tener hasta 35 caracteres de largo.

hace 2 años
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792793794795796797798799800801802803804805806807808809810811812813814815816817818819820821822823824825826827828829830831832833834835836837838839840841842843844845846847848849850851852853854855856857858859860861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890891892893894895896897898899900901902903904905906907908909910911912913914915916917918919920921922923924925926927928929930931932933934935936937938939940941942943944945946947948949950951952953954955956957958959960961962963964965966967968969970971972973974975976977978979980981982983984985986987988989990991992993994995996997998999100010011002100310041005100610071008100910101011101210131014101510161017101810191020102110221023102410251026102710281029103010311032103310341035103610371038103910401041104210431044104510461047104810491050105110521053105410551056105710581059106010611062106310641065106610671068106910701071107210731074107510761077107810791080108110821083108410851086108710881089109010911092109310941095109610971098109911001101110211031104110511061107110811091110111111121113111411151116111711181119112011211122112311241125112611271128112911301131113211331134113511361137113811391140114111421143114411451146114711481149115011511152115311541155115611571158115911601161116211631164116511661167116811691170117111721173117411751176117711781179118011811182118311841185118611871188118911901191119211931194119511961197119811991200120112021203120412051206120712081209121012111212121312141215121612171218121912201221122212231224122512261227122812291230123112321233123412351236123712381239124012411242124312441245124612471248124912501251125212531254125512561257125812591260126112621263126412651266126712681269127012711272127312741275127612771278127912801281128212831284128512861287128812891290129112921293129412951296129712981299130013011302130313041305130613071308130913101311131213131314131513161317131813191320132113221323132413251326132713281329133013311332133313341335133613371338133913401341134213431344134513461347134813491350135113521353135413551356135713581359136013611362136313641365136613671368
  1. <?xml version="1.0" encoding="UTF-8" ?>
  2. <!--
  3. Licensed to the Apache Software Foundation (ASF) under one or more
  4. contributor license agreements. See the NOTICE file distributed with
  5. this work for additional information regarding copyright ownership.
  6. The ASF licenses this file to You under the Apache License, Version 2.0
  7. (the "License"); you may not use this file except in compliance with
  8. the License. You may obtain a copy of the License at
  9. http://www.apache.org/licenses/LICENSE-2.0
  10. Unless required by applicable law or agreed to in writing, software
  11. distributed under the License is distributed on an "AS IS" BASIS,
  12. WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  13. See the License for the specific language governing permissions and
  14. limitations under the License.
  15. -->
  16. <!--
  17. For more details about configurations options that may appear in
  18. this file, see http://wiki.apache.org/solr/SolrConfigXml.
  19. -->
  20. <config>
  21. <!-- In all configuration below, a prefix of "solr." for class names
  22. is an alias that causes solr to search appropriate packages,
  23. including org.apache.solr.(search|update|request|core|analysis)
  24. You may also specify a fully qualified Java classname if you
  25. have your own custom plugins.
  26. -->
  27. <!-- Controls what version of Lucene various components of Solr
  28. adhere to. Generally, you want to use the latest version to
  29. get all bug fixes and improvements. It is highly recommended
  30. that you fully re-index after changing this setting as it can
  31. affect both how text is indexed and queried.
  32. -->
  33. <luceneMatchVersion>8.10.0</luceneMatchVersion>
  34. <!-- <lib/> directives can be used to instruct Solr to load any Jars
  35. identified and use them to resolve any "plugins" specified in
  36. your solrconfig.xml or schema.xml (ie: Analyzers, Request
  37. Handlers, etc...).
  38. All directories and paths are resolved relative to the
  39. instanceDir.
  40. Please note that <lib/> directives are processed in the order
  41. that they appear in your solrconfig.xml file, and are "stacked"
  42. on top of each other when building a ClassLoader - so if you have
  43. plugin jars with dependencies on other jars, the "lower level"
  44. dependency jars should be loaded first.
  45. If a "./lib" directory exists in your instanceDir, all files
  46. found in it are included as if you had used the following
  47. syntax...
  48. <lib dir="./lib" />
  49. -->
  50. <!-- A 'dir' option by itself adds any files found in the directory
  51. to the classpath, this is useful for including all jars in a
  52. directory.
  53. When a 'regex' is specified in addition to a 'dir', only the
  54. files in that directory which completely match the regex
  55. (anchored on both ends) will be included.
  56. If a 'dir' option (with or without a regex) is used and nothing
  57. is found that matches, a warning will be logged.
  58. The examples below can be used to load some solr-contribs along
  59. with their external dependencies.
  60. -->
  61. <lib dir="${solr.install.dir:../../../..}/contrib/extraction/lib" regex=".*\.jar" />
  62. <lib dir="${solr.install.dir:../../../..}/dist/" regex="solr-cell-\d.*\.jar" />
  63. <lib dir="${solr.install.dir:../../../..}/contrib/langid/lib/" regex=".*\.jar" />
  64. <lib dir="${solr.install.dir:../../../..}/dist/" regex="solr-langid-\d.*\.jar" />
  65. <lib dir="${solr.install.dir:../../../..}/contrib/velocity/lib" regex=".*\.jar" />
  66. <!-- browse-resources must come before solr-velocity JAR in order to override localized resources -->
  67. <lib path="${solr.install.dir:../../../..}/example/files/browse-resources"/>
  68. <lib dir="${solr.install.dir:../../../..}/dist/" regex="solr-velocity-\d.*\.jar" />
  69. <!-- an exact 'path' can be used instead of a 'dir' to specify a
  70. specific jar file. This will cause a serious error to be logged
  71. if it can't be loaded.
  72. -->
  73. <!--
  74. <lib path="../a-jar-that-does-not-exist.jar" />
  75. -->
  76. <!-- Data Directory
  77. Used to specify an alternate directory to hold all index data
  78. other than the default ./data under the Solr home. If
  79. replication is in use, this should match the replication
  80. configuration.
  81. -->
  82. <dataDir>${solr.data.dir:}</dataDir>
  83. <!-- The DirectoryFactory to use for indexes.
  84. solr.StandardDirectoryFactory is filesystem
  85. based and tries to pick the best implementation for the current
  86. JVM and platform. solr.NRTCachingDirectoryFactory, the default,
  87. wraps solr.StandardDirectoryFactory and caches small files in memory
  88. for better NRT performance.
  89. One can force a particular implementation via solr.MMapDirectoryFactory,
  90. solr.NIOFSDirectoryFactory, or solr.SimpleFSDirectoryFactory.
  91. solr.RAMDirectoryFactory is memory based and not persistent.
  92. -->
  93. <directoryFactory name="DirectoryFactory"
  94. class="${solr.directoryFactory:solr.NRTCachingDirectoryFactory}"/>
  95. <!-- The CodecFactory for defining the format of the inverted index.
  96. The default implementation is SchemaCodecFactory, which is the official Lucene
  97. index format, but hooks into the schema to provide per-field customization of
  98. the postings lists and per-document values in the fieldType element
  99. (postingsFormat/docValuesFormat). Note that most of the alternative implementations
  100. are experimental, so if you choose to customize the index format, it's a good
  101. idea to convert back to the official format e.g. via IndexWriter.addIndexes(IndexReader)
  102. before upgrading to a newer version to avoid unnecessary reindexing.
  103. -->
  104. <codecFactory class="solr.SchemaCodecFactory"/>
  105. <!-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  106. Index Config - These settings control low-level behavior of indexing
  107. Most example settings here show the default value, but are commented
  108. out, to more easily see where customizations have been made.
  109. Note: This replaces <indexDefaults> and <mainIndex> from older versions
  110. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -->
  111. <indexConfig>
  112. <!-- maxFieldLength was removed in 4.0. To get similar behavior, include a
  113. LimitTokenCountFilterFactory in your fieldType definition. E.g.
  114. <filter class="solr.LimitTokenCountFilterFactory" maxTokenCount="10000"/>
  115. -->
  116. <!-- Maximum time to wait for a write lock (ms) for an IndexWriter. Default: 1000 -->
  117. <!-- <writeLockTimeout>1000</writeLockTimeout> -->
  118. <!-- Expert: Enabling compound file will use less files for the index,
  119. using fewer file descriptors on the expense of performance decrease.
  120. Default in Lucene is "true". Default in Solr is "false" (since 3.6) -->
  121. <!-- <useCompoundFile>false</useCompoundFile> -->
  122. <!-- ramBufferSizeMB sets the amount of RAM that may be used by Lucene
  123. indexing for buffering added documents and deletions before they are
  124. flushed to the Directory.
  125. maxBufferedDocs sets a limit on the number of documents buffered
  126. before flushing.
  127. If both ramBufferSizeMB and maxBufferedDocs is set, then
  128. Lucene will flush based on whichever limit is hit first. -->
  129. <!-- <ramBufferSizeMB>100</ramBufferSizeMB> -->
  130. <!-- <maxBufferedDocs>1000</maxBufferedDocs> -->
  131. <!-- Expert: Merge Policy
  132. The Merge Policy in Lucene controls how merging of segments is done.
  133. The default since Solr/Lucene 3.3 is TieredMergePolicy.
  134. The default since Lucene 2.3 was the LogByteSizeMergePolicy,
  135. Even older versions of Lucene used LogDocMergePolicy.
  136. -->
  137. <!--
  138. <mergePolicyFactory class="solr.TieredMergePolicyFactory">
  139. <int name="maxMergeAtOnce">10</int>
  140. <int name="segmentsPerTier">10</int>
  141. </mergePolicyFactory>
  142. -->
  143. <!-- Expert: Merge Scheduler
  144. The Merge Scheduler in Lucene controls how merges are
  145. performed. The ConcurrentMergeScheduler (Lucene 2.3 default)
  146. can perform merges in the background using separate threads.
  147. The SerialMergeScheduler (Lucene 2.2 default) does not.
  148. -->
  149. <!--
  150. <mergeScheduler class="org.apache.lucene.index.ConcurrentMergeScheduler"/>
  151. -->
  152. <!-- LockFactory
  153. This option specifies which Lucene LockFactory implementation
  154. to use.
  155. single = SingleInstanceLockFactory - suggested for a
  156. read-only index or when there is no possibility of
  157. another process trying to modify the index.
  158. native = NativeFSLockFactory - uses OS native file locking.
  159. Do not use when multiple solr webapps in the same
  160. JVM are attempting to share a single index.
  161. simple = SimpleFSLockFactory - uses a plain file for locking
  162. Defaults: 'native' is default for Solr3.6 and later, otherwise
  163. 'simple' is the default
  164. More details on the nuances of each LockFactory...
  165. http://wiki.apache.org/lucene-java/AvailableLockFactories
  166. -->
  167. <lockType>${solr.lock.type:native}</lockType>
  168. <!-- Commit Deletion Policy
  169. Custom deletion policies can be specified here. The class must
  170. implement org.apache.lucene.index.IndexDeletionPolicy.
  171. The default Solr IndexDeletionPolicy implementation supports
  172. deleting index commit points on number of commits, age of
  173. commit point and optimized status.
  174. The latest commit point should always be preserved regardless
  175. of the criteria.
  176. -->
  177. <!--
  178. <deletionPolicy class="solr.SolrDeletionPolicy">
  179. -->
  180. <!-- The number of commit points to be kept -->
  181. <!-- <str name="maxCommitsToKeep">1</str> -->
  182. <!-- The number of optimized commit points to be kept -->
  183. <!-- <str name="maxOptimizedCommitsToKeep">0</str> -->
  184. <!--
  185. Delete all commit points once they have reached the given age.
  186. Supports DateMathParser syntax e.g.
  187. -->
  188. <!--
  189. <str name="maxCommitAge">30MINUTES</str>
  190. <str name="maxCommitAge">1DAY</str>
  191. -->
  192. <!--
  193. </deletionPolicy>
  194. -->
  195. <!-- Lucene Infostream
  196. To aid in advanced debugging, Lucene provides an "InfoStream"
  197. of detailed information when indexing.
  198. Setting The value to true will instruct the underlying Lucene
  199. IndexWriter to write its debugging info the specified file
  200. -->
  201. <!-- <infoStream file="INFOSTREAM.txt">false</infoStream> -->
  202. </indexConfig>
  203. <!-- JMX
  204. This example enables JMX if and only if an existing MBeanServer
  205. is found, use this if you want to configure JMX through JVM
  206. parameters. Remove this to disable exposing Solr configuration
  207. and statistics to JMX.
  208. For more details see http://wiki.apache.org/solr/SolrJmx
  209. -->
  210. <jmx />
  211. <!-- If you want to connect to a particular server, specify the
  212. agentId
  213. -->
  214. <!-- <jmx agentId="myAgent" /> -->
  215. <!-- If you want to start a new MBeanServer, specify the serviceUrl -->
  216. <!-- <jmx serviceUrl="service:jmx:rmi:///jndi/rmi://localhost:9999/solr"/>
  217. -->
  218. <!-- The default high-performance update handler -->
  219. <updateHandler class="solr.DirectUpdateHandler2">
  220. <!-- Enables a transaction log, used for real-time get, durability, and
  221. and solr cloud replica recovery. The log can grow as big as
  222. uncommitted changes to the index, so use of a hard autoCommit
  223. is recommended (see below).
  224. "dir" - the target directory for transaction logs, defaults to the
  225. solr data directory. -->
  226. <updateLog>
  227. <str name="dir">${solr.ulog.dir:}</str>
  228. </updateLog>
  229. <!-- AutoCommit
  230. Perform a hard commit automatically under certain conditions.
  231. Instead of enabling autoCommit, consider using "commitWithin"
  232. when adding documents.
  233. http://wiki.apache.org/solr/UpdateXmlMessages
  234. maxDocs - Maximum number of documents to add since the last
  235. commit before automatically triggering a new commit.
  236. maxTime - Maximum amount of time in ms that is allowed to pass
  237. since a document was added before automatically
  238. triggering a new commit.
  239. openSearcher - if false, the commit causes recent index changes
  240. to be flushed to stable storage, but does not cause a new
  241. searcher to be opened to make those changes visible.
  242. If the updateLog is enabled, then it's highly recommended to
  243. have some sort of hard autoCommit to limit the log size.
  244. -->
  245. <autoCommit>
  246. <maxTime>15000</maxTime>
  247. <openSearcher>false</openSearcher>
  248. </autoCommit>
  249. <!-- softAutoCommit is like autoCommit except it causes a
  250. 'soft' commit which only ensures that changes are visible
  251. but does not ensure that data is synced to disk. This is
  252. faster and more near-realtime friendly than a hard commit.
  253. -->
  254. <!--
  255. <autoSoftCommit>
  256. <maxTime>1000</maxTime>
  257. </autoSoftCommit>
  258. -->
  259. <!-- Update Related Event Listeners
  260. Various IndexWriter related events can trigger Listeners to
  261. take actions.
  262. postCommit - fired after every commit or optimize command
  263. postOptimize - fired after every optimize command
  264. -->
  265. </updateHandler>
  266. <!-- IndexReaderFactory
  267. Use the following format to specify a custom IndexReaderFactory,
  268. which allows for alternate IndexReader implementations.
  269. ** Experimental Feature **
  270. Please note - Using a custom IndexReaderFactory may prevent
  271. certain other features from working. The API to
  272. IndexReaderFactory may change without warning or may even be
  273. removed from future releases if the problems cannot be
  274. resolved.
  275. ** Features that may not work with custom IndexReaderFactory **
  276. The ReplicationHandler assumes a disk-resident index. Using a
  277. custom IndexReader implementation may cause incompatibility
  278. with ReplicationHandler and may cause replication to not work
  279. correctly. See SOLR-1366 for details.
  280. -->
  281. <!--
  282. <indexReaderFactory name="IndexReaderFactory" class="package.class">
  283. <str name="someArg">Some Value</str>
  284. </indexReaderFactory >
  285. -->
  286. <!-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  287. Query section - these settings control query time things like caches
  288. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -->
  289. <query>
  290. <!-- Max Boolean Clauses
  291. Maximum number of clauses in each BooleanQuery, an exception
  292. is thrown if exceeded.
  293. ** WARNING **
  294. This option actually modifies a global Lucene property that
  295. will affect all SolrCores. If multiple solrconfig.xml files
  296. disagree on this property, the value at any given moment will
  297. be based on the last SolrCore to be initialized.
  298. -->
  299. <maxBooleanClauses>${solr.max.booleanClauses:1024}</maxBooleanClauses>
  300. <!-- Solr Internal Query Caches
  301. There are four implementations of cache available for Solr:
  302. LRUCache, based on a synchronized LinkedHashMap,
  303. LFUCache and FastLRUCache, based on a ConcurrentHashMap, and CaffeineCache -
  304. a modern and robust cache implementation. Note that in Solr 9.0
  305. only CaffeineCache will be available, other implementations are now
  306. deprecated.
  307. FastLRUCache has faster gets and slower puts in single
  308. threaded operation and thus is generally faster than LRUCache
  309. when the hit ratio of the cache is high (> 75%), and may be
  310. faster under other scenarios on multi-cpu systems.
  311. -->
  312. <!-- Filter Cache
  313. Cache used by SolrIndexSearcher for filters (DocSets),
  314. unordered sets of *all* documents that match a query. When a
  315. new searcher is opened, its caches may be prepopulated or
  316. "autowarmed" using data from caches in the old searcher.
  317. autowarmCount is the number of items to prepopulate. For
  318. LRUCache, the autowarmed items will be the most recently
  319. accessed items.
  320. Parameters:
  321. class - the SolrCache implementation LRUCache or
  322. (LRUCache or FastLRUCache)
  323. size - the maximum number of entries in the cache
  324. initialSize - the initial capacity (number of entries) of
  325. the cache. (see java.util.HashMap)
  326. autowarmCount - the number of entries to prepopulate from
  327. and old cache.
  328. -->
  329. <filterCache size="512"
  330. initialSize="512"
  331. autowarmCount="0"/>
  332. <!-- Query Result Cache
  333. Caches results of searches - ordered lists of document ids
  334. (DocList) based on a query, a sort, and the range of documents requested.
  335. Additional supported parameter by LRUCache:
  336. maxRamMB - the maximum amount of RAM (in MB) that this cache is allowed
  337. to occupy
  338. -->
  339. <queryResultCache size="512"
  340. initialSize="512"
  341. autowarmCount="0"/>
  342. <!-- Document Cache
  343. Caches Lucene Document objects (the stored fields for each
  344. document). Since Lucene internal document ids are transient,
  345. this cache will not be autowarmed.
  346. -->
  347. <documentCache size="512"
  348. initialSize="512"
  349. autowarmCount="0"/>
  350. <!-- Field Value Cache
  351. Cache used to hold field values that are quickly accessible
  352. by document id. The fieldValueCache is created by default
  353. even if not configured here.
  354. -->
  355. <!--
  356. <fieldValueCache size="512"
  357. autowarmCount="128"
  358. showItems="32" />
  359. -->
  360. <!-- Custom Cache
  361. Example of a generic cache. These caches may be accessed by
  362. name through SolrIndexSearcher.getCache(),cacheLookup(), and
  363. cacheInsert(). The purpose is to enable easy caching of
  364. user/application level data. The regenerator argument should
  365. be specified as an implementation of solr.CacheRegenerator
  366. if autowarming is desired.
  367. -->
  368. <!--
  369. <cache name="myUserCache"
  370. size="4096"
  371. initialSize="1024"
  372. autowarmCount="1024"
  373. regenerator="com.mycompany.MyRegenerator"
  374. />
  375. -->
  376. <!-- Lazy Field Loading
  377. If true, stored fields that are not requested will be loaded
  378. lazily. This can result in a significant speed improvement
  379. if the usual case is to not load all stored fields,
  380. especially if the skipped fields are large compressed text
  381. fields.
  382. -->
  383. <enableLazyFieldLoading>true</enableLazyFieldLoading>
  384. <!-- Use Filter For Sorted Query
  385. A possible optimization that attempts to use a filter to
  386. satisfy a search. If the requested sort does not include
  387. score, then the filterCache will be checked for a filter
  388. matching the query. If found, the filter will be used as the
  389. source of document ids, and then the sort will be applied to
  390. that.
  391. For most situations, this will not be useful unless you
  392. frequently get the same search repeatedly with different sort
  393. options, and none of them ever use "score"
  394. -->
  395. <!--
  396. <useFilterForSortedQuery>true</useFilterForSortedQuery>
  397. -->
  398. <!-- Result Window Size
  399. An optimization for use with the queryResultCache. When a search
  400. is requested, a superset of the requested number of document ids
  401. are collected. For example, if a search for a particular query
  402. requests matching documents 10 through 19, and queryWindowSize is 50,
  403. then documents 0 through 49 will be collected and cached. Any further
  404. requests in that range can be satisfied via the cache.
  405. -->
  406. <queryResultWindowSize>20</queryResultWindowSize>
  407. <!-- Maximum number of documents to cache for any entry in the
  408. queryResultCache.
  409. -->
  410. <queryResultMaxDocsCached>200</queryResultMaxDocsCached>
  411. <!-- Query Related Event Listeners
  412. Various IndexSearcher related events can trigger Listeners to
  413. take actions.
  414. newSearcher - fired whenever a new searcher is being prepared
  415. and there is a current searcher handling requests (aka
  416. registered). It can be used to prime certain caches to
  417. prevent long request times for certain requests.
  418. firstSearcher - fired whenever a new searcher is being
  419. prepared but there is no current registered searcher to handle
  420. requests or to gain autowarming data from.
  421. -->
  422. <!-- QuerySenderListener takes an array of NamedList and executes a
  423. local query request for each NamedList in sequence.
  424. -->
  425. <listener event="newSearcher" class="solr.QuerySenderListener">
  426. <arr name="queries">
  427. <!--
  428. <lst><str name="q">solr</str><str name="sort">price asc</str></lst>
  429. <lst><str name="q">rocks</str><str name="sort">weight asc</str></lst>
  430. -->
  431. </arr>
  432. </listener>
  433. <listener event="firstSearcher" class="solr.QuerySenderListener">
  434. <arr name="queries">
  435. <!--
  436. <lst>
  437. <str name="q">static firstSearcher warming in solrconfig.xml</str>
  438. </lst>
  439. -->
  440. </arr>
  441. </listener>
  442. <!-- Use Cold Searcher
  443. If a search request comes in and there is no current
  444. registered searcher, then immediately register the still
  445. warming searcher and use it. If "false" then all requests
  446. will block until the first searcher is done warming.
  447. -->
  448. <useColdSearcher>false</useColdSearcher>
  449. </query>
  450. <!-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  451. Circuit Breaker Section - This section consists of configurations for
  452. circuit breakers
  453. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -->
  454. <circuitBreaker class="solr.CircuitBreakerManager" enabled="true">
  455. <!-- Enable Circuit Breakers
  456. Circuit breakers are designed to allow stability and predictable query
  457. execution. They prevent operations that can take down the node and cause
  458. noisy neighbour issues.
  459. This flag is the uber control switch which controls the activation/deactivation of all circuit
  460. breakers. At the moment, the only circuit breaker (max JVM circuit breaker) does not have its
  461. own specific configuration. However, if a circuit breaker wishes to be independently configurable,
  462. they are free to add their specific configuration but need to ensure that this flag is always
  463. respected - this should have veto over all independent configuration flags.
  464. -->
  465. <!-- Memory Circuit Breaker Control Flag
  466. Use the following flag to control the behaviour of this circuit breaker
  467. -->
  468. <str name="memEnabled">true</str>
  469. <!-- Memory Circuit Breaker Threshold In Percentage
  470. Specific configuration for max JVM heap usage circuit breaker. This configuration defines the
  471. threshold percentage of maximum heap allocated beyond which queries will be rejected until the
  472. current JVM usage goes below the threshold. The valid value range for this parameter is 50 - 95.
  473. Consider a scenario where the max heap allocated is 4 GB and memoryCircuitBreakerThresholdPct is
  474. defined as 75. Threshold JVM usage will be 4 * 0.75 = 3 GB. Its generally a good idea to keep this value between 75 - 80% of maximum heap
  475. allocated.
  476. If, at any point, the current JVM heap usage goes above 3 GB, queries will be rejected until the heap usage goes below 3 GB again.
  477. If you see queries getting rejected with 503 error code, check for "Circuit Breakers tripped"
  478. in logs and the corresponding error message should tell you what transpired (if the failure
  479. was caused by tripped circuit breakers).
  480. -->
  481. <str name="memThreshold">75</str>
  482. <!-- CPU Based Circuit Breaker Control Flag
  483. Use the following flag to control the behaviour of this circuit breaker
  484. -->
  485. <str name="cpuEnabled">true</str>
  486. <!-- CPU Based Circuit Breaker Triggering Threshold
  487. The triggering threshold is defined in units of CPU utilization. The configuration to control this is as below:
  488. -->
  489. <str name="cpuThreshold">75</str>
  490. </circuitBreaker>
  491. <!-- Request Dispatcher
  492. This section contains instructions for how the SolrDispatchFilter
  493. should behave when processing requests for this SolrCore.
  494. -->
  495. <requestDispatcher>
  496. <!-- Request Parsing
  497. These settings indicate how Solr Requests may be parsed, and
  498. what restrictions may be placed on the ContentStreams from
  499. those requests
  500. enableRemoteStreaming - enables use of the stream.file
  501. and stream.url parameters for specifying remote streams.
  502. multipartUploadLimitInKB - specifies the max size (in KiB) of
  503. Multipart File Uploads that Solr will allow in a Request.
  504. formdataUploadLimitInKB - specifies the max size (in KiB) of
  505. form data (application/x-www-form-urlencoded) sent via
  506. POST. You can use POST to pass request parameters not
  507. fitting into the URL.
  508. addHttpRequestToContext - if set to true, it will instruct
  509. the requestParsers to include the original HttpServletRequest
  510. object in the context map of the SolrQueryRequest under the
  511. key "httpRequest". It will not be used by any of the existing
  512. Solr components, but may be useful when developing custom
  513. plugins.
  514. *** WARNING ***
  515. Before enabling remote streaming, you should make sure your
  516. system has authentication enabled.
  517. <requestParsers enableRemoteStreaming="false"
  518. multipartUploadLimitInKB="-1"
  519. formdataUploadLimitInKB="-1"
  520. addHttpRequestToContext="false"/>
  521. -->
  522. <!-- HTTP Caching
  523. Set HTTP caching related parameters (for proxy caches and clients).
  524. The options below instruct Solr not to output any HTTP Caching
  525. related headers
  526. -->
  527. <httpCaching never304="true" />
  528. <!-- If you include a <cacheControl> directive, it will be used to
  529. generate a Cache-Control header (as well as an Expires header
  530. if the value contains "max-age=")
  531. By default, no Cache-Control header is generated.
  532. You can use the <cacheControl> option even if you have set
  533. never304="true"
  534. -->
  535. <!--
  536. <httpCaching never304="true" >
  537. <cacheControl>max-age=30, public</cacheControl>
  538. </httpCaching>
  539. -->
  540. <!-- To enable Solr to respond with automatically generated HTTP
  541. Caching headers, and to response to Cache Validation requests
  542. correctly, set the value of never304="false"
  543. This will cause Solr to generate Last-Modified and ETag
  544. headers based on the properties of the Index.
  545. The following options can also be specified to affect the
  546. values of these headers...
  547. lastModFrom - the default value is "openTime" which means the
  548. Last-Modified value (and validation against If-Modified-Since
  549. requests) will all be relative to when the current Searcher
  550. was opened. You can change it to lastModFrom="dirLastMod" if
  551. you want the value to exactly correspond to when the physical
  552. index was last modified.
  553. etagSeed="..." is an option you can change to force the ETag
  554. header (and validation against If-None-Match requests) to be
  555. different even if the index has not changed (ie: when making
  556. significant changes to your config file)
  557. (lastModifiedFrom and etagSeed are both ignored if you use
  558. the never304="true" option)
  559. -->
  560. <!--
  561. <httpCaching lastModifiedFrom="openTime"
  562. etagSeed="Solr">
  563. <cacheControl>max-age=30, public</cacheControl>
  564. </httpCaching>
  565. -->
  566. </requestDispatcher>
  567. <!-- Request Handlers
  568. http://wiki.apache.org/solr/SolrRequestHandler
  569. Incoming queries will be dispatched to a specific handler by name
  570. based on the path specified in the request.
  571. If a Request Handler is declared with startup="lazy", then it will
  572. not be initialized until the first request that uses it.
  573. -->
  574. <!-- SearchHandler
  575. http://wiki.apache.org/solr/SearchHandler
  576. For processing Search Queries, the primary Request Handler
  577. provided with Solr is "SearchHandler" It delegates to a sequent
  578. of SearchComponents (see below) and supports distributed
  579. queries across multiple shards
  580. -->
  581. <requestHandler name="/select" class="solr.SearchHandler">
  582. <!-- default values for query parameters can be specified, these
  583. will be overridden by parameters in the request
  584. -->
  585. <lst name="defaults">
  586. <str name="echoParams">explicit</str>
  587. <int name="rows">10</int>
  588. <!-- Default search field
  589. <str name="df">text</str>
  590. -->
  591. <!-- Change from JSON to XML format (the default prior to Solr 7.0)
  592. <str name="wt">xml</str>
  593. -->
  594. </lst>
  595. <!-- In addition to defaults, "appends" params can be specified
  596. to identify values which should be appended to the list of
  597. multi-val params from the query (or the existing "defaults").
  598. -->
  599. <!-- In this example, the param "fq=instock:true" would be appended to
  600. any query time fq params the user may specify, as a mechanism for
  601. partitioning the index, independent of any user selected filtering
  602. that may also be desired (perhaps as a result of faceted searching).
  603. NOTE: there is *absolutely* nothing a client can do to prevent these
  604. "appends" values from being used, so don't use this mechanism
  605. unless you are sure you always want it.
  606. -->
  607. <!--
  608. <lst name="appends">
  609. <str name="fq">inStock:true</str>
  610. </lst>
  611. -->
  612. <!-- "invariants" are a way of letting the Solr maintainer lock down
  613. the options available to Solr clients. Any params values
  614. specified here are used regardless of what values may be specified
  615. in either the query, the "defaults", or the "appends" params.
  616. In this example, the facet.field and facet.query params would
  617. be fixed, limiting the facets clients can use. Faceting is
  618. not turned on by default - but if the client does specify
  619. facet=true in the request, these are the only facets they
  620. will be able to see counts for; regardless of what other
  621. facet.field or facet.query params they may specify.
  622. NOTE: there is *absolutely* nothing a client can do to prevent these
  623. "invariants" values from being used, so don't use this mechanism
  624. unless you are sure you always want it.
  625. -->
  626. <!--
  627. <lst name="invariants">
  628. <str name="facet.field">cat</str>
  629. <str name="facet.field">manu_exact</str>
  630. <str name="facet.query">price:[* TO 500]</str>
  631. <str name="facet.query">price:[500 TO *]</str>
  632. </lst>
  633. -->
  634. <!-- If the default list of SearchComponents is not desired, that
  635. list can either be overridden completely, or components can be
  636. prepended or appended to the default list. (see below)
  637. -->
  638. <!--
  639. <arr name="components">
  640. <str>nameOfCustomComponent1</str>
  641. <str>nameOfCustomComponent2</str>
  642. </arr>
  643. -->
  644. </requestHandler>
  645. <!-- A request handler that returns indented JSON by default -->
  646. <requestHandler name="/query" class="solr.SearchHandler">
  647. <lst name="defaults">
  648. <str name="echoParams">explicit</str>
  649. <str name="wt">json</str>
  650. <str name="indent">true</str>
  651. </lst>
  652. </requestHandler>
  653. <!--These useParams values are available in params.json-->
  654. <requestHandler name="/browse" class="solr.SearchHandler" useParams="query,facets,velocity,browse"/>
  655. <initParams path="/update/**,/query,/select,/tvrh,/elevate,/spell,/browse">
  656. <lst name="defaults">
  657. <str name="df">_text_</str>
  658. </lst>
  659. </initParams>
  660. <!--<initParams path="/update/**">
  661. <lst name="defaults">
  662. <str name="update.chain">files-update-processor</str>
  663. </lst>
  664. </initParams>-->
  665. <!-- Solr Cell Update Request Handler
  666. http://wiki.apache.org/solr/ExtractingRequestHandler
  667. -->
  668. <requestHandler name="/update/extract"
  669. startup="lazy"
  670. class="solr.extraction.ExtractingRequestHandler" >
  671. <lst name="defaults">
  672. <str name="xpath">/xhtml:html/xhtml:body/descendant:node()</str>
  673. <str name="capture">content</str>
  674. <str name="fmap.meta">attr_meta_</str>
  675. <str name="uprefix">attr_</str>
  676. <str name="lowernames">true</str>
  677. </lst>
  678. </requestHandler>
  679. <!-- Search Components
  680. Search components are registered to SolrCore and used by
  681. instances of SearchHandler (which can access them by name)
  682. By default, the following components are available:
  683. <searchComponent name="query" class="solr.QueryComponent" />
  684. <searchComponent name="facet" class="solr.FacetComponent" />
  685. <searchComponent name="mlt" class="solr.MoreLikeThisComponent" />
  686. <searchComponent name="highlight" class="solr.HighlightComponent" />
  687. <searchComponent name="stats" class="solr.StatsComponent" />
  688. <searchComponent name="debug" class="solr.DebugComponent" />
  689. Default configuration in a requestHandler would look like:
  690. <arr name="components">
  691. <str>query</str>
  692. <str>facet</str>
  693. <str>mlt</str>
  694. <str>highlight</str>
  695. <str>stats</str>
  696. <str>debug</str>
  697. </arr>
  698. If you register a searchComponent to one of the standard names,
  699. that will be used instead of the default.
  700. To insert components before or after the 'standard' components, use:
  701. <arr name="first-components">
  702. <str>myFirstComponentName</str>
  703. </arr>
  704. <arr name="last-components">
  705. <str>myLastComponentName</str>
  706. </arr>
  707. NOTE: The component registered with the name "debug" will
  708. always be executed after the "last-components"
  709. -->
  710. <!-- Spell Check
  711. The spell check component can return a list of alternative spelling
  712. suggestions.
  713. http://wiki.apache.org/solr/SpellCheckComponent
  714. -->
  715. <searchComponent name="spellcheck" class="solr.SpellCheckComponent">
  716. <str name="queryAnalyzerFieldType">text_general</str>
  717. <!-- Multiple "Spell Checkers" can be declared and used by this
  718. component
  719. -->
  720. <!-- a spellchecker built from a field of the main index -->
  721. <lst name="spellchecker">
  722. <str name="name">default</str>
  723. <str name="field">text</str>
  724. <str name="classname">solr.DirectSolrSpellChecker</str>
  725. <!-- the spellcheck distance measure used, the default is the internal levenshtein -->
  726. <str name="distanceMeasure">internal</str>
  727. <!-- minimum accuracy needed to be considered a valid spellcheck suggestion -->
  728. <float name="accuracy">0.5</float>
  729. <!-- the maximum #edits we consider when enumerating terms: can be 1 or 2 -->
  730. <int name="maxEdits">2</int>
  731. <!-- the minimum shared prefix when enumerating terms -->
  732. <int name="minPrefix">1</int>
  733. <!-- maximum number of inspections per result. -->
  734. <int name="maxInspections">5</int>
  735. <!-- minimum length of a query term to be considered for correction -->
  736. <int name="minQueryLength">4</int>
  737. <!-- maximum threshold of documents a query term can appear to be considered for correction -->
  738. <float name="maxQueryFrequency">0.01</float>
  739. <!-- uncomment this to require suggestions to occur in 1% of the documents
  740. <float name="thresholdTokenFrequency">.01</float>
  741. -->
  742. </lst>
  743. <!-- a spellchecker that can break or combine words. See "/spell" handler below for usage -->
  744. <lst name="spellchecker">
  745. <str name="name">wordbreak</str>
  746. <str name="classname">solr.WordBreakSolrSpellChecker</str>
  747. <str name="field">name</str>
  748. <str name="combineWords">true</str>
  749. <str name="breakWords">true</str>
  750. <int name="maxChanges">10</int>
  751. </lst>
  752. <!-- a spellchecker that uses a different distance measure -->
  753. <!--
  754. <lst name="spellchecker">
  755. <str name="name">jarowinkler</str>
  756. <str name="field">spell</str>
  757. <str name="classname">solr.DirectSolrSpellChecker</str>
  758. <str name="distanceMeasure">
  759. org.apache.lucene.search.spell.JaroWinklerDistance
  760. </str>
  761. </lst>
  762. -->
  763. <!-- a spellchecker that use an alternate comparator
  764. comparatorClass be one of:
  765. 1. score (default)
  766. 2. freq (Frequency first, then score)
  767. 3. A fully qualified class name
  768. -->
  769. <!--
  770. <lst name="spellchecker">
  771. <str name="name">freq</str>
  772. <str name="field">lowerfilt</str>
  773. <str name="classname">solr.DirectSolrSpellChecker</str>
  774. <str name="comparatorClass">freq</str>
  775. -->
  776. <!-- A spellchecker that reads the list of words from a file -->
  777. <!--
  778. <lst name="spellchecker">
  779. <str name="classname">solr.FileBasedSpellChecker</str>
  780. <str name="name">file</str>
  781. <str name="sourceLocation">spellings.txt</str>
  782. <str name="characterEncoding">UTF-8</str>
  783. <str name="spellcheckIndexDir">spellcheckerFile</str>
  784. </lst>
  785. -->
  786. </searchComponent>
  787. <!-- A request handler for demonstrating the spellcheck component.
  788. NOTE: This is purely as an example. The whole purpose of the
  789. SpellCheckComponent is to hook it into the request handler that
  790. handles your normal user queries so that a separate request is
  791. not needed to get suggestions.
  792. IN OTHER WORDS, THERE IS REALLY GOOD CHANCE THE SETUP BELOW IS
  793. NOT WHAT YOU WANT FOR YOUR PRODUCTION SYSTEM!
  794. See http://wiki.apache.org/solr/SpellCheckComponent for details
  795. on the request parameters.
  796. -->
  797. <requestHandler name="/spell" class="solr.SearchHandler" startup="lazy">
  798. <lst name="defaults">
  799. <!-- Solr will use suggestions from both the 'default' spellchecker
  800. and from the 'wordbreak' spellchecker and combine them.
  801. collations (re-written queries) can include a combination of
  802. corrections from both spellcheckers -->
  803. <str name="spellcheck.dictionary">default</str>
  804. <str name="spellcheck.dictionary">wordbreak</str>
  805. <str name="spellcheck">on</str>
  806. <str name="spellcheck.extendedResults">true</str>
  807. <str name="spellcheck.count">10</str>
  808. <str name="spellcheck.alternativeTermCount">5</str>
  809. <str name="spellcheck.maxResultsForSuggest">5</str>
  810. <str name="spellcheck.collate">true</str>
  811. <str name="spellcheck.collateExtendedResults">true</str>
  812. <str name="spellcheck.maxCollationTries">10</str>
  813. <str name="spellcheck.maxCollations">5</str>
  814. </lst>
  815. <arr name="last-components">
  816. <str>spellcheck</str>
  817. </arr>
  818. </requestHandler>
  819. <!-- Term Vector Component
  820. http://wiki.apache.org/solr/TermVectorComponent
  821. -->
  822. <searchComponent name="tvComponent" class="solr.TermVectorComponent"/>
  823. <!-- A request handler for demonstrating the term vector component
  824. This is purely as an example.
  825. In reality you will likely want to add the component to your
  826. already specified request handlers.
  827. -->
  828. <requestHandler name="/tvrh" class="solr.SearchHandler" startup="lazy">
  829. <lst name="defaults">
  830. <bool name="tv">true</bool>
  831. </lst>
  832. <arr name="last-components">
  833. <str>tvComponent</str>
  834. </arr>
  835. </requestHandler>
  836. <!-- Terms Component
  837. http://wiki.apache.org/solr/TermsComponent
  838. A component to return terms and document frequency of those
  839. terms
  840. -->
  841. <searchComponent name="terms" class="solr.TermsComponent"/>
  842. <!-- A request handler for demonstrating the terms component -->
  843. <requestHandler name="/terms" class="solr.SearchHandler" startup="lazy">
  844. <lst name="defaults">
  845. <bool name="terms">true</bool>
  846. <bool name="distrib">false</bool>
  847. </lst>
  848. <arr name="components">
  849. <str>terms</str>
  850. </arr>
  851. </requestHandler>
  852. <!-- Query Elevation Component
  853. http://wiki.apache.org/solr/QueryElevationComponent
  854. a search component that enables you to configure the top
  855. results for a given query regardless of the normal lucene
  856. scoring.
  857. -->
  858. <searchComponent name="elevator" class="solr.QueryElevationComponent" >
  859. <!-- pick a fieldType to analyze queries -->
  860. <str name="queryFieldType">string</str>
  861. <str name="config-file">elevate.xml</str>
  862. </searchComponent>
  863. <!-- A request handler for demonstrating the elevator component -->
  864. <requestHandler name="/elevate" class="solr.SearchHandler" startup="lazy">
  865. <lst name="defaults">
  866. <str name="echoParams">explicit</str>
  867. </lst>
  868. <arr name="last-components">
  869. <str>elevator</str>
  870. </arr>
  871. </requestHandler>
  872. <!-- Highlighting Component
  873. http://wiki.apache.org/solr/HighlightingParameters
  874. -->
  875. <searchComponent class="solr.HighlightComponent" name="highlight">
  876. <highlighting>
  877. <!-- Configure the standard fragmenter -->
  878. <!-- This could most likely be commented out in the "default" case -->
  879. <fragmenter name="gap"
  880. default="true"
  881. class="solr.highlight.GapFragmenter">
  882. <lst name="defaults">
  883. <int name="hl.fragsize">100</int>
  884. </lst>
  885. </fragmenter>
  886. <!-- A regular-expression-based fragmenter
  887. (for sentence extraction)
  888. -->
  889. <fragmenter name="regex"
  890. class="solr.highlight.RegexFragmenter">
  891. <lst name="defaults">
  892. <!-- slightly smaller fragsizes work better because of slop -->
  893. <int name="hl.fragsize">70</int>
  894. <!-- allow 50% slop on fragment sizes -->
  895. <float name="hl.regex.slop">0.5</float>
  896. <!-- a basic sentence pattern -->
  897. <str name="hl.regex.pattern">[-\w ,/\n\&quot;&apos;]{20,200}</str>
  898. </lst>
  899. </fragmenter>
  900. <!-- Configure the standard formatter -->
  901. <formatter name="html"
  902. default="true"
  903. class="solr.highlight.HtmlFormatter">
  904. <lst name="defaults">
  905. <str name="hl.simple.pre"><![CDATA[<em>]]></str>
  906. <str name="hl.simple.post"><![CDATA[</em>]]></str>
  907. </lst>
  908. </formatter>
  909. <!-- Configure the standard encoder -->
  910. <encoder name="html"
  911. class="solr.highlight.HtmlEncoder" />
  912. <!-- Configure the standard fragListBuilder -->
  913. <fragListBuilder name="simple"
  914. class="solr.highlight.SimpleFragListBuilder"/>
  915. <!-- Configure the single fragListBuilder -->
  916. <fragListBuilder name="single"
  917. class="solr.highlight.SingleFragListBuilder"/>
  918. <!-- Configure the weighted fragListBuilder -->
  919. <fragListBuilder name="weighted"
  920. default="true"
  921. class="solr.highlight.WeightedFragListBuilder"/>
  922. <!-- default tag FragmentsBuilder -->
  923. <fragmentsBuilder name="default"
  924. default="true"
  925. class="solr.highlight.ScoreOrderFragmentsBuilder">
  926. <!--
  927. <lst name="defaults">
  928. <str name="hl.multiValuedSeparatorChar">/</str>
  929. </lst>
  930. -->
  931. </fragmentsBuilder>
  932. <!-- multi-colored tag FragmentsBuilder -->
  933. <fragmentsBuilder name="colored"
  934. class="solr.highlight.ScoreOrderFragmentsBuilder">
  935. <lst name="defaults">
  936. <str name="hl.tag.pre"><![CDATA[
  937. <b style="background:yellow">,<b style="background:lawgreen">,
  938. <b style="background:aquamarine">,<b style="background:magenta">,
  939. <b style="background:palegreen">,<b style="background:coral">,
  940. <b style="background:wheat">,<b style="background:khaki">,
  941. <b style="background:lime">,<b style="background:deepskyblue">]]></str>
  942. <str name="hl.tag.post"><![CDATA[</b>]]></str>
  943. </lst>
  944. </fragmentsBuilder>
  945. <boundaryScanner name="default"
  946. default="true"
  947. class="solr.highlight.SimpleBoundaryScanner">
  948. <lst name="defaults">
  949. <str name="hl.bs.maxScan">10</str>
  950. <str name="hl.bs.chars">.,!? &#9;&#10;&#13;</str>
  951. </lst>
  952. </boundaryScanner>
  953. <boundaryScanner name="breakIterator"
  954. class="solr.highlight.BreakIteratorBoundaryScanner">
  955. <lst name="defaults">
  956. <!-- type should be one of CHARACTER, WORD(default), LINE and SENTENCE -->
  957. <str name="hl.bs.type">WORD</str>
  958. <!-- language and country are used when constructing Locale object. -->
  959. <!-- And the Locale object will be used when getting instance of BreakIterator -->
  960. <str name="hl.bs.language">en</str>
  961. <str name="hl.bs.country">US</str>
  962. </lst>
  963. </boundaryScanner>
  964. </highlighting>
  965. </searchComponent>
  966. <!-- Update Processors
  967. Chains of Update Processor Factories for dealing with Update
  968. Requests can be declared, and then used by name in Update
  969. Request Processors
  970. http://wiki.apache.org/solr/UpdateRequestProcessor
  971. -->
  972. <!-- Add unknown fields to the schema
  973. An example field type guessing update processor that will
  974. attempt to parse string-typed field values as Booleans, Longs,
  975. Doubles, or Dates, and then add schema fields with the guessed
  976. field types.
  977. This requires that the schema is both managed and mutable, by
  978. declaring schemaFactory as ManagedIndexSchemaFactory, with
  979. mutable specified as true.
  980. See http://wiki.apache.org/solr/GuessingFieldTypes
  981. -->
  982. <!-- Deduplication
  983. An example dedup update processor that creates the "id" field
  984. on the fly based on the hash code of some other fields. This
  985. example has overwriteDupes set to false since we are using the
  986. id field as the signatureField and Solr will maintain
  987. uniqueness based on that anyway.
  988. -->
  989. <!--
  990. <updateRequestProcessorChain name="dedupe">
  991. <processor class="solr.processor.SignatureUpdateProcessorFactory">
  992. <bool name="enabled">true</bool>
  993. <str name="signatureField">id</str>
  994. <bool name="overwriteDupes">false</bool>
  995. <str name="fields">name,features,cat</str>
  996. <str name="signatureClass">solr.processor.Lookup3Signature</str>
  997. </processor>
  998. <processor class="solr.LogUpdateProcessorFactory" />
  999. <processor class="solr.RunUpdateProcessorFactory" />
  1000. </updateRequestProcessorChain>
  1001. -->
  1002. <!-- Language identification
  1003. This example update chain identifies the language of the incoming
  1004. documents using the langid contrib. The detected language is
  1005. written to field language_s. No field name mapping is done.
  1006. The fields used for detection are text, title, subject and description,
  1007. making this example suitable for detecting languages form full-text
  1008. rich documents injected via ExtractingRequestHandler.
  1009. See more about langId at http://wiki.apache.org/solr/LanguageDetection
  1010. -->
  1011. <!--
  1012. <updateRequestProcessorChain name="langid">
  1013. <processor class="org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory">
  1014. <str name="langid.fl">text,title,subject,description</str>
  1015. <str name="langid.langField">language_s</str>
  1016. <str name="langid.fallback">en</str>
  1017. </processor>
  1018. <processor class="solr.LogUpdateProcessorFactory" />
  1019. <processor class="solr.RunUpdateProcessorFactory" />
  1020. </updateRequestProcessorChain>
  1021. -->
  1022. <!-- Script update processor
  1023. This example hooks in an update processor implemented using JavaScript.
  1024. See more about the script update processor at http://wiki.apache.org/solr/ScriptUpdateProcessor
  1025. -->
  1026. <!--
  1027. <updateRequestProcessorChain name="script">
  1028. <processor class="solr.StatelessScriptUpdateProcessorFactory">
  1029. <str name="script">update-script.js</str>
  1030. <lst name="params">
  1031. <str name="config_param">example config parameter</str>
  1032. </lst>
  1033. </processor>
  1034. <processor class="solr.RunUpdateProcessorFactory" />
  1035. </updateRequestProcessorChain>
  1036. -->
  1037. <!-- Response Writers
  1038. http://wiki.apache.org/solr/QueryResponseWriter
  1039. Request responses will be written using the writer specified by
  1040. the 'wt' request parameter matching the name of a registered
  1041. writer.
  1042. The "default" writer is the default and will be used if 'wt' is
  1043. not specified in the request.
  1044. -->
  1045. <!-- The following response writers are implicitly configured unless
  1046. overridden...
  1047. -->
  1048. <!--
  1049. <queryResponseWriter name="xml"
  1050. default="true"
  1051. class="solr.XMLResponseWriter" />
  1052. <queryResponseWriter name="json" class="solr.JSONResponseWriter"/>
  1053. <queryResponseWriter name="python" class="solr.PythonResponseWriter"/>
  1054. <queryResponseWriter name="ruby" class="solr.RubyResponseWriter"/>
  1055. <queryResponseWriter name="php" class="solr.PHPResponseWriter"/>
  1056. <queryResponseWriter name="phps" class="solr.PHPSerializedResponseWriter"/>
  1057. <queryResponseWriter name="csv" class="solr.CSVResponseWriter"/>
  1058. <queryResponseWriter name="schema.xml" class="solr.SchemaXmlResponseWriter"/>
  1059. -->
  1060. <queryResponseWriter name="json" class="solr.JSONResponseWriter">
  1061. <!-- For the purposes of the tutorial, JSON responses are written as
  1062. plain text so that they are easy to read in *any* browser.
  1063. If you expect a MIME type of "application/json" just remove this override.
  1064. -->
  1065. <str name="content-type">text/plain; charset=UTF-8</str>
  1066. </queryResponseWriter>
  1067. <!--
  1068. Custom response writers can be declared as needed...
  1069. -->
  1070. <queryResponseWriter name="velocity" class="solr.VelocityResponseWriter" startup="lazy">
  1071. <str name="template.base.dir">${velocity.template.base.dir:}</str>
  1072. </queryResponseWriter>
  1073. <!-- XSLT response writer transforms the XML output by any xslt file found
  1074. in Solr's conf/xslt directory. Changes to xslt files are checked for
  1075. every xsltCacheLifetimeSeconds.
  1076. -->
  1077. <queryResponseWriter name="xslt" class="solr.XSLTResponseWriter">
  1078. <int name="xsltCacheLifetimeSeconds">5</int>
  1079. </queryResponseWriter>
  1080. <!-- Query Parsers
  1081. https://lucene.apache.org/solr/guide/query-syntax-and-parsing.html
  1082. Multiple QParserPlugins can be registered by name, and then
  1083. used in either the "defType" param for the QueryComponent (used
  1084. by SearchHandler) or in LocalParams
  1085. -->
  1086. <!-- example of registering a query parser -->
  1087. <!--
  1088. <queryParser name="myparser" class="com.mycompany.MyQParserPlugin"/>
  1089. -->
  1090. <!-- Function Parsers
  1091. http://wiki.apache.org/solr/FunctionQuery
  1092. Multiple ValueSourceParsers can be registered by name, and then
  1093. used as function names when using the "func" QParser.
  1094. -->
  1095. <!-- example of registering a custom function parser -->
  1096. <!--
  1097. <valueSourceParser name="myfunc"
  1098. class="com.mycompany.MyValueSourceParser" />
  1099. -->
  1100. <!-- Document Transformers
  1101. http://wiki.apache.org/solr/DocTransformers
  1102. -->
  1103. <!--
  1104. Could be something like:
  1105. <transformer name="db" class="com.mycompany.LoadFromDatabaseTransformer" >
  1106. <int name="connection">jdbc://....</int>
  1107. </transformer>
  1108. To add a constant value to all docs, use:
  1109. <transformer name="mytrans2" class="org.apache.solr.response.transform.ValueAugmenterFactory" >
  1110. <int name="value">5</int>
  1111. </transformer>
  1112. If you want the user to still be able to change it with _value:something_ use this:
  1113. <transformer name="mytrans3" class="org.apache.solr.response.transform.ValueAugmenterFactory" >
  1114. <double name="defaultValue">5</double>
  1115. </transformer>
  1116. If you are using the QueryElevationComponent, you may wish to mark documents that get boosted. The
  1117. EditorialMarkerFactory will do exactly that:
  1118. <transformer name="qecBooster" class="org.apache.solr.response.transform.EditorialMarkerFactory" />
  1119. -->
  1120. <schemaFactory class="ClassicIndexSchemaFactory"/>
  1121. </config>