Gene Sare_2407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2407 
Symbol 
ID5703691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2762729 
End bp2772034 
Gene Length9306 bp 
Protein Length3101 aa 
Translation table11 
GC content70% 
IMG OID641271884 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001537255 
Protein GI159038002 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.195315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCACT CCCTCGACGC CGTATCCCCG GAGAAGATCG CCATTGTCGG CCTCGGCTGC 
CGATTGCCGG GCGGCGCCTC GGACCACCGG TCGTTCTGGC GCAATCTGGT CAATGGTAAG
GACTGCATCA CCGATACACC TGCAGACCGG TACGACGTGC GCACGCTCGG CAGCCGGGAC
AAGGCCAAGC CGGGCCGGCT GGTCGGCGGC CGGGGCGGAT ACATCGACGG CGTGGCCGAC
TTCGACCCGG CCTTCTTCGG AATCAGCCCA CGCGAAGCCG CGCATATGGA CCCGCAGCAG
CGCAAGCTGC TGGAGGTCGC CTGGGAGGCG CTGGAGGACG GCGGACAGAT TCCGGGTGAC
CTGGCCGGTA CGGACGTTGG CGTCTTCATC GGCGCATTCA CACTGGACTA CAAAATCCTC
CAGTTCGCTG ACCTCAGCTT CGAGACCCTG GCGGCACACA CCGCGACCGG AACCATGATG
ACGATGGTCT CCAACCGCGT CTCCTACTGC TTCGACTTTC GCGGGCCGAG CCTGTCGATC
GACACCGCGT GTAGCTCATC GCTGGTCGCC GTGCATCTGG CCCGGCAAAG CCTGCTGCGC
GGGGAGAGCC GGATCGCGCT GGCCGGCGGC ACGCTGCTGC ACCTCACCCC GCAGTACACG
ATCGCCGAGA CCAAGGGCGG CTTCCTGTCA CCGGAGGGCC GGTCGCGCCC ATTCGACGCC
TCCGCGAACG GGTACGTGCG CGCGGAGGGG GTCGGGGTCG TCGTGTTGAA ACGGCTGTCC
GACGCCGTGC GCGACGGCGA CCCGATCTAC GCCGTGATCG CCGGCAGCGG TGTCAACCAG
GACGGCCGCA CCAACGGCAT CACCGTCCCG AACGCGGACA TGCAGACATC GTTGATCGAG
CGGGTCTGCG CCGAGGCCGG GGTGTTGCCG GGCAGCCTGC AATACATCGA GGCGCACGGC
ACGTCCACCC CGATAGGCGA CCCGATCGAG GCGAACGCAC TCAGCCGCGT GCTGGCCATC
GGACGCACGC CGGGCGAGCG GTGCTATGTG GGCTCCGTCA AGACGAACAT CGGGCACACC
GAGTCGGCCG CGGGAGTCGC CGGCCTGATC AAGACCGCGC TGGCGCTCAA GCACCGGATC
ATCCCCCCGC ACATCAACTT CGAGCGGATC AACCCCGCGA TCGACGAGGC ATCCCTGTCG
TTCGAGATCC CCACCGAGCC GACTCCCTGG CCGAGGCACA CCGGACCGGC TCGCGCCGGG
GTCAACTCGT TCGGGTTCGG TGGCACTAAC GCGCACGTGC TCCTGGAGAG CGCACCGACG
GCCACAGTGG ATTCAGTCGC GATGCCCGTG CCGCCGAGCC ACACCATCCT GCCGCTGACC
GCCCGCGACC CGGCGGTGCT GCTGCGGCTC GCCGCCGGCA TCCACGCGGA GCTGACCGGC
AGCGACGTCA CGCCGGCAGA CCTCGGGCAC ACGCTGTCCC ACCGACGGCA GCATCACGAC
TTCCGACTGT CGATCGTCTA CTCCTCACGC GAGTCGTTGG CCGAGGCGCT GGCCGCCTAC
GGCTGCGGTG AGCGGCACCT CCGGGTGCTC ACCGGCCAGC GCCGGGACCC GGCGGACCGG
CGACTGGTGT GGGTGTTCAC CGGCATGGGC CCGCAGTGGT GGGCAATGGG CCGGCAACTG
TTTGAGATCG AGCCAGTCTA CCGCGCCACG GTCGAACGAT GCGACCGGGT GATCCGCCGG
CTCACCGGAT GGTCGCTGAT CGACGAGCTG AACGCGGACG AGGTCGACTC GCACATGGCG
GAGACGTGGC TGGCCCAGCC GGCCAACTTC GCGGTCCAGA TCGGTCTTGC CGCCCTGTGG
CGCAGCTACG GCATCCGGCC GGATGCGATC GTCGGGCACA GCACCGGCGA GGCGGCGGCG
TTCTACGAGG CCGGCGTGTA CTCGCTGGAT GACGCGGTGC GAGTGGTCGT GCACCGCAGC
CGGCTCCAGC AGAAGCTGGT CGGCGCCGGC ACCATGCTCG CGGTCGGCCT GACCGAGCCC
GAGGCGCGGC AGCGGGTGCT CGCCTACGGC CACGCGATCT CGGTGGCCGC GGTCAACAGT
CCCGGCACCG TCACGCTCGC CGGGGACGAG GACGCGCTCA CCGACCTCGC CGAGAAGCTG
ACAGCCGAGC AGACCTTCGC GAAATTCCTC GCGGTGCAGG TGCCGTATCA CAGCGCGCGG
ATGGATGCGA TCAAGGACGA GTTGCTCACC TCGCTCGCCG GTCTGGTCCC GCGGCAGGCG
CGGGTGCCGC TTTACCTGAC CGCCAGGGAC GGCGTGGCGG ACGGCCCGGA GCTCGACGCC
GGGTACTGGT GGCGCAACGT GCGGGACACC GTCCGGTTCG GCGCCGCAGC CGACCGGCTG
ATCAGTGACG GCTATCGGTT CTTCCTGGAG ATTGGCCCGC ACCCGGTGCT AGGCCACTCG
ATCCAGGAGT GCCTCGCGGA TCGCGACGTG CAGGGCCGCA CCCTGCCGTC GATCCGGCGC
ACCGAGAACG AGCCGGAGCA GATGATCATG TCGCTCGCCG CCCTGTTCAA CGAGGGCTTC
CCGATCGCGT GGGACATTCT CCAGCCGGAC GGGGTTCCGG TGCCCCTTCC GCCATACCCG
TTCAAGACCG ATCGCTACTG GGTGGAGCCG GCGCCGGTAG CGCAGATCCG GCTCGGCCGG
CGCGATCACC CACTGCTGGG ACGGCGGCTG GCCACCGCGG GACCGGTGTG GGAGGTCAAA
CTGGACACCG AGATCGCGCC CTACCTCGAC GATCACCGGA TCCAGGGCAA TGTCGTCTTC
CCCGCCGCCG GATACATCGA GATGGCCGCG CAGGCGGCCC GTGCCGTCAC TGGCGGGGCC
GGGACCACGC TGACCGGCAT CGAGCTGCGC AAGGCGCTGT TTCTCACCAG CGGCGACACC
CGAACCGTGC AGACGACCAT CACCGCGGAG GATGGAGGAT TCACGGTGGC ATCACTCACC
GCTGACCCCG CCGAGCCCAC GGTACACGCG AGTGGGGCGC TGACCGCCGG TCGATCGCAC
GGGCACGCCC CACCACTGGA CGTGACGGCG ATCCGCGACC GCGCCGCGCT GCACCTGAGC
TCCGAGGCGT GCTATACCGG GCTCGCCGCG CAGGGATACG GGTACGGCCC GGCGTTTCAG
GCCATCGCCG AGGTGTGGAT CGGGCCGGAC GAGGCACTGG CCCGGCTCCG GCCACCAGCC
GCGCTGAACG GCGGCGCCGC CGAGTTCCAC ACCCATCCGG CGCTGCTCGA CGCCGCCTTT
CAGACGCTGC TCACCCCGCA GCTGCTAGAG CCCGGCGGAG ACGACGACCG GCGCGGAATC
CGGCTTCCGC TATCCATCGG AGAGGTCTGC CTCGACGCAT TCGGCGATCG GGAGCTGTGG
GTGCACGCCA CGATCACCCG GCGGCACGTC AGCGAACTCG TCGGCGACCT CACGGTATAC
GCGGGGGACG GCACGCCGGT GGGCGCGATC CGTGCGTTCC GGGCCGCCGA CGTGGAGAAG
GCGTCAGCGG CGGTCAGCAT CGGCACAATC GACGGCTGGC TGGCCGAACC GGCCTGGGTT
CCGCTCCCGC CGCTGGCTGA GCCGGCGGCG GACGCGACGC CGGCCGGTGC GCGATGGCTG
ATCCTGGCCG ACGGCACCGG CTTGGCGGAG GAGTTCGCCG GCCTGGTCGT GGCGCACGGC
GGACAGTGTC ACCTGGTCCG GCCCGGCACC GCGCTGCACG TCGACGTCTC CGCGGGACGC
TCGACGGTGC GACCGGACTC CGCCGAGGAT CTGCGCCACG TGCTCGCCGG CAGCGGCGGC
CCGGCCGGTC ACATCCTGCA CTTGTGGAAC CTCGACCTGC CGGACTTCGG CGAGCTCACC
GCCATGGACC TGCCACACTG CGGCACGCTC GGCGCCTACT CTCTGATCGC GCTCGCTCAG
GCGCTGTCTG GCCAGAAGAC CGGCGGCCGG CTGCACGTGG TGACCCGGGG CAGCCAGGCG
GTGGACGGGT CCGGCGTGAC GCAGCCGACC GGCGCACCCG CCTGGGGTGT CGCCCGGGTG
CTGCGGGACC AGGAACTGCC CGGCCACTGC GGGAAGATCG TGGATCTGGG CGTGCCGAGC
CACACGGCGG AGGCAGCGGA GCTGTGGCGG GAGTTCGCTC ACGACGACGA GGATGAGATC
GCGCTGCGCG GCGGGGACCG ATTCACCTGC CGGCTACGGC CGGCGGTGGA ACTGCACCGC
CCGTTGCCGC TGCGGCTCCG GCCCGACGGC GCCTACCTGG TGACCGGCGC GTTCGGCGCG
CTTGGCCGGC TGCTGTGCCG GACCCTGGTG CGCCGAGGCG CGCGGCGGCT CGTGCTGGTC
GGGCGTACCC CGATGCCACC ACGAGAGCGG TGGGACGAAC CCGGGTGGGA CGACGCGACC
CGGGAGCGGA TCGCGTTCCT GACCGAGCTG GAGACACTCG GCTGTCAGCC GCTCGTCGCC
CAGGTCGACG TCACCGACGA GCACGCGCTG ACCCTCTGGC TGGGCGAGTA CCGGCGGCGG
CGGTCCGCGC CGATCCGTGG CGTGTTCCAC CTCGCCGGCC AGGTCCGGGA CACCCGCGTC
GAGCGAATGG ATCGCGAGAC GTTCGACGCC GCCTACGACC CGAAGGTGGC CGGCGCCTAT
CTGCTGCACT GGCACCTGCG CGACGAGCCG CTGGACCACT TCGTGCTCTT CGCGTCGATC
GCCTCGCTGC TGACCACGGC CGGGCAGGCG AACTATGCCG CCGGCAACGC GTTTCTAGAC
GCCCTGGCCC ATCATCGGCG GGCCAGCGGC CGAACGGCGT TGAGCCTGGA CTGGGGGCCG
TGGGCCACTG GAATGATCGA GGAACTGGGC CTGGTCGATC ACTACCGGAA CAACCGCGGC
ATGAGTTCGC TGTCCCCGAA CACGGGCATG GCGGTGCTGG AACGCGTCGC CGGCGAGAGC
CGCCCGCAGC TGATCGTGGC GACGGTGGTC AACTGGCCTA CCTTCCTCTC CTGGTACGAG
CGGCCGCCCG CGCTGGTGAC CGAGCTGGCC GCCACGACCG CGTCGCCGGC GGACGCCGCG
CACAGCAGCT TCCTGGAGGA GTTCCGGGAC GCCGGCGAGG ATGACCGGCA CCGGCTGCTG
AGTGAGCGGT TCGTCGCGGT GGTCGCCGAC GTGCTGCGAA CCCCCGTCGA GACCGTGGAC
CGGTCGGCGG GCGTCACCGC ACTCGGTCTC GACTCGCTGC TCGCCATGGA ACTGCGGGCC
CGGATCTCGG CCGAGCTGCA CGTTGCACTC CCGGTGGTTG CGCTACTCAG CGGCGTCTCG
GTCACCGACC TGATCGGGCA GGTACACGAG GGCCTGCTCG ACGTGCTGGA CGCCGGCGAC
GTAACCGCCT CAGATGTGAC GGTCCACGCC GACGAGGCAC AGTATCCGCT CACCGAGAAC
CAGAACGCGC TGTGGTTCCT CAAGCAGCTG GATCCACACG GCTTCGCCTA CAACATCGGG
GGAGCGGTCG AGGTCGGCGT CCCGATCGAG CCGGACCTCA TGTTCGAGGC CGTCCGGGCG
CTGATCGCCC GGCACCCGAG CCTGCGGGCG AACTTCCTGA TGGAGCAGAG TCGGCCGGTC
CAACGCATCT CGGCGGAGCC TCGGGCGGAC GTGGCACTGA TCGACGTCCG TGGCGACGAG
TGGGACGACG TCTATCAGGC GATCATCCGG GAGTACCGGC GACCGTACGA TCTGGAGCAC
GATCCGCTGC TACGGTTCCG GTTGTTCCGG CGGGGTGAGG ACCGCTGGAT CATTATGAAG
GCCGTCCACC ACATCATCTC GGACGCCATT TCCACGTTCA CGTTTATCGA GGAGCTGTTC
GCCGTCTACG AGGGGCTGCG GCGCGGCGAG ACCGTGTCGC TGCCCCCGGT GGCGTCGTCG
TACCTCGACT TCCTCAACTG GCAGAACCGG TTCCTGGCGA GCCCGCAGGC GAAGCGCTCC
CTGGACTACT GGACGGCCCA CCTGCCGGCC CAGATTCCGA CCCTCCAGCT CACCACGGAC
CACCCGCGGC CCGCAGTGCA GACCCACAAC GGCGCGTCCG AATTCTTCAC GCTCGACCCA
CGGCTGAGCG AACGGGTGCA CACCACGGCC CGGAAGCACA ACGTCACCCC GTTCATGGTG
CTGCTCTGCG CCTACTACCT GTTGCTGCAC CGGTACACCG GGCAGGACGA CGTCATCGTG
GGTAGCCCGG TGACCGGGCG TACCCAGGAG AAGTTCGGCT CGGTCTACGG CTACTTCGTC
AACCCGCTGC CGTTGCGGGC GAACCTGGCC GGCGACCCGG CCGTTGGCGA GTTGCTCGCA
CAGGTGCGCG ACACCGTCCT CAACGGACTG GACAACCAGG AATACCCGTT CGTACTGCTG
GTGGAGCAAC TCGGGCTGCA ACACGACCCG AGGCGGTCGG CGGTCTTCCA GGCGATGTTC
ATCCTGCTGA CGCACAAGGT GGAGACGGAA AAAGATGGCT ACCGGCTGAC ATACATCGAG
CTGCCGGAGG AGGAGGGCCA GTTCGATCTG ACCCTGTCGG TGTACGAGGA GGAGTCCGAC
GCGCGATTCC ACTGCGTATT CAAGTACAAC ACCGACCTCT TTCTGCCGGA CACGATACGG
CGGCTTTCCC GGCACTACGT ACGACTGCTC GACTCGCTGA CGGCGGCGGC CGTGGACACC
CCGATCGCCC GGCTGGAACT GTTTGCCGAG GAGGAGCGAG AGCGGATGCT CCACGAGTGG
AGCGGCGCGG ACCGCCGGGC CGAGTACGGC TCGCCGGTGC ACGAGCTGAT CGTCGCTGCC
GCCGCGGCGC GGCCCGGCGC AGACGCGGTG GTGTCGCCGG GTGGGAACGG GCCGGCGCAG
CGGCTGACCT ACGCGGCGCT CGAACGCCGG TCCCGCGCGC TCGCGCACCG GTTGCGGCGC
CTCGGTGTCC GGCACGGCAC GGTGGTCGCG CTCTGCCACG AGAAGTCCGC GGACTTGATC
GTATCGATCC TCGCGGTGCT CCGGGCCGGC GGCGCCTATC TGCCGCTGGA CCCTGGGTAT
CCGCCGGAAC GCCTGACGTA CCTGGTGGAC AACGCCGGCG CGGCCGTGTT GCTCGCCGAC
GACGCCGGTC TGGCGAGGCT GCCCAGGGCA TCCTGCGACG TGCTGGATGT GGCCGCGCTG
CTGGCACATA CGGATGGGGA GCCCCAAGCA GATCTCTGCG TACGGGTGAC GCACGACGAC
GCGGCCTACG TCATCTACAC CTCCGGATCG ACCGGCATTC CGAAGCCGGT CCGGGTCACC
CACGGCAATC TGGCCGCCGT CCACGCGGGC TGGCGCACCG AGTACGGGCT GGACTCCGAC
GTGCGGGTGC ACCTGCAGAT GGCCGGCGTC GCGTTCGATG TCTTCACCGG CGACCTGGTG
CGTGCGCTCT GCTCCGGCGG CACGCTGGTG CTCGCCGACC GCGATCTGCT CCTCGACCCC
GGCCGCCTGT ACCACACGAT GACCGAGGAG CGGGTGGACT GCGGCGAGTT CGTACCCGCA
GTCGTTCGCG GCCTGCTCAC CCACTGCGAG CGGCACGGGT TGCGGCTGGA CTTCCTGCGC
CTGCTGGTGG TCGGATCGGA TGTCTGGAAA GCGGAGGAGT ACGGGCGGCT GCGCGCGATC
TGCGGTGCCT GCACGCGCGT GGTCAACTCG TACGGGCTCA CTGAGGCAAC CATCGACAGC
ACGTACTTCG AAGGCCCGGT GGACGGCCTG GAACCCGGCC AGATGGTGCC GATCGGCCGA
CCTTTCCCGA ACAGCGCGGT CTACCTGCTC GACCGGCACG GCGAGCCGGT ACCGCCCGGC
GTACCCGGTG AACTCTGGGT CGGCGGGGAC GGTGTGGCGG CCGGCTATCC CGGCGACGAG
GAGCAGACGG CGCAGCACTT CGTCACCCGG ACGCTGAGCC GCCGTCCAGA CGCGGCCGCG
TTGCGGCTCT ACCGCACCGG AGACCTCGGC CGGTGGGACG CCGACGGGGT GCTGCACCTG
CTGGGCCGGC CCGACAACCA GGTCAAGGTG CGCGGGCACC GGATCGAGAC CGGCGAGGTG
GAGTCACACC TGCTGCGCCG GCCCGAGGTG GCGGAGGCGG TGGTCGTGGT CCGCCCGGAC
GCCGCCGGTG AGCCTGCGCT TGCTGCCTAC TGGGTGCCGG CTAGCCCCGG CGAGGCCGCG
CCTGACGCCC GGGACCTGCG CCGGTGGCTC GCCGACCGGC TGCCGACCTT TATGATCCCG
ACCTACCTGA CGGCGCTGGA CGCGCTGCCG CTGACCCCGA ACGGCAAGGT CGACGCGGCC
GCACTGCCGG CACCGCGCCC GGAGGCCGGT GCCGGCGAGC CAGAACCCCC AGTCACGCTG
TACGAGGTCC GAATGGGCGA GCACTGGCGT GCGCTACTCG GCTGCGAGCC TGACCTGCGG
CTGGACTTCT TCGAGGCCGG CGGCACGTCG ATCAAGCTTG TTGAGTTGAT CTACCGGCTC
CGGCAAGAGT TCAACATCGA GATCCAGGTG AGCCGCCTGT TCCGGATCAC CACGCTGCGC
GGCATGGCCG ACACGGTCGA GCAAGTCGTC ACCGGCCGGC TCAGCGGCGG GCAGTCCTAC
CTGACGTTCA ACGCCGACGC CGCCCCGGCG CTGTTCTGCT TCCCGCCGGC CGGCGGGCAC
GGGCTGGTGT ACCGTCAGTT CGCGGCGCAG CTGCCCGAGT GGCGAATCGT CGGGCTCAAC
TACCTGGCCG GGGACGACAA GGTAGCGCGT TACGCGGACC TGGTCGAGCG GCTCCAGCCG
GCCGGTCCCT GGTTACTCCT CGGCTACTCC CTCGGCGGGA ACCTGGCCTT CGAGGTGACA
AAGGAACTGG AGGGCCGGGG CCGGGCCGTC CGCGACGTGA TAATCGTTGA CTCGTACCGC
ATCGTGGAGC CGTTCGAGTT CGGCCCGGAC GAGTTCGCGG CGTTCGAGCG GGAGCTGGCC
GAGCACCTGC TTCGGCACAC CGGGTCGGAG ATCGTCGCCC GGGAGACCCG AGAGCAGGCC
CGGGAGTACA TCGAGTTTTG CCGCCAGACG CCGAACACCG GCATGGTCCA GGCGGCCGTG
ACCGTCCTTA GCGATCGGGA CAAGACCGCG CTCTACGCGG CCGGCGAGCG CGGAAGCTGG
ACCGGCGCGT CGGCCCGCGG GGACACGCTG CTGGCCGGTT CCGGAACGCA CGCCGAGATG
CTCGACCAGG AGCACGCCGT CCACAACGCC GCCCTGATCC GCGAGGTCCT GGGTGCCCCC
AACTGA
 
Protein sequence
MRHSLDAVSP EKIAIVGLGC RLPGGASDHR SFWRNLVNGK DCITDTPADR YDVRTLGSRD 
KAKPGRLVGG RGGYIDGVAD FDPAFFGISP REAAHMDPQQ RKLLEVAWEA LEDGGQIPGD
LAGTDVGVFI GAFTLDYKIL QFADLSFETL AAHTATGTMM TMVSNRVSYC FDFRGPSLSI
DTACSSSLVA VHLARQSLLR GESRIALAGG TLLHLTPQYT IAETKGGFLS PEGRSRPFDA
SANGYVRAEG VGVVVLKRLS DAVRDGDPIY AVIAGSGVNQ DGRTNGITVP NADMQTSLIE
RVCAEAGVLP GSLQYIEAHG TSTPIGDPIE ANALSRVLAI GRTPGERCYV GSVKTNIGHT
ESAAGVAGLI KTALALKHRI IPPHINFERI NPAIDEASLS FEIPTEPTPW PRHTGPARAG
VNSFGFGGTN AHVLLESAPT ATVDSVAMPV PPSHTILPLT ARDPAVLLRL AAGIHAELTG
SDVTPADLGH TLSHRRQHHD FRLSIVYSSR ESLAEALAAY GCGERHLRVL TGQRRDPADR
RLVWVFTGMG PQWWAMGRQL FEIEPVYRAT VERCDRVIRR LTGWSLIDEL NADEVDSHMA
ETWLAQPANF AVQIGLAALW RSYGIRPDAI VGHSTGEAAA FYEAGVYSLD DAVRVVVHRS
RLQQKLVGAG TMLAVGLTEP EARQRVLAYG HAISVAAVNS PGTVTLAGDE DALTDLAEKL
TAEQTFAKFL AVQVPYHSAR MDAIKDELLT SLAGLVPRQA RVPLYLTARD GVADGPELDA
GYWWRNVRDT VRFGAAADRL ISDGYRFFLE IGPHPVLGHS IQECLADRDV QGRTLPSIRR
TENEPEQMIM SLAALFNEGF PIAWDILQPD GVPVPLPPYP FKTDRYWVEP APVAQIRLGR
RDHPLLGRRL ATAGPVWEVK LDTEIAPYLD DHRIQGNVVF PAAGYIEMAA QAARAVTGGA
GTTLTGIELR KALFLTSGDT RTVQTTITAE DGGFTVASLT ADPAEPTVHA SGALTAGRSH
GHAPPLDVTA IRDRAALHLS SEACYTGLAA QGYGYGPAFQ AIAEVWIGPD EALARLRPPA
ALNGGAAEFH THPALLDAAF QTLLTPQLLE PGGDDDRRGI RLPLSIGEVC LDAFGDRELW
VHATITRRHV SELVGDLTVY AGDGTPVGAI RAFRAADVEK ASAAVSIGTI DGWLAEPAWV
PLPPLAEPAA DATPAGARWL ILADGTGLAE EFAGLVVAHG GQCHLVRPGT ALHVDVSAGR
STVRPDSAED LRHVLAGSGG PAGHILHLWN LDLPDFGELT AMDLPHCGTL GAYSLIALAQ
ALSGQKTGGR LHVVTRGSQA VDGSGVTQPT GAPAWGVARV LRDQELPGHC GKIVDLGVPS
HTAEAAELWR EFAHDDEDEI ALRGGDRFTC RLRPAVELHR PLPLRLRPDG AYLVTGAFGA
LGRLLCRTLV RRGARRLVLV GRTPMPPRER WDEPGWDDAT RERIAFLTEL ETLGCQPLVA
QVDVTDEHAL TLWLGEYRRR RSAPIRGVFH LAGQVRDTRV ERMDRETFDA AYDPKVAGAY
LLHWHLRDEP LDHFVLFASI ASLLTTAGQA NYAAGNAFLD ALAHHRRASG RTALSLDWGP
WATGMIEELG LVDHYRNNRG MSSLSPNTGM AVLERVAGES RPQLIVATVV NWPTFLSWYE
RPPALVTELA ATTASPADAA HSSFLEEFRD AGEDDRHRLL SERFVAVVAD VLRTPVETVD
RSAGVTALGL DSLLAMELRA RISAELHVAL PVVALLSGVS VTDLIGQVHE GLLDVLDAGD
VTASDVTVHA DEAQYPLTEN QNALWFLKQL DPHGFAYNIG GAVEVGVPIE PDLMFEAVRA
LIARHPSLRA NFLMEQSRPV QRISAEPRAD VALIDVRGDE WDDVYQAIIR EYRRPYDLEH
DPLLRFRLFR RGEDRWIIMK AVHHIISDAI STFTFIEELF AVYEGLRRGE TVSLPPVASS
YLDFLNWQNR FLASPQAKRS LDYWTAHLPA QIPTLQLTTD HPRPAVQTHN GASEFFTLDP
RLSERVHTTA RKHNVTPFMV LLCAYYLLLH RYTGQDDVIV GSPVTGRTQE KFGSVYGYFV
NPLPLRANLA GDPAVGELLA QVRDTVLNGL DNQEYPFVLL VEQLGLQHDP RRSAVFQAMF
ILLTHKVETE KDGYRLTYIE LPEEEGQFDL TLSVYEEESD ARFHCVFKYN TDLFLPDTIR
RLSRHYVRLL DSLTAAAVDT PIARLELFAE EERERMLHEW SGADRRAEYG SPVHELIVAA
AAARPGADAV VSPGGNGPAQ RLTYAALERR SRALAHRLRR LGVRHGTVVA LCHEKSADLI
VSILAVLRAG GAYLPLDPGY PPERLTYLVD NAGAAVLLAD DAGLARLPRA SCDVLDVAAL
LAHTDGEPQA DLCVRVTHDD AAYVIYTSGS TGIPKPVRVT HGNLAAVHAG WRTEYGLDSD
VRVHLQMAGV AFDVFTGDLV RALCSGGTLV LADRDLLLDP GRLYHTMTEE RVDCGEFVPA
VVRGLLTHCE RHGLRLDFLR LLVVGSDVWK AEEYGRLRAI CGACTRVVNS YGLTEATIDS
TYFEGPVDGL EPGQMVPIGR PFPNSAVYLL DRHGEPVPPG VPGELWVGGD GVAAGYPGDE
EQTAQHFVTR TLSRRPDAAA LRLYRTGDLG RWDADGVLHL LGRPDNQVKV RGHRIETGEV
ESHLLRRPEV AEAVVVVRPD AAGEPALAAY WVPASPGEAA PDARDLRRWL ADRLPTFMIP
TYLTALDALP LTPNGKVDAA ALPAPRPEAG AGEPEPPVTL YEVRMGEHWR ALLGCEPDLR
LDFFEAGGTS IKLVELIYRL RQEFNIEIQV SRLFRITTLR GMADTVEQVV TGRLSGGQSY
LTFNADAAPA LFCFPPAGGH GLVYRQFAAQ LPEWRIVGLN YLAGDDKVAR YADLVERLQP
AGPWLLLGYS LGGNLAFEVT KELEGRGRAV RDVIIVDSYR IVEPFEFGPD EFAAFERELA
EHLLRHTGSE IVARETREQA REYIEFCRQT PNTGMVQAAV TVLSDRDKTA LYAAGERGSW
TGASARGDTL LAGSGTHAEM LDQEHAVHNA ALIREVLGAP N