Gene Cagg_0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0012 
Symbol 
ID7269008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp12949 
End bp21300 
Gene Length8352 bp 
Protein Length2783 aa 
Translation table11 
GC content55% 
IMG OID643564884 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002461401 
Protein GI219846968 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.296104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGA CGGTCACCAG GGCACCAACA CAAGGTTGGG AACTCCTGAC CCAATTGGCC 
CTACATAGTC AACAACCACT CGGTACTACC GAGCCGGAAG CGTTGTGCCG CCTGATCGGC
GAGTTACTCG AACGGCACCT TGGCGTCGGC GGTCGCTTGA CGTTATTGGC CGGCGAGCAC
GAGCTGGTCA GCGTGGCATG GGGTGATGCG CCAACGAATG GCTATGCTTT CGATCTCCGC
GACGCGAACC ATCATTATGG CCGGCTTACC TTAGCGGCGA CCTTCGATTC GGCATTTACT
AATGCACTGA CTGCTCAAAT TACTACTCTC CTCAGTATCT GGCAACGATC CCAACTGGCC
GAGCGCATCG AACGATTGCG GGCTCTAGGC TACACAACCT TAGAGCATGC TGGCCTTGGT
GATATGACGG CTGCACTTGA GCAACTGTGT CGTGAAGCAT GCAGCTTGTT GCCGGCGAAA
GCACTCGCCC TGTACTGGAT TAACTTCAAT GATCAGACCC TTTTACGGGC AGTAAGTAGT
GCAGGCAGTA CGGTTTTCCC CTCACGACTG GCGATTGAGG ATAATGAAGC TATTGCCGAA
GCGATCACGT TTCAGACCGG TCAGCATGGT ACCTGGCAAT ATCATTTGTC GCGCCGTGAC
ATACCGACTG AGTGGCCGAT GCTGGTTGAA CCACTCAGTG TTGGCGTTGA GTTAGTTGGA
TTACTGATCC TGATCGATCC CGGACCTGAT GCGGCGATCA CCATCCTGTC GTTGGCTCGT
CCGCTTGCGC TCTTGCTGCA CGCCACATTG TTACAACAAC ACGAAAGTCG GCACGCGCGC
GAACTGTTTG TATTGTACGA GAACAGTCTT GAGATCGGAT CGGGATTGGT CATCGAAGAG
ACCATTGCAC GGGCAACCGA GAACATGGCA TTAGCGCTCA ATGCCGATTT TGGGGCGGTG
TACTTGATCG ATCCGAACCG GCCATATCAA GCACAGACGA TTGCTGTGCA TAGCGAACAT
CTCAGTGGCG TGCGCGGCTT TGATACCATA CCCCTTACCG CAACGCTCCA AAGGTTCATG
GAGCAAAGTG AACCGGTTCT GATAGCCAAC ACGCCACTGA CGGCGCAAGA TAATCCGATA
GCCGCTCACG CAACGATGTT TGGTTGCCAA AGTCTCTGGT TGACACCGTT ACGGAGCAAA
GAGCAGGTTA TTGGATTCTT GGCGATGGGC TACGCCGCCT CGAGCTATAT GCTCGATCAG
ATTGAGCGCA ACCTGTTACA GGTGTTGAGC GCACAGATCG CGACTACCGT ACAACACCGG
CGGTTGTACG ATGCAGCTCA ACAGCGTGCC GGTGAACTGG AACGACTACA AGAGATTAGT
GAGTCGCTCG CTGCTGACCT CTCGCTCGAT GAGACACTAG AGTCGACGAT AGCCGGTGTT
GCTAAGCTCG TCCGATTCAG CGGTGCTCGG ATCACCCTGT ACGACGAACG CTACCAGACG
ATGCGTATCG CTTTCCAGCA TGGATTGCGG TGTCAGGTCG GTGAGGAGGG TGATTTCCTG
AGTTCGTGGA TCGCACGCCA TCAACGACCG CTGCGGATCG GTAATCTTTC CCAACCGCCG
GTGTGGTTGG GGGCACTTGA TAATGCCTGC ATTCTGATCC TCGTTAACGA TATGCCTGCC
CAAAGTTATT TGGGCATGCC GTTGCGGAGT GGCAACACGC TCTTGGGTGC ATTGGAGCTT
GTTTCGACGC AAGCGGGAGC CTTTAGCGCC GAAGATGAGC GGTTGTTGAG CATTATTGCC
GGTCAATCGG CACGTGCCAT CGCCAATGCT AACCGATTTG CACAAGTTGA TAGCAGCTTG
CGCCTGCGCA TCGAACAACT GCGCGCGCTC CAACGGATCG GTAGTCAGTT AGCGATTACC
CTCAACCAAA ACGAGATTCT TGCCTTTGTA CTTGAGCAAG CACTACGCGC TACCGGTGCA
AGTCACGGTT TGATCGCATT ACGAGTAAAT TCCTCGGAGG CGACCAATGG TGGCCGCTCA
ATGACCGAAT TGTTGCACCG ATCACTGACC ACCGACCTTG ATAGTGAAGA CGGTGCATTT
GTGATTGTCG AAGTCATTGG CTACGCACCG CAACTACGTA CCCGACTGCT CGGCAGCGCG
ATTGATCAAG ATGTACATAC TGCCCAAGAG TCAATCGTTC GTCGGGAGAT TGCAATCGGT
GACTACCTCA GTGAAGATGA GCGCAGTGCA TTACTTTGCC CGAACGCCAG TTCAGCGCTG
GCAGCACCCA TCTTCTACCA AGGTAGTGTG TACGGAGTAG TGCTGTTGCT CGCGGAACAA
CCACACTACT TCGACCACGA CGCGGCTGAA TTTCTCCGTG CCCTCACTCA CCAGGCGGCA
ATTGGGATCG GGAATGCTCA GCACTACTTC GAGCTTGAGC ATATGGCCAA AATGCTCCAG
CGGCGAGCCG ATATTCTCAA CGACGTGTTA GAGATTGGGC AGGCCCTGCG CGCCGATCAG
TCGCTGCCGA GTGTGCTTGA GCAGATCGGG TATAGCATCA TTGAAGCGCT CGGCCTGCGG
ACCGTCATGT TCTGCCTTGT TCAGCCCGAT GATCCGAATC GCCTCTATCT GGAGGCGGCT
GCCGGTATTC CTTTATCGGA ACAAGCGCGC TTGGCCGAAC ATCCATTGGC GTTAACACTG
GCGACGCGCT ATCTTGACCC ACGCTTCCGG CTCGGACGAA CATACTTTGT GCCGGCTGCC
GAGGCTCGCT TACTGGAAGC TGGGTTTGAC ACGAGAATAT TTGACTATCA CCCATTCAGC
GATGAGCGGG CTGAGCACGA ATGGCAACTG AATGACCGGC TGTGCGTACC ACTTTATTCG
ACCAACGGGA TGTTGTTGGG CATGATCTTC GCCAGTGACA CCGAAGATCG GCAACGGCCA
ACAGCACGGA TGGTCGAACC ACTTGAGATT TTTGCTGATC AAGCTGCTAT TGCGATTGAA
AACCATCTGC TGTTGCAGGA AGCTCGCGAC CGGGCCGAAG AAATGGCGGC ACTCTTCCAG
GTTGGTGCCG CAGCAACCTC GACAACCGAC CTCGACATCT TACTTGAGCG TGTCTATCAA
GAGATAGTAG CTTTCCTCGG CATACCGGAT TTCTTCTACA TCGCTAGCTA CAATGCCGAG
CAAGAAACGA TACGCTTCGA GCTATTTAAA CAACATGGTG TAACGGTTCC TGCATACCAT
AAGCGTGTCA CACCGAAGCA AGGACTAACT GCTAAGATTA TTGATGAGGG CAAGCCGCTT
CGGATCGATG ACCTTTTGCA AGCAAACGAT CTGCGTAATC AGTTGTTCTT GTTAGTTGAT
GAGGGTCGGC AGGTACGTTC GTGGATCGGT GTGCCGCTGA TCAGCCAAGG AGCGGTGATC
GGCGTTTTAT CGTTACAGAG TGAAACCGCC GCCGCATTTT CTGAACGCAC ATTACGCTTT
TTAGCCGCGC TGGCCAACCA ACTCGCCATC GCGCTTGAAA ATGCTCGTCT CTTCGCCGAT
CGCGAACAAC AGATTCGCGA GCTGAACATT ATCAACCGAA TCGGGCAAAT CATCGCTGCA
ACGCTCGATC AACGGCAAAT GCTGCGCGAT GTGTACGAGC AGTTACGCCA CTTCCTTCCC
CTCGACTCTT TCGTCGCATT TTTGTACGAA CCGGAAAGTG GCGAGATGAC GTTGTGTTAC
GAAGTCGACG AAGGGGTTGA GTCATTTACC GAGCATCGTC AACCACCTAC ACCGGGGAGC
CTAACCGAAC GAATTATTCA GACCCGCCAG CCTTTACAAT TTACCGATCT TAGCATCGAG
GCTGCCCAAG CCGGTTTTCA GCCGGTACGT TTTGGGAGTG AACGTCCCTC GGCAGCATGG
CTCGGCGTGC CATTGCTGAT CGGGGACCAG ACGGTCGTTG GCGTACTGGC AGTGATGAGC
TATACTCCCG GTATCTACCA CGAGCGGGAG CGCGCTTTCC TGACGACGGT TGCCAGCCAA
TTAGCGCTCG GCGTTCAGAA TGCTCGCCTC CTCGAGCGCG CACAAACGCA AGTTGAGCAA
CTGGCATTGA TTAACCGGGT TGCAGCCAAA ACCAATGCAC TGACCGATCT CAAAGCCATT
TACCAAGAGA TCGTCAATGC AATGGCGACC GCGACGGGGG TTGATCAGGC ACGGATGGTG
ATCTACGACC GTGAAAGCGG CTATGCACCG GCCGTCGCCG AGTTTGTCGA TAGTGGACTG
CTCGATCAAC TGCGCATCCA ACTCTTCGAC AATCCTTCGG TTACTTGGCT CGATCGCGAG
AAACGACCGC TGGTTTCAGA AGATGCGCAA AACGATCCAC TGTTTGCGCC GTCGCATGAG
ATATTCCGGG CACTTGATAT TCGTTCCATT GGTATCATCC CGATCATCTT GAATGATGAA
GTGATCGGCG CAGTCGGGCT TGACTTTGTC GGACGAACAG GTACGTTCTC GCCACAGGTC
CTCGAGTTGT GCCAGACCCT GGCCAACCAG ACGAGTACCG CCATTGCACG CGCCCGTGCA
ACGGCTGAAG CGCAACGGAG TGCTGAGGCG ACCCGGCAGA AGGTCAGCGA ACTATCAACG
CTGCTCGACG CTGCTCGGAT CTTGTCGTCG CTGCTCCGCC CGCAAGAGGT GCTCAATAAA
CTGATGGAAT TGGTCAGCCG TCAGTTTAAT GTCACGACGG TAGCACTATG GACGATCAGC
GAAGGCAATG TCCTCACACC GGCAGCCCTC GATGGTATCC CGTCTGAGCA GGGGCGCAGG
ATGCGCGTAC CTATCGGTCA GGGCTTCACC GGACGAGTAG CCGAGACCGG TCAGCCCCTG
ATCATCGAAG ATGTGAATGA AGAGGGCGGA TCGCTCTACC CCAACTACCA GCAGCGCAAT
AATCTCATCT CATTTATGGG TGTACCGGTC ATCTACCGCG AGCAGATCAT CGGTGTGCTT
AGTGTGATGA CCAATTACCG ACGTCGCTTT ACGAACGATG AGATGGTGTT GCTGTCCGGT
CTGGCCAATC AGGCGGCAAC AGCACTGGAG AATGCGCGCC TGTTTGAGGA GCGTGAGCGA
CGTATCAACG AGCTGACTAT TATTAACCGG ATTAGTGCTG AAGTAAACGC TTCGCTCGAT
GTTATTGAAC TGATCGAACG ATTGCACGCC GGGATCGGTG AAATTATCGA TACTAGCACG
TCGCTGATCG CTCTCTACGA TGAAGCGACC AACACGATTA GCTATCCAAT TGCCTACGAC
CGCGGTCAAC GAGTTACGCT TGAGTCGCAA CCACTTGGCT ATGGAACAAA TGGATGGGTC
ATCCGAAACC GTCGTCCGCT CCTACTCGGC ACAGCGTCTG CCGCCCGTGC GATGGGTCTG
CTGATCGACG AGGGGCGGAT CGGCGATACA TCGGCTATCG AGGAGTCGTA TCTGGTCGTT
CCCATTATCT ACGGCAATCA GGTACTGGGT GTGATTAATA TCCAGAGCTA TACGCAACAT
GCCTTCGACG AGAACGATCT GCGTTTCGTC ATGACAGTAA CGAATCAGGC GGCTGTTGCA
TTAAACAACG CCCGCCTGTT TGCCGAAACC CGCCAAAATG CGAATGAGAT GAGTACCCTG
TTCGAGGTGT CGCAGAGCTT ATCGGGCACA CTCGAACCTG ACACAATTCA GATGCTTATC
GCGGATGCAG CGGTGCGCTT GCTTCGTGCC GAACTGGGAG CAGTACTTCG GCTCGACCGG
CGTGGGAATA TTGAGCGCCA GATCCTCATC GATGGCCTTG AATTCCGTGA AGATATTCGT
ATTGATTTTC GGCGCGATGG CTTGACTACC GCCTTGTTAC GACGCGATCA GCCGATTGCG
ATTAGCGATT TAGCCGAGTT CGATGGCGGT GATCACCATG CATTGAAGCT CGGTGTACGT
AGCGCACTGG GCATTGCGGT TGGCCCAATC GAAGAACGCC TCGCCGTGAT CTGGGTTGGC
ATGCGGACAC CTTTTGAATG GACCCAGCAC CAAATTTCGC TGCTCTCGAT CTTGGCCAAC
CAAGCGGCAC AAGCCCTGAA GAGTGCGCAA CTCTTCGCCG TCGAGCAAAA ACGGCGACGA
CAGGCGGATA TGTTGCGTGA GATAGCTCAG TCGTTTACGT CGACCCTTGC GTTGCGCGAA
ATTCAGACAC TGGTCCTCGA GCAGTTGCAG CAGATCGTAC CCTACGATAG CGCAGCAATG
CTCCTGCGTG ATGAGGGGTA CGGTGATCTA CGAGTAGTCG AGGTGCGTGG CACAACGTGT
ACCCTACCGG TTAATAGCAC CTTGATCCTC GAAAATAGTA GTCTCTTCCA ATCGATGGCG
ATTGAGCATC AGCCGATCCT GATCGCCGAT ACGTATCGCG ATCCGCGCTT TCACGAACTA
CGCCGACTGG GCTGGTGTGA TGGTGCGTGG ATCGGAGCGC CGCTGCTGGT TGATAACGAA
CTCGTGGGTG TCTTGACGGT TCACAATATC GAACCGGCTG CCTACGATGA AGAGGCATTA
GCGGTTGTAT TTACTATCGC CAGCCAAGCA AGCCAAGCCA TCCAGAATGC GCGTCTGTTC
GATCAAATTA GCAATCTCGC CGCCGATCTC GATGCGCGTG TGCGCGAACG CACCGCCGAA
CTCGAACAAG CGACCCAGTT GCTGTCGGAA GAGAAGGAAC GCCTCGAGGC AGTCAACCGC
ATTGCCCTAG AACTGACGAC TCAACTCGAT CTCGACTTGA TCATTCAGCG GGCGCTTGCA
TTGATCTCCG ACAGTCTGCA AGTGGATCGC GGGTCGCTCA TGTTGCGTGA TGTGGAAAGT
GGAGCGTTAA TCTGCCGCGC AGTGTTGCAC GGTCGTGGGC AGGTGCAAGC AGCCAATGTG
CCGCTACGAT TCGCGAACAA TACCGAGGGC TTGGCAGGCT GGATTATGCA GCACCAAGAA
CCGGTGAACA TCCGTGACGT TACGCATGAT CCGCGCTGGG TTCAGGAATC GGGTCGTGCC
GATACGGTTC GTTCGGTTGC CGGTGTCCCA TTGCAGACGG GTGATACTGC AATTGGTGTG
ATCATTCTGT CTAGCCCGCA ACCTAATTAC TTCAGCGATT CGCAAATGAA TCTGCTCGGC
ACAATTGCGA GTGTGATCGC TGCTGCCGTC TACAATGCCC AGCTTTACGG TTTCGTGAAC
GATCTGGCGC TCCGCAATGC AGTGCTCCTC GAAGAGCAGC GTGCCGAGTC GTCGAAGAGT
GCCGCTATCT TCCGCTCGAT GACGGAAGGT GTTATTGTGC TCGATACGAC GAATCAGATT
ACGCTGTTCA ACCCGGCTGC GGAACAGATG CTCGATATTG CGGCTGACCT TGTTCTCGGT
CAACCCCTCG CTCATTTAGC CGAGGTTGGG AACGACGACA TCAGCCGCCG ACGTGGGCAG
ACGATCTACC AAGGGATTTT GCACGGCCTA CGCCGGATGC GCGAGACCCA GGGTATCTTT
AGCACATCAA TTGACTTAAC CGATCCGACA CAGGTGATCG CGGTGAATAT CGCGCCGGTT
CGCGGCCTTG ACACCGATTA CGGCACCGTG GCTGTGTTGC GCGATATTAC CCGCGAAATT
GAGGCAGATA AGGAGAAACG TCAGTTTATC TCGGACGTCA GCCACGAGCT ACGTACACCG
TTGACGGCGA TCAAAGGGTA TGTCGATGTC CTGCTGCTTA CGGCCAGTCT GAACCTGACG
CCGGATCAGC TCAGTTATCT GAACATTATC AAGAATAACA CTAACCGACT CCGCGCCCTT
ATCGAAGACA TTCTCGAGTT CTCACGGCCC GACTCGAAGA AGAAACTGAC TTTTACGCAA
GTTGATATTC CAACGGTGAT CGAGGAAGTA GTGCAGTCGT TGCGGTTGGA ATACGAGCGT
AAGGGGATGA AGGTGCGGAT CGATACCTCG CCGTCGCTGC CACCGGTTAT CGCCGACCAG
AAACGGGTTA GTCAGATTAT CTTCAACCTC TTCTCGAACG CGGTGAAATA CACCTACGAA
GGTGGCTCGA TTACCGTTCG AGCGTTCGTT AATCGAGCGA ACATGATGCA AATTGAAGTG
GAAGACACCG GTGTTGGAAT GTCACCGGAG CAGTTGAAGA AACTCTTCCG CCCCTTCTAC
CGCGCCGACA ACCCGCTGCG TGATATTGCC GGTGGTACCG GTTTGGGTCT GAGTATCGCC
AAGCAACTGG TAGAGATGCA TGGTGGCGAG ATCTGGGTGA CCAGCGAGCT AGGTAAAGGC
AGTACGTTCT CGTTTGCCAT CCCGCTCCAG CAAACGAAGA GCACCGACAA CGACGAGGAG
GTGGAGGTAT GA
 
Protein sequence
MDTTVTRAPT QGWELLTQLA LHSQQPLGTT EPEALCRLIG ELLERHLGVG GRLTLLAGEH 
ELVSVAWGDA PTNGYAFDLR DANHHYGRLT LAATFDSAFT NALTAQITTL LSIWQRSQLA
ERIERLRALG YTTLEHAGLG DMTAALEQLC REACSLLPAK ALALYWINFN DQTLLRAVSS
AGSTVFPSRL AIEDNEAIAE AITFQTGQHG TWQYHLSRRD IPTEWPMLVE PLSVGVELVG
LLILIDPGPD AAITILSLAR PLALLLHATL LQQHESRHAR ELFVLYENSL EIGSGLVIEE
TIARATENMA LALNADFGAV YLIDPNRPYQ AQTIAVHSEH LSGVRGFDTI PLTATLQRFM
EQSEPVLIAN TPLTAQDNPI AAHATMFGCQ SLWLTPLRSK EQVIGFLAMG YAASSYMLDQ
IERNLLQVLS AQIATTVQHR RLYDAAQQRA GELERLQEIS ESLAADLSLD ETLESTIAGV
AKLVRFSGAR ITLYDERYQT MRIAFQHGLR CQVGEEGDFL SSWIARHQRP LRIGNLSQPP
VWLGALDNAC ILILVNDMPA QSYLGMPLRS GNTLLGALEL VSTQAGAFSA EDERLLSIIA
GQSARAIANA NRFAQVDSSL RLRIEQLRAL QRIGSQLAIT LNQNEILAFV LEQALRATGA
SHGLIALRVN SSEATNGGRS MTELLHRSLT TDLDSEDGAF VIVEVIGYAP QLRTRLLGSA
IDQDVHTAQE SIVRREIAIG DYLSEDERSA LLCPNASSAL AAPIFYQGSV YGVVLLLAEQ
PHYFDHDAAE FLRALTHQAA IGIGNAQHYF ELEHMAKMLQ RRADILNDVL EIGQALRADQ
SLPSVLEQIG YSIIEALGLR TVMFCLVQPD DPNRLYLEAA AGIPLSEQAR LAEHPLALTL
ATRYLDPRFR LGRTYFVPAA EARLLEAGFD TRIFDYHPFS DERAEHEWQL NDRLCVPLYS
TNGMLLGMIF ASDTEDRQRP TARMVEPLEI FADQAAIAIE NHLLLQEARD RAEEMAALFQ
VGAAATSTTD LDILLERVYQ EIVAFLGIPD FFYIASYNAE QETIRFELFK QHGVTVPAYH
KRVTPKQGLT AKIIDEGKPL RIDDLLQAND LRNQLFLLVD EGRQVRSWIG VPLISQGAVI
GVLSLQSETA AAFSERTLRF LAALANQLAI ALENARLFAD REQQIRELNI INRIGQIIAA
TLDQRQMLRD VYEQLRHFLP LDSFVAFLYE PESGEMTLCY EVDEGVESFT EHRQPPTPGS
LTERIIQTRQ PLQFTDLSIE AAQAGFQPVR FGSERPSAAW LGVPLLIGDQ TVVGVLAVMS
YTPGIYHERE RAFLTTVASQ LALGVQNARL LERAQTQVEQ LALINRVAAK TNALTDLKAI
YQEIVNAMAT ATGVDQARMV IYDRESGYAP AVAEFVDSGL LDQLRIQLFD NPSVTWLDRE
KRPLVSEDAQ NDPLFAPSHE IFRALDIRSI GIIPIILNDE VIGAVGLDFV GRTGTFSPQV
LELCQTLANQ TSTAIARARA TAEAQRSAEA TRQKVSELST LLDAARILSS LLRPQEVLNK
LMELVSRQFN VTTVALWTIS EGNVLTPAAL DGIPSEQGRR MRVPIGQGFT GRVAETGQPL
IIEDVNEEGG SLYPNYQQRN NLISFMGVPV IYREQIIGVL SVMTNYRRRF TNDEMVLLSG
LANQAATALE NARLFEERER RINELTIINR ISAEVNASLD VIELIERLHA GIGEIIDTST
SLIALYDEAT NTISYPIAYD RGQRVTLESQ PLGYGTNGWV IRNRRPLLLG TASAARAMGL
LIDEGRIGDT SAIEESYLVV PIIYGNQVLG VINIQSYTQH AFDENDLRFV MTVTNQAAVA
LNNARLFAET RQNANEMSTL FEVSQSLSGT LEPDTIQMLI ADAAVRLLRA ELGAVLRLDR
RGNIERQILI DGLEFREDIR IDFRRDGLTT ALLRRDQPIA ISDLAEFDGG DHHALKLGVR
SALGIAVGPI EERLAVIWVG MRTPFEWTQH QISLLSILAN QAAQALKSAQ LFAVEQKRRR
QADMLREIAQ SFTSTLALRE IQTLVLEQLQ QIVPYDSAAM LLRDEGYGDL RVVEVRGTTC
TLPVNSTLIL ENSSLFQSMA IEHQPILIAD TYRDPRFHEL RRLGWCDGAW IGAPLLVDNE
LVGVLTVHNI EPAAYDEEAL AVVFTIASQA SQAIQNARLF DQISNLAADL DARVRERTAE
LEQATQLLSE EKERLEAVNR IALELTTQLD LDLIIQRALA LISDSLQVDR GSLMLRDVES
GALICRAVLH GRGQVQAANV PLRFANNTEG LAGWIMQHQE PVNIRDVTHD PRWVQESGRA
DTVRSVAGVP LQTGDTAIGV IILSSPQPNY FSDSQMNLLG TIASVIAAAV YNAQLYGFVN
DLALRNAVLL EEQRAESSKS AAIFRSMTEG VIVLDTTNQI TLFNPAAEQM LDIAADLVLG
QPLAHLAEVG NDDISRRRGQ TIYQGILHGL RRMRETQGIF STSIDLTDPT QVIAVNIAPV
RGLDTDYGTV AVLRDITREI EADKEKRQFI SDVSHELRTP LTAIKGYVDV LLLTASLNLT
PDQLSYLNII KNNTNRLRAL IEDILEFSRP DSKKKLTFTQ VDIPTVIEEV VQSLRLEYER
KGMKVRIDTS PSLPPVIADQ KRVSQIIFNL FSNAVKYTYE GGSITVRAFV NRANMMQIEV
EDTGVGMSPE QLKKLFRPFY RADNPLRDIA GGTGLGLSIA KQLVEMHGGE IWVTSELGKG
STFSFAIPLQ QTKSTDNDEE VEV