Gene Haur_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4114 
Symbol 
ID5735975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5251387 
End bp5261799 
Gene Length10413 bp 
Protein Length3470 aa 
Translation table11 
GC content52% 
IMG OID641281268 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001546874 
Protein GI159900627 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAACAG GCATTGCTCA CCTCGCCCGT GAGCAATCGC CTGATTTGCC AACGTTCGCG 
CAGCGCTGTG CCGATTTGCT CAAACAAAGC GGCATGTTTG CTCAAGGGGC GATCTTACGC
CTTGGTCAAC AACGTTCGTT GTTGACTGCA TGGGGCATGA ATAAACGCGC AACCAATCGG
TTGTTGCCTG CTCGCGTTTT AAATGGTGGA AGTGAGGTTC GCCTGCCAGA TGGTTGGCAG
ATGTTTGAGA TTGGCACTGC CGAGGTGCCT GGCGTTTTGG TCGCTCCAGT TACAATTGAG
CCTGAACCAC TGACGGTTTT GGTCGCGCAA CTAACCAGTT TGTGGCAGGG CGTGGCCTTG
CAAGAGGAAT TTGCCCGCCG TGAACGCACG GTTGCAGCCA TGACTGAATC GTTGCAAGGG
TTGGCTCAGC AACTCAATGC CGACGATTTC TTGCAAACCT TGGTCACTCA AGCAACCGAA
TTACTGGGCG CTGCTGGCGG TGGGGTCTAC ATGACCGATT CCAACCAGCA ATACTTGGAG
TTGCGCCAAG TTGTGCAATT TCCGGCCAAT TGGAATGGCG CACGGATTCA GGTTGGCAAG
GGTGTGGCTG GCAAGGTGGC CCAAAGTGGC AAGCCGATGC TGGTCAATGA TTATGCCCAA
GCTAAAGAAA AATACGATAA TCTGCCCGAG GGAATTAACT TCACGGCTGT GATGGCCGCG
CCGCTCCGCG CCGACGAGGC GATTATCGGG GTCTTGGTGC TGGTGCATAT CGAGCCTGAT
CGCGGTTTTC AAAATGCTGA TTTGGCCTTG CTTGAATCAT TCGCTGCCCA AGCTTCGTTG
GCGATGCGCA CAGCTAAACT CTTCGATGCC CAACGTCAAC GCTCGCGCGA ATTGTATTTG
CTGTATGAAA ATAGCTTGAC GGTTGGCTCA TCACTTGATC TTAGCCATAT TCTCAATCGA
TTAACTGAAA ATGTCTTGCT GGCCCTTGGG GTCGAGCAAT GTTTGCTGTT GCTATGGGAT
GATCGGCGCA AACTCTGTGA GTTGGTGGCT CAAGCCACCG ACGATGATGC TGCCAATTCG
CTTGATCTCT CGATTGGGGC AACCTATGAG TTGCGGCCTG AATCGATTTT GAAATTATCG
TTTGATACCC AACAACCGGT CGTGGTTGTC GATATTGATA CTGACCCGCG GATCAATGCT
CAACGTTCAT GGCTCAAGTC CCGTGGCATT CGCAGCGCTT TAGGTTTGCC GATGCTGCTC
AAGGAGCGGG TAATTGGGAC ATTGCTCTGT ACCACTACCT CGAAAACCCG TACCTTCAAT
CCTGGCGAAA TTACCTTGGC CCAAACCTTG GCGGCCCAAG CAGCCACAGC GATTGGCAAT
ACGCGCTTAT TAAATGATGA GCGCCGCCGC AATGCTGAGC TTTCCGTGCT GCAATCGCTG
AGCGCCAAAT TGACCTCAGG CGTGAGCCTG CAAGTTGCAC TCGAAAGCAT TGGCGAAAGC
GTTGTCCAGC TCTTTGACAA TATCGATCTG GAAATTTGTC TCTACGATCC GCAAAATCAG
GTCTTGAATA GCCAGTTTGC CACGCCACGC ACTCGCCAGC ACTATGCTAG CGAAGGTGGT
TCGTATGGCA TCGATCAAGG CTTGACTGGT TGGTTGGCGC GGCATCGCTC CACCTTGCGC
ATCGACGATT TGCAACGCCA AAAGATTGTC AAGCCAATTC GACCCCACGA AACTGAAAGT
GGCTTGGCCT TCCGCTCATT CTTGGGCGTG CCAATGCTGA TTGGCGATCA ATTGATCGGG
ACGTTGGAGC TTGGTTCAAG CACGGTTGGG CGGTTTGATG CTGAAGATGA GCGCTTGTTG
AATATTATTT CTTCACAAGC AGCCCAAGCG TTGCGCAATG TTCAACGCTA CGAAGCCACC
GACGAAGTGT TGCGCGAACG GGTACGCGAG CTTTTGGCGC TCCAACGCAT CAGCCGCGAA
CTAACCTCGA CCTTGCAGCT AGAGCAATTG CTGCCCGCGA TGTTAACCGA AATCACCCAA
GCTACTGGCT GTGGCTATGG GATTGTCGTG CTGCATAACG AGGATGAATC GCTGCAAGTT
ATCGCACAAA CTGGCTACAA TTCAGCCGAA GCTGCTGGCG TTCTGGCGCT ACCATTGCTG
AATAATGGCT TGCTGAGTGC GCCAATGCAA CGCGCCGAAG CCCTGATTTA TGATGATGTA
ACCGTCTTAG AAGCGAATAT TGCCTGGGGC GAAATTCGTT CGTTGCTCAT GGCTCCAATT
TTGTATGAAA ATCGGGTGGC TGGGGCAATT ATTGCTGGCG ATGCCAAAGG TTATTCTTTC
GATCACGCTG CGCTCGATTT TGTACGAGCT GTGGCCGACC AAGCTGCTTT GGCCATCGGT
AATGCTCAAC ACTACGAGGA GCAAGTTAAG CAACGTGAAT TGTTGCAACA ACGCGCAAGC
CTCTTAAATG AAGTGTTGGA GATTGGTAAC GCGCTACGGG CCGATATGGA GCTGAGCAAT
CTGCTTGAGC AAATTGCCTT TAGTGTAACC GAAGCCGCTG GCTATCGCAT GGTCTTGTTC
AATCTGATCG ATCCAGCGCG TCCGACGATT ATGCGCACCG CTGCCGGCGC GGGGATTGCC
CTCTCCGATT TGGAACAATT GCGCAGCGAA GATATTTCGA TCGAAACTGT GCACCCCTTG
CTCGACCCCC AATATCGGAT TGGCCGAGCC TATTACATCA CCCGCGGCAG CGATTCGAAT
GCTTGGCACG ATGGCGATCA ACTGCTCGTG CCGCTCTACT CGACCGAACG CGAATTAATT
GGGATTATGA CGGTCGATGA TCCGTTTACC CACGAAGCGC CAACTCGCCG CACGGTCGAA
ACCTTGGAAA TTTTCGCCAA CCAAGCAGCA ATTGCAATCG AAAATGCTTG GCTGTTTGAT
CAGCGTTCGC GCCAAATTGC CGAGCTGGCG ATTATCAACC GCATTAGCCG TGCTGCAACT
GCTAGCCTCG AATTCAATGA TTTGGCGCGT GAAGTCTACA ATGTGCTCCG CGAAAACCTG
CCAATTCGTG CCTACTATTT GGCAGTCTTT GATACCCAGC GCAATAGTGT GGTCAAATCG
TTGGCGATTG ATGACGAACA CTTTATGCCC GATGTGGTCA ATGGGCCAAT TAACGAAAGC
TCGTTGATGG CACGAATCAT CAAGCAGCGC AAAACGTTGT ATTTCAATGA TATGACCACC
GAATTTCATT ACGACGAGGA GAATAGCCCA CCGCGTAATG ATGAAGGCAA TAGCACCGAT
GTTCCGCGTT CGTGGATTGG CGTGCCACTA TTGTTGGGTG ATGGTACGGT GCGGGGGGTG
CTTTCGCTGC AACATAATGA AGCTGGGCGC TATGGCGAAC GCGATGCGGC GGTGCTCGAT
ACGATTGCCA ACCAATTGGC TGTGGCAATT GAAAACTCGC GCCTCTACAC CGATACCCAA
TCGCGCTTGA ATGAATTGGC GCTGATCAAC AAAATTGGTA GCCTAACCAA TTCAACCCTC
GATTTTGTCG AAATTCTCAA GGGTGTTTAC GAATCATTGC GCACAACCTT GGAATTGAAC
GTCTTCTATA GCTTCGTCTA TGATCCAACT CATGGCGAGA TTGTGCTGCG GGTCAATGTT
GAAGAAGGCA AATTCTTCAT CGAAGAGCAA CGCGAGAAGC TGTTGCCCAA CTCGCCGTCA
GCCCATGTTG TTAACGCCCT TGAACCCTTG GTCTTCCGCG ATATGCCCCA AGAAATTGAG
GGCAAATATA CGATCAAGCG CTTTGGCAAC CCCGATAAAT GGGTACGCTC GTGGATTGGC
GTGCCGCTGA AGATTCGCGA TGACACCGCT GTGGGGATGC TCTCGGTTCA ACACTACGAG
CCAAATATCT ATAGCGAACG TGAAGCTGAG TTGCTGCAAA CAATCGCTAG CCAAGTGGCC
TTGGCGGTGC AAAATGCCCG CCTCTTCGGC GACCGCGAAC GCCAAATTCG CGAGTTGGAT
GCGATTAGCC AAATTGGCCA GTTGATGAGC GCCTCACTTG ATCTGGGTGA AATGCTGGGT
TTGACCGCTG AATACTTACA AGAAGTAACC TCAGCTCCAG TTTTTTACAC CATGATCTAC
GATGCTGAGC ACGACCAAAT TACCGATGGC TATGCTGTCC AGGAAGGTGT GCCAGCTGAA
CGCACGCCGC GTGGCAAGCC ACGCCCAGGC AGCCCAAGCG CGTGGGTGGT GCAAAATCGT
GCGCCCTTGA TTTTGGCCGA TGTTAATAAT AAAGCTGAGT TGCAACAAAA GGGCGTTCAG
CCAATGGCCA ATCCAATCGA GGGCAAAAAT AAATCGCCCC GCTCATGGAT TGGCGTGCCG
ATTATTGCTC GCGACGGCGC ACCAATTGGG ATGCTCTCGT TACAAGATTA TCGGACGAAT
GCCTTTGATC AGCGCACAGT CGCCTTCTTA ACCAATGTGG TTTCGCACAT TAGTTTGGGT
GTGCAAAAAG TTCAGCTGTT CAACGAACGT GATCGTCAAA TTAAAGAGCT GGATGTGATT
CGCCGCGTCG GCCAAGTTAC TAGCTCAACC TTGAACGCTG ATGAATTGAT GCAGGGCGTG
TATAACGTGT TGCGTGAGTT CTTGCCGATT GAAATTTTCA ATCTGAGCGT TTTTGATAGC
GATCTGACGA TACGTATCCA TAGCTTTGTG ATTGATCGTG GCTTGATTAT TCAATATCCC
AAAGCTACTC CGATTACGCC GCACTCGCTA ACGGCTTGGA TTTTGGAGCA TCGTCAACCA
TTGCTGTTCA AGCATGTTGA CGAGGAAATG AAGGCCTATC CCGATATTCA TCCACGGGTG
CAATCTGCCG ATACGGTGCT TTCGCAATCG TTGATGGGTG TGCCATTGCT CACCAGCAAC
GACGAACCGC TTGGGATTTT GCTGCTACAA CACTATAAGC AAGCGATGTT CGATGAACGC
GATTTGCAAT TGCTGATCAA CGTGGCCTAT CAGGTAACGT TGGGGATGCA AAATGTGTTG
TTGTATAGCC AAACTCAGGA TGGTTTGGAG CAACTGGCAA CTGAAGCTGA ACGTTTGGCA
CTGATCAATC ACGTTTCTGA TCTAACCGCA TCGACGCTTG ATACCCAAGT GCTCTACGAT
TTGGCGGTTG AAGAAATGGC CCAAGTGACC GATGCTTCGC AAGCTCGCTT GGTGATTTTC
GATTACGAAG CCCAAGTTGG CTACACCCGC GCCGAATTCC CCAAAGGTGA TATGAGCATC
GAAGTACCGG TGGCCAACAA CGGCACGATT CCATGGATGC AGCGCCATCG CCGCCCATTG
GTCATCAACA ATCCGCTTGA GCACGAATTA ACTGAATCGT TCCGCGAAAC GATCAGCCAA
TTGGGCATTC AAGCGCTGAT GTTGGTTCCG TTGATTGTCA AGGGCGAAGT CGTTGGCTCG
ATCGGGCTTG ATCATATTGG TCAGGGCCGT CATTTCAGCC AACGCGATGC TGAAACTTGC
CAAACCATCG CTAACCAAAT TTCGCAAGGC TTGGAAAATG CCCGTCTATT TGCTGAAACT
GAACGCCAAG CTCATGTGCT GAGCCGCAAA GTTGGCGAAT TGTCGGTGCT CTTGGAAGCC
GGTCAAGCGC TTAGCTCAAT CCACGAGCCA ACTCAAGTCT TGGATACCTT GGTACGCTTG
GTGGCTCGCC AACTCAATAT CGAAACCGTG ATTCTGTTTA CTGGTGATGA AGAACTTCAG
CCAGCAGCCT CGTTGGGCTT GCCAACCGAT TTTGTGCAAA GCTTACGGGT CAAGCGCGGC
GAGGGTTTGG TTGGTACGGT CGCTGAGCAA CGTAAACCGC TAACAACCAG TGCTACCGAT
GGCGAATCGC CGATCATCTC GCAACATGTT GAGTTCAATC GCGACCATGG CCTTTCGGTC
TTTATGGGCG TACCGATTGT CTATCGCAAT GAGTTGCTGG GTGTGTTGAG CGTGATGTCG
GGGGCTGGCG CTTCATTCAA CGAAGATGAT ACAGCGCTGT TAAGTGCTTT GGCCGACCAA
GCGGCAATCG CGATTGAAAA TACTCGCCTG TTTGTTGAGC GTGAACGCCA AATTACGGTG
TTGCAAGCAT TAAATGATGT GACTCAAGCG ATCACTTCGA CGCTTGATCC ACAAACCTTG
CTGCGCCAAT TGCACTTACG CCTGAGCAGC GTGGTCGATA CGCGCTACTC GTTCATCGTG
CTCTACGACA GCGATCATAA TGTGCTGACC TTCCCTGTGG TGATGAATGC TGGCCGTGAG
GAGCGGCTGG AGCCGCAAGC TTTGGCCGAA GGCGTGCTGA GCCGAGTGAT TCGCGGTCGG
CGCACGATTG TGCTGAACAC CATCGAAGAA ACTGCCAATG CCTCGCGCTT CTTGCTGGGC
CACGATACGC CGATGGCTTC GTGGGTCGGG ATTCCGATTA TGCTGGGCGA TATGGTTTTG
GGGGTTATTA CCATCCAAAG CCCCCATCAA CGAGCATTCA GCACCGAAAC CATTCAATTC
TTGCAAGCGG TCGCCAGCCA AACGGCAATT GCGCTTGAAA ATGCCCGCTT GTTTGCTGAT
CGCGAACGTC AAGTAACCGA AGCAGCAATT TTGTCGTCGA TCAGCCAATC GATGACTGCG
ACGCTCTCGC CCGAAGAATT GGCTAGCTCA ATTTTGAAGG GCTTGACCGA AATCTACAAT
ACTGACAATG CGTATATCGT GTTCTACGAT GCGCCAACCA ATATGATCTC GACCACGGTT
GGCTTTAGCA ATGGTCAGCC CTATTCGTTG ATTTCGCATG TACTGAGCAA CAACTTCCTC
AAAACGATGT TGTTTGAGCA ACGTCCATTG ATGTTCAACA GCAGCAGCGA TTTGCTGAAT
GAGCGTTTCT CGCCACGGCT TGAAGGCATT CCCGATGCGA TTGAACCGGA ATCGGTAATT
GCGGCTTCGA TCACCATGGG TAGCCAAACC TATGGGATGA ATATCAATCA GCCGTTGGGC
GTGCTTGTTA TTCAAAGCCC ACAACCTAAC ACATTCAGTC GGGCACAACT TCAGTTCCTC
GAATCGTTGG CTAATCAATC GTCGGCAGCT GTGCAAAAAG CCATGCTGTT TACCGAACGT
GAACGCCGGA TTCGTGAACT GGATACGCTC AACCGCATCA GCCAAGGGAT TACCTCAACC
ATTAGTTTGG ATGAAATCTT AGAGCGCTTG TATGCTGGTT TGGGCGAAAT TGTTGATGTT
AGTACCGCCT TCATTGGCTT GTACGATCCA CAAACCCACT CGATGGAATT CCGTGAAGCC
CATGATCGGG GTGCACCAGT CACGATTGGT TCACGCCGAT TGTCGGTTGG CGTGCCAGCG
TGGGTTATCG AACATCGCCA GCCGTTGTTG CTGAACACCT CGGAAGAAGC CAACGAATAT
CGTGATGCCC AAACCCAAAT CACCTCGGAA ACCAGCGCCA TGCGGGTTGG TGGTCAAGGT
GAGATTGAAC AATCGTATCT GGTTGTGCCA ATTGTGGTCG GCGTTGATGT GATTGGGATT
ATCAATATTC AGAGCTATGA AGAGTATTCG TTCTCTGAAT ATGATATGAG CTTTGTCACC
ACGGTTGCGG CGCAGGCAGG CGCGGCGATT GCCAATGCTC GCTTATTCTC CGAGCGTGAA
CAATCGATTC AGCGCTTGAA TACCCTGAAC AATATCGGCC AAGCCTTGAG TTCGACGGTG
CGTTTCGACG ACTTGCTGCG GGTGATTTAC GAACAAACTG GCAAGTTGGT CAAAACCGAA
AACTTCTATC TGGCGCTGTA TGATGAGCGC AATAAAGAAG TGACCTTCCC ACTGTATTAC
GAATATGGCC ATCCAATTAA CGTGATGCCG CAGCGCGGCG TAAATGGCTT GACTGAATAT
GTCATTCGTA AGCGCCAGCC GTTGTTGCTG CAAGGCCCAC ATATCGCCGA ACGCATGACC
GAAATGAAGG TCGATCAAGT TGGCGATATG GCGCGTTCAT GGCTAGGGAT TCCGCTGATC
GCCGCCGATA AGGTTGTCGG GGTTATGACG GTTCAAAGCT ATGAGCAAGA TAATGCCTAT
AGCGATGAAT TGGTGCAATT GTTGCTCACG ATTGCCTCGC AAGCCGCCCA AGCCCTGGAA
AATGCACGGC TCTTCTCCGA ATCACGCCAG AGCGTGCGCG AACTTTCGAC GCTTTCAGAA
ACCAGCGTTT CCTTGGCCAG TACCTTGGAA ATCGACGAAT TGATGGCGAT TAGTGCTTCG
AGCGCAATCG AAATGTCGCG GGCCGATTTC GGCGGGATTA TCGTGGTTGG AGGCGATGGC
TATACAATTA CCAACTCATT GGCGCTCAAC CGCGACCATA TGGAATTGGA ACTGCCCGAT
ACCAACGAGC TTGATGTTGG CAGCATGGAA ATTCTGCGGC CATTGCGCTC AGGCCATCCA
TTGGCGTTGT TCGATGCTAG CACCGACGAA GGCTTAGCGC CATTCGTTAA TAGCATCGGC
ATGCGTGGCT CGATCTTCTT GCCAATGTTG CGCGAAGGCT TGCGCGGCGT GGTGTTCGTG
GGGATGGACG AACCATTTAC CTTCAACGAA CGCACAATTT CGAGCTTGAT GATTTTGGCA
ACCCAAATTG GCCAAGCGAT CAATAATGCT CAATTGTTCG ATCAAATTCG CCGTTTCAAC
CTTGAACTCG AAGAAATCGT CGATCAACGA ACCTTGGAAC TTAAGGGCGA AAAGGAACGG
GTCGAAGCGC TCTACAACAT TGCAACCGAA CTTGGGACGA CCCTCGACCG CGACGAACTC
TTGTTGCGCA CACTTGATCT TGCGGCTTCG GCCTTGATGG TACGGCGTGG GGCGGTCTTC
TTGCTCGACC GCGAATCCAA AGATTTGGTG TGTCATGCAG TGCTCAACGA AGAATTGGGC
TTGAAATCAA TCGAAACGCG AGTACGCTTC CAACACCCAG GCTTGGCCAA CTGGATCATC
GAACACAACG AAGGCGTGGT GATTTCCGAT GTTTGGTTGG ATGAGCGCTG GACGAATGCT
CAAAGTGGTC GCGGCGACGA TGTACGCTCG GTGATTGCTG TGCCATTGCT TTCGAGCGAC
GCACCGCAAG GTGTGTTGAT GCTCTATAGC AACGAAATTG GCTTCTTCAC CCAAGATCAC
CTGCGCTTCC TCTCGACGAT TGGTGGTGAA GTTTCTTCGG CCTTGCACAA CGCCGACCTT
TACACCTTGG TGTATGATTC GGCTGACCGG CTCTCTGATG CGATGTGGCA ACAACGCGAA
GAAGCCAGCA AGACCGCTGC TATTCTCCAG AGTGTTTCGG AAGGGGTTAT GGTGCTTGAC
CACCAAACCG AGAAGATCAT TCTGTACAAC CCTGCCGCCG AAGATGTGCT GCGGATTCCG
CGCTCGGAAG TGATGCACAA CAGCTTGCAA GTGGTGGCGG CTCCGCGCAA CGAAGAAGAA
CTCCACGAAG AAGGTCGTTC GTTGCTGCTG TATGCTGGTT TGCACGAAGG CATTCAGGTG
GTGCAACGCT CTGAAGGCGT GCATCGCAGC ATGATCGAAT TGCCAGGCCA ATCGATTGCG
GCTAACTTTG CCCCAGTGGT TGGCGAAGAA AGCTCACGCT TCGGGGTTGT GGTGGTCTTG
CGCGATATCA CCCGTGAAGT TGAAGCCGAT CGGGCTAAGC GCGACTTTGT GGCGACGGTT
TCGCACGAGT TGCGCACGCC GCTTACTCCA ATTCGCGGCT TTGTCGATTT GCTCTTGCTG
GGTGCGGTTG GCCAACTCTC CGACCCACAA CGCGAAATGT TGAATACTGT CAAGACCAAT
GCAATGCGCA TGGTGGGCTT GGTCGAAGAC CTGTTGGAAA TTGGGCGCTT GGAAGCAGGC
AAGATTGTCT TGAACACTGC GCCGAACCAA ATCAACCAAT TGGTGCGCGA TATCGTGGCG
ACTTGGGGAC TTGAGATCGA GAAGAAGAAT ATGACGCTGA AACTCGAGCT TGACGACACA
TTGCCATTAA TCGAGTATGA TAGTAAGCGG ATTGGACAGG TCTTGACGAA CATGGTTTCG
AATGCGATCA AGTATACCTA CGCTGGTGGC GATGTGATAA TCCGCACCTT CATCAACGAG
GACAAGATGA TCCAGCTTGA TGTCAAAGAT ACCGGGGTCG GTTTGACAGC CGAGCAACAA
AAGAGCATGT TCAAACGATT CTATCGAGCC GATAGTCCCC TACGTGATGA AGTTGGCGGC
ACAGGCTTAG GGCTTTCCAT CGCTAAGTCG TTTATCGAAT TGCATAATGG CGATATGTGG
GTGCAAAGTG TCTATGGTGA GGGCAGTACA TTCAGCTTCT CGCTACCAGA AGTGCAACCA
CGCCCCGACT TAGGCGAACG TGATGAGGTC TAG
 
Protein sequence
MLTGIAHLAR EQSPDLPTFA QRCADLLKQS GMFAQGAILR LGQQRSLLTA WGMNKRATNR 
LLPARVLNGG SEVRLPDGWQ MFEIGTAEVP GVLVAPVTIE PEPLTVLVAQ LTSLWQGVAL
QEEFARRERT VAAMTESLQG LAQQLNADDF LQTLVTQATE LLGAAGGGVY MTDSNQQYLE
LRQVVQFPAN WNGARIQVGK GVAGKVAQSG KPMLVNDYAQ AKEKYDNLPE GINFTAVMAA
PLRADEAIIG VLVLVHIEPD RGFQNADLAL LESFAAQASL AMRTAKLFDA QRQRSRELYL
LYENSLTVGS SLDLSHILNR LTENVLLALG VEQCLLLLWD DRRKLCELVA QATDDDAANS
LDLSIGATYE LRPESILKLS FDTQQPVVVV DIDTDPRINA QRSWLKSRGI RSALGLPMLL
KERVIGTLLC TTTSKTRTFN PGEITLAQTL AAQAATAIGN TRLLNDERRR NAELSVLQSL
SAKLTSGVSL QVALESIGES VVQLFDNIDL EICLYDPQNQ VLNSQFATPR TRQHYASEGG
SYGIDQGLTG WLARHRSTLR IDDLQRQKIV KPIRPHETES GLAFRSFLGV PMLIGDQLIG
TLELGSSTVG RFDAEDERLL NIISSQAAQA LRNVQRYEAT DEVLRERVRE LLALQRISRE
LTSTLQLEQL LPAMLTEITQ ATGCGYGIVV LHNEDESLQV IAQTGYNSAE AAGVLALPLL
NNGLLSAPMQ RAEALIYDDV TVLEANIAWG EIRSLLMAPI LYENRVAGAI IAGDAKGYSF
DHAALDFVRA VADQAALAIG NAQHYEEQVK QRELLQQRAS LLNEVLEIGN ALRADMELSN
LLEQIAFSVT EAAGYRMVLF NLIDPARPTI MRTAAGAGIA LSDLEQLRSE DISIETVHPL
LDPQYRIGRA YYITRGSDSN AWHDGDQLLV PLYSTERELI GIMTVDDPFT HEAPTRRTVE
TLEIFANQAA IAIENAWLFD QRSRQIAELA IINRISRAAT ASLEFNDLAR EVYNVLRENL
PIRAYYLAVF DTQRNSVVKS LAIDDEHFMP DVVNGPINES SLMARIIKQR KTLYFNDMTT
EFHYDEENSP PRNDEGNSTD VPRSWIGVPL LLGDGTVRGV LSLQHNEAGR YGERDAAVLD
TIANQLAVAI ENSRLYTDTQ SRLNELALIN KIGSLTNSTL DFVEILKGVY ESLRTTLELN
VFYSFVYDPT HGEIVLRVNV EEGKFFIEEQ REKLLPNSPS AHVVNALEPL VFRDMPQEIE
GKYTIKRFGN PDKWVRSWIG VPLKIRDDTA VGMLSVQHYE PNIYSEREAE LLQTIASQVA
LAVQNARLFG DRERQIRELD AISQIGQLMS ASLDLGEMLG LTAEYLQEVT SAPVFYTMIY
DAEHDQITDG YAVQEGVPAE RTPRGKPRPG SPSAWVVQNR APLILADVNN KAELQQKGVQ
PMANPIEGKN KSPRSWIGVP IIARDGAPIG MLSLQDYRTN AFDQRTVAFL TNVVSHISLG
VQKVQLFNER DRQIKELDVI RRVGQVTSST LNADELMQGV YNVLREFLPI EIFNLSVFDS
DLTIRIHSFV IDRGLIIQYP KATPITPHSL TAWILEHRQP LLFKHVDEEM KAYPDIHPRV
QSADTVLSQS LMGVPLLTSN DEPLGILLLQ HYKQAMFDER DLQLLINVAY QVTLGMQNVL
LYSQTQDGLE QLATEAERLA LINHVSDLTA STLDTQVLYD LAVEEMAQVT DASQARLVIF
DYEAQVGYTR AEFPKGDMSI EVPVANNGTI PWMQRHRRPL VINNPLEHEL TESFRETISQ
LGIQALMLVP LIVKGEVVGS IGLDHIGQGR HFSQRDAETC QTIANQISQG LENARLFAET
ERQAHVLSRK VGELSVLLEA GQALSSIHEP TQVLDTLVRL VARQLNIETV ILFTGDEELQ
PAASLGLPTD FVQSLRVKRG EGLVGTVAEQ RKPLTTSATD GESPIISQHV EFNRDHGLSV
FMGVPIVYRN ELLGVLSVMS GAGASFNEDD TALLSALADQ AAIAIENTRL FVERERQITV
LQALNDVTQA ITSTLDPQTL LRQLHLRLSS VVDTRYSFIV LYDSDHNVLT FPVVMNAGRE
ERLEPQALAE GVLSRVIRGR RTIVLNTIEE TANASRFLLG HDTPMASWVG IPIMLGDMVL
GVITIQSPHQ RAFSTETIQF LQAVASQTAI ALENARLFAD RERQVTEAAI LSSISQSMTA
TLSPEELASS ILKGLTEIYN TDNAYIVFYD APTNMISTTV GFSNGQPYSL ISHVLSNNFL
KTMLFEQRPL MFNSSSDLLN ERFSPRLEGI PDAIEPESVI AASITMGSQT YGMNINQPLG
VLVIQSPQPN TFSRAQLQFL ESLANQSSAA VQKAMLFTER ERRIRELDTL NRISQGITST
ISLDEILERL YAGLGEIVDV STAFIGLYDP QTHSMEFREA HDRGAPVTIG SRRLSVGVPA
WVIEHRQPLL LNTSEEANEY RDAQTQITSE TSAMRVGGQG EIEQSYLVVP IVVGVDVIGI
INIQSYEEYS FSEYDMSFVT TVAAQAGAAI ANARLFSERE QSIQRLNTLN NIGQALSSTV
RFDDLLRVIY EQTGKLVKTE NFYLALYDER NKEVTFPLYY EYGHPINVMP QRGVNGLTEY
VIRKRQPLLL QGPHIAERMT EMKVDQVGDM ARSWLGIPLI AADKVVGVMT VQSYEQDNAY
SDELVQLLLT IASQAAQALE NARLFSESRQ SVRELSTLSE TSVSLASTLE IDELMAISAS
SAIEMSRADF GGIIVVGGDG YTITNSLALN RDHMELELPD TNELDVGSME ILRPLRSGHP
LALFDASTDE GLAPFVNSIG MRGSIFLPML REGLRGVVFV GMDEPFTFNE RTISSLMILA
TQIGQAINNA QLFDQIRRFN LELEEIVDQR TLELKGEKER VEALYNIATE LGTTLDRDEL
LLRTLDLAAS ALMVRRGAVF LLDRESKDLV CHAVLNEELG LKSIETRVRF QHPGLANWII
EHNEGVVISD VWLDERWTNA QSGRGDDVRS VIAVPLLSSD APQGVLMLYS NEIGFFTQDH
LRFLSTIGGE VSSALHNADL YTLVYDSADR LSDAMWQQRE EASKTAAILQ SVSEGVMVLD
HQTEKIILYN PAAEDVLRIP RSEVMHNSLQ VVAAPRNEEE LHEEGRSLLL YAGLHEGIQV
VQRSEGVHRS MIELPGQSIA ANFAPVVGEE SSRFGVVVVL RDITREVEAD RAKRDFVATV
SHELRTPLTP IRGFVDLLLL GAVGQLSDPQ REMLNTVKTN AMRMVGLVED LLEIGRLEAG
KIVLNTAPNQ INQLVRDIVA TWGLEIEKKN MTLKLELDDT LPLIEYDSKR IGQVLTNMVS
NAIKYTYAGG DVIIRTFINE DKMIQLDVKD TGVGLTAEQQ KSMFKRFYRA DSPLRDEVGG
TGLGLSIAKS FIELHNGDMW VQSVYGEGST FSFSLPEVQP RPDLGERDEV