Gene HS_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0209 
Symbolhsf 
ID4239724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp197001 
End bp212432 
Gene Length15432 bp 
Protein Length5143 aa 
Translation table11 
GC content41% 
IMG OID638103745 
Productlarge adhesin 
Protein accessionYP_718416 
Protein GI113460355 
COG category 
COG ID 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC CAAATACGAT GTAACAACAG GTCAAACGAA AGTGGTGTCT 
GAATTAGCGA ATAACCGTCA GGTGGCGAGC CGTGTCGAGG GAGCGTCGGT CGGTGTGGGT
CAACCGAAGT GCGGTGTGTT TTTAGGCATG TTTAAGGTTC TGCCATTGGC ATTATTGATG
AGCGGGCTGT TATCGAGTGC GGCGTATGGG GCGAATGTGT GGATTGACGT TAATCCTAAT
GGAATAACTG GTGGTGATAA TAAGCCTTCA TCCGTTTGGT ATAATGGAGG TCTATCTGAT
AATAAGAATG AAGTCGTTTT ATTAGCCGAT GGTGAAAGTA CGACAGGAGC AAAGGCTCAA
TATAAAAATA AAGACTTTAA AAGAACGGTA ATTATAGGAT CGCGTGCAGT TGGTGGAGGT
AATGATGCTA CTGCTATAGG TTATCGTGCT ATTGTAGGTA AAAACCTTGC TGAAACGAAT
ACAGATGACA GCCATCAAGG AACGGCAGTG GGATATAGGG CCTTTTCTTA TGGTAATGAA
TCTGTGTCGT TGGGGAATGA TACGGTGGCA TATGGTGAAT CATCTATTTC TATTGGGTCG
GATAATGTAG GTAGAAATAT AGGCAAGTAT TCACGGACAG GATTAAGTTA TGATATTTGG
AAATTATATC GTCAAAATGG ATCGAAGTTT AATTATACGG GTGAATATGC AGCAATAGAT
GGGGCTGACC CTGATACGAG TCGGAGTAAA TATGAAGCAT ACCTAAGCGA AACAAATCCT
ACAAAACCTT TTTACAGAAC TCATAACTGG GCTTTTGGGG ATTCATCTAT TGCTATTGGT
AGTAGAAACG TTGCTTATGG ACATGGATCG CTTGCTATGG GAACATTATC AGTTGCAAAA
GGAGACTATT CAACAGCATT AGGGACTGCT ACTCTGGCAT TTGGTAATTC ATCTGTTGCT
TTAGGAAATG AAAGTTATGT ATATGCAGGT AATTCAATCG GAGTAGGAAA TGAAGTACAA
GCTATTTCAG ATGGTTCAAT GGTATATGGG TATCAATCTT ATGCAGGAGG ACCTGGGTCT
ATAGCAATAG GTAAAAGGGC GTTATCTAAT GTTGAGCCAT CAGCACATTT TAAACAAACT
GTAGAGGATT TTGGAAATCT ATGGTATGAA GGAAAAAGTA CGGTACATGC ACTAGGTATG
TTAGATAATC CACAAAATCA TGGAAAATCA AAAGATTTAG ATGGATATTT TTTACCTGCT
ACAGAGAAAC AATCAGGAAC AGAAGAAGAA AAGGCAAAGG GGAAAAATGG TGGAGCAGTA
GCCCTAGGAT ATTACGTATA TGCTCTAGGA GAGAACTCTG TTGCATTAGG AAGACAAGCT
TATTCTAAAG GAGATAGATC TATTGCAATA GGACCTTATG CTTATGGTGG ATATGAGAAA
ACAATTGCAA TGGGATATGG TGCTAAGGCT ATGAGTAGTC AATCACTAGC ATTAGGAAGT
TTATCAAGAG TTGAGGGTAA AAATTCAGTA GCATTAGGAG TTGAGGCTAA AGTTTTAAAT
GATGAAAGTA ATTCTGAATT AAATGGACAA AACTCAATGG CTTTAGGGAA TAATTCAGAA
GTTACGATGA AAAACTCAGT AGCAATCGGT AATATGTCAA ATACAAGATA TTATTATACA
GGAGATAAAA GTAATCCTAC CCCTTCTTCT AATAATAATA ATAATAATAA TAATAATGCT
ATAACATTGC CTGCGTACAT ACCTAAAGGA ACAAGTTATA GTTATACATC AACTTCAGAT
GATGGTGTAA TTTCAGTAGG TGGTTGGGAT AGAAGGGATG GTAAATTAGG ACGTAGAAGA
ATAATCAATG TAGCACCCGG AGCATTAGAT TCAGACGCAG CGACTGTAGG GCAATTGAAA
GCATTAGAAT ATGCATATAA AGAGGGAGTG GTTGCTTATT ATACGGTAGA AGGTGGTAAA
AATTATAAAG TAGTAAAGGA TGCAGACGGT AAATTTTATA AGGCAAATAC AGAAAATGGA
ACGCCGTTAG ATAGTACAGC AATAGCTGCA GATAAGGTTT TTGTAGGTCC GAAAGGAGCA
AATGAAAGAA GCCAATCAAT ACAAGTTAAT GGTAGGAGAA GAAATGTAGT TGATATGGGC
GATAAAATTA AATTTGCTCA TATTTCAGAT GGAGAGATTA CTAGCGGTTC CGATCAGGCG
ATTACGGGTA ATCAGTTAAA TCAATTAGGT AGTACTATTT TAGGATTAGG TGTACAAAGT
AATGATAAAA CAAAATTTGA TACTCCTAGT TTTACGAAGG TTCAATATGA AGGTTCAACA
GGAACTCAAA ATCATACAAC ATTTAAAAAT GCGATAGATG AAAGCATAAA AGCGATTAAT
AAGGGGCTTG TAATTCAAGG AGATGATAAA CAAGGGAAAA TCTCCCTAGG GTCAACTTTA
AATATTAAAG CAGGAAATAC AAGTGTTACA GACCAAGGAA CAAAAACGGA ATATAAAAGC
GATAATATTA GAACAGCATA TCAACCGAAG AACAAAACAC TTTTAATAGG CATAAAAGAA
AATCCAACCT TTAAAAGTGT AACTGTGAGT GATCCAATTA CAGCGAGTAG TCCAGAAGGA
ACATTAACTA CTAAAAAATA TGTTGACGAC CAACTCAAAA ATGTCTCGAC CAATTTGCAT
TTCTTATCAG TACAAGGGAC GGATAAGACC GCAGGCAACT ACAACAACGA TGGAGCAAAA
GCGAGTTATT CTGTTGCTAT TGGGGTGAAT GCTCAAGTAG TAGATCCTAC TATTGTAGTG
ACTGGAAACA AACCAAAACC GACAGCAACA GCAGGTATAG CGATAGGATA TAACGCTAAA
TCTGAAGCAG AAAATGCGGT TGCTATAGGT AGAGATGTAA GTATTGACGT ACCGAATTCT
TTTGTTATGG GGTCTAATAA TACAGTTACG CAAAGCTTTA AAGAGACAAA TGGTGCGGTT
GTTGTGATCG GTAGCGGAAC AAAATTAGTA GAATCAAAAA GTTCTATCGC TATCGGTGCG
GTTTATAAAG TACATCAGGG TAAAGCAGAT GGCACTTTAA TTGAAAACGC CGCTTGGACA
GCGTCTATCG GGAATAAAAA TAAAATTAAA AATGGTACGG ATATTGTTGC ACTGGGGAAT
AATATTGAAG TCAAAGATGA AACAAATGAG GAAAATCCAA AAAATAAAAC AAGAATTGCT
AATAATGATT TAATTCTGAT TGGTAATGGA GCGACAGCTG AAACCGCAAA AAACAGTGTG
CTTGTTGGGG CAAAATCAAA CGCTGGCAAG GGTGCAAAAA ATGCGGTAAT TATCGGTCAT
AGTGCAGAAG CGAAAGCAGA AGCCGAAGGT GCGGTCGTGA TTGGACAAGG TGCGAATGTT
CAAACTAAAG CAACGGGTGC CATTGCCTTT GGTCAAAGTG CCACTGTCAA TGCTGAAGCA
AGCAATGCCA TTGCCTTTGG TAAGAGTGCT AGTGTTCAAC CCAACGCAAC TAGCGGTATT
GCTTTTGGTC AAAGTGCGAG TGTATCAGTG GCAGACGGTA TAGCCTTAGG AAGTCACTCT
GTTGCAAGTA CAGCTGGCGA TAAAGTAGGG TTTAATACCG TAACTGGTGC GACTAAAGCA
GATCCAGTTT CTCATGGTGC ATGGAAATCA AAATATGGTG CATTGTCGAT TGGTGGCAAT
AATAATGGCA CACGTCAAAT CACCGGAGTC GCCGCAGGGA CAAACGACAC CGATGCCGTC
AACGTGGCAC AGTTAAAAGA AGCGACATTG CATTTTGTGT CTGTTAATGG TGGAAGTAAA
ACAGACGAGA ACTATACTAA TAATGGTGCG ACTAAAACAG GTGCAATTGC ATTAGGAATT
GGAGCAAAAG CGGCGTCTGA AAACTCCATC GCTATGGGTA AAGACTCGAA AATTGAGGCG
GAAATATCGA ATGCCGTCGC TATCGGGGCG AATAATCACC TAAGAGGTCT AAGAGGGAAG
GATAATAAAG ACCAATATAA GCATACGGTG GCTATTGGTT CAGACAATAT CATCACCGGA
CGAAAAATCG TGAATTTAGG TTCGGGGAAT AAGATAGGAA ATGCTGAAGA TAGTTATCAA
ACTGACAAAA AAGCAGGTGC GGTCAGCATA CGAGTGATCG GGGACGATAA TAAGGTTCAT
GGAGTATGGA ATACGGTCAT CGGCGAAGAG AATAAGTTAG AGTCATCAAA CTGGACTCAA
GTGATTGGGG ATTACAATGC CGTCACGAAG TCAGACTATG CCATCGTTAT TAGTAGTAAT
CAAGCCCAAA AAGGTGATCA ACAAAAAACA ACAAATACCG TTACGAATTC AAACTATGCT
ATCGTCATAG GTAACCAAGC CAAAGCAACA GATGCGAAGA ACAGTGTGGT GATCGGAGCG
TCTGCAAACT CTACAGCGGA AAGTGCGGTA GTTTTAGGTA AAGGAGCGAC TGTTCAAGCC
AACGCCACCG GTGCGGTAGC GATTGGGGAG GGTGCGTCTG TATCTACTAA CGCTGGTGAA
TCAATAGCTT TAGGAAAAGG TTCAAAAGCG ACACAAAAAG AAAATACTGT CTCAACATAT
ACAGCAACAC AAACAAGTAA TATTAAATTT AATGGGTTTA GTGGTTCAGG AAATGATAAA
TCAGTTTTAA GTATAGGAGA TGGTGGAAAA GAACGTGTTA TTAAACATGT AGCACCAGGG
AAAATTAGCA ATGATTCAAC AGATGCAATA AATGGAAGTC AGTTATATAG TGTAATAGAT
GTATTTGGTC ATTTAGGAGC AACTGTATTA GGAGCAGAAG TAGATGCAAC TAAAGGATTT
AAACAAAGCA CTTTTGAAAA TGTTAAGTAT AAAGATGGTG GAAGAGATCA AAAAAATACA
TTTAAAGCTG CAATTGATGA AACAATTAAA GCGATTAATA AAGGAATTGT AATTAGTGAT
GGAAGTGAAA CAGGTACACG TCAATTAGGA GAAACTTTAA CAGTTAAAGC TGGGAATATA
GATAAACCAC AAACTGATTC AGATGGATTT AGTAGTAATA ATATTAAAAC TAAATATCTA
AAAGATAATG GAGAAATTTT AATAGGAATA AAAGATAAAC CAGAGTTTAA AGAGGTAACA
GTTAGTCAAG AGCTAACAGA TACAAGCAAA GACAATGTAT TAACAACTAA AAAATATGTT
GATAATAAAT TAGCAAATGT AGCATCTAAA TTTACAGTTA GTGGAAATGA AGGAAGTCAG
TTTGAAATAA ATAAAGATAA TAATAAATTA TCTATTAAGG GAGAAACTAC AAATAACAAT
ATTAAAACAA AAGCGACTTC TTCAAAAGAG ATAGAAATAT CGCTTAGTGA TACATTAACA
GGAATTACAT CAATAGGGAA AAATTCTGAC AATGGAATTA CATTTAATAC AAATGCTACA
ACTATAAAAC TTGGAGGAGT TTCTCTAAGT TTAAATAAAG ATAATACAGC AGTTAAAATT
TCTGGTGTAG CAGATGGAAA AGATACTAAT GATGCAGTAA ATAAAGGGCA ACTGGACAAA
AAACAAGATA AATTGACAGG TACTATCACC GCAAATAATG GGATAAAGCT TACCGGAACT
GCAGCTAATA GTATCGCAAG CGATATCACT TTAAGTTTAG AAGATGGCTT AAAAGAGAAA
ATCGACAATG CGTTAAGTAA AGCCGACGCA GGGAATACTT ATGCGAAAGT AGATGCGTCG
AATATAGAAA ATGGCAATAA AGCTAGCTGG AGAACGACGC TAGACGTCTA TTCTAAAACC
GAAGCCAACG CTGAAATCGC CAAAGCGAAA GAAACAGTAA CAAATGGAGA TGGAATTACA
GTCAGTGCAA CGCCGGACGG CACAGACGGA CCGAAAACCT TTACGGTAGC ATTGGCAGAC
GAGTACAAAA CCAAAATCAA CAGTATCGGC ACGGGTTCTG TCGCAGATAA CGATAAAAAC
ACCGTCACCG GCGGCAAGGT GCATACAGCC ATTGAAGATG CAAAAACTAC TTTAACCTCT
ACAATTAATG GTAAAGCGGA CAAAAATTTA AGCAATATCG ATGAAACGGG TAAAGCAAAA
ATTAAAACGC TTGCTACAGA AGCGATTGAT GTACAAGGTA AAGAGGGCGA GTTAAAAGTT
ACCCCTAGCA CGAGTGGCAA TAAAAAGTCA TTTACGGTTT CATTAAATGA TGACATTAAA
ACTAAAATTG ATGGCATTGG TACTGGAACG GTAGTGGCTG ATAATAACAA AACCGTAACG
GGCGGTGCTG TACATACTGC TATTGAGGCG GCAAAAACTG ACTTAAAGGG TAAACTGTAT
GCAAATACAG CAACATTTGG ACTAAAAGGC AATGATAGTC AAGAAGTGAC CAAAAAATTG
GATAACACTA TCGAAATCAA AGGCACTGAC ACGGCGAAAA GCGGACAAAC CAACATCTAC
GTCTCGAAAG CCGATGAAAA CGGTCTAAAA ATCGAATTGG GAGAAAATTT AAAAGGGATT
AAAAAGATTT CTCAAGGGGA AAAAGCAGTC ATTAGTTTAG AGGACAAAGC GTTAACTTTA
ACGAGTAACA ATAATAAAGT TGCGGTTAAA TCTGACCATG TATTGTCAGA TAAGGACATC
TATATAGGCA CAAAAACTGA TGATAATAAA TTAGTCAAAA AATCCGAATT AGGCGGAGGA
TATATTACTT TTGAAGATGA AAATACATCA AAAGGCACTA AAAATGTGAA TTTTGGTAAG
AAAGTCATAT ATGGAAAATC AGCTGAAATT ACACCAACAG TGGAAGCTTC AATAGATGAC
GCAAAAGTAA CATTTGCTAT TAATGACGCT TCAATTGAGG GAACGAAATT AAAAGACGCT
ACCATAGCGG AGGCAAAACT CGATCAAGGT ATTAAAACCA AACTGAATAA AACGTTTAAA
GTTAAGGCTG GAGATGAATC TAGTGATAAC CTTATCGGCG AAGAATTAGA ATTTGCAACA
AAAGATGCTA ATTTAACCGT TGGATTAGAT GCCACTAAGA AAAAAATAAC TTATGGATTA
AGTTCAGCAT TAACGGGGAT TGAATCTATA GGCAAAGACA ATAATACTAA AATCACTTTC
AAAAATAATG GTCAAAATGA AATAGACTTT ACAGTAGGCA CTAATACAAC CTATAAATTC
ACTGACAGCG GATTAGATTT AGCTAGCAAA CCAATCACAA ATCTAGGAAG CGGTTTAGAG
CAAAAAACGG GTAATGGTGG CAGACAAGAT TTAGATGAGT TTTTAAAACT TACTGCCACA
AATGGGCAAC CAAGCGATGG CAAATTAAAC AAAGCCATCA ATGCAGGCGA TTTGCTTCAT
GTCGCTCAAG GCTTGGTGGG CAAGGGCTTG AAGTTTAAAG CTGACAGTGC AAGTGCGGAC
GGTTCTACAA AAACGGAAAT GACCATAGCA CTTGGACAAG CAGTAACCTT TAAAGGCGAC
GGTAAATACC TCACAACAAA ACTGAACGAC ACCAATGGTG AGATTTCCTT TAACCTCAGC
GTGGCGGAAA GCATTAGTGA TGCTTCAACA GATGGGGATT CAACTCAAAA CACGTCCAAC
AGCAAACTTG TAACTGAAAA TGCGGTGAAA AATTTTGTCA CCAACAAAAT TAATAACCTG
AGCAGTACGT TGCAGTTAGA GGGGGATAAT ACCGAAAATG ATAAAGATCC GGCTGATCCG
ATTGGAAAAG TGGAGTTGAA AACCGAGAAG CTGAAATTGA CTGGGGAGAA TGATTTCATT
GAAACAGAGG TGAAAGCAAA CGACCCAACC GTTACTATTA AATTGGCTCA AAAAGTCAAA
GATAAGCTAG CGATTATTAA GGTTGGTGAG AATACTAACG GCGATAATTC TTTTGCGTTG
GGTAAAGATT CAAAACTAGA GACGAAAAAG ACCGCACCGA CCGGAATCAC ACAGAATGTA
GGTGGTAATG ATGTCACTAT TACTTGGAGT AATGCAGGTG CAGGACAAAA TAAAGAAGTC
GTTAGCGTGG GTGATACCGG CAAAGAACGC ATAATTACCC ATGTTGCTGC AGGTGACGTA
CGTGACGGCT CCACCGACGC CGTCAACGGC GGGCAACTCC ACCGCGTGAT TGATGTATTT
GGTAAATTAG GACTTGACGT CCTCGGAGCG GAAAAAGCGG ATAGCGGAGA TGGCTTTAAG
AAATCAAAAT TTGATGTGGT GAAGACTAGT GATAATACGA CCGATTCAAA TCCAGAGAAA
TCTGAGAAAA CCTTTAAAAC AGCCATTGAA GATAATATTG CAGCCATCAA CAAAGGCTTG
AAATTTGCTG GCGATAACGA CGGAGAGAAA CAACTCTATC TCGGCTCAAC CTTGAACATT
AAAGGGGCGA AAGGCGAGGC TTCAAGTGCG GGTTCAAATT CAGCTTCAAG TGATACAACA
AATAACCACC AAAATATCTT CACTAAAGCC AGCGATACGG GGTTAGAAAT TGCACTGAAT
GAAGCGTTAC AAGGCATTAG CTCGATTAGC GGTAAAAAAG GTACTGACGG AAGTGCGGTA
GCGAAAATTG ACTTTACGAG TAGCGGTACT TCAACCAGCC CAACAGTCAA AATCACCGCA
GATGGCGGAG AATTTACTTT TGGTAAAGAC GGCTTGAACT TAAACAGCAA ACAAATCACA
GGTATCGCAA GCGGTTTAGG CTTGAAAGAT AGTGTTGATG GAAATAACGG CGGAAGTGCT
GGAACTAGCA ATTCTGACAC TGACATTATC AACAAAGTCC TCTCGGGCAA TCCTGATAAA
GATAATAACA ACGCCGTCAA TGTAAAAGAC CTGTCCAAAG TCGCTAACGC CCTTGTGGAA
AAAGGACTAT CCTTTGAGGG GAATGGTGGC AATACGGATA AAGTTACTCG CAAACTCGGT
GATACCTTGA AAATTGTGGG TAAAGGAAGT GATGCTAATT CAATAACAGC CACTGAAAAC
AACATTAAAG TGTCTAAAAA TACAACAAAC GATGGACTGG AAATTGGCTT GTCGGAGACT
TTAGCAGGCA TTAAATCTAT TGCCAATGGC GATAAAGCAA AAATTGCTTT AGATAATGAT
AAGAAAACCA TTACCTTTAC CGCAGGAGAA ACAAATAACA ATGTCACTCT AAGCGAAGGT
GAATTCAGCG GCGTTTCTGA AATTAATAAA GCAGACGGCA AAGGAGCGTT GAAACTTGCT
GATAGCACGG CAACTTTAGA AAGTGCTAGC GGTAATTCCA ACGTTGCCTT GAAAGCCAAT
GAAGCCACCG TTACGGCTGG TACGGATAAA GGCTCACTAA AACTTGAAGC CGCCAAAGCG
ACTTTAGAAA GTGCGAAAAA CGGTTCAAAC GTTGCTTTAG ATGGTACTAG TGCAACCTTA
AGTGCGGGCA ATGGTAAAGG AAGCATTAAA GTTGCTACTG GCTCAGGTGA TGATGCTAAC
AAAATCGAGT TAAGTCCGGA AAACGGCTCT GCTGTTACCT TAGCGAAAGA CGGCACAAAC
GGTGTCAAAG CCACGGGGTT ATCTACGGTT GGGTTAGATG GTGATAACGC TTTAGTGTTT
ACAAATGGTG CTGGCAACAA GGCAGAGTTA AAAGTTGGCG GAAGTGCATT GACATTTACT
AAAGCCACTA CCGGAAATAC CGTGAAAATC TCTAATGTGG CGGTTGGTAA GATAGAAAGT
AGTTCATCAG AAGCTATTAC AGGCGGACAG CTTCATGATT TAGCTACTCA TTTAGGGGTT
GCAGTTGATA ACTCAGGTAA AACTACGTTT ACAGCACCAT CATTTGCAAA AATTAATGGT
AGTGAAGCAC CTAAAACCTT TAAAGGAGCA ATTGATAACC TAATCACCGC CGTGAATGGT
GGATTGACAT TCAGGGGGAA TGATAATAGT AGCTCCCCAA GTTCAACCAC ATTGCAGTTA
GGCAGAACAT TGACCATTGA TAGCTCAGAA AGTAAAACAA AATTAGAGAA CTCTAAAGCA
ACTAAAGAGA AAGACATCAC CACAAAACTA GAACCATCGA ATGGTAGCGA CTCTCAAGCA
GGTACACTCA CGCTGACTTT AAATAAAGCA ACATCTGTCG ATGAAAATGA TGAAAGGGTT
GTTACATCTA GTGCGGTAGC AAATAAATTA AAAAACTATA CGACAACAGA AACATTAGAA
GATGACTTTT TAAGAGTTAC TGGAGAAAAT ATAAATGGTC ATCAAAAAGA GTTTGGCAAA
AATGTCGGAC TTGAAGAAGT AAAACTTGAA GAAGATGAAA CAGAAGGCAC AAGCGAATTA
GTACAAGCCA AAGCCTTAGT CGACTACCTA AAAGGCACAG GGGAAAAATC CGTTAAACTT
TCTGATTCCG CAAAAACACA AGCGATAGGC GAAGGCTCTA TTTCTATCGG TCATAATGCC
GTGTCAAGAA ATGAGGGATC CATTGCCATG GGTTATAACT CTGAGGCGTC TAATAGCGGT
GCAATCTCTA TCGGTCAGGG TTCAACTGTA TTGGGAACAA GTTCTGTCGC TGTAGGTAAA
GAAAATGATG TTAAAGGTAA CTTCTCTTTT GTGTTAGGTG AAAGCAATAC GCTTGATAAA
GAACAAACCT ATGTCATCGG TTCGGATAAT ACAATCAGCG GTGCTAAGAA TATTGCTATA
GGATTGGGCA ATACAGTTGG CGGCAATGAA AACATCGTAT TAGGCTCTCA TGTAGACCTC
AAAGATGATG TAGAAGGTGC GATTGTATTA GGCGATAAGT CAATAGGCGT ATCTAATGCT
GTATCGGTCG GGAATGCGTT AACAAAAAGA AGGATTGTAT TTGTCGATAC CCCACAAGGT
GAATACGATG CCGTCAATAA GAAATATGTT GATGGCTTGA CCTTATCGTA TAAAGAGAAT
AGCACAGGAC CTGCGAAATC TATCAACCTC AAAACAGGGG CGTTAGACTT CGTGAAGTCG
GAAAATATTT CTGTTGCTGT GGAAGCGGAT GGAAAAATTA CGCATACGCT TAATAATAAT
CTTACAAGTA TAGTAAGTAT TTCAGGTGGT AAAAACGGAA GTGAAGATGC TGCGAAAATC
ACGTTATCCT CAAAACCTGA AACAGAAGAA GAGGCTAAAG ATTATGTTAA AACAGTCACT
ATTAACGATG CTAAGTTAAA AGGTCTGTTA GACGGTGAAA TTGCTGAGAG TTCAAAAGAA
GCCGTAACAG GTAAACAACT CGCTGATTTG ACTAAGCAAT TAGGCGTTGA TGTTAGTACT
TCAGATAAAA CGAAATTTAC TGCTCCAATC TTTGAATATC TGTCTAAGGT GGACGGCTCA
TTCAACAAGA ACCCAACCAC ATTGAAAGGT GCTATTGACG AAGCAAGAGC GAAGTTAAAC
GAGGGCTTGA AATTCGGCGG TGATATTCCA AGTGCGGGTA CAAATACCAA CAATACCCAC
TACCTCGGCT CAACTATTAA CATTGTTCGT TTAGGTACGC CGACAGGGAC AGGTGCGGTA
GCTCCAACAT CAACCGATGG TTACAGCGGT AGCAACTTAA TCACGCAATA CACCAACGAT
AAGGGCAATG CCAAAATCGA AATCGGCTTT AAAGACGCAC CGACATTTAG CAAAGTGACT
TTATCACAAG AGCAAAAATA TGGTGAAGCT GACAAAGTTG GCAGCAACGA CCTGATTACC
AAATCCTACC TTGAGGGGGC GTTAAGCAAC TTTAAGTTTA ATGTGGAATA TGGTGATAAG
AAGGTCCAAA TCGGTCGTGG CGATACCTTG AAATTTGCAG ATGGCTTGAA TATTCAAGGT
AGCTTGAAGC AAGAGGGGGC AACACAGCCA AGTGTGGTAA CTTCAACTAC AGCACCAACA
ACAGTAAACA CCCCATCAGG TAGTGAAAAT GGCGGTGCAG GTACTACGGT CGTTTCAAAT
GGTGCTGATA GTTCAAATAG TGGCAGTTCT GATAGCACAA GCAATATGGC TTCGTCAGGT
GGTACAAATG GTGCAAGTAG TGATAGTGCA GATACAGCGG TCGCTTCAAG CTCTCCAACT
GCGGGAACTT CAAGCACACC AACGACCACG CCAACCACTA CGACCTCAAC GACAACAGCT
GTAGTCACCA TTGGCACGAA AAATGATTTA ACCAATATCA CCTCGATTTC CTCTAAAGCT
GGAAGTGCGG ACGGAAATGA TGCAGACAAT GGTACAACTG GAGACGTGAC TAAATTATCT
TTAACACCAG AAAATGCCAC ATTCCAAGTC GGCACAACAG GTTCTAAAGT GAAGATTGAT
AAAGAGGGTA TTTCACTCAC GCCACAAGCG ACAGATCCGA AAGATCCATC TGCTAACGCT
CCGTCTATTA CCATTAACGT AGGTTCTGTA CCAGCTACTC CAAACACAAG CAGTCCAGCA
CAACCTGCTG ACCCAAGCTT GCCTGCTGTT GATAACGGAC CTTCAATAGC GTTTGCAGCC
AAAGGCGGTA GTAATGGCTC AAAAGAGGGA ACCGGTACAA TTAAATACCT TAAAGACAGA
ACTGTTAAAA ATACTCAAAG CATGAATGAC AAGGACAAAT ACGGTGAGGG CGATAACAAA
GGCAACGCTG CCACAGAAGG GGCGGTGAAA GAGCTTTATG ACTCCGGCTT GAAATTCGCC
GGCAATGATA ACGTGGAAGT CGCTAAAAAA ATCGGTGACA AACTGGCGAT TGTGGGACAA
GGTTTAACAA AAGACAAAGT TGCTACATTC AAAGGCACCG ACGGCAATAT TGCGGTGACC
GCTAAACCAA ATAGTGGTAG TGATGCTAAT AGCAAGCTGG AAATTTCCCT CAGCGAATCT
TTGAAAGATA TGAATTCTTT TGAAACCAAA GCAAAAAAAG TTACTGGTGG GAAAGACGAT
TTGTATGCGA AAAGTAAATT AGATGGTGAG GGCTTGCATC TTACGCCTTT TGCAGGTATT
AGTGAGCAAG ATGTTATTGG TCCGCTAACC GACAAAGCCG CTCATTACGG GTTACTTGGC
TCAACAGTGA AAGATGGTGA TAAAACCAAT ACGCAAACAG CGGGCGAAAG TAAGCTGACA
CAAGATGGCA ATACGAATGT GTCCACTGCC ACTGAAACGA AGCTCACAAG TAAAGACGGC
GAGACCGCAA GCTTGACTGC CAAAGGCTTA ACCGTAGGTG ATAGCACTAA GGATGGCGAC
AAAACCCACG CTGTGTATGG CAAAACTGGC TTTAGCGTAA AAGGTAAAGA TGGTTCAAGT
GAAATCGTGA GCTTGAAAGT CGCAAATGGT CAAGACGGCA ACGCTCAATC CGCTACCCTT
GCTTTTGCTA AAGGCACTGA CAGTAAATCA GGCACAGGCA CAATCAAAGG GCTTGCGGAC
ATTAAGCCAG ATGAAACTGA TGGAAGCCTT GCAGCGAATA AAAACTATGT GGATGAGAAA
GTGTCAGATT TGGACAGCAA TCGTCCGTTT GATTTCTATG TTAAAGAGGG AGAGAAAGAA
ATCAAAGTCG TGAAAGGACG TGATGGTAAG TTCTATAAAC CTGAAGACTT GAAAGGGGCG
AAGTATGTCG CTGGTACGGC TGATGGTGAT AAAGGAAAAT ACACCAAAAA TGGTCAAGAT
GTTAAATCAT CTATAGCGGA CAAACAAGCC GCTGTGGTGA TTAAGGCTGA GCCGACCACA
TCACCGATGA CCATCACCAA TGTCAAAGAT GGGGATTTAA CAAATACTTC GACCGATGCC
ATCAATGGGG GACAGCTTGT CAAAGCGACA GGGGCGAAAT TCATTGATGA TCCAAATTCT
TCTGAAACTG CTCCAAAACC GAAGATAATG GTTTTTGCCG ATGGTAGAGA TGGAAAATCG
GGACTTGAAG CAGAGGCGAT GGCGACTAAA GGCTTAACTG GTAAAGACGG TTTAAACGGC
AAAAATGCGA ACGACAAAGC CAATGCTTTA CGAGATGGTG AAGCAGGAAC GGTGGTATTT
ACAGATCGTC AAGGCAATAG ATTAGTCAAA GCCAATGATG GTAAGTACTA TGAAGCAGGT
GATGTTGAGG CTGATGGTAA GCTTAAACAA GGTGCAAGTG CGGTTGAAAA ACTACAACTT
TCGTTAGTAA ACCACGAAGG TGAAGCTACT AAGCCTGTTG CCTTAGGCAA CGTGGCGAGC
GGTTTGGAAC TACCTGCCCC TGAAACTGAT CCAGCTAAAG CAAAAGAGGC AAAAGAAAAA
ACAGCTCAGC TAGCAAATGC GGTGAAAGAT AAAAAAGCAG AGGTGAGTGA CAAAGCCAAA
ACACTTTCGG ATAAAGCAAA AACCTTTACC CGTTTAACCT TAGCCGTGAA CGGTTTGGAA
CAAGCGGCGA ATGCTTTACC GGATGGTAAG GCGAAAGCAC AAATCGAAGC TCAGTTAAAA
GAGACTCAAG AGGAACTTGC TAAGGCACAA AAAGCACTTG AGACAGCTAA GACTGACTTA
CAAACAGCAC AAGAAAACTT GAAAACAGCG AATGCTGACT ATGAAGCTAA CTACGAGGGC
TATGCGAAAG TGGCTGACTT GGTTAGTGCG GATAGCAAAG CGAACGTTGC GAATGTCGCT
ACCGTTGGCG ATTTGCAAGC TGTGGCAAAA TCCGGCATGA AGTTTAAAGG CAATGACGGC
GTGGAAGTGC GTAAACAGCT GAGTGAAACC CTGTCCATCA TAGGCGAGGG AACATTTAAC
AGCAACCGCA CTGCTGCCGG CAATATCAAA GTGGAAATGG CACAAGATGG CAAAGGCTTA
GAAGTTAAAC TGTCTGACCA GTTGAAAAAC ATGACTTCCT TTGAAACTCG TGAAGTGAAC
GGTAAGAAAG CCCGCTTGGA TAGCAACGGT TTGAGAGTGG TCAATAAAGG CAAAGATGGC
ACTGATGATA AAGCCAAATC TGCAACTTAT GGAGCGGAAG CGGTGGTGTT GGAAGATAAA
AACAAAAGCG ATAAGGCTGT GATGACCGCA GGTGGTATCC GTTTTGCGGA CAGTTCAACC
CAAGCGACCG CACTAAGTAC CGAGCTGAAC AAAAACGGAT TGACAGTGAA CGGTGCAGGC
GGTCAGATTC ACATTGATGG TACAAAAGGG GTTATCACCG TGCCGAATAT TAAACCTAAT
GCAGATGGTC ATGTGGTAGT AAACAAAAAC TACGTGGATA CCAAAAACAA TGAACTGCGG
ACACAAATGA ACAATAATGA TCGCAATATG CGCGCGGGTG TAGCACAAGC TGTTGCACAA
GCGAACTTAC CGATCAATAT ATTACCAGGT AAAAGTACGT TAAGTTTGGC GACTGGTAAC
TATATGGGTA CGCAAGCCTT TGCGGTAGGT TATTCCAGAG TATCTGATAA TGGTAAACTT
AGCGTTAAGT TCAGCTTAGG ACATGGCGAT AAGAAAACTT CTGTGGGTGC GGGAGTTGGT
TATAGCTGGT AA
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEGASVGVG QPKCGVFLGM FKVLPLALLM 
SGLLSSAAYG ANVWIDVNPN GITGGDNKPS SVWYNGGLSD NKNEVVLLAD GESTTGAKAQ
YKNKDFKRTV IIGSRAVGGG NDATAIGYRA IVGKNLAETN TDDSHQGTAV GYRAFSYGNE
SVSLGNDTVA YGESSISIGS DNVGRNIGKY SRTGLSYDIW KLYRQNGSKF NYTGEYAAID
GADPDTSRSK YEAYLSETNP TKPFYRTHNW AFGDSSIAIG SRNVAYGHGS LAMGTLSVAK
GDYSTALGTA TLAFGNSSVA LGNESYVYAG NSIGVGNEVQ AISDGSMVYG YQSYAGGPGS
IAIGKRALSN VEPSAHFKQT VEDFGNLWYE GKSTVHALGM LDNPQNHGKS KDLDGYFLPA
TEKQSGTEEE KAKGKNGGAV ALGYYVYALG ENSVALGRQA YSKGDRSIAI GPYAYGGYEK
TIAMGYGAKA MSSQSLALGS LSRVEGKNSV ALGVEAKVLN DESNSELNGQ NSMALGNNSE
VTMKNSVAIG NMSNTRYYYT GDKSNPTPSS NNNNNNNNNA ITLPAYIPKG TSYSYTSTSD
DGVISVGGWD RRDGKLGRRR IINVAPGALD SDAATVGQLK ALEYAYKEGV VAYYTVEGGK
NYKVVKDADG KFYKANTENG TPLDSTAIAA DKVFVGPKGA NERSQSIQVN GRRRNVVDMG
DKIKFAHISD GEITSGSDQA ITGNQLNQLG STILGLGVQS NDKTKFDTPS FTKVQYEGST
GTQNHTTFKN AIDESIKAIN KGLVIQGDDK QGKISLGSTL NIKAGNTSVT DQGTKTEYKS
DNIRTAYQPK NKTLLIGIKE NPTFKSVTVS DPITASSPEG TLTTKKYVDD QLKNVSTNLH
FLSVQGTDKT AGNYNNDGAK ASYSVAIGVN AQVVDPTIVV TGNKPKPTAT AGIAIGYNAK
SEAENAVAIG RDVSIDVPNS FVMGSNNTVT QSFKETNGAV VVIGSGTKLV ESKSSIAIGA
VYKVHQGKAD GTLIENAAWT ASIGNKNKIK NGTDIVALGN NIEVKDETNE ENPKNKTRIA
NNDLILIGNG ATAETAKNSV LVGAKSNAGK GAKNAVIIGH SAEAKAEAEG AVVIGQGANV
QTKATGAIAF GQSATVNAEA SNAIAFGKSA SVQPNATSGI AFGQSASVSV ADGIALGSHS
VASTAGDKVG FNTVTGATKA DPVSHGAWKS KYGALSIGGN NNGTRQITGV AAGTNDTDAV
NVAQLKEATL HFVSVNGGSK TDENYTNNGA TKTGAIALGI GAKAASENSI AMGKDSKIEA
EISNAVAIGA NNHLRGLRGK DNKDQYKHTV AIGSDNIITG RKIVNLGSGN KIGNAEDSYQ
TDKKAGAVSI RVIGDDNKVH GVWNTVIGEE NKLESSNWTQ VIGDYNAVTK SDYAIVISSN
QAQKGDQQKT TNTVTNSNYA IVIGNQAKAT DAKNSVVIGA SANSTAESAV VLGKGATVQA
NATGAVAIGE GASVSTNAGE SIALGKGSKA TQKENTVSTY TATQTSNIKF NGFSGSGNDK
SVLSIGDGGK ERVIKHVAPG KISNDSTDAI NGSQLYSVID VFGHLGATVL GAEVDATKGF
KQSTFENVKY KDGGRDQKNT FKAAIDETIK AINKGIVISD GSETGTRQLG ETLTVKAGNI
DKPQTDSDGF SSNNIKTKYL KDNGEILIGI KDKPEFKEVT VSQELTDTSK DNVLTTKKYV
DNKLANVASK FTVSGNEGSQ FEINKDNNKL SIKGETTNNN IKTKATSSKE IEISLSDTLT
GITSIGKNSD NGITFNTNAT TIKLGGVSLS LNKDNTAVKI SGVADGKDTN DAVNKGQLDK
KQDKLTGTIT ANNGIKLTGT AANSIASDIT LSLEDGLKEK IDNALSKADA GNTYAKVDAS
NIENGNKASW RTTLDVYSKT EANAEIAKAK ETVTNGDGIT VSATPDGTDG PKTFTVALAD
EYKTKINSIG TGSVADNDKN TVTGGKVHTA IEDAKTTLTS TINGKADKNL SNIDETGKAK
IKTLATEAID VQGKEGELKV TPSTSGNKKS FTVSLNDDIK TKIDGIGTGT VVADNNKTVT
GGAVHTAIEA AKTDLKGKLY ANTATFGLKG NDSQEVTKKL DNTIEIKGTD TAKSGQTNIY
VSKADENGLK IELGENLKGI KKISQGEKAV ISLEDKALTL TSNNNKVAVK SDHVLSDKDI
YIGTKTDDNK LVKKSELGGG YITFEDENTS KGTKNVNFGK KVIYGKSAEI TPTVEASIDD
AKVTFAINDA SIEGTKLKDA TIAEAKLDQG IKTKLNKTFK VKAGDESSDN LIGEELEFAT
KDANLTVGLD ATKKKITYGL SSALTGIESI GKDNNTKITF KNNGQNEIDF TVGTNTTYKF
TDSGLDLASK PITNLGSGLE QKTGNGGRQD LDEFLKLTAT NGQPSDGKLN KAINAGDLLH
VAQGLVGKGL KFKADSASAD GSTKTEMTIA LGQAVTFKGD GKYLTTKLND TNGEISFNLS
VAESISDAST DGDSTQNTSN SKLVTENAVK NFVTNKINNL SSTLQLEGDN TENDKDPADP
IGKVELKTEK LKLTGENDFI ETEVKANDPT VTIKLAQKVK DKLAIIKVGE NTNGDNSFAL
GKDSKLETKK TAPTGITQNV GGNDVTITWS NAGAGQNKEV VSVGDTGKER IITHVAAGDV
RDGSTDAVNG GQLHRVIDVF GKLGLDVLGA EKADSGDGFK KSKFDVVKTS DNTTDSNPEK
SEKTFKTAIE DNIAAINKGL KFAGDNDGEK QLYLGSTLNI KGAKGEASSA GSNSASSDTT
NNHQNIFTKA SDTGLEIALN EALQGISSIS GKKGTDGSAV AKIDFTSSGT STSPTVKITA
DGGEFTFGKD GLNLNSKQIT GIASGLGLKD SVDGNNGGSA GTSNSDTDII NKVLSGNPDK
DNNNAVNVKD LSKVANALVE KGLSFEGNGG NTDKVTRKLG DTLKIVGKGS DANSITATEN
NIKVSKNTTN DGLEIGLSET LAGIKSIANG DKAKIALDND KKTITFTAGE TNNNVTLSEG
EFSGVSEINK ADGKGALKLA DSTATLESAS GNSNVALKAN EATVTAGTDK GSLKLEAAKA
TLESAKNGSN VALDGTSATL SAGNGKGSIK VATGSGDDAN KIELSPENGS AVTLAKDGTN
GVKATGLSTV GLDGDNALVF TNGAGNKAEL KVGGSALTFT KATTGNTVKI SNVAVGKIES
SSSEAITGGQ LHDLATHLGV AVDNSGKTTF TAPSFAKING SEAPKTFKGA IDNLITAVNG
GLTFRGNDNS SSPSSTTLQL GRTLTIDSSE SKTKLENSKA TKEKDITTKL EPSNGSDSQA
GTLTLTLNKA TSVDENDERV VTSSAVANKL KNYTTTETLE DDFLRVTGEN INGHQKEFGK
NVGLEEVKLE EDETEGTSEL VQAKALVDYL KGTGEKSVKL SDSAKTQAIG EGSISIGHNA
VSRNEGSIAM GYNSEASNSG AISIGQGSTV LGTSSVAVGK ENDVKGNFSF VLGESNTLDK
EQTYVIGSDN TISGAKNIAI GLGNTVGGNE NIVLGSHVDL KDDVEGAIVL GDKSIGVSNA
VSVGNALTKR RIVFVDTPQG EYDAVNKKYV DGLTLSYKEN STGPAKSINL KTGALDFVKS
ENISVAVEAD GKITHTLNNN LTSIVSISGG KNGSEDAAKI TLSSKPETEE EAKDYVKTVT
INDAKLKGLL DGEIAESSKE AVTGKQLADL TKQLGVDVST SDKTKFTAPI FEYLSKVDGS
FNKNPTTLKG AIDEARAKLN EGLKFGGDIP SAGTNTNNTH YLGSTINIVR LGTPTGTGAV
APTSTDGYSG SNLITQYTND KGNAKIEIGF KDAPTFSKVT LSQEQKYGEA DKVGSNDLIT
KSYLEGALSN FKFNVEYGDK KVQIGRGDTL KFADGLNIQG SLKQEGATQP SVVTSTTAPT
TVNTPSGSEN GGAGTTVVSN GADSSNSGSS DSTSNMASSG GTNGASSDSA DTAVASSSPT
AGTSSTPTTT PTTTTSTTTA VVTIGTKNDL TNITSISSKA GSADGNDADN GTTGDVTKLS
LTPENATFQV GTTGSKVKID KEGISLTPQA TDPKDPSANA PSITINVGSV PATPNTSSPA
QPADPSLPAV DNGPSIAFAA KGGSNGSKEG TGTIKYLKDR TVKNTQSMND KDKYGEGDNK
GNAATEGAVK ELYDSGLKFA GNDNVEVAKK IGDKLAIVGQ GLTKDKVATF KGTDGNIAVT
AKPNSGSDAN SKLEISLSES LKDMNSFETK AKKVTGGKDD LYAKSKLDGE GLHLTPFAGI
SEQDVIGPLT DKAAHYGLLG STVKDGDKTN TQTAGESKLT QDGNTNVSTA TETKLTSKDG
ETASLTAKGL TVGDSTKDGD KTHAVYGKTG FSVKGKDGSS EIVSLKVANG QDGNAQSATL
AFAKGTDSKS GTGTIKGLAD IKPDETDGSL AANKNYVDEK VSDLDSNRPF DFYVKEGEKE
IKVVKGRDGK FYKPEDLKGA KYVAGTADGD KGKYTKNGQD VKSSIADKQA AVVIKAEPTT
SPMTITNVKD GDLTNTSTDA INGGQLVKAT GAKFIDDPNS SETAPKPKIM VFADGRDGKS
GLEAEAMATK GLTGKDGLNG KNANDKANAL RDGEAGTVVF TDRQGNRLVK ANDGKYYEAG
DVEADGKLKQ GASAVEKLQL SLVNHEGEAT KPVALGNVAS GLELPAPETD PAKAKEAKEK
TAQLANAVKD KKAEVSDKAK TLSDKAKTFT RLTLAVNGLE QAANALPDGK AKAQIEAQLK
ETQEELAKAQ KALETAKTDL QTAQENLKTA NADYEANYEG YAKVADLVSA DSKANVANVA
TVGDLQAVAK SGMKFKGNDG VEVRKQLSET LSIIGEGTFN SNRTAAGNIK VEMAQDGKGL
EVKLSDQLKN MTSFETREVN GKKARLDSNG LRVVNKGKDG TDDKAKSATY GAEAVVLEDK
NKSDKAVMTA GGIRFADSST QATALSTELN KNGLTVNGAG GQIHIDGTKG VITVPNIKPN
ADGHVVVNKN YVDTKNNELR TQMNNNDRNM RAGVAQAVAQ ANLPINILPG KSTLSLATGN
YMGTQAFAVG YSRVSDNGKL SVKFSLGHGD KKTSVGAGVG YSW