Gene Haur_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2169 
Symbol 
ID5734056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2737182 
End bp2750255 
Gene Length13074 bp 
Protein Length4357 aa 
Translation table11 
GC content52% 
IMG OID641279310 
ProductLamG domain-containing protein 
Protein accessionYP_001544937 
Protein GI159898690 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATTC GATCATCGCG CTCGTTGATT GTCCGAGCAA CTTGGCTAGC TCTGTTGGCA 
CTGCTGATCA GTCCGTTTTT GCTTGGTTGG CTTGATCAGG CTGCAATGCC GCAAACGGCG
ATCGAACGTG CATGGGAGTT GGCCGGAGCC AGTGGCAGCT ATCGCTACCA AGCGACTATC
GAGCAAACAA CCTATCCAGT TGCCGCGATC ACGAGTGCTG GCCGCGAACC ACAAATCGAT
CGCTTGGCGC TCGAAGGCCA AGTTGATGCA ACCACGAATT ATCTTGAAAT GCAGCTTTGG
AATAGTGCCG ATCGTAATCC AGCCAAAGCC CATTCGTTGC GCCTACTGGA TGGTAAGGTT
GAGCAACGCC AAGGGCTAGG GCCATGGCAA GCAGGCGATC TGCAAGATCT CAACGGCATT
GCGCCTAGCG GCAATTTGCT GAGCTTTTTG ATCGGGGCTG ACGATGTGCA ATTGCTTGAC
CAAGCAACTC GCTCGTTTGA TACAGCCGCT GGAGCAGCGC TCGATTTAAC CCTAACCTAC
GAGCATTATG GCTTTAAGCT TGATGGAACA GCGTTGGCTG AAAAACTTGG GCCAATGCTC
GAAGCCCAAA TTCGCGCAAG CCGCAATATT CCAGCCCATG TGGTACTCGA TGCTGGGCGA
GCCTACCGCG ATATGCAGGG CCACGGCGAA ATTTGGATCG ACCAAACAGG CCTGCCCCGC
CGCATCGCAT TAACCCTCGA ATTTCCGGCG CAACAAGGCC AAGGTCGCTC GACAGCCATG
ATTGTTGGCG ATTACAGCGG GTTCGATCAA AGCCGTTTAG CCTTGGCGAC CACGCCTTTT
AGCGCCAACC CAATCAATTG GCTGAGCTTT CGATTAACTG AACAATTGCC AGCTTTGCGC
TTGTTGGTGT TGCAAATCGT GAGTATTGGG CTGGTGCTGG CGCTGATTGG CTGGTGGATG
CGGCGCTTGC ACAACCGCAA AATTCATGCC TTCACCGTGG GCTTGGTGAT TATTTCACTG
CTGACAATGC CTTTATTTCG TGCTGATCTC ACGGCGGCCT TTGCTGCCGA GCAACAAACT
GAGCAGGCTG AATACGACCA ACAACGTGCG CAGCATGAGG CTTTGAGCGA GGCCGTCGCT
GCCCAACAAA CCAGCAATTG GAACCCCCAT CAAAATCCGC GTAGCACGAC CCCCAACGTT
GCGCTGCCCG ATGCCGATTT AACGGCGTTA TTGCAAGCAC GGCCAGAATT GGCGTTGCCA
AGCATGCTGG CTTCAAGTGA TACGACCGAT ACTGATGGCG ATGGCTTAAC CGACTACGAT
GAAGCAATTT GGGGTGCTTG TCCAAGTGCT AGCTCAAGTA GCAACGATTG TATTGGGGTC
GCCGACTCGA CTGATAGTGA TGGCGATGGG CTGAGCGATG GCATCGAAGT TAATCAACTT
GGCACTTTGC CCGACGAAGC CGATAGCGAT GGCGATTTGC TTGATGATCA ATTGGAAGTG
GCGGGATTTA GCTTTGGTGG CACGCAATGG TATCTCGATC CCTACGCCAC TGATAGCAAC
AGCGATGGCT TGACCGATGG CATGGAGTGT CAAGTATGGG TTGAAATTTC CAGCGATTAT
GATCCAAGCG CAGCCTGCCC CGATACTGAT GCTGATGGCA CACCCGACGT GTTTGACCTT
GATAATGATA ATGATGGGGT CAACGATGCG GTTGATAATT CGCCCAATGG TGTGATTGCC
CAAACTTTCA ATGGCGATAC GCCACTTGAA CTAAGTATTA ATCAACTTGA AACTAACAAA
CCGGTCTATG TTGATTTTCA AATTACCCCA ACCAATGCTG ACCATCTTGA TTATTTTGGC
ACGGTGCTCG ATTGGCCAAC TGGTGACAGT GCAGGCCAAA TTCAGCGCCA CCTGGAAACC
AACTTTGCTA ACACCGAAAA TCTGACCTTG CGCAGCAGCG ATACTAACGC TGCCAATGGC
GATGTTCGGC TTGTGCCAAT GCTGCAAATT CGCATGCCCT ACACCAGTGG CCACTACGCC
AATTTGCCAA TCAATGCCAG CTACAGCGGG ATTGATCGTA GTTTGGAGAT GGCGGTTGAT
AACTGGCTCG ATAGTAGCGC CATTGATCGT TATGATCTGA GTGTTTACGA TTCAAACAGC
GGCGATGGCG ACCTTTTGGC CTATCTCCCA GTCAGTTATG TTTCCGATGA AAGCGATGGT
GGAGTGGTTG GCTTTGCGGC GCGGATGTTC TATCAACCGA GCCAAGGCAG CAACGGTCTG
GCAACGTGGG GCGCAGCCCA TGAAGTTCGG CTGGTTTGGT TGGTCGAAAT GTTGACCGAT
AGCTGCACAG ACGACACCGA CCTCAGCACC TGCGAAGACA GCTACGCAAT CATCCAAACC
TATTACGATG AGTGGAAATT GGCTGGCCTG ACCGTGACCG AGGAGCATGG CACCGATACA
GCGATTGTCT ATGAAAATCC GACTGAGGAT ACCAATCTCG CGCTGGATAG CGATTTGTGG
GTTGCCAGCT GGAACATGAG CAACAGCTTT TTGCGCGGTC GCGATTGCTC AAGCATCAAT
GGCAGTGGCA CATGCAGCAG CAACGGCAGC CGCGATGTGA CAATCAGCAA TTTGGCGAGT
AGCATTGATG GCTGGGCAGG CGGCGCGGCC AATCATAGCT TAGCAGTTCA AAGCTTTAGT
TATGACCATA ACGACGATTA TGTTGAAGAT TTGACGATCA CCCGTACTGC CGAATTGCTT
GATAGTGTGT TCACACCCTA TGCCAACCAA ACCAATCCAA CGTTGTTATT TGCCAGCGAA
CATCGCAGCC GCTCGGTCAA TGTTGATGAC AACACCATCA CTAGCGCCGC AATTGCGCTT
GATTTTGACG CTGATACAGT GCCGTTGGTG ACCGCAGCTA GCATGAGCTG GGCACCGTAT
CAATATACTG ATGGCGTTTG GGGCAATTAC GATCCTGAGC AATATTTGCT GCTGCTGAAC
ACCCTGTTGA CCAGCGATGA ATTTTTTCAA GGCGATGGCT CAAGCACAAG CCTCGATGAA
ATCAGCGGCA AGCAAATTTG GGGCCAAAGT TATTATGCCG CTTTGTTGCA AGGCCTCTCG
GAGAGCATCG AAAGCGATGG TGATCTGCTT TGGACGCAAA GCACTGAGGT TCCTGAAAGC
GCCTATACCC CAGCTTGGCC CTCATCGACT CCCAAAGGCT TTACCTTTAT TGGTTCGGCC
TACCTCAACA CAATCATCGA ATCGGTGAGC AAATTTGCTA AATACAAAGC CATGGGCTAC
AGCGGCTATA GCTTCTGGAG CGTGATCAAC CATGCCTACA AAAAGAGTTT TACTCAATAT
ACCTTTTCCT TTGAGCGCTT GCTGCAAAAC AAAAAGAGCA TGGCCTTGCA TGGCCTAATC
GGCATAACCA CAATTGGCTT GGCAGTTGGC GCAACCTTGT TTGCAGTTGG CTACCTGACT
GGCGATGATA CAACCTTCCA AGCGGGCATT TATATCCTGA ATGCGGCCAC GATTGTTGGG
GTTGGCTTGT ATATCGCCAA TATGATGCAT AAATTTTATA CGCTCTACCA GAGTGGCATG
GGCGTAACGG CAATTCTTAA ATCGACCACC ATGGCTAATT TTAAGGCGGT AGGTAAATTA
GGCTTGGTGC TTGGTACAGT TGTACCTTGG ATCATCTTCC TCTCGACATC GGGTAGTGTG
CTTTGGAAGA TGATTACGAC TGGTGAGGGC GGCAGCATTG CGCTCACCGT CTATGTAGCC
TACACCTTGG CCAGTACGAT CATCACGGTG GTGATGTTTG CCTTGGAATA TGTACCTGGT
ATCGGCCAAA TCTTTTCGTT CCTCTTCCTG ATGCTCTATT TTGTCGATGG GATTTTAGCA
ATTTTTGGCG TGCGCACGGT GCAAGATCGC ATGACTGAAG CGCTTGCCAA GCTGCTCTAC
GATGTTGATA TTGTAATTAA AAATATGGAT TCCTCCGATC GCTTGGCGAT CGATATTGTA
GATCGCAGTT TGGCCGAGCC TGAAGATGGC TTTGTGGTCT CCAACAGCAT CACCTACACG
ATGCGCGTGA CCAATACGTT GTTGTATGGC ACGCAATATT CAGCGTCCGA TGCCAATAAA
GCCTCGTTCC TGTATACCCT CGACGATGAA CCAGTTGATT ATCACGATCA AATTAGCAAA
GGCGATATGC GCGGCGATTG GGATGCGATT GGCGGGCACA AAATTCGCTT GAGCAGAACG
ATCGTTTCGG TCGATGCCTT GGATTTGGAT CGAGTTGGCA CGGGGATTAA TCGCTCGCTT
GATGGCTTAC TCTATTTGAA TGAAGCCTAT CAACTGCCCT ACACTGGTTG CTGGCTTGAT
GCCTTTAGTT GTAGCACCGA AACCTATGGC GGCTCCAGCG AATTGAATAT TGGTGCAAGC
GAAACGCTGG ATATACTCCC GTCCACGCTG AGCGAATTCT ATGCAATGGA GTGGAATGAT
CGCGGCCAGT TGAGTTTCCC CAGCCAGCGT GATCACGATG GCGATGGGCT GTTCAGTGTT
GATGCGGGTG GAGTTGACCC CGATGATTTG ACCCGCGATG CTGATGCGGA TTTGTTGCTC
GATACCTACG AATTGGCCCA GGGCACTGAT CCTGAAGCAA TCGATACTGA TGGCGATGGG
CTTGATGATG CCCGCGAATT GAGCCTTGGC ACCAATCCGT TGCTCAACGA TAGCGATGGC
GATGGGCTTG ATGATGGCAC TGAAAGCACA ACTGGTTGGT TGATTAGCTA CAACGATGAT
GCGGGCAATC GCAGCTATAC CCGCGTTTGG TCAAACCCCA ATGTTGGCGA CATTGACGAC
GACGGCTTGA ACGATTTGCA AGAATTCGTC TATGGATTCA ACCCGTGGGT CGCCACTGAT
GCCTCGTTGA TTGATAACTT GGTGCAATTT GCCGACATGG ACGTGAGCGA GCGCGATGCC
GCCGCAGTAT TGCTACGCTT TGAGGAAAGT GCCGATGCAA GCGTATTTGC CAATAGCGCT
AGCTCCAACA ATTTTAGTTG CGCCAGCACG ACAACTTGCC CAATCGCGGC GCAAACTGGG
CGGTATGGCA ATGCGGCCAG CTTCGATGGC CTAAACGATT ATCTACAAGC CAGCCTGAGC
CTTGCGCCAA CAGCCTATAC CCAAGCAGTT TGGGTCTATC CAACCAGCAC TGATAGCAAT
TTCCATGGAA TTGTGGGCTA TGATGGCGGG ATTTTGGCGC AACGAGCACC CAGCATCTAC
ATGTTTCAGG CTGGACGGGT GCAAGTTGGC TTCGGCGATG GCACGAATTG GAACAGTTTA
TCAACCAGCG GAGTGGTGCT GAGCAGCAAC ACCTGGAGCC ACATCGCCAC AACGTTCGAT
GGCACGACCA TGCGTTTGTA TATCAACGGG GTTGAGCAAG CCAATTCGGC GGCTGCCGCT
GGCAAAGTGC CCTATCCAAT TGATACCTTG CGGGTTGGTC GCATCGACAA CTATTTTCAA
GGTTCAATCG ACGAAGTAAG CCTGTTTGAA CGCGCACTTT CGGCCAGTGA AGTTGTAGCG
CTCAAAGATG GCCGCTACAA TCCCAATGAT CTGGTTGTGC AGCCAGGCGC GGCCTTGAGC
TATTCAACCA GCGTCACCAA TACCCTGGCC ACCCAAGGTA TCCATGGCAA TTTGCTTGGC
ACGACCAGCG CTAGCGATCC AGTCGTCACC CAACCAAAGA TTGCCTTACG CTTTGAAGAA
AGTGATCGAA AGAGCGGTTT TAGCCCAGCT TCGGGTGAGA GCGAAGCTGC GACCTGTGTT
GGCGCAAGTT GCCCCGCTAG CGACTTGGTG AGCAGTACTG ATCGCTCAGT GGCCTTCGAT
GGCGTGGATG ATCAGCTGGC GATTGGCACG CTGGCCTATG AAAATGCCTT TCAAGCCTCA
ACCTTCTCGT TCAAAGTCAA ATTGAACGCC CTGCCTAGTG CTGACAAAAC CATGAGCTTG
ATCGCAACCG AATCAACTCA GGCCTATGGC TTGAACGTTT CGGTCAATAG CAGCGGCAAG
TTGGTTGTGG CATTAAATGA TGTTAGCCCA AGTTTGACTG GCGTTATCAC CATGGCCACC
AACACCTGGA TCACCATCAC AATTAATGTG AATGATAAGC AATTGCGGGT TTATCAGAAT
GGCTCGCTTG ATAGTGGCTT GAACAACAAT GTGCGCTTGC GTTTGGTGGT TGGTGCTGGC
ACGTTGGGCA ATAGCATCGA CGGCGCTAGT CCGCTGCATG GCAATCTCAA CGACATCAGC
ATCGTCAACA ATAATGCTGA GACCGTATTT GCGTTTGGCT TTGACGAACA TAATACCAAC
AGCCTGCGCA CCAGTTTCGC CAACACCGCC AGTGGTGGCA GCATCGTCAG CTGTGCCTCA
ACCGCAACCT GCCCAAGTTT GACCGCAGGT GCGACCCACG AAGGCCTGCT TTTTGATGGC
AGCGACGATT ATTTACCACT GCCCGCAACA GCAAGTAGCG CGGCTGGGAC AAGTGGCTCA
TTTAGCTTCA AACTGAAATT GAGTGCCCTG CCAGCCAGCG GCTCCTACTA TTATTTGCTT
GATAATGCTT GTTCTAGCTC AGCCGGAGCA GGCTACTGTT TACGCGCCTA TATCGACTCA
GCGGGCTTGG TTACGCTCGG CCTGATCAAC CGTACTTCGA GCAGCGTGGC GTTTGGCCCA
TTCTCGACGA CTGCTGCTGG TGGTTTTAGC GGCAAGCTTG GCAGTTGGGT CTCCGTGACA
ATCAGCTGGA GCTTGGCGGC AGCGGCTGGC TCAACCAGCA ATTTCGCAAT TGCCACAACC
CACAATGGCT CAACTATCAC CGCAACCAGC AGCGGCACCA GCACCTACTG GCCATTGATC
ACGACCGATA GTAATGCGCG ATTTGGTCGC CGCGTTGGTG GCACGTTGCC GCTCAAGGGT
GCGCTTGATG ATCTGGTGGC AACCAGCTAT AACCTAAGTT TTGACCAACC AAGCTTTAAT
GTCGAGCAAA TCAATCGGGT TAATGATGGG CGGGTCGCAG CTTGTGCCGC TTTCTACAAT
TGCCCCACAT CAAGCAGCGC TGGCAAATTT GGGGCAGCCC TCAATTTCGA TGGCAGTGAC
GATTATTTGC TGCTTGATCA TACGGTTGGC GATGATTTTA CAATTGCCTT TTGGATGCAA
TCGAGCCAAA CCACGGGCAG CGCTAGCGCT TGGTGGCAGG GCAATGGCTT AATCGATGGC
GAAGTTGCTG GTAATGCCAA TGACTTTGGG ATTAGCCTCG GTGATGGCGG CAAAGTGCTG
TTTGGGATTG GGAATCCTGC GGCCAGCGAT ACAACCCTCA AAGGCGGGAG TGTCGCCGAT
GGCACTTGGC ATCATGTGGT GGCGACCCGC GTCAAGCAAA CTGGCGCAAT GCGTTTGTAT
GTTGATGGCG TGCTCGTCGC TAGTGGCACA GGCAATACTG CTAGCCTGAG TGCACCACCC
TATCTGCGCA TCGGTATGAT CCAAACTGGC TACAACGCCT ATGCTGGCTT GCTCGATGAA
ATTGTGATTG TGCCAGCAGC GGTCGATCTG GCTGGGGCTA AATTGCTGAT GCAAACCACC
TATCCAATCA TCGACATTAC CGAAGCTGTT ACAACCTTCC AATTAAATGC GCTTTCGGCT
AGCAGCATCA GCGCCATCGC CAACGTCAGC AGTAATGCCG TCACGAGCCG GCATAGCTTT
ACCCAAGAAG TTGAGGCGGC GATCGATTTA CAATCGGCGA TTGATTATCC GGTGACTGAT
AGCAATGCTG CCAGCTTGCC AATCTTGTTG CCGTTTGAGG AAGTTCCAGG CGAAACCAAC
TTTAGCAACT ATGGCACAGT CACAGGCTAT AGCCAAAACA ACGAGATGAA ATCGCCAACC
TGCTATAGCT CAATTGGCTG CCCAACCGCT GGTTTGCCTG GGGTAGATGG CCGCGCCGCC
TATTTCAATG GCAGCGGCGA TTGGCTGAGC TGTACTTATT CTGGGGTTTT GGGCTTGCCC
TGTGCTTGGA ACGAAGTTAA TATTCGCACG GTGGCCGCGT GGGTCAAAGC CGATCGTGGC
ACGATTGCTG ATTTCCGTGG CTCGATCGGA GCCAACGGAA TCGAGCTAGA TTTCAATAGT
TTCAAGGTGA CTATCAATAG TACAACCTAT CGAATTGCAA TCGACTTGCC TGAAAATGAG
TGGGTGCATG TTGCGGCTAC GGTTGATATG ACCAGCCGAA TTGCCAAGGT TTACGTCAAT
GGAGCCTTGT ATGGCTCGAC CAGCCTTGGC AGCACTGGCA CATACAGCGG TAGTTTCCCA
TCGATTGGGG CAAATCGGGC TGGAACCTTT GGCAACCTTG GAGCCGATGG CGATTTCTTC
CATGGCTCTA TGGATGATCT GCGCGGCTAT ACCGTAACGC TTTCGGCTAG CCAAATCAAA
CAACTCTACA GCGAATCGGC TCCGGCGATG GCCTTTACGT TTGATGCCGA TGCCAGCAGT
GCAAGCACAG TGATCGACAC TTCAGCTAAT AGCTATGCTG GCGTGTTACG CGGCTCAGTC
TGCAACACTG TCACCCTGAA TTCAGCCGAT TTTGGCAATT TTGGCGCTGA TGGGATTAAT
ATTCGTTTGG AGAGTACCGA TCAGCTAATT TTGAGTATGC CGCTAATCAC AACCGCGTTG
CAGCAGTTGA ATGTTAGCGT GCCGATTTGT GGCATCGATA GCCTAATCAT TGAGGAAATT
GCCAGCAATG GCACAACTTC GAGCTATGGT GCGATTAGCC TGAATGCAGC GAATTCGGGG
ACAACCACTG ATGTCAATGT TGGTTCATAT CCATTTGACC ATATTGTGCT GAATTATAGC
TACGCCACGC CGACCAGCAG CAGCGCTCCC AGCCTGACCG ATGGCAAAAT CGGCCAAAAC
ATGACCTTTG GTGGTTCAGG CGCGATTGAA GTGCAAAGTC CAACTGCTGT TTCAGCGCTC
ACCAGCACAT TTACCTTGAT GGGCTGGATC AATCCTGACG ATGTTGCAGG CTTCCAGCAG
CTGATCGCCA GCGGCACAGA TGCCACAAGC AACAATGGGT TTAGCCTTGA GCTGAACGAC
GACCTATTGC AATTCCGTAC CTTGGGGGTC AAGACCTACG CCAGCACTGC CAGCGTGCGA
GCCGAAGTTT GGCAGCATGT GGCCTTGGTC TTTGATAGCA ACTACGATGC CTTGTTCTAT
GTCAACGGCA CCTTGCAACA AACGATCGAC GGTTCTGCGG TTGCCAAAGC CAACAGCGAC
GATCCAACTT ATATCGGTGG CAGCGCTAGT CCAATTGGCG TATTGCAAGG CTTTTTCCGT
GGCCAACTCG ATCAGCTTGC GGTTTACGAC CGCCAAATGA CCACTGGCGA GATCTATAGC
ATTTATTTGC GCGATTTACG CTGGTATAGC GCTCGGAGCA CCGTTGAACT CCAGATTGAT
ACTGATGCGC CAGCGATCGC CTTGCTTAGC GCTGCCGATT ATCTGGCAAA TCAACCAACA
ACCTTGGTGG TATCGACGAT CGATGCGACT TCGGGCGTGC GCTTGCTGGA TGTTGGAGTT
AAACAGCCTA ATGCCAGCAG CTACACCTGG TCGAGTGCTG GCCTGTGTGC CGAATCGCTC
AGCGTCGCGA GCAATGCCGC TTGGTGCTAC AACTTTGATC CAACGAGTTT GGGCGGGGCT
GGCAACTATA GTTTGCAATT CCGCGCTGTC GATGCGGTTG GCAACCAAAC AATCTCATCA
GTCTATACAA TTAATATTGA TACAACCGCC CCGACTGCCG CCAGCAGCTA TAGCGAAACC
TGGCAACAAC TGAGCACCAG CAACAGCGAT GAGCTTGCTT GGACAGTAGC GCTCAACGGA
ACCGTCAATG ATACGGGCAG CGGCATCGAT CCAAACTCAG TGCAACTAAG TTTGCTCGAT
AGCACTGGAG CTTTGGCGGG GCTTGATCAG GCTCAAACTG CCACGGTGAG CGGCAATACC
TGGAGCATCA GTTATGGCTT TGCCAACAAA CGCCCAGCTG GTCGCTATAA CCTGAGCCTC
AGCGCTACCG ACTTGGTTGG CAACAATTGG TCGGGAATCG TTGGCAGCAT CGTGCTAGAT
GGACGTGCTC CCAGTATGCA GCTTGAGCCA AGCCTGCTTA CATCACGGAT TATCAGCACA
ACTCCTAATC TGAATGGCTT GCTAGTTGAG CAACCAGCTT GGGGCGGCGA AGTCGCGGCC
TTCCACTTTG CCGAAGCCAG TGGTGCAACC AACTTTAGCG ATTACTCGGA TACCAATCTA
GTGGCTACGT GTAGTGCAAA TGCATGCCCA AGTGGTGTAA GTGGCCTATT TGGCCGAGCG
CTCAACTTCG ATGGCAACAA CGATAACTTG ACGATTGCCA ACACCACGAC CCTTGATTTG
AGCGTGGCAA GTTTCAGTGC GTGGGTCAAA CCAAGTTGGG TTTCGGGCAC ATTGGGCTAT
GCGCCAACGA TTTTGGCCCA AAGCGCTGGC AGCAATAGTA ATTGGCGCTG GCAGATGAGC
GCCGATTACC GCAGCATGCA ATTACACAAT GGCAGCGCAA CCACCAGTTT GCCACTCACC
TTGGCCTCCA ATCAATGGTC GCACGTGGCC TTGGTGCAAG CAGGCGATCG CTGGACTGGC
TACTTGAATG GGGTGGCAAT TGGCACGATC GAGCAAGCGT TTGGCTCAAG CACTGGCTTG
CCGCTGCACA TCGGTTCAAA TGGCACAAGC CAGTTTTGGG CAGGCCAACT TGATCAGGTT
ACGATCTACG AACGCGATTT ATCAGCGGCT GAAGTGTATG CCTTAGCCCA AAGTAAGGTC
GCAGGGGTGA GCAAGGCCCA AGTTTGGCTA GCTCCCGACG AAACGATGAT CCCAGCTAGC
CCGCTGATCA GCTACGATTT TACTGAAGTG CGTGGCGCAA CCAGCTTTGC CGATGCTAGC
GGCAATGGTC ATGGCGCGAC ATGTACCAGC TGTCCAACCG CAACCAATGG CTGGACTGAT
GCTGAGGCAC TGAGTTTTGA TGGAGTTAAT GATGGGCTGA GCGCCACGAT TACTAATAAT
CTGAGCTTGG CAAACTTTAC CCAAGCCGCT TGGATCTATT CAACGGCAAC CGATAATGGC
TATCATTCAG TGATGGGCTA TCAGCCAGGA TCACTCAATC AGCGCCGCCC GCCAAGCATC
TACATTACCC AGAAAACGCG GATTCATGCT GGGTTTGGCG ATGGCAGCGT GTTCACATCA
TTGGAAACTG GCTCAGTCTT AACGCCCAAT GCCTGGAATC ACATCGCGAC TACCTTTGAT
GGTCAAGCCT ACCGCGTCTA TGTCAATGGC ACGGCGGTTT ATACCTCAAC TGCTAGCGCT
GGACGCACGC CCTATCCCGT TTCCAAGCTG GAGATTGGCA AGGCTGATAC CTATTTCAAA
GGCGCGATCG ACCAAGTGCG TATCTATGAT CGAGCGCTTT CGGCCACCGA TATTGGGTTG
TTGGCTTCAA GCTGGACGAA TGCCAGCCTG ACCGAGACAA ATACTGCTAG CGCCGACTGG
AATTATCAAA TTCCTAATGG CTTGCAAGGC TTATATAAGC TGAGTTTGCG CGGCACTGAT
ACATTCAGCA ACACCGAAAG TATCGCGACG GTTTGGCGTG GTATGCTCGA TAGCGTTGCA
CCTAACGTCT CGATTAGCGC AACCCATCAA GGCGGTGGCT TTGCGGCCTC AACGGTCTAC
ACCATTACCG CCAGCGATCT GTTCCTTGAT CCAGCGACCT TGATCACGCC ATGTCAAACT
GCCAACAGTA CGTTGAGCTA CAACGAAATG GGCGGAGTTA GCTCGCTAAT TGTGACATGC
AGCGTTGTAG GCCATCAGCA AGGGAGCGTC AGTGCAACTA TTCGCGATTT GGCGGGCAAC
CAAGCCAGCA CATCCTTTAA TTTGCCAATG CCGAACACCA GCCCAGCGAT TCTGATTAGC
AGTCCAAGCG GCAGCATCAC TGGAACCGCG CCGATCAGCA TTACTGGGGG AGCGTTTGCA
CCGATTGGGA TTCAAAGCGT GGCAGTGTTG ATTAATGGTC AAAATCTGAG CACGATCAAC
TATGGCGTTG GCATTACCAA TACGCTCTGG GCCACAAATT GGCAACCGCA GGCGAGTGGC
AGCTACACGA TTACGGCGAT ATTAACTGCA AGCAACGGTG GAGTTTACAC CGATACAACC
ACGATCAGCG TGCGCGAAGC CTACCAATTA ATCGTCAATC GTGCAGGCAC GGGTAGCGGC
ACAATTAGCA GCGAGCCAAT AGGTATCAAT TGTGGTGATG TTTGCAGCGT CTATTTTGCC
GAAAGTAATG TTATCACCCT GACCGCTACG CCTGCGAGCA ATTCAGTGTT TAGTGGTTGG
AGTGGCGCGT GTAGTGGCAA TAGCCTGTGT GTTGTGCCCA TGACCCAAGC ACAGAGTGTT
ACCGCCAGCT TTAGCCTGAA GACCTATCCG TTGGGCATTA GTTTTGCGGG CACTGGTGAT
GGCACGGTTA ATATTCTGCC AAATGGGGTT AATTGTACCC GCCAAACTCC AGCTTGTTTG
CTATTCTTCA GTGCGGGAAC CGTGGTGACC TTGACGGCCA CACCGCTTGC GAATGCTAGC
TTTGTTGGTT GGAGCGGTGA TTGTAGCGGA ACTGCCAGTT GTGTGTTGAC GATCGATCGT
GCCCATACCG TATCAGCCCG TTTCGATTTG GTAACGCTAA CCCCAACCGC AACGGCAACC
AGCACATCAA CACCAACGCC AAGCGCGACG GCCACGCCAA CAACGACAGC AACCGCCACC
GCGACGGCAA CCAGCACGCT CACCGTTACA CCAAGTCCAA CCGCGACGGC AATCGCGACG
CATACACCAA GTCCAACGGC CACGAATACG CCAACCGCCA CCGCAACCGT GACAATCACG
GCCACCGCGA CGGCGACCGT AACGCCAAGT CCAACGGTCA CAAATACGCC AAGTCCAACA
CCAACGCTGG GCGCAAGCGA ATATCGGATT TATCTGCCGA TTGCCATGCG CTAA
 
Protein sequence
MHIRSSRSLI VRATWLALLA LLISPFLLGW LDQAAMPQTA IERAWELAGA SGSYRYQATI 
EQTTYPVAAI TSAGREPQID RLALEGQVDA TTNYLEMQLW NSADRNPAKA HSLRLLDGKV
EQRQGLGPWQ AGDLQDLNGI APSGNLLSFL IGADDVQLLD QATRSFDTAA GAALDLTLTY
EHYGFKLDGT ALAEKLGPML EAQIRASRNI PAHVVLDAGR AYRDMQGHGE IWIDQTGLPR
RIALTLEFPA QQGQGRSTAM IVGDYSGFDQ SRLALATTPF SANPINWLSF RLTEQLPALR
LLVLQIVSIG LVLALIGWWM RRLHNRKIHA FTVGLVIISL LTMPLFRADL TAAFAAEQQT
EQAEYDQQRA QHEALSEAVA AQQTSNWNPH QNPRSTTPNV ALPDADLTAL LQARPELALP
SMLASSDTTD TDGDGLTDYD EAIWGACPSA SSSSNDCIGV ADSTDSDGDG LSDGIEVNQL
GTLPDEADSD GDLLDDQLEV AGFSFGGTQW YLDPYATDSN SDGLTDGMEC QVWVEISSDY
DPSAACPDTD ADGTPDVFDL DNDNDGVNDA VDNSPNGVIA QTFNGDTPLE LSINQLETNK
PVYVDFQITP TNADHLDYFG TVLDWPTGDS AGQIQRHLET NFANTENLTL RSSDTNAANG
DVRLVPMLQI RMPYTSGHYA NLPINASYSG IDRSLEMAVD NWLDSSAIDR YDLSVYDSNS
GDGDLLAYLP VSYVSDESDG GVVGFAARMF YQPSQGSNGL ATWGAAHEVR LVWLVEMLTD
SCTDDTDLST CEDSYAIIQT YYDEWKLAGL TVTEEHGTDT AIVYENPTED TNLALDSDLW
VASWNMSNSF LRGRDCSSIN GSGTCSSNGS RDVTISNLAS SIDGWAGGAA NHSLAVQSFS
YDHNDDYVED LTITRTAELL DSVFTPYANQ TNPTLLFASE HRSRSVNVDD NTITSAAIAL
DFDADTVPLV TAASMSWAPY QYTDGVWGNY DPEQYLLLLN TLLTSDEFFQ GDGSSTSLDE
ISGKQIWGQS YYAALLQGLS ESIESDGDLL WTQSTEVPES AYTPAWPSST PKGFTFIGSA
YLNTIIESVS KFAKYKAMGY SGYSFWSVIN HAYKKSFTQY TFSFERLLQN KKSMALHGLI
GITTIGLAVG ATLFAVGYLT GDDTTFQAGI YILNAATIVG VGLYIANMMH KFYTLYQSGM
GVTAILKSTT MANFKAVGKL GLVLGTVVPW IIFLSTSGSV LWKMITTGEG GSIALTVYVA
YTLASTIITV VMFALEYVPG IGQIFSFLFL MLYFVDGILA IFGVRTVQDR MTEALAKLLY
DVDIVIKNMD SSDRLAIDIV DRSLAEPEDG FVVSNSITYT MRVTNTLLYG TQYSASDANK
ASFLYTLDDE PVDYHDQISK GDMRGDWDAI GGHKIRLSRT IVSVDALDLD RVGTGINRSL
DGLLYLNEAY QLPYTGCWLD AFSCSTETYG GSSELNIGAS ETLDILPSTL SEFYAMEWND
RGQLSFPSQR DHDGDGLFSV DAGGVDPDDL TRDADADLLL DTYELAQGTD PEAIDTDGDG
LDDARELSLG TNPLLNDSDG DGLDDGTEST TGWLISYNDD AGNRSYTRVW SNPNVGDIDD
DGLNDLQEFV YGFNPWVATD ASLIDNLVQF ADMDVSERDA AAVLLRFEES ADASVFANSA
SSNNFSCAST TTCPIAAQTG RYGNAASFDG LNDYLQASLS LAPTAYTQAV WVYPTSTDSN
FHGIVGYDGG ILAQRAPSIY MFQAGRVQVG FGDGTNWNSL STSGVVLSSN TWSHIATTFD
GTTMRLYING VEQANSAAAA GKVPYPIDTL RVGRIDNYFQ GSIDEVSLFE RALSASEVVA
LKDGRYNPND LVVQPGAALS YSTSVTNTLA TQGIHGNLLG TTSASDPVVT QPKIALRFEE
SDRKSGFSPA SGESEAATCV GASCPASDLV SSTDRSVAFD GVDDQLAIGT LAYENAFQAS
TFSFKVKLNA LPSADKTMSL IATESTQAYG LNVSVNSSGK LVVALNDVSP SLTGVITMAT
NTWITITINV NDKQLRVYQN GSLDSGLNNN VRLRLVVGAG TLGNSIDGAS PLHGNLNDIS
IVNNNAETVF AFGFDEHNTN SLRTSFANTA SGGSIVSCAS TATCPSLTAG ATHEGLLFDG
SDDYLPLPAT ASSAAGTSGS FSFKLKLSAL PASGSYYYLL DNACSSSAGA GYCLRAYIDS
AGLVTLGLIN RTSSSVAFGP FSTTAAGGFS GKLGSWVSVT ISWSLAAAAG STSNFAIATT
HNGSTITATS SGTSTYWPLI TTDSNARFGR RVGGTLPLKG ALDDLVATSY NLSFDQPSFN
VEQINRVNDG RVAACAAFYN CPTSSSAGKF GAALNFDGSD DYLLLDHTVG DDFTIAFWMQ
SSQTTGSASA WWQGNGLIDG EVAGNANDFG ISLGDGGKVL FGIGNPAASD TTLKGGSVAD
GTWHHVVATR VKQTGAMRLY VDGVLVASGT GNTASLSAPP YLRIGMIQTG YNAYAGLLDE
IVIVPAAVDL AGAKLLMQTT YPIIDITEAV TTFQLNALSA SSISAIANVS SNAVTSRHSF
TQEVEAAIDL QSAIDYPVTD SNAASLPILL PFEEVPGETN FSNYGTVTGY SQNNEMKSPT
CYSSIGCPTA GLPGVDGRAA YFNGSGDWLS CTYSGVLGLP CAWNEVNIRT VAAWVKADRG
TIADFRGSIG ANGIELDFNS FKVTINSTTY RIAIDLPENE WVHVAATVDM TSRIAKVYVN
GALYGSTSLG STGTYSGSFP SIGANRAGTF GNLGADGDFF HGSMDDLRGY TVTLSASQIK
QLYSESAPAM AFTFDADASS ASTVIDTSAN SYAGVLRGSV CNTVTLNSAD FGNFGADGIN
IRLESTDQLI LSMPLITTAL QQLNVSVPIC GIDSLIIEEI ASNGTTSSYG AISLNAANSG
TTTDVNVGSY PFDHIVLNYS YATPTSSSAP SLTDGKIGQN MTFGGSGAIE VQSPTAVSAL
TSTFTLMGWI NPDDVAGFQQ LIASGTDATS NNGFSLELND DLLQFRTLGV KTYASTASVR
AEVWQHVALV FDSNYDALFY VNGTLQQTID GSAVAKANSD DPTYIGGSAS PIGVLQGFFR
GQLDQLAVYD RQMTTGEIYS IYLRDLRWYS ARSTVELQID TDAPAIALLS AADYLANQPT
TLVVSTIDAT SGVRLLDVGV KQPNASSYTW SSAGLCAESL SVASNAAWCY NFDPTSLGGA
GNYSLQFRAV DAVGNQTISS VYTINIDTTA PTAASSYSET WQQLSTSNSD ELAWTVALNG
TVNDTGSGID PNSVQLSLLD STGALAGLDQ AQTATVSGNT WSISYGFANK RPAGRYNLSL
SATDLVGNNW SGIVGSIVLD GRAPSMQLEP SLLTSRIIST TPNLNGLLVE QPAWGGEVAA
FHFAEASGAT NFSDYSDTNL VATCSANACP SGVSGLFGRA LNFDGNNDNL TIANTTTLDL
SVASFSAWVK PSWVSGTLGY APTILAQSAG SNSNWRWQMS ADYRSMQLHN GSATTSLPLT
LASNQWSHVA LVQAGDRWTG YLNGVAIGTI EQAFGSSTGL PLHIGSNGTS QFWAGQLDQV
TIYERDLSAA EVYALAQSKV AGVSKAQVWL APDETMIPAS PLISYDFTEV RGATSFADAS
GNGHGATCTS CPTATNGWTD AEALSFDGVN DGLSATITNN LSLANFTQAA WIYSTATDNG
YHSVMGYQPG SLNQRRPPSI YITQKTRIHA GFGDGSVFTS LETGSVLTPN AWNHIATTFD
GQAYRVYVNG TAVYTSTASA GRTPYPVSKL EIGKADTYFK GAIDQVRIYD RALSATDIGL
LASSWTNASL TETNTASADW NYQIPNGLQG LYKLSLRGTD TFSNTESIAT VWRGMLDSVA
PNVSISATHQ GGGFAASTVY TITASDLFLD PATLITPCQT ANSTLSYNEM GGVSSLIVTC
SVVGHQQGSV SATIRDLAGN QASTSFNLPM PNTSPAILIS SPSGSITGTA PISITGGAFA
PIGIQSVAVL INGQNLSTIN YGVGITNTLW ATNWQPQASG SYTITAILTA SNGGVYTDTT
TISVREAYQL IVNRAGTGSG TISSEPIGIN CGDVCSVYFA ESNVITLTAT PASNSVFSGW
SGACSGNSLC VVPMTQAQSV TASFSLKTYP LGISFAGTGD GTVNILPNGV NCTRQTPACL
LFFSAGTVVT LTATPLANAS FVGWSGDCSG TASCVLTIDR AHTVSARFDL VTLTPTATAT
STSTPTPSAT ATPTTTATAT ATATSTLTVT PSPTATAIAT HTPSPTATNT PTATATVTIT
ATATATVTPS PTVTNTPSPT PTLGASEYRI YLPIAMR