Gene Haur_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2008 
Symbol 
ID5733897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2487561 
End bp2499866 
Gene Length12306 bp 
Protein Length4101 aa 
Translation table11 
GC content53% 
IMG OID641279152 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544779 
Protein GI159898532 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGCTG ATTTGTTGTC TTGCATTGCT GCTACGACGA CGCTTCCCGA GCTGCTTCGA 
ACCGCAGCAG AAACCACGCC TGACCAAGTA ATTATCCATA TTGCTGCCGC TGGTAACGAA
CGAAGTATCA GTTACCACGA ACTCTATAGA TCGTCCCAAG CTGTTGGCCA AACGTTGCGA
CGCAGTGGAC TCAGCACAGG CCAAGTAGTC TTGATCGCGC TTGAATCGAG TGTCGATTTC
TTGGTTGGTT TTTGGGGTGC GCTGTTTGCA GGTCTTGTCC CCGCACCGCT GGCCGCCGAA
CCAAAGCGGA TTCTGGCAAT TTGGCAAAGC CTAGAGCAGC CAGCTTTACT TGTCAATCAC
GCAGTTGGCG AGTCAATCAT CGCGCTAGCG GAACAACTAG CACCACCAGC ACAATTAGCG
CAGGAACGAC CCACCGCGCT TGATGGGCCA TGGACTACGG CAGTCCAGCT TTTCAGCCCA
ATAACTAAAC GTCATCGTTC AGGGCACATC AGCGCTGAAG GTAGCGTTCA GCCCCATGAT
CTCGCCTATC TCCAGTTTTC TTCGGGCAGC ACCGGCCAGC CTCGGGGAGT TGAGTTGTCT
CATGCTGGGT TGCTGGCAAA TCTGTACCAA ATGGGGAGTG CCTGTGCCAT CAACTCCCAA
GATAGCGTTG TCAGTTGGAT GCCTTATTAT CATGATATGG GCTTAATTGC TGCGCACTTG
CTACCGCTGG CAGCTGGAAT CAAACAGGTC AAAATTGATG AGTTTTATTT TGCCCGTCGT
CCAGCTATTT GGCTAGAAAT CACCCATCAA CATCAAGCAA GCTTGTTAAC TGCTGCTCCT
TTCGCCCTCG ATTTGGTCAA TCGTCGGGTA AAACCAGCAC AGCTTGTAGG TCTCGATCTG
CGTTGTGTGC GGTTGTTGAT CGTTGGAGCC GAACCAATTG TTGCGGCGAG CTGTCGGGCG
TTTCTTGCTC AACTTGCACC AACTGGCTTA TCGCCACAGG TTTTGCTACC GGTGTATGGT
TTAGCTGAGG CTTGTGTCGG CGTTAGCCTA TCCCCATTGG GCACAGGTAT GACCACCCAT
CATATTAATC GTCACATATT GCTTCATGAA GCACGGGCTA GCTCACCTGA CGAGAACCAA
CATCACAGCA CTGACCCAAT CGACCAAACC GATTGGTTAG AACTGGTTGA TGTTGGCCTG
CCAATTCCCG ATTGTCGTGT GCGGATCGTC GATGATCAGG ATAAGCTGCT CGGCGATGAT
TTGATCGGTC ATATTCAGGT ATCAGGGTCG CAATTGATGC GCGGTTATTA TCGTAGCAAC
GACCCGAGCG CTGCATTTTG CGATGGTTGG CTGCGCACTG GCGATCTTGG TTTTTTACGC
AATGGACGTT TGGTAATTAC TGGACGGGCC AAGGAGATCG TGATTGTTAA TGGGCAAAAG
CATCATGCAC CTGATCTTGA AGATCTCATC AGCACCGTTG ATGGGCTACA TGCCAAACGG
ATCGCAGTGT GTGGCGCTGA GCGAGATGGT CAGCGCGTCG TTGTATTTCT GGCGATTAAC
GCATGGCAAA CGGTACTGCC CGCAATCAAC ACAGCAATCC GCAGGCTGCG ACGCACAACT
GGCACGACAA TAATCGATAT TGTGCCGTTG CGAGCCAGCC AGTTTCCACG TACAAGCAGC
GGCAAACTCA AGCGTAATGT GCTGCGCGAA CGCTATGAGT TGGGTGAATT CGATGCGGTG
ATCGCTGATG TGCAACAAGC CTTAGCAGCC CTTAATTCTC CCCCACGCAT GGCATTAAAC
CACTTGGAAC AAGCGATTAT TATGCTTTGC GCTCAAACGT TAGAGCTTGA TCCAAGTCAG
ATAGGCTTGC ACGATTCGGT CTTTGAACTC GGGGCAACCT CGCTACAATT AATGGATCTT
TTAGCAGAAA TTGGTGATCG CTTCAACCGA GAGCCAGATG CCGCCGTGCT ACGCAACCAT
CCCACACCCG CAGGCCTAAT CGCTTGGATT CAACAACCAG AAATTATGTC AACTAATAGT
ATGCCAGATG CGCGTAGCGA TCGATTCGCT ACCCCCGAGC CAATCGCAAT TATTGGTATG
GCCTGCCGAT TGCCCGATGC TAATACACCA GAACAGTTTT GGTTGAATTT GGCCGCAGGG
GTCGATAGTA TTAAACAATT GCCAAGCCCA CGCCATGACA CAACGGCTAG CCCCGCGCCC
AATGGCCAAG CGTGGGGCAG CCAGTTAGCG ACAGTCAGCT ATTTTGATCA CGACTTTTTT
AACATTAACG CGGATGAAGC CGCAGCCATG GACCCTCAAC AACGCATGTT GCTAGAGTTA
GCCTATCATG CCCTCGAACG AGCGGGGTAT GCAGCCGAGC GACGGAATGG TCGGCGAGTT
GGTGTATTTG TGGGGGTTGG TGAGGCCTCG TATCAAGAGC TATTGTTGCC TCTATTGGCG
CATAGCGAAC AGCTTCATTC ATCAATAGCA ACTGGCAACA TGCGCAATCT GATTGCTGGT
CGAATTGCCC ATTGTCTTGA TTTAAATGGC CCAGCAATTG CGATTGATAC TGCCTGCTCA
TCAAGCTTGG TTGCGCTGCA TATGGCTCGC ACAAGTTTAC TCGTTGGCGA TTGCGATCTC
GCGTTGGTAG GTGGCATTAA CCTTAATTTA ACCGAAACCC CCTATCAACT GTTGGAACGT
GCAGGAGCGC TCTCACCAAG CGGGCGCTGC CAAGCATTTG ATGCCGCAGC CGATGGAATT
GTCCTCGGCG AGGGGGCTGG AGTATTGGTG CTAGAGCGCT TAGGTCATGC CCAGCACAAT
GGCGATAGCA TCTTAGCGCT GATTCGTGGC TCAGCAATCA ATAATGATGG GCACTCGTTG
AGTCCAATGG CTCCCAACCC ATTGCGCCAA ACGGAAGTGC TACGCCAAGC CTACCGCGAA
GCCAACCTTG ATCCAGCAAG TATCTCCTAT ATCGAGGCCC ATGGCACTGG TACTGCCATT
GGCGATCCGA TAGAAGCTCG TTCATTGGCC CAAGCTTTTC CAGCAACAAG CAATCAACCA
CGTCGGATCG GTTCGGTCAA AACCAACCTC GGCCATTTAC TCAATGCTGC GGGCATTGCC
TCCCTGATCA AAGTGATCTT AATGTTTCAG CAGCGCCAAA TTCCCCCATC GCTGCATTAC
ACAACCCCCA ATCAACGCTT TGATCTTGCA GCAGCTGGAA TGACGATCAA CACAACGCTT
GAACCGTGGC ACGGGCCACA GCCACTCCGT GCTGGGGTTA ATAGCTTTGG GTTTGGGGGC
ACAAATGCCC ATGTTATTCT GGAAGCGCCA GCTCCAGCAC CAAATCCTGC GCACATACCA
AACGATTTTC AAATGCTACC AATTTCGGCT CGTACCGAGC AGGCTTTAGC CGAACTTGCA
GCTACTTTAG CCCAACGCAT GCAAACCGAT AAGGCGTTGA AGCTAGCCGA TGTATGCTTT
AGCTTGGCTG AGCGTGAAGT TTTTAGCCAC CGCGCAGTCC TAGCCAGCGA TGGGGCTGAG
CGAGCGCATA GCGAACTTGT CGATGGGTTG GCCTGTTTAG CCGCCGGAGC GGCAAATCCA
GTGCTGATCA CTGCGCCGCC AACCGCGCAA CGCCGTAAAA TTGCCCTGCT ATTTGCGGGA
CAAGGAGCAC AATATCCCCA GCAAGGGGCA CTGCTTTATC AACAAGAAGC AGTCTTCAGG
GCCACCCTGG ATGCTGCCTC GGCGCAACTT GGCCCAATTA ATGGGCGGCC ATTGCTCGAA
TGGTGTTTGG ATGCGGATGT TGATAGTCGT GCCTTGGCCG ATACCGCTGT GACCCAGCCG
CTGCTTGTGG CCTTTGAGGT GGCCCTAGCT CGCCTTGTTA TCAGTTGGGG GCTTAGTCCT
GATGCGCTGG TTGGCCATAG TGTTGGTGAG CTAGCAGCAG CATGTATCGC TGGTGTCCTA
AGTTTTGAGG CAGTTTTAGA GTTGGCGCGG GCACGGGGAC AGCTGATGGC AACATTGGCA
GAGCCGGGCA TGATGGCGGC AGTCTTTGCC CCAGAGCTGA TTGTTGCCAC AGTGGTGGCC
CACTATGCCG CTGACGTAAC AATCGCTGCC TACAATACAC CCAATCAGGT GGTTATTTCA
GGTCAGCGGG TTGCAGTTAT GCAGGCACTC GCAGATTTGG AGCGTGATGG ATTTAATGCG
GTGATTGTTA ACGATCATAT GGCTTATCAT TCACCCTTGA TCCAAGCAGC AGTGCCTGCG
ATTGCCGAGG TCGCCGCCAA GTTTCAGCCA GCAACCCCTA CGCTTCCACT GTTGAGTACG
GTCAGCGTTG AATGGATGCA TGGTACTGCT AAGCTCGATG CCGAGTACTG GGCAACCCAA
GTTGTCGAGC CAGTGCGGTT TGCTGCGGCG CTCGAGCGGT TATTCAATGA AGGGTTCGAT
ACCTTGATTG AGATTGGCCC AGGCAGCACG CTAACAGCCT TTGCTCGCCA AATGGCGGTA
GGGCGAGCTG CCAGCTCAAG CGTCGAGGCA CTGCTCAAGC GCGGCACAAA CGACTACACC
ACTATTCGCA CGGCAATTGG GCGCTTGTGG GTTCATGGCG TTGATTTCAA GCTATCAGCA
TTGGTGGGCA AACAGGGACA ACGTGTGCCC CTGCCAAATT ACCCGTTCGC CCGTATTCGC
CATTGGTTGC CAACCCCCCC CGAGCTTAAG CCACGTCTTC AGACACTTGA ACTTAAACCA
ACACCTGAGC ACCCAGCGTT GCTCCATGGT CAGGCGATTG CCGCAGTTAG TTTGCAAACT
ACTCCACGTG GGTATCGCTT GAATTTGAAG ACTGCCGATG GGCAAACCCT CCTCGAATTA
ACCAATCTTC GCGAGGTAGC CGCCCCCCAG CCTCCGCCCG ATCACGCGCC AGCGCTCCTT
CAACAAGTTG TTTGGACTCA GACTGCATTA GCTGCCCCAA GCGACCAAGC GCTCGAACAA
TGGGTTGTGA TTGCAGATAA TAACAATCCG TTGGCCGACC AGCTGCTTAG ATTGCTGAAT
ACAGCCAATC GAGCCTGTGT TGTAACAACC AGTGCAATGC TCAACAGCGT GCTTCCAACC
ATCTCCAATC GTTATGGATT GATCATCCTA GGTGCGCTGA GTTCCGATGA AGTGATTGCT
CAAACCAACG CGTTTGAGCA AACTTGTTAC GTTGGGGCGC TCCAACTCTT GGATTCAATC
AAGTCAATCT TGGCCTTACC CACATCCAAG CAACCCAACG GCTTATGGGT TGTTACAGCG
GGAGCATACG CGATTAACAA CCATACACAA GTTGTTGCGC CACAAGCATT GCTTGCCGGG
CTAGCAGCAG CCTTGCCCGA TCAGCGGATT AAATTTCCCT GTGTAGCACT TGATCTCGAA
TTGACCGACG ACATCCCAGC CCAAGGCCAA CTACTCTTGG GGGAGCTACA AACCCAACCA
AGCAATGGGG TCGTTGCATG GCGGGCGGGC AAACGGCTAA CCCGCACCTT AGCACCCTTA
GCGGGCGGCC CAAGCAAGCG ACCACCAAGT GAAAAGCCTG GTCGGGTGAT TATCATCGCA
GGTGGAACAG GCGGGGTTGG CGCTCAGCTT GCCCGCCATC TAGCAACCCA CAACCAACCA
ACCCTCATCT TGCTTGGCCG TTCTGCCCTT GATGCACAGC GCAGCAGCTT ACTTGAACAA
CTCAACGACT TGGGCGCAGT GGCTCGCTAC TGCCAAGTTG ATATTTGCGA TCCCCAACAG
GTAGACCAGC TTATTGCCGA GCTGGCAGCC TCCAGCAATG GGATTTTTGG GATCATTCAA
GCAGCAGGAA TTGTCGATGT AGGTTCATTA CAAGCCAAAA GCGCCCAGCA GTTGCTGGCC
GTGCTCGCAC CAAAAGTTAG CGGTACATGG CTGCTCGCGC GAGCGCTTGA GCGCTACCAG
CAACGACCAG CATTTTTTAT CAATTGCTCA TCAATTGCCG CTGTGGTCGC TGGCCTCGGC
GGGGGAATTG CCGATTACGT AGCAGCTAAT GCGTTTCTTG ATGCGTTTGC AGCAAGCGAG
CGCCAAGCTG GACGGCCCAT GACCACCCTA AATTGGGCCG CCTGGGATGG AATTGGCTTG
GCTGCTAATC CAATGTTGGT TGAGCAACTC CGCCAGCGCG GCTTACCACC ACTGCATCCA
AGCCAAGCAT TACAGGCATT TGACCAAGTG CTTTACACCG AGCAACGCCA AGTTGTTATT
TTGGCCCCTG TTGAATCGAC AGCTGAGCCG CAGTATCAAG CAACTAAAGC AGCAATTACT
CCGCCAGTTG CAATCAGTAA TCCTGCAATG AATGTTACCC AGCAGATTCA AGCCTTGGTG
GGTAACGCAT TGAAATTGCC ACCTGAACAG ATTTCTGAGG ATGCTTCGTT TCTGGCGCTC
GGCCTTGACT CGTTGCAAGC GGTCGATTTG GTTAAACAAC TTGAACAAAC CCTTGGCACA
ACCTTGCCAT TAACCCTGTT TTTTGAGTTC CAGACTATTC GTGAATTAGT GGCGTATCTG
AGTAAGCAGC GCCAAGTTGG AATAGCAGAA ACAGTAGCTA CTGATATTGT GATGCAGACT
ATTCCGTCGA TCAATCTAAA TCAAGCATTC CCAATCGCGC CAGCCCAAGT TAGCTTTTAC
GTGGGGCATC AGCTCTATCC GGCCAGCCCA GCCTTTACCC TGATTCGCCA ACAGATTGCG
GGCTTCCTCG ATCAAGTGGC CTTACAGCAG GCACTATGCT ATCTCGTTGA GCGCCATCCA
ATGCTGCGGG CGCAATTTGA GCCAGTTGAT CACGAGCAAC CTGAGCCACG CCAGCGGATC
ATCGCAGCAG ATCTGCTACC ACCTAGTTTG TGGTTTGAAC AACGGGAAGC TCCAGCGGAT
CATGCAGTCT TTGAGCAGCA GTTAGCCCAT TATCAATTTG ATTTGTTCAA GGCTCCGTTA
TTTCGAGTTG TGCTTTGGCC CGATAGCGCT GATCGTTGGG TATTGCTTTT GTTGCTTCAT
CATAGTATTG CCGATGGTTG GAGCACCAGC ATCCTCATCG ACGAGCTATG GCAGGTCTAT
ACCCAGCTGG TACAAAAACA GCCAATTGCT CTCCCAGCCC TAGCCTGTAC CTTTGAACAT
TACACCGAGC ACGCGCTTAA GGCCGCAACC AGCCAGCAAG CCAGCATTGA TCGCGCTTGG
TGGAATAGCT ATTTGGGTCA AAATAACGCT GCCATTACCT GGACACTGCC AACCGATGCT
CCCCTCAGCG AAGCAATCAC CCAGCCAATT GGTAGCTTCC ACCAGCAGCT AGATTTAGCT
ACAAGCAGGG AACTACGCCA GCATGCCGCA GCATTAGGAG TTTCACTCTT TCATCTGCTG
TTGGCGATCT ATGTTCGTCA GCTAGCAGAT TGGAGCCATA CTAATGCCCT TGCGATCAAT
GTGGCTGAAC ATGGGCGGAG CATGCGTTTG GTTGGCATTG AGCAAATAGT TGGCTGCTGC
GCTGATCATC TGCCGTTGCT TTTAACCCTT GACGAAGCTG CTGATATCAA CAGCTTAGCT
GGATTAATTC GTGATCAGTG GACAAGTATC CAACAACATT CCAACATCTC GGCTCTTGAT
TTGGCTCGCT TGTCGGGAGT ACGCCACCAA ACCGGGCCAC GTGCGCCAGG TGCGGCTTCG
TTCAGTCTTG CTCGCTTTCC CGGCAAGCTG CCCGAGGATT GTCCAATCAG CATTCAAGCG
CTAACCGCAA GTACAGCCAC AGCAGCAACG CAACTTTCCT TACTCATTGC CGAAGTCCGT
GGCGTGCTCC AGTGCACGTG GATCTATGCA ACGAGCGCCT TCCAAGCTCA CACCATCGAA
CAGCTAGCCA ACAGCTATCG GCGCGATCTA ATGGCGATCA TTCAGCCAAG CCAACCCCAG
CCAGCGTTAC AATCAAAGCT GGTCGCAGCC AAACCGCAAT CGCTGACTCC AAGCCGAATT
CTTGACCAGT GCTTGCGTCA GCCTGGGCGA GTTGCAGTCA ATGCTGATGG TCAATTATTA
ACCTATGCCC AACTAGCGAG CTATGCAGCA CAGGTGGCAA TTTGGCTTTT GGCCAACGGC
GCAGGCCCAA ATCAACCAGT TGCGCTGCTA ACCCAACCAG GAATTGCCAG CATTGTTGGG
ATGGTAGGGG CGCTATGGGC GGGCGTACCA TGGCTTGGGC TTAATCCCGA TTATCCATTA
GCCCAACTGC ACGATCAGCT TACTCAAGCG GGTGTTCAAC GACTGCTCCA TCACAATCAA
ACCCACCAAA CAGCTTTGCA ACTCCAGCAA AGCGCAATGC CGCAGCTTCA ACTAGGCTGG
CTTGATCAAC TCATTCAGCA AGTGACAGCC CTAACCTCAA TGCCATCGAT TGCCACGCCC
ACGCCGACTG ATCTTGCGTA TGTCATTTTC ACATCAGGCT CAACAGGCCG CCCAAAAGGT
GTGCCGATTA CCCATGGAGC GCTTGCTAAT TATCTTGAGT GGTTGGTTGA ACGCTTCGAT
TACAGCCCGA ATGATCGGCT GCTCCAAACC GCCGCGCTCA GCTTTGATGC GGCTATCAGT
CAGATTCTTG GGCCACTGAC CTCGGGTGGC AGCGTGATTA CCCTTAATGC GTTGGCGGTA
CGCGATCCAC TTGAGTTATT AGAGGTACTT GAGCGTGAGC GTCCGACCAT TTGGCGCTCT
GTCCCAGCAC TTTGGGAACG TGTGATCACG GCAATCGAAC GCCGGATTGC CGATGGGCAA
GCAGCACCCG CCTTGAGCGA ACTACGATTG ATTGGGGTTG GCGGCGAGGC GCTACCCGCG
AGCTATGTCC GACGCTGGAT GGATATCTAC GGCGAACAGC AGCAGATTGT TAATCATTAT
GGGCCAACCG AGGCCACGAT CAACGCAACT GCTTACCAAA TTAGGCAACG ACCAAGCATA
AACGCCCACA TTCCAATTGG CAAAGCAATC ACTGGCACGA TCACTCGTGT GCTTGATCAA
CAAGGGCAAA TCTGCCCGCT CGCAACAATC GGCGAGTTGT ACATCGGCGG CAGCGGGCTA
GCAGCGGGCT ATCTTGGGCG ACCCGATCTG ACGGCACTCC AGTTTGTGCC TGATCCACTA
CAAGCAGGCG CACGACTCTA TCGAACGGGT GATTTAGTTC GTGAGCTAGC CGATGGCAAT
TTGGTGTTTG TTGGTCGGGT TGATGAGCAA ATCAAGCTAC GCGGCTATCG CATCGAGCCA
GCCGAAATCG AGGCCGCATT ACAAGAACAT GAGGCAATTA CCAAAGCGGT TGCTTGTATG
GTTGAGGCTG GCGATCAGTC AATCTTGGCA GCCTATTTGG AAACAAAAGC CGTCTTGCCA
TCCGATCCGG AGCTACGGCG CTGGTTAGCA AAGCGTTTGC CGCCACAGAT GATTCCTCAG
CGCTTTTATG CGGTGGCATC CTTTCCAATC ACAAGTTCAG GCAAGATTGA TCGTGCTCGA
CTACGCTCGC TGCCAATTCC GGCCCCAATC AATGTAGCCC AAGGAGTGCA GCCGGAAACC
GCAACCGAGT TACTGCTGGC CGAAATCTGG CAGAAGGTGC TTAACCTACC ACATGTTTAT
CGCGATGACG ATTTCTTTGA ATTGGGCGGC GATTCGCTGT TGCTGTTGCA GGTGCTAACC
CGTCTAGAGG GCCGTGTCGC GGTACTACCA AGAGCCGCTA GTTTGTATGC CCAAAGTAGC
TTGGTAGGTT TCGCTCAGGC CTTGGATGCA GCGGCTAGCC AGCAACAATC AACTGAGCAA
CCAACGTTCG CCGAACAGCG ATCATCGATC CAAGCCGACA ATCCAACGTT TGCGCTTACC
CCCGCTCAAA TCGGTTTTAT GCTAACCGAG GCGTTTGATC CAGCGGCGGC AACAACCTGG
TGCGCCCGCC TTGCCATCAC AGGCCCACTG GATCAAGCGT TGCTTCAGCA GGCGCTTGGC
ATCCTCGTCA AGCGCCATCA AATGCTGCGC GTGCGCATAT TGACCGATCA ACGCCCGCCG
CTTCAGCAAG AACAACCATT TGAGCTTCCC CACTTGATCG TTCACGATGT GCAAGCGCTG
CTTGCTGCTG GTGCTGATGA ACATCAGCTA ATTGAGCAGC ACTGGTACGC AGAACAAACG
CAGCGCTTTC AACTTGATCA ACCGCCCTTG CTGCGCATGC GGGTATTGCG ATTAGCTCCA
ACTCGCCATA TCTGGCTGAT TGCCGCTCAT CACATTATTG GCGATGGTTG GAGTGCATGG
ATTTTCGGCC AAGAATTACT GCACATATAC GATAGTCTTG GGCGGGGTGA ATCGCCACGT
TTGCCCAATT TACGCTCAAC GTTTCAAGAT TATGTCAAAT TAATCCAACA ATCGAGTGAG
CAATCAGCAC TACACGCAGC ATACTGGCGT AATCAATTTC GCCAAACCTA TCACCGACCG
TTATTACCAG CCAATAATGT CGAGCCAGCT GCAACAACCC TGAATATCAG CCGCAGCCTA
CCAGCAAAGA CCTTGAAACA ACTACGGCAA GTGGCGGCAG TCGAGGGTTT AACCCCTTAT
GTAGTGCTGC TGAGTCTCTT TATTTATCAA TTGCGCCAAC TCACTGCGGC CAACGACCTG
GTAATCGGCA CTGCCCACGC TGGTCGCGAC CTTGCCTTGC CTGATATTGA GCGGATATTC
GGCTGTTTCG CCACCGCATT ACCAATTCGA TTCATTCATA ACCAGCCAGA AGTAGCGATC
CATAGTCTCT TGCAACCAGT CGCCCAAGCC TTCCGAAGCG CCTACCAGCA TGCACTAGCA
CCTACGGAAA TTGCCCGAAT CATTGGTGCT GATAACACAA TCTCAGCCAT TACTGCAACG
GGGGCACAAT TTTTCTTCAC ATTCTTGGAT TTTGAGGCTC TTGGAAGCTT GCAGAGCCAA
ACGCTAACGC TTGATTGGGA CAACTCATAT GCCGAAATTC AACCACCATT TGGCGCGACC
GAGTTGCTGT TTAGCGCACG CGCAACGAAT GGCAATTTGC GGCTCACACT GCAAGCAGCC
CCTACAAAAC TTGATCACAC CACCATGCAA GGCTTTATGG ATGGTTTGCT AGCAGCAATG
CAGCAACTTA TTGCAAAACC ACAGCCCATT AAGCAGCGCC AGATTCAGGT TGGCACACGG
CCAGTCAACA TCCAATCAAA CACCCTCGAT GCTGCATTGA TCGGCTATCT ACCACCAAGC
CGAAGTATTG CAGCGCTGGT TGGGCTACAA GGCGCTGAGT CCAAGCTACG CGAGCAGTTG
CGTGCGATGC TCTTCCCTGC TGGACAGCCG CGCTGGTTTG AACTACTTGA GACTCCCATT
GGATCATCAG CATTACTGTG TTTACCGCGC TTTGCCGAGG AGTTGCAGCC CAACCATGCC
GCCCACATCA CAAACGAAAT CGCTGCTGGA ATGGCTTTAG CCCAAGCGAG GGGCGTGCGC
TGCGTTTCAT TAGCAGGGAT GTTGCCAGCG CTTACTGGCT ATGGGTATGG TGTGCTGCGT
GCGCTAGCCC ACGATACCCA ACCCCAACCA GCCCTCACGA CTGGCCATGC CACAACCGTC
GTCGCCGTGG TACTAACGCT TGAGTCCGCC CTTGTCGTAA CAGGTCACGA ACTGGCACAA
AGCGACGTGG CCATTGTGGG GCTTGGCTCG ATTGGGCAAT CAGTCTTGCA TTTACTCCTC
AAAACCCTGC CACATCCGCG CTCGTTGGTG CTATGTGATC TCGCCAGCAA TCAATCGCGG
CTCAACGACC TCGCCACCCG CTTGCAGCAC GAGGCTGGCT ATACAGGCCC AATTAAGGTA
GTTGCCGCCG ATAGTGGAGT GCCGATCGAC GTTTACCAAA GCCAAATCAT TATTGCGGCC
ACAAGCACAG CCGGAATTAT TGAGGTAGAA CAGTTACAAG CCGGCACAAT TGTGGTTGAT
GATTCGTTTC CACCATGCCT TGACCCAGCC AAAGCCATTC AGCGTATGCA ACACCATGGC
GATGTCTTGA TCGTTGGTGG CGGGCAGCTC GCGTGTGGGC CAAGCCAGCG CACAATCGAT
CTCCCACTCA GCAATCCGGC CCTTTACGAA CGGATTTTAG CCGAAATTCT ACCGAATGCC
GCCGCTAGTT GCCAACTCGA AGCATTGTTA TGGGCAACCG ACCCAAGCTT ACCCCTGACG
CATGGACTGG TTACGGTCGA GGCAGCCTTG CGCTATCGAG CTGCGATAAT GCGGGCTGGC
TTTGGCCCGG CACCGCTCCA TTTACAAGGA TTTCAGCCAG ATCTCCAGCA CTTTACAACC
CTATAA
 
Protein sequence
MVADLLSCIA ATTTLPELLR TAAETTPDQV IIHIAAAGNE RSISYHELYR SSQAVGQTLR 
RSGLSTGQVV LIALESSVDF LVGFWGALFA GLVPAPLAAE PKRILAIWQS LEQPALLVNH
AVGESIIALA EQLAPPAQLA QERPTALDGP WTTAVQLFSP ITKRHRSGHI SAEGSVQPHD
LAYLQFSSGS TGQPRGVELS HAGLLANLYQ MGSACAINSQ DSVVSWMPYY HDMGLIAAHL
LPLAAGIKQV KIDEFYFARR PAIWLEITHQ HQASLLTAAP FALDLVNRRV KPAQLVGLDL
RCVRLLIVGA EPIVAASCRA FLAQLAPTGL SPQVLLPVYG LAEACVGVSL SPLGTGMTTH
HINRHILLHE ARASSPDENQ HHSTDPIDQT DWLELVDVGL PIPDCRVRIV DDQDKLLGDD
LIGHIQVSGS QLMRGYYRSN DPSAAFCDGW LRTGDLGFLR NGRLVITGRA KEIVIVNGQK
HHAPDLEDLI STVDGLHAKR IAVCGAERDG QRVVVFLAIN AWQTVLPAIN TAIRRLRRTT
GTTIIDIVPL RASQFPRTSS GKLKRNVLRE RYELGEFDAV IADVQQALAA LNSPPRMALN
HLEQAIIMLC AQTLELDPSQ IGLHDSVFEL GATSLQLMDL LAEIGDRFNR EPDAAVLRNH
PTPAGLIAWI QQPEIMSTNS MPDARSDRFA TPEPIAIIGM ACRLPDANTP EQFWLNLAAG
VDSIKQLPSP RHDTTASPAP NGQAWGSQLA TVSYFDHDFF NINADEAAAM DPQQRMLLEL
AYHALERAGY AAERRNGRRV GVFVGVGEAS YQELLLPLLA HSEQLHSSIA TGNMRNLIAG
RIAHCLDLNG PAIAIDTACS SSLVALHMAR TSLLVGDCDL ALVGGINLNL TETPYQLLER
AGALSPSGRC QAFDAAADGI VLGEGAGVLV LERLGHAQHN GDSILALIRG SAINNDGHSL
SPMAPNPLRQ TEVLRQAYRE ANLDPASISY IEAHGTGTAI GDPIEARSLA QAFPATSNQP
RRIGSVKTNL GHLLNAAGIA SLIKVILMFQ QRQIPPSLHY TTPNQRFDLA AAGMTINTTL
EPWHGPQPLR AGVNSFGFGG TNAHVILEAP APAPNPAHIP NDFQMLPISA RTEQALAELA
ATLAQRMQTD KALKLADVCF SLAEREVFSH RAVLASDGAE RAHSELVDGL ACLAAGAANP
VLITAPPTAQ RRKIALLFAG QGAQYPQQGA LLYQQEAVFR ATLDAASAQL GPINGRPLLE
WCLDADVDSR ALADTAVTQP LLVAFEVALA RLVISWGLSP DALVGHSVGE LAAACIAGVL
SFEAVLELAR ARGQLMATLA EPGMMAAVFA PELIVATVVA HYAADVTIAA YNTPNQVVIS
GQRVAVMQAL ADLERDGFNA VIVNDHMAYH SPLIQAAVPA IAEVAAKFQP ATPTLPLLST
VSVEWMHGTA KLDAEYWATQ VVEPVRFAAA LERLFNEGFD TLIEIGPGST LTAFARQMAV
GRAASSSVEA LLKRGTNDYT TIRTAIGRLW VHGVDFKLSA LVGKQGQRVP LPNYPFARIR
HWLPTPPELK PRLQTLELKP TPEHPALLHG QAIAAVSLQT TPRGYRLNLK TADGQTLLEL
TNLREVAAPQ PPPDHAPALL QQVVWTQTAL AAPSDQALEQ WVVIADNNNP LADQLLRLLN
TANRACVVTT SAMLNSVLPT ISNRYGLIIL GALSSDEVIA QTNAFEQTCY VGALQLLDSI
KSILALPTSK QPNGLWVVTA GAYAINNHTQ VVAPQALLAG LAAALPDQRI KFPCVALDLE
LTDDIPAQGQ LLLGELQTQP SNGVVAWRAG KRLTRTLAPL AGGPSKRPPS EKPGRVIIIA
GGTGGVGAQL ARHLATHNQP TLILLGRSAL DAQRSSLLEQ LNDLGAVARY CQVDICDPQQ
VDQLIAELAA SSNGIFGIIQ AAGIVDVGSL QAKSAQQLLA VLAPKVSGTW LLARALERYQ
QRPAFFINCS SIAAVVAGLG GGIADYVAAN AFLDAFAASE RQAGRPMTTL NWAAWDGIGL
AANPMLVEQL RQRGLPPLHP SQALQAFDQV LYTEQRQVVI LAPVESTAEP QYQATKAAIT
PPVAISNPAM NVTQQIQALV GNALKLPPEQ ISEDASFLAL GLDSLQAVDL VKQLEQTLGT
TLPLTLFFEF QTIRELVAYL SKQRQVGIAE TVATDIVMQT IPSINLNQAF PIAPAQVSFY
VGHQLYPASP AFTLIRQQIA GFLDQVALQQ ALCYLVERHP MLRAQFEPVD HEQPEPRQRI
IAADLLPPSL WFEQREAPAD HAVFEQQLAH YQFDLFKAPL FRVVLWPDSA DRWVLLLLLH
HSIADGWSTS ILIDELWQVY TQLVQKQPIA LPALACTFEH YTEHALKAAT SQQASIDRAW
WNSYLGQNNA AITWTLPTDA PLSEAITQPI GSFHQQLDLA TSRELRQHAA ALGVSLFHLL
LAIYVRQLAD WSHTNALAIN VAEHGRSMRL VGIEQIVGCC ADHLPLLLTL DEAADINSLA
GLIRDQWTSI QQHSNISALD LARLSGVRHQ TGPRAPGAAS FSLARFPGKL PEDCPISIQA
LTASTATAAT QLSLLIAEVR GVLQCTWIYA TSAFQAHTIE QLANSYRRDL MAIIQPSQPQ
PALQSKLVAA KPQSLTPSRI LDQCLRQPGR VAVNADGQLL TYAQLASYAA QVAIWLLANG
AGPNQPVALL TQPGIASIVG MVGALWAGVP WLGLNPDYPL AQLHDQLTQA GVQRLLHHNQ
THQTALQLQQ SAMPQLQLGW LDQLIQQVTA LTSMPSIATP TPTDLAYVIF TSGSTGRPKG
VPITHGALAN YLEWLVERFD YSPNDRLLQT AALSFDAAIS QILGPLTSGG SVITLNALAV
RDPLELLEVL ERERPTIWRS VPALWERVIT AIERRIADGQ AAPALSELRL IGVGGEALPA
SYVRRWMDIY GEQQQIVNHY GPTEATINAT AYQIRQRPSI NAHIPIGKAI TGTITRVLDQ
QGQICPLATI GELYIGGSGL AAGYLGRPDL TALQFVPDPL QAGARLYRTG DLVRELADGN
LVFVGRVDEQ IKLRGYRIEP AEIEAALQEH EAITKAVACM VEAGDQSILA AYLETKAVLP
SDPELRRWLA KRLPPQMIPQ RFYAVASFPI TSSGKIDRAR LRSLPIPAPI NVAQGVQPET
ATELLLAEIW QKVLNLPHVY RDDDFFELGG DSLLLLQVLT RLEGRVAVLP RAASLYAQSS
LVGFAQALDA AASQQQSTEQ PTFAEQRSSI QADNPTFALT PAQIGFMLTE AFDPAAATTW
CARLAITGPL DQALLQQALG ILVKRHQMLR VRILTDQRPP LQQEQPFELP HLIVHDVQAL
LAAGADEHQL IEQHWYAEQT QRFQLDQPPL LRMRVLRLAP TRHIWLIAAH HIIGDGWSAW
IFGQELLHIY DSLGRGESPR LPNLRSTFQD YVKLIQQSSE QSALHAAYWR NQFRQTYHRP
LLPANNVEPA ATTLNISRSL PAKTLKQLRQ VAAVEGLTPY VVLLSLFIYQ LRQLTAANDL
VIGTAHAGRD LALPDIERIF GCFATALPIR FIHNQPEVAI HSLLQPVAQA FRSAYQHALA
PTEIARIIGA DNTISAITAT GAQFFFTFLD FEALGSLQSQ TLTLDWDNSY AEIQPPFGAT
ELLFSARATN GNLRLTLQAA PTKLDHTTMQ GFMDGLLAAM QQLIAKPQPI KQRQIQVGTR
PVNIQSNTLD AALIGYLPPS RSIAALVGLQ GAESKLREQL RAMLFPAGQP RWFELLETPI
GSSALLCLPR FAEELQPNHA AHITNEIAAG MALAQARGVR CVSLAGMLPA LTGYGYGVLR
ALAHDTQPQP ALTTGHATTV VAVVLTLESA LVVTGHELAQ SDVAIVGLGS IGQSVLHLLL
KTLPHPRSLV LCDLASNQSR LNDLATRLQH EAGYTGPIKV VAADSGVPID VYQSQIIIAA
TSTAGIIEVE QLQAGTIVVD DSFPPCLDPA KAIQRMQHHG DVLIVGGGQL ACGPSQRTID
LPLSNPALYE RILAEILPNA AASCQLEALL WATDPSLPLT HGLVTVEAAL RYRAAIMRAG
FGPAPLHLQG FQPDLQHFTT L