Gene Cag_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0529 
Symbol 
ID3746339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp617409 
End bp631502 
Gene Length14094 bp 
Protein Length4697 aa 
Translation table11 
GC content47% 
IMG OID637773063 
Producthypothetical protein 
Protein accessionYP_378845 
Protein GI78188507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAA AAGCATTTAA TTCGGCACAC TTTATTACAG CTTTAAGAGG ATTTAATTCA 
GTGTACTATT TGGGTGTAAA GCTCGCCCAA CTTCAGGCAA CACCAAGCAG TGGTTGGGAT
AGTTCTAAAA CAATAGATGA TCTTTTAACT ACGTTAAAAG CTGAGGGCTT TACTCCAGAA
ACTCATTATA TGCGGTATGG TTATAGAGAA AATCTTGCGC CAAACGCATT CTTTAATGCG
GCGGAGTACA TACAGGCAAA AGCGAATCAA TTGGTTACGG TGGATCATCG GTATGCCTCT
GTTGAGGCTG CTAAAGCGGC ATTTTTGGCG GCATGGGATG GAGATGTCTA TCAGCACTAT
CTACGGTATG GTGCGGCGGA AAATGTAAAT CCATCAAATG CTTTTGATGA GTCGGCTTAT
TACGCATTAA AGCTTGCTGC GCTTCGGGCT GATCCATTAA CCAGCGCTGA ATGGACACCA
AAAAGTGTTG CTGATTTGCA GAGATATTTT AAAAATGCTG GTTTTACGGC GTTGACACAT
TATGAGGCTT ATGGTAAAGC TGAGGGAATT GTGGTTACGC CTGTTTTATC CTCGCTAACA
CCATCTTTAT TTAATCCCAC AGAGTACACA CAAGCAAAAG CTAATCAGCT ATTTTTACAA
CATGCTTATG ACAGTGTTGA TGCTGCCAAA ACAGCATTCC TTAAAGCTTG GAATCAGAAT
GTCTATCAGC ACTATCTACA GTATGGTGCG GCGGAAAATG TAAATCCATC AAATGCTTTT
GATGAGTCGG CTTATTACGC ATTAAAGCTT GCTGCGCTTC GAGCTGATCC ATTAACCACG
GTTGAGTGGA CATCGAAGAG TGTTGCTGAT TTGCAGAGAT ATTTTAAAAA TGCCGGTTTT
ACGGCGTTGA CACATTATGA GGCTTATGGT AAAGCTGAGG GAATTGTTGT TACGCCTGTT
CCTGTAGGTG AAAAGGTTGC TGATACATTG TTTGCCGTTA CGATAGATGG TGCTGCTACT
CCAACGGTAA CAATTACGAG CAGTTCAAGC GCGTTGAAGG CGGGAGAAAC AGCAACGATC
ACTTTCACCT TCAGCGCAGA TCCCGGTGCA AGCTTTGTGG CTACGGATAT TGTAACCACC
GGTGGCACGC TTGGAGATTT GAGTGGCACC GGACGGGTGA GAACGGCGAC GTTTACGCCA
ACAGCGAGCC TAAAGTTCGG CAGTGCGAGC ATCACGATAG CCGTAAGGAA CTATACCGAT
GCAGCAGGAA ACACCGGTAG CGCAGGTACA ACACCCACGA TAACGATTGA CACGCTGGCG
CCGACAGTAG CGATCACCAG CAGCACCAGT GCTCTGAAGG CGGGCGAAAC AGCGACAATC
ACCTTCACCT TCAGCGAAGA TCCCGGTACA AGCTTTGTTG CTACGGATAT TGTGACCACA
GGTGGCACGC TTGAAGATTT GAGTGGCACC GGACGGGTGA GAACGGCGAA GTTTACGCCA
ACAGCGAACC TAAACTTCGG CAGTGCGAGC ATCACGATAG CCGTAAGGAA CTATAGCGAT
ACAATAGGGA ACACCGGCGG CGCAGGCACG ACACCCAAGA TAACGATTGA CACGCTGGCG
CCGACAGTAG CGATCACCAG CAGCACCAGT GCTCTGAAGG CGGGCGAAAC AGCGACAATC
ACCTTCACCT TCAGCGAAGA TCCCGGTACA AGCTTTGTTG CTACGGATAT TGTGACCACA
GGTGGCACGC TTGAAGATTT GAGTGGCACC GGACGGGTGA GAACGGCGAA GTTTACGCCA
ACAGCGAACC TAAACTTCGG CAGTGCGAGC ATCACGATAG CCGTAAGGAA CTATAGCGAT
ACAATAGGGA ACACCGGCGG CGCAGGCACG ACACCCAAGA TAACGATTGA CACGCTGGCG
CCGATGGTGG TGATCACGAG CAGCGCCAGC GCACTGAAAG CAGGCGAGAC AGCGACGATC
ACGTTCACCT TCAGCGAAGA CCCCGGTACA AGCTTTGTTG CTACGGATAT TGTGACCTCA
GGCGGCACCC TTGGAACGCT GAGTGGCACC GGACTGGTGA GAACAGCGAT GTTTACGCCA
ACAGCAAACC TTGCTAACGG CAGTGCAAGC ATCACGGTAG CCGCCGGGAA CTATGCCGGT
CCAGCAGGCA ACACCGGCAG CGCAGGCACG ACGCCTGTGG TGACGATTGA CACGTTGGCT
CCGACACTCT CCTCCAGTAT TCCAGCAGAT AATGCGATGG CAGTTTTGGT GGGAGCAAAC
ATTGTACTGA ATTTCAGCGA AAGCGTTACA GCCGTTGCTG GTAAGAATAT TGTGCTGCAC
AATGTGACAG ACTCTACCAC TACCACCATT GCCGCCAATG ATGCTCAGAT TTCTATAGTT
GCTGGTGTTG TAACCATTAA CCCGACGGCA GATTTTCTCA ATGGAAAAAA CTATTACGTT
ACAGTTGATG CAGGAGCTTT CATAGATGGT GCTGGTAACG ATTATGCTGG AATTGCAGAT
GCGACGTTAT TAAACTTCAC GATAACTCCC GATGTTACAG CCCCAACACT CTCCTCGAGT
ATTCCTGCCG ACAACGCGGT AGCCGTTGCA GTGGGTGCCA ATATCGTACT GAATTTCAGC
GAAAGCGTTA CAGCCGTTGC TGGCAAGAAT GTTGTGCTTC ACAATGTGAC GGACTCTACT
ATAACCACCA TTGCAGCCAA CGATGCTCAA GTTTCTATTG TTGCTGATGT TGTTATCATT
AACCCCACAG CAGATTTCCT CAATGGAAAA GACTACTATG TTACGGTTGA TGCAGGAGCT
TTCATAGATG GTGCTGGTAA CGGTTATGCT GGAATTACAG ATGCGACGCT ATTAAACTTC
ACGATAACTC CCGATGTTAC TGCCCCAACA CTCTCCTCGA GTATTCCTGC CGACAACGCG
GTAGCCGTTG CAGTGGGTGC CAATATCGTA CTGAATTTCA GCGAAAGCGT TACAGCCGTT
GCTGGCAAGA ATGTTGTGCT TCACAATGTA ACGGACTCTA CTACAACCAC CATTGCCGCT
AATGATGCTC AAGTTTCTAT TGTTGCTGGT GTTGTTATCA TTAACCCCAC AGCAGATTTG
CTCAATGGAA AAGACTACTA TGTTACGGTT GATGCAGGAG CTTTCATAGA TGGTGCTGGT
AACGGCTATG CTGGCATTGC AGATGCGACA CTATTAAACT TCACGATAAC TTCCGATGTT
ACTGCCCCAA CACTCTCCTC GAGTATTCCT GCCGACAACG CGTTGGCAGT TGCAGTGGAT
GCAAACATTG TGCTGAATTT CAGCGAAAGC GTTACAGCCG TTGCAGGCAA GAGTGTTGTG
CTGCACAATG TGACGGACTC TTCTACAACC ACCATTGCCG CCAATGATGC TCAAGTTTCT
ATTGTTGCTG GTGTTGTTAT CATTAACCCC ACAGCAGATT TCCTTAATGG AAAAGACTAC
TACGTTACGG TTGATGCGGG AGCTTTCATT GATGGCGCTG GTAACAGCTA TGCTGGAATT
GCAGATGCGG CAACATTAAA CTTCACGACA ACTCCCGATG TTACAGCCCC AACACTTTCC
TCGAGTATTC CTGCCGACAA CGCGGTAGCC GTTGCAGTGG GTGCCAACAT CGTACTGAAT
TTTAGTGAAA GCGTTACAGC CGTTGCTGGC AAGAATGTTG TGCTGCACAA TTTGACAGAC
TCTACCACAA CCACTATTGC TGCCAACGAT GCTCAGATAT CCATTGTTGG ATCAGTAGTA
ACCATTAACC CGACGGCAGA TTTTCTCAAT GGAAAAAACT ACTACGTTAC GGTTGATGCG
GGAGCTTTCA TTGATGGTGC TGGTAATGGT TATGCTGGTA TTGCAGATGC GGTAACATTC
AACTTCACGA CAACTCCCGA TGTTACAGCC CCAACACTCT CTTCGAGTGT ACCCGCCGAC
AATGCGACGG CAGTTGCATT AGGAACCAAT ATCGTACTGA ATTTCAACGA AAGCATTACA
GCCGTTGCAG GCAAGAGTGT TGTACTGCAC AATGTGACGG ATTCTACTAC TACCACTATT
GCCGCCAACG ATGCTCAGAT ATCCATTATT GGATCAGTAG TAACCATTAA CCCGACGGCA
GATTTTCTCA ATGGAAAAGA CTACTACGTT ACAGTTGATG CAGGAGCTTT CATTGATGGT
GCTGGTAACA GCTATGCAGG TATTGCAGAT GCGGCAACAT TAAACTTCAC GACAACTCCC
GATGTTACAG CCCCAACACT CTCCTCGAGT ATTCCTGCCG ACAACGCGGT AGCCGTTGCT
GTGGGAGCAA ACATTGTACT GAATTTCAAT GAAAGCGTTA CAGCCGTTGC AGGTAAGAGT
GTTGTGCTGC ACAATGTGAC AGACTCTACC ATAACTACGA TTGCCGCCAA CGATGCTCAG
ATCTCCATTG TGGGATCTGT AGTAACCATT AACCCGACGG CAGATTTTCT CAATGGAAAA
GACTATTACG TTACGGTTGA TGCGGGAGCT TTTATAGATG GTGCTGGTAA TGGCTACGCC
GGTATCACAA GCGCGACGGC ATTAAACTTC ACGACAACTC CTGACGTGAC AGCCCCAACA
CTCTCCTCCA GTGTACCCGC CGACAACGCG TTGGCAGTTG CGCTGGGAGC GAACATTGTA
CTGAATTTCA GCGAAAGCGT TACAGCCGTT GCAGGTAAGA ATATTGTGCT GCACAATGTG
ACGGACTCTA CCATAACTAC GATTGCCGCC AACGATGCTC AGATCTCCAT TGTGGGATCT
GTAGTGACGA TTAACCCGAC GACAGATTTT CTCAATGGAA AAGACTACTT CGTCACGGTT
GATGCGGGAG CTTTTATAGA TGGTGCTGGT AATGGCTACG CCGGTATCAC AAGCGCGACG
GCATTAAACT TCACGACAAC TCCTGACGTG ACAGCCCCGA CGCTCTCCTC GAGTGTACCC
GCCGACAACG CGGTAGCCGT TTCAGTGGGA GCCAATATCG TATTGAATTT CAATGAAAGC
GTTACAGCCG TTGCAGGCAA GAATATTGTG CTACACAATG TGACGGACTC TACTACTACC
ACCATTGCCG CCAATGATGC TCAAATATCC ATTATTGGAT CAGTAGTAAC CATTAACCCA
ACAGCAAATT TCCTCAATGG AAAAGACTAC TACGTTACGG TTGATGCGGG AGCTTTTATA
GATGGTGCTG GTAATGGCTA CGCCGGTATC ACAAGCGCGA CGGCATTAAA CTTCACGACA
ACTCCTGACG TGACAGCCCC GACGCTCTCC TCGAGTGTAC CAGCCGACAA CGCGTTGGCA
GTTGCAGTGG GAGCAAACGT CGTATTGAAT TTCAATGAAA GCGTTATAGC CGTTGCAGGC
AAGAATGTTG TGTTGCACAA TGTGACGGAC TCTACCACTA CCACTATTAC AGCCAACGAT
GCTCAGATCT CCATTGTTGG CTCAGTAGTA ACCATTAACC CAACAGCAAA TTTCCTCAAT
GGAAAAGACT ACTATGTTAC GGTTGATGCG GGAGCTTTTA TAGATGGTGC TGGTAACGGT
TATGCAGGAA TTGCAGATGC GGTAACATTA AACTTCACGA CAACTCCAGA CGTTACCGCT
CCAACACTCT CATCAAGTGT ACCAGCCGAC AACGCGTTGG CAGTTGCAGT GGGAGCAAAC
GTCGTATTGA ATTTCAATGA AAGCGTTACA GCCGTTGCTG GCAAGAATGT TGTGCTGCAC
AATGTGACAG ACTCTACCAT AACTACGATT GCCGCCAACG ATGCTCAAGT TTCTATTGTT
GCAGGTGTTG TAACTATTAA CCCCACAGCA GATCTTCTCA ATGGAAAAGA CTATTACGTT
ACGGTGGATA CGGGAGCTTT TATAGATGGT GCTGGTAACG GTTATGCAGG CATTGCAGAT
CCGACGGCAT TAAACTTCAC GATAACTCCT GACGTTACAG CTCCGACGCT CTCCTCCACT
GTTCCCGCCG ACAACGCTAC GGCAGTTGCA TTAGGAGCCA ATATCGTACT GAATTTTAGC
GAAAGCGTTA CAGCCGTTGC AGGCAAGAAT GTTGTGCTGC ACAATGTGAC GGATTCTACT
ACTACCACGA TTGCTACCAA CGATGCTCAA GTTTCTATCG TTGCTGGCGT TGTAACCATT
AACCCGACGG CAGATTTCCT TAATGGAAAA GACTATTACG TTACGGTTGA TGCGGGAGCT
TTTATAGATG GTGCTGGTAA CGGCTATGCT GGCATTGCAG ATACGGTAAC ATTAAACTTC
ACGACAACTC CCGATGTTAC AGCCCCAACA CTCTCCTCGA GTATTCCTGC CGACAACGCG
GCGGCTGTTG CGCTGGGTGC CAATATCGTA TTGAATTTCA ATGAAAGCGT TACAGCCGTT
GCAGGCAAGA ATATTGTGCT GCACAATGTG ACGGACTCTA CTACTACCAC CATTGCCGCC
AATGATGCTC AAATATCCAT TATTGGATCA GTAGTAACCA TTAACCCAAC GGCAAATTTC
CTCAATGGAA AAGACTACTA CGTTACGGTT GATGCGGGAG CTTTTATAGA TGGTGCTGGT
AATGGCTATG CCGGTATCAC AAGCGCGACG GCATTAAACT TCACGACAAC TCCTGACGTG
ACAGCCCCGA CGCTCTCCTC CACTGTTCCC GCCGACAACG CGGCGGCAGT TGCGCTGGGA
GCGAACATCG TACTGAATTT CAATGAAAGC GTTATAGCCG TTGCAGGCAA GAATGTTGTG
CTGCACAATG TGACGGATTC CGCTATCACC ACCATTGCTG CCAACGATGC TCAGATCTCC
ATTGTTGGCT CAGTAGTAAC CATTAACCCA ACGGCAAATT TCCTCAATGG AAAAGACTAC
TACGTTACGG TTGATGCGGG AGCTTTCATT GATGGTGCAA GTAACGGTTA TGCTGGAATT
GCAGATACGG TAACATTAAA TTTCACGACA ACTCCCGATG TTACAGCCCC AACACTCGCC
TCCAGTATTC CTGCGGATAA TGCGATGGCA GTTTTGGTGG AAGCGAACAT TGTACTGAAT
TTTAGCGAAA GCGTTACAGC CGTTGCAGGT AAGAATATTG TGCTGTACAA TATGACGGAT
TCCGCTATTA CCACCATTGC TGCCAACGAT GCTCAGATAT CCATTATTGG ATCAGTAGTA
ACCATTAACC CGACGGCAGA TTTTCTCAAT GGAAAAGACT ACTACGTTAC AGTTGATGCA
GGAGCTTTCA TAGATGGTGC TGGTAACAGC TATGCTGGTA TTGCAGATGC GGCGACATTA
AACTTCACAA CATTTCTTGT TGTGCCTCCA CCAGACCTTA TACCTCCAAC GCTCTCCTCA
AGTGTTCCTG CGGATAATGC GATGGCGGTT TTGGTGGGAG CGAATATTGT GCTGAATTTC
AATGAAAGCG TTACAACCGT TGCAGGCAAG AATGTTGTGT TGCACAATGT AACGGACTCT
ACCATAACTA CGATTGCCGC CAACGATGCT CAGATCTCCA TTGTGGGATC TGTAGTAACA
GTTAACCCGA CGACAGATTT TCTCAATGGA AAAAGTTACT ACGTTACGGT TGATGCGGGA
GCTTTTATAG ATGGTGCTGG TAACAGTTAT GCAGGAATTG CAGATCCGAC GGCATTAAAC
TTCACGATAA CTCCTGACGT TACAGCTCCG ACGCTCTCCT CCACTGTTCC CGCCGACAAC
GCGGTAGCCG TTGCAGTGGG AGCGAACATC GTATTGAATT TCAATGAAAG CGTTACAGCC
GTTGCTGGCA AGAATATTGT GCTGCACAAT GTGACGGATT CCGCTATTAC CACCATTGCT
GCCAATGATG CTCAGATCTC CATTGTTGCT GGTGTTGTGA CCATTAACCC GACGGCAGAT
TTCCTTAATG GAAAAGACTA CTACGTTTCG GTTGATGCGG GAGCTTTTAT AGATGGTGCT
GGTAACGGCT ATGCTGGCAT TGCAGATACG GTAACATTAA ACTTTACGAC AACTCCCGAT
GTTACAGCCC CAACACTCGC CTCGAGTGTT CCCGCCGACA ACGCGGCGGC AGTTGCCATG
GGAGCCAATA TCGTACTGAA TTTCAATGAA AGCGTTACAG CCGTTGCTGG CAAGAATATT
GTGCTGCACA ATGTGACAGA CTCTACCATA ACTACGATTG CCGCCAACGA TGCTCAAATA
TCCATTGTTG CAGGTGTTGT TACCATTAAC CCAACGGCAG ATTTTCTCAA TGGAAAAGAC
TATTACGTTA CGGTTGATGC GGGAGCTTTC ATTGATGGTG CTGGTAACGC TTATGCAGGC
ATTGCAGACC CAACGGCATT AAACTTCACG ACAACTCCCG ATGTTACAGC CCCAACACTC
GCCTCGAGTG TTCCAACGGA TAATGCGGCA GCCGTTGCCG TGGGAGCCAA TATCGTACTG
AATTTCAATG AAAGCGTTAC AGCAGTTGCA GGCAAGAATA TTGTGCTGCA TAATGTGACA
GACTCTACCA CTACCACTAT TGCCGCCAAT GATGCTCAGA TCTCCATTGT TGCTGGTGTT
GTGACCATTA ACCCGACGGC AGATTTTCTC AATGGAAAAG ATTATTACGT TACGGTTGAT
GCTGGAGCTT TCATAGATGG TGCTGGTAAC GGTTATACTG GCATTGCAAA TGCGGCAACA
TTAAACTTTA CGACAACTCC CGACGTTACA GCCCCAACAC TCTCCTCTAG TATTCCTGCC
GACAACGCGG TAGCCGTTGC AGTGGGTGCC AATATCGTAC TGAATTTCAG CGAAAGCGTT
ACAGCCGTTG CAGGCAAGAA TATTGTGCTG CACAATGTGA CGGATTCCGC TATTACCACC
ATTGCCGCCA ATGATGCTCA AGTTTCTATT ATTGCTGGTG TTGTAACCAT TAACCCGGCG
GCAGATTTTC TCAATGGAAA AAACTACTAC GTTACGGTTG ATGCGGGAGC TTTCATAGAT
GGTGCTGGTA ATGGTTATGC TGGTATTGCA GACCCAACGG CATTAAACTT CACGACAACT
CCCGATGTTA CAGCCCCAAC ACTCGCCTCA AGTGTTCCAA CGGATAACGC GGCAGCCGTT
GCCGTGGGAG CCAATATCGT ACTGAATTTC AATGAAAGCG TTACAGCCGT TGCAGGCAAG
AATGTTGTGC TGCACAATGT GACGGACTCT ACTACTACCA CCATTGCCGC CAACGATGCT
CAGATATCCA TTGTGGGATC TGTAGTGACC ATTAACCCGA CGGCAGATTT CCTCAATGGA
AAAGACTACT ACGTTACGGT TGATGCGGGA GCTTTCATTG ATGGTGCTGG TAACAGTTAT
GCTGGCATTG CAGATGTGGC AACATTAAAC TTCACGACAA CTCCCGATGT TACCGCCCCA
ACGCTCTCAT CAAGTGTTCC AGCGGATAAC GCGGCGGCCG TTGCAGTGGG TGCCAATATC
GTACTGAATT TCAACGAAAG CGTTACAGCC GTTGCAGGCA AGAATGTTGT GTTGCACAAT
GTGACGGACT CTACTATTAC CACTATTGCC GCCAATGATG CTCAGATCTC CATTGTTGCT
GGTGTTGTGA CCATTAATCC GACGGCAGAT CTTCTCAATG GAAAAGACTA CTACGTTACG
GTTGATGCAG GAGCTTTCAT TGATGGTGCT GGTAACGCTT ATGCAGGCAT TGCAGATCCA
ACAGCATTAA ACTTCACGAC AACTCCTGAC GTGACAGCCC CAACACTCTC CTCCACTGTT
CCGGCCGATA ACGCGGCAGC AGTTGCGCTG GGAGCCAATA TTGTATTGAA TTTCAATGAA
AGCGTTACAG CCGTTGCTGG CAAGAATGTT GTGCTGCATA ATGTGACGGA TTCCGCTATT
ACCACCATTG CCGCCAATGA TGCTCAAGTT TCTATTATTG CTGGTGTTGT AACCATTAAC
CCGGCGGCAG ATTTTCTCAA TGGAAAAAAC TACTACGTTA CGGTTGATGC GGGAGCTTTC
ATTGATGGTG CTGGTAATGG TTATGCTGGA ATTGCAGATA CGGTAACATT AAACTTCACG
ACAACTCCCG ATGTTACCGC CCCGATGCTC TCCTCGAGTG TACCCGCCGA CAACGCGGCA
GCAGTTGCAT TAGGAGCCAA TATCGTACTG AATTTTAGCG AAAGCGTTAC AGCCGTTGCT
GGCAAGAATA TTGTGCTGCA TAATGTGACA GACTCTACCA CTACCACTAT TACAGCCAAC
GATGCTCAAG TTTCTATCGT TGCAGGTATT GTAACTATTA ACCCGACGAC AGATTTTCTC
AATGGGAAAG ACTACTACGT TACGGTTGAT GCGGGAGCTT TCATTGATGG TGCTGGTAAT
GGTTATGCTG GAATTGCAGA TACGGTAACA TTAAACTTCA CGACAACTCC CGATGTTACC
GCCCCGATGC TCTCCTCGAG TGTACCCGCC GACAACGCGG CAGCAGTTGC ATTAGGAGCC
AATATCGTAC TGAATTTTAG CGAAAGCGTT ACAGCCGTTG CTGGCAAGAA TATTGTGCTG
CATAATGTGA CAGACTCTAC CACTACTACT ATTGCCGCCA ATGATGCTCA GATCTCCATT
GTTGCAGGTG TTGTTACCAT TAACCCGACG ACAGATTTTC TCAATGGAAA AAACTACTAC
GTTACGGTTG ATTCAGGAGC TTTTATAGAT GGTGCAGGTA ACGGCTATAC TGGAATTACA
GATCCAACGG CATTAAACTT TACGACAACT CCCGATGTTA CTGCCCCAAC ACTTTCCTCA
AGTGTACCCG CCGACAATGC AGCAGCAGTT GCAGTGGGAG CGAACATCGT ATTGAATTTC
AATGAAAGCG TTACAGCCGT TGCTGGTAAG AATATTGTGC TGCACAATGT GACAGACTCT
ACCATAACTA CGATTGCCGC CAACGATGCT CAAATATCCA TTGTTGCTGG TGTTGTTACC
ATTAACCCAA CGGCAGATTT CCTCAATGGA AAAGACTACT ACGTTACTGT TGATGCAGGA
GCTTTCATTG ATGGTGCTGG TAACGGTTAT GCAGGCATTG CAGATGCGGC AACATTAAAC
TTCACGACAA CTCCCGACGT GACAGCCCCA ACACTCTCCT CAAGTGTTCC AGCCGACAAC
GCGTTGTCAG TTGCATTAGG AGCCAATATC GTATTGAATT TCAATGAAAG CGTTACAGCC
GTTGCTGGTA AGAATATTGT GCTGCACAAT GTGACGGATT CTACTACTAC TACTATTGCC
GCTAATGATG CTAAAGTTTC TATCGTTGGA GGTGTTGTAA CCATTAACCC GACGGCAGAT
TTTCTCAATG GAAAAAACTA CTACGTTACG GTTGATGCAG GAGCTTTCAT AGATGGTGCT
GGTAATGGTT ATGCTGGCAT TGCAGATGCG GCAACATTAA ACTTCACGAC AACTCCCGAT
GTTACCGCCC CGATGCTCTC CTCAAGTGTA CCCGCCGACA ATGCAGCAGC AGTTGCAGTG
GGAGCCAATA TCGTACTGAA TTTCAACGAA AGCGTTACAG CCGTTGCAGG TAAGAATATT
GTGCTGCACA ATGTGACGGA TTCTACTACT AACACCATTG CAGCCAATGA TACTCAAGTT
TCTATCGTTG CTGGTGTTGT TACCATTAAC CCGACGGCAG ATTTCCTCAA TGGAAAAAAC
TACTACGTTA CGGTTGATGC AGGAGCTTTC ATTGATGGTG CTGGTAACGG TTATGCAGGC
ATGGCAGATA CGACGCTATT AAACTTCACG ACAACTCCTG ACGTGACAGC CCCAACACTC
TCCTCGAGTG TACCCGCCGA CAACGCTACG GCAGTTGCAT TAGGAGCCAA TATCGTACTG
AATTTCAGCG AAAGCGTTAC AGCCGTTGCT GGCAAGAGTG TTGTGCTGCA CAATGTAACG
GACTCTATTA CAACCACCAT TGCAGCCAAC GATGCTCAAG TTTCTATTGT TGCTGGTGTT
GTTATCATTA ACCCCACAGC AGATTTTCTC AATGGAAAAG ACTACTACGT TACGGTTGAT
GCGGGAGCTT TCATTGATGG TGCTGGTAAC AATTATGCTG GAATTGCAGA TGCGGCAACA
TTAAATTTCA CGACAACTCC CGATGTTACC GCCCCAACAC TCTCCTCAAG TATTCCCGCC
GACAACGCGT TGTCAGTTGC AGTGGGAGCA AATATCGTAC TGAATTTCAA TGAAAGCGTT
ACAGCCGTTG CAGGCAAGAA TGTTGTGCTG CACAATGTGA CAGATTCTAC TACTACCACC
ATTGCCGCCA ACGATGCTCA GATATCCATC ATGGGATCTT TAGTAACCAT TAACCCGACG
GCAGATTTTC TCAATGGAAA AGACTACTAC GTTACAGTTG ATGCGGGAGC TTTCATTGAT
GGTGCAGGTA ACGGTTATGC TGGCATTGCA GATCCAACGC TATTAAACTT CATAACGGCT
CCTGATGTTA CAGCCCCCAC ACTCACCTCC AGTGTACCCG CCGACAACGC GACAGCAGTT
TCAGTGGAAG ACAACATCGT ACTGAATTTC AGCGAGAACG TCTTAGCGAA CACTGGTTAT
ATCGTGCTCA AGGCAACGGC GGACAACGCG ATCATTGAAA GCTTTAACAC TGCAACTGGA
CAGGGCAATC ATGGTGGCAC AGTTACGGTT ACAGGCGTAT CGGTAACTGT TGACCCTATG
GCATATCTCA CAGCCAACAC AGGATACTAC GTTACTGTCG ATTCGACTGC CGTTAAAGAT
GTAGTGGGTA ATAATTATGC TGGTATTGTT AGCTCTACAG AATTAAACTT CACCACGCCC
ACACCCACAT CGTATAACCT CACTACGTTT GCGGATATTG CTCCTGCATT TGTAGGAACA
GTAGGGGATG ACATTTTTAA TGGTACTTAT GGTGATGGAG CAGGTCCTTA TACACTTGAC
GCTACCGATG TTTTAAATGG TGGAACTGGA GTAGATACGC TCTCTATTAC AACGGGAGCT
GAAGCTAGTA CGCCGCCTGA CTCTCTGTGG GCTAACAAAA CAAATTTTGA AAAGGTGGAA
TTTCACTCAA CAGGTGCTGG TGCGCAGAGC ATTACCACAG GTGTAAATTT TAATACGGCA
TTTGCTGGGC ATGTAGATCT AATTGTAGAG ACATATAATG GAGCAACAAC GATTGAGATG
CAGGCTTTTG ATGGAACCTC CACGCTTGTA GCAACAACGA CCTTGGATGG TGCGCAGACA
ATTACGACCA GCAACACTCA CGCTGCTATT GTTAAAGCAA TAAATTCTGC TGCTGGTGCG
CAGACAATTA GTGGTCAATT TCTTACTGAG GTACAGGCAA CAATAAATGG AGCAGGCGCT
CAAACTATTG GTAATGCTCT TGGAGGAGGT TCACATCTTA TCAATGTTAC AGCAACTGTT
CTTGGAGCTG GTGATCAAAC GATTACCACT ACCAGCACGG GTAATGCTAC GGTAAATGCA
ACATGCACGA CAGGAACTCA GAGAATTGTA ACTGGTGTGG GTAATGATTC TGTAACTGCA
CATTCGACTA CAGCCTCAAA CAATGTAATT ACGACCGATG CTGGTAACGA TACCATTATT
GCGGGTCAGG GTAATGACTC GATCACTGGC GGTCTCGGTT CTGATAGTAT GACCGGAGGC
GGAGGCACTG ATACGTTTGT ATTTGGTGCG AACGGTTCAA TTGTTGGTGC CTCAATGGAT
ATTATTACAG ATTTCAACAA TGCGGGTGCT GATATTCTCA CCTTTGGTGG CAATACAACG
GTGTTGGCTG CTGATGCGAG TGTTCTTGTT GCTGGTACAA ATGTGCAGAC CTCTGATGGT
GGCTTGATTA CTTTTGATGT TTCAGATAAT ACGTTGGCGT TTAAAATTGC AGCAGTCGAG
GCTGATGCCC AGCTTGATGT GGCTGGCTCT GTTGCAATGT TTGTTGATAG TGGTAATACT
TATCTCTATT ATGCTGGTAT AGCCGCTGGA AATCTGGATG ATCAAGTTAT ACAGTTGACA
GGTATTACCA CGTTTATTAC TATTACGGGC GGACCGACAA CAACTATTAT TTAG
 
Protein sequence
MILKAFNSAH FITALRGFNS VYYLGVKLAQ LQATPSSGWD SSKTIDDLLT TLKAEGFTPE 
THYMRYGYRE NLAPNAFFNA AEYIQAKANQ LVTVDHRYAS VEAAKAAFLA AWDGDVYQHY
LRYGAAENVN PSNAFDESAY YALKLAALRA DPLTSAEWTP KSVADLQRYF KNAGFTALTH
YEAYGKAEGI VVTPVLSSLT PSLFNPTEYT QAKANQLFLQ HAYDSVDAAK TAFLKAWNQN
VYQHYLQYGA AENVNPSNAF DESAYYALKL AALRADPLTT VEWTSKSVAD LQRYFKNAGF
TALTHYEAYG KAEGIVVTPV PVGEKVADTL FAVTIDGAAT PTVTITSSSS ALKAGETATI
TFTFSADPGA SFVATDIVTT GGTLGDLSGT GRVRTATFTP TASLKFGSAS ITIAVRNYTD
AAGNTGSAGT TPTITIDTLA PTVAITSSTS ALKAGETATI TFTFSEDPGT SFVATDIVTT
GGTLEDLSGT GRVRTAKFTP TANLNFGSAS ITIAVRNYSD TIGNTGGAGT TPKITIDTLA
PTVAITSSTS ALKAGETATI TFTFSEDPGT SFVATDIVTT GGTLEDLSGT GRVRTAKFTP
TANLNFGSAS ITIAVRNYSD TIGNTGGAGT TPKITIDTLA PMVVITSSAS ALKAGETATI
TFTFSEDPGT SFVATDIVTS GGTLGTLSGT GLVRTAMFTP TANLANGSAS ITVAAGNYAG
PAGNTGSAGT TPVVTIDTLA PTLSSSIPAD NAMAVLVGAN IVLNFSESVT AVAGKNIVLH
NVTDSTTTTI AANDAQISIV AGVVTINPTA DFLNGKNYYV TVDAGAFIDG AGNDYAGIAD
ATLLNFTITP DVTAPTLSSS IPADNAVAVA VGANIVLNFS ESVTAVAGKN VVLHNVTDST
ITTIAANDAQ VSIVADVVII NPTADFLNGK DYYVTVDAGA FIDGAGNGYA GITDATLLNF
TITPDVTAPT LSSSIPADNA VAVAVGANIV LNFSESVTAV AGKNVVLHNV TDSTTTTIAA
NDAQVSIVAG VVIINPTADL LNGKDYYVTV DAGAFIDGAG NGYAGIADAT LLNFTITSDV
TAPTLSSSIP ADNALAVAVD ANIVLNFSES VTAVAGKSVV LHNVTDSSTT TIAANDAQVS
IVAGVVIINP TADFLNGKDY YVTVDAGAFI DGAGNSYAGI ADAATLNFTT TPDVTAPTLS
SSIPADNAVA VAVGANIVLN FSESVTAVAG KNVVLHNLTD STTTTIAAND AQISIVGSVV
TINPTADFLN GKNYYVTVDA GAFIDGAGNG YAGIADAVTF NFTTTPDVTA PTLSSSVPAD
NATAVALGTN IVLNFNESIT AVAGKSVVLH NVTDSTTTTI AANDAQISII GSVVTINPTA
DFLNGKDYYV TVDAGAFIDG AGNSYAGIAD AATLNFTTTP DVTAPTLSSS IPADNAVAVA
VGANIVLNFN ESVTAVAGKS VVLHNVTDST ITTIAANDAQ ISIVGSVVTI NPTADFLNGK
DYYVTVDAGA FIDGAGNGYA GITSATALNF TTTPDVTAPT LSSSVPADNA LAVALGANIV
LNFSESVTAV AGKNIVLHNV TDSTITTIAA NDAQISIVGS VVTINPTTDF LNGKDYFVTV
DAGAFIDGAG NGYAGITSAT ALNFTTTPDV TAPTLSSSVP ADNAVAVSVG ANIVLNFNES
VTAVAGKNIV LHNVTDSTTT TIAANDAQIS IIGSVVTINP TANFLNGKDY YVTVDAGAFI
DGAGNGYAGI TSATALNFTT TPDVTAPTLS SSVPADNALA VAVGANVVLN FNESVIAVAG
KNVVLHNVTD STTTTITAND AQISIVGSVV TINPTANFLN GKDYYVTVDA GAFIDGAGNG
YAGIADAVTL NFTTTPDVTA PTLSSSVPAD NALAVAVGAN VVLNFNESVT AVAGKNVVLH
NVTDSTITTI AANDAQVSIV AGVVTINPTA DLLNGKDYYV TVDTGAFIDG AGNGYAGIAD
PTALNFTITP DVTAPTLSST VPADNATAVA LGANIVLNFS ESVTAVAGKN VVLHNVTDST
TTTIATNDAQ VSIVAGVVTI NPTADFLNGK DYYVTVDAGA FIDGAGNGYA GIADTVTLNF
TTTPDVTAPT LSSSIPADNA AAVALGANIV LNFNESVTAV AGKNIVLHNV TDSTTTTIAA
NDAQISIIGS VVTINPTANF LNGKDYYVTV DAGAFIDGAG NGYAGITSAT ALNFTTTPDV
TAPTLSSTVP ADNAAAVALG ANIVLNFNES VIAVAGKNVV LHNVTDSAIT TIAANDAQIS
IVGSVVTINP TANFLNGKDY YVTVDAGAFI DGASNGYAGI ADTVTLNFTT TPDVTAPTLA
SSIPADNAMA VLVEANIVLN FSESVTAVAG KNIVLYNMTD SAITTIAAND AQISIIGSVV
TINPTADFLN GKDYYVTVDA GAFIDGAGNS YAGIADAATL NFTTFLVVPP PDLIPPTLSS
SVPADNAMAV LVGANIVLNF NESVTTVAGK NVVLHNVTDS TITTIAANDA QISIVGSVVT
VNPTTDFLNG KSYYVTVDAG AFIDGAGNSY AGIADPTALN FTITPDVTAP TLSSTVPADN
AVAVAVGANI VLNFNESVTA VAGKNIVLHN VTDSAITTIA ANDAQISIVA GVVTINPTAD
FLNGKDYYVS VDAGAFIDGA GNGYAGIADT VTLNFTTTPD VTAPTLASSV PADNAAAVAM
GANIVLNFNE SVTAVAGKNI VLHNVTDSTI TTIAANDAQI SIVAGVVTIN PTADFLNGKD
YYVTVDAGAF IDGAGNAYAG IADPTALNFT TTPDVTAPTL ASSVPTDNAA AVAVGANIVL
NFNESVTAVA GKNIVLHNVT DSTTTTIAAN DAQISIVAGV VTINPTADFL NGKDYYVTVD
AGAFIDGAGN GYTGIANAAT LNFTTTPDVT APTLSSSIPA DNAVAVAVGA NIVLNFSESV
TAVAGKNIVL HNVTDSAITT IAANDAQVSI IAGVVTINPA ADFLNGKNYY VTVDAGAFID
GAGNGYAGIA DPTALNFTTT PDVTAPTLAS SVPTDNAAAV AVGANIVLNF NESVTAVAGK
NVVLHNVTDS TTTTIAANDA QISIVGSVVT INPTADFLNG KDYYVTVDAG AFIDGAGNSY
AGIADVATLN FTTTPDVTAP TLSSSVPADN AAAVAVGANI VLNFNESVTA VAGKNVVLHN
VTDSTITTIA ANDAQISIVA GVVTINPTAD LLNGKDYYVT VDAGAFIDGA GNAYAGIADP
TALNFTTTPD VTAPTLSSTV PADNAAAVAL GANIVLNFNE SVTAVAGKNV VLHNVTDSAI
TTIAANDAQV SIIAGVVTIN PAADFLNGKN YYVTVDAGAF IDGAGNGYAG IADTVTLNFT
TTPDVTAPML SSSVPADNAA AVALGANIVL NFSESVTAVA GKNIVLHNVT DSTTTTITAN
DAQVSIVAGI VTINPTTDFL NGKDYYVTVD AGAFIDGAGN GYAGIADTVT LNFTTTPDVT
APMLSSSVPA DNAAAVALGA NIVLNFSESV TAVAGKNIVL HNVTDSTTTT IAANDAQISI
VAGVVTINPT TDFLNGKNYY VTVDSGAFID GAGNGYTGIT DPTALNFTTT PDVTAPTLSS
SVPADNAAAV AVGANIVLNF NESVTAVAGK NIVLHNVTDS TITTIAANDA QISIVAGVVT
INPTADFLNG KDYYVTVDAG AFIDGAGNGY AGIADAATLN FTTTPDVTAP TLSSSVPADN
ALSVALGANI VLNFNESVTA VAGKNIVLHN VTDSTTTTIA ANDAKVSIVG GVVTINPTAD
FLNGKNYYVT VDAGAFIDGA GNGYAGIADA ATLNFTTTPD VTAPMLSSSV PADNAAAVAV
GANIVLNFNE SVTAVAGKNI VLHNVTDSTT NTIAANDTQV SIVAGVVTIN PTADFLNGKN
YYVTVDAGAF IDGAGNGYAG MADTTLLNFT TTPDVTAPTL SSSVPADNAT AVALGANIVL
NFSESVTAVA GKSVVLHNVT DSITTTIAAN DAQVSIVAGV VIINPTADFL NGKDYYVTVD
AGAFIDGAGN NYAGIADAAT LNFTTTPDVT APTLSSSIPA DNALSVAVGA NIVLNFNESV
TAVAGKNVVL HNVTDSTTTT IAANDAQISI MGSLVTINPT ADFLNGKDYY VTVDAGAFID
GAGNGYAGIA DPTLLNFITA PDVTAPTLTS SVPADNATAV SVEDNIVLNF SENVLANTGY
IVLKATADNA IIESFNTATG QGNHGGTVTV TGVSVTVDPM AYLTANTGYY VTVDSTAVKD
VVGNNYAGIV SSTELNFTTP TPTSYNLTTF ADIAPAFVGT VGDDIFNGTY GDGAGPYTLD
ATDVLNGGTG VDTLSITTGA EASTPPDSLW ANKTNFEKVE FHSTGAGAQS ITTGVNFNTA
FAGHVDLIVE TYNGATTIEM QAFDGTSTLV ATTTLDGAQT ITTSNTHAAI VKAINSAAGA
QTISGQFLTE VQATINGAGA QTIGNALGGG SHLINVTATV LGAGDQTITT TSTGNATVNA
TCTTGTQRIV TGVGNDSVTA HSTTASNNVI TTDAGNDTII AGQGNDSITG GLGSDSMTGG
GGTDTFVFGA NGSIVGASMD IITDFNNAGA DILTFGGNTT VLAADASVLV AGTNVQTSDG
GLITFDVSDN TLAFKIAAVE ADAQLDVAGS VAMFVDSGNT YLYYAGIAAG NLDDQVIQLT
GITTFITITG GPTTTII