Gene Cpha266_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1846 
Symbol 
ID4571188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2125986 
End bp2140616 
Gene Length14631 bp 
Protein Length4876 aa 
Translation table11 
GC content53% 
IMG OID639766428 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_912286 
Protein GI119357642 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03660] T1SS-143 repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGT ACACTACTAC TCCGATGGCA ATCGCTGCCA CAGTTACCGG TATAGTCTGG 
GTTCAGTCTA AAGATGGAAC CCGGCGTCGG CTTTATGAAG GAGATAAGGT CTATGAAGGG
GAGGTCATCA TTACCGAACA AGGTAGTACG GTCGAGTTCA AAACCCCGAG TGGCAATACG
CTTAAAGTTC TCGGCAATCA GGAGATTGTT CTTGCTGCAG GACTTTTTGA TAATCCCGGT
GATTCTGACC AGAGTGATTC CGGTGCACCG GTTGAGGTAG TGAGTGTTAT CGGCAAGGTC
TGGGTTCTGG ATGAAAACGG AACCCGGCGA CGGCTTCACA AGGGTGACAC GGTTCACGAA
GGTGACACCA TCATCACCCA GAGCGGGAGC AAGGTCAGAC TCAGGAATGC AGATGGAACA
ACGCTGAGCG TTACCGGTAA TAAAGAAACG GTTCTTTCGT CAGGAATTTT TGATAATCAG
AACCTGTTTG ATCCTCTCCA GAGAGATTCC GTCACATTCC CTGAGAGTAC AGCAAGAAAT
ATCAGGGATA ATCAACAGGA TTCGTCAAGC CCCTACACTT CGCCTGACGG AGATCATGGA
CACGGATATA TCCGTGCGCC ACGTATTCTG GAGAGTGTTG TACCTGTGCC TTATCGTTAC
TGGTCAAATA CCGGCAATTA TGTCTCATCA TTTGGTGCCC GCCTTGAATC TCCCGACGCT
CTGCTTGGGG GGCGCGCAAC CACTGATGAA AGGCTCGTCA TGTATACCAC CCCGCTTACT
TTTGATTATG AGCAGTCCGG CTATGTGTTG AGAGAGTTCG AAGGTGAAGG TGAAGAAAGA
ACTCCCAACT ATCTTCCAAA GGCATTTGGT GAAACAGCGG TGGTTGTTGA GGGTGAAAAC
ACGATAACCG GCAATCTTCT TTTGAATGAT GAAAACGGCA ACGGGCCTTC GAAGGTTTCG
GCAATAACCT ATTTCCCTGA AGGCGGAGGT GCTCCTGTTA CCGAATCGGT TCCTGCAGGC
GGTTCATTAA CCGCCAATAC GCAATACGGA TCATTTACCA TCAACAGTGA TGGAACGTGG
AGTTACCTGA GCGATCCTAC GGAAACGCAT GGAGCGGACA ATGTCCTTAA AGATCCGATC
GTCTACACCG TCAGCGATAT TGATGGAGAC ATAGCGTTAT CCAGCCTGGT TATTGATGTT
CTCGACACCT TTCCTGTCAT TGGCACTCCC TTGCCCTCAT CCGTTGACGA GGATGACCTT
GACAATGCAC AATCTGTCGG TACTGATCCT GTCAAGGAGT CGGTTACGGT CGGCGGATCG
CTTGGTGTTG TTCCTGGCGA AGACCCTATC GACACCTATT TCACCTCGCA AGATGCGCCG
ACGGGGTTGA CCTCGGGCGG CAAAGAGGTG AAATATTATG TAAGTGGTGA CGGCCATACC
CTGATTGCCT ATACGGGCAG TTCAGTAAAT CCTGACGGTA CGCCTGTTGA CAAGGTTTTC
AGTGTCGAGA TTATCGACCC GTATGACCAG AGCGGATCAC AGCGCTATGA ATTTACCCTT
CTCGATCAGA TAGACCATCC TGCAGCGGCT GGCGAGAATA CGATGGATTT TACTTTTGAT
TTTCAGGTCA GGGATAGCAA CGAAGGTACT CTCGACACCG ATAATGGGTC GTTTACCGTT
ACTGTGGTTG ACGACGTACC GATTCTTAAA GGTCCATCGA CGAGCCTGAC AGCCATAAGC
GGAAAAGTCT ACGAAGACGC ACTCGGTCAC GAAAGCGACA GCGGCGACCT CTCGACGGGC
AACCAGGAAA CCATCCCCGG CGTAACGCAG GTAGCCCAGT TGAGCGGAAG CGGAACAAGC
GGCAGTCAGG CAAGCCTGTC GACACTGTTC AGCATAGGAG CCGACGAACC AGCGCTCAGC
TACCGCCTGA CGACCGACAG CAGCCTGTTA AGCGCAGCGC AGTCCGGACT GACCTCGAAC
AGCGAAACGG TGTTGTATAG CGTCAACTCC GACGGCAGCC AGTTAACAGC GACAGCCGGA
ACGCGGACGG TATTCACGCT GAACGTCTCA AGCAGCGGCA CCTGGAACTT CGACCTCGAA
GACCAGCTCG ACCACAGCGG AAGCGACAGC GACAGCGAAA CGATGCTGGC CAACAGCCAG
AGCCTGCTGA ACTTCACCAA ACTGATCGAA GTAACCGACG CCGACAACGA CACGGTAAAC
CTCGGAACCC TTGGCGGAAG CAACAGCCAG CTCTTTACGG TAACGGTAGA AAACGACATT
CCGACCCTGA AAGGTCCATC GACGAGCCTG ACAGCCATAA GCGGAAAAGT CTACGAAGAC
GCACTCGGTC ACGAAAGCGA CAGCGGCGAC CTCTCGACGG GCAACCAGGA AACCATCCCC
GGCGTAACGC AGGTAGCCCA GTTGACCGGA AGCGGAACAA GCGGCAGTCA GGCAAGCCTG
TCGACACTGT TCAGCATAGG AGCCGACGAA CCAGCGCTCA GCTACCGCCT GACGACCGAC
AGCAGCCTGT TAAGCGCAGC GCAGTCCGGA CTGACCTCGA ACAGCGAAAC GGTGTTGTAT
AGCGTCAACT CCGACGGCAG CCAGTTAACA GCGACAGCCG GAACGCGGAC GGTATTCACG
CTGAACGTCT CAAGCAGCGG CACCTGGAAC TTCGACCTCG AAGACCAGCT CGACCACAGC
GGAAGCGACA GCGACAGCGA AACGATGCTG GCCAACAGCC AGAGCCTGCT GAACTTCACC
AAACTGATCG AAGTAACCGA CGCCGACAAC GACACGGTAA ACCTCGGAAC CCTTGGCGGA
AGCAACAGCC AGCTCTTTAC GGTAACGGTA GAAAACGACA TTCCGACCCT GAAAGGTCCA
TCGACGAGCC TGACAGCCAT AAGCGGAAAA GTCTACGAAG ACGCACTCGG TCACGAAAGC
GACAGCGGCG ACCTCTCGAC GGGCAACCAG GAAACCATCC CCGGCGTAAC GCAGGTAGCC
CAGTTGAGCG GAAGCGGAAC AAGCGGCAGT CAGGCAAGCC TGTCGACACT GTTCAGCATA
GGAGCCGACG AACCAGCGCT CAGCTACCGC CTGACGACCG ACAGCAGCCT GTTAAGCGCA
GCGCAGTCCG GACTGACCTC GAACAGCGAA ACGGTGTTGT ATAGCGTCAA CTCCGACGGC
AGCCAGTTAA CAGCGACAGC CGGAACGCGG ACGGTATTCA CGCTGAACGT CTCAAGCAGC
GGCACCTGGA ACTTCGACCT CGAAGACCAG CTCGACCACA GCGGAAGCGA CAGCGACAGC
GAAACGATGC TGGCCAACAG CCAGAGCCTG CTGAACTTCA CCAAACTGAT CGAAGTAACC
GACGCCGACA ACGACACGGT AAACCTCGGA ACCCTTGGCG GAAGCAACAG CCAGCTCTTT
ACGGTAACGG TAGAAAACGA CATTCCGACC CTGAAAGGTC CATCGACGAG CCTGACAGCC
ATAAGCGGAA AAGTCTACGA AGACGCACTC GGTCACGAAA GCGACAGCGG CGACCTCTCG
ACGGGCAACC AGGAAACCAT TCCCGGCGTA ACGCAGGTAG CCCAGTTGAG CGGAAGCGGA
ACAAGCGGCA GTCAGGCAAG CCTGTCGACA CTGTTCAGCA TAGGAGCCGA CGAACCAGCG
CTCAGCTACC GCCTGACGAC CGACAGCAGC CTGATAAGCG CAGCGCAGTC CGGACTGACC
TCGAACAGCG AAACGGTGTT GTATAGCGTC AACTCCGACG GCAGCCAGTT AACAGCGACA
GCCGGAACGC GGACGGTATT CACGCTGAAC GTCTCAAGCA GCGGCACCTG GAACTTCGAC
CTCGAAGACC AGCTCGACCA CAGCGGAAGC GACAGCGACA GCGAAACGAT GCTGGCCAAC
AGCCAGAGCC TGCTGAACTT CACCAAACTG ATCGAAGTAA CCGACGCCGA CAACGACACG
GTAAACCTCG GAACCCTTGG CGGAAGCAAC AGCCAGCTCT TTACGGTAAC GGTAGAAAAC
GACATTCCGA CCCTGAAAGG TCCATCGACG AGCCTGACAG CCATAAGCGG AAAAGTCTAC
GAAGACGCAC TCGGTCACGA AAGCGACAGC GGCGACCTCT CGACGGGCAA CCAGGAAACC
ATTCCCGGCG TAACGCAGGT AGCCCAGTTG ACCGGAAGCG GAACAAGCGG CAGTCAGGCA
AGCCTGTCGA CACTGTTCAG CATAGGAGCC GACGAACCAG CGCTCAGCTA CCGCCTGACG
ACCGACAGCA GCCTGTTAAG CGCAGCGCAG TCCGGACTGA CCTCGAACAG CGAAACGGTG
TTGTATAGCG TCAACTCCGA CGGCAGCCAG TTAACAGCGA CAGCCGGAAC GCGGACGGTA
TTCACGCTGA ACGTCTCAAG CAGCGGCACC TGGAACTTCG ACCTCGAAGA CCAGCTCGAC
CACAGCGGAA GCGACAGCGA CAGCGAAACG ATGCTGGCCA ACAGCCAGAG CCTGCTGAAC
TTCACCAAAC TGATCGAAGT AACCGACGCC GACAACGACA CGGTAAACCT CGGAACCCTT
GGCGGAAGCA ACAGCCAGCT CTTTACGGTA ACGGTAGAAA ACGACATTCC GACCCTGAAA
GGTCCATCGA CGAGCCTGAC AGCCATAAGC GGAAAAGTCT ACGAAGACGC ACTCGGTCAC
GAAAGCGACA GCGGCGACCT CTCGACGGGC AACCAGGAAA CCATCCCCGG CGTAACGCAG
GTAGCCCAGT TGACCGGAAG CGGAACAAGC GGCAGTCAGG CAAGCCTGTC GACACTGTTC
AGCATAGGAG CCGACGAACC AGCGCTCAGC TACCGCCTGA CGACCGACAG CAGCCTGTTA
AGCGCAGCGC AGTCCGGACT GACCTCGAAC AGCGAAACGG TGTTGTATAG CGTCAACTCC
GACGGCAGCC AGTTAACAGC GACAGCCGGA ACGCGGACGG TATTCACGCT GAACGTCTCA
AGCAGCGGCA CCTGGAACTT CGACCTCGAA GACCAGCTCG ACCACAGCGG AAGCGACAGC
GACAGCGAAA CGATGCTGGC CAACAGCCAG AGCCTGCTGA ACTTCACCAA ACTGATCGAA
GTAACCGACG CCGACAACGA CACGGTAAAC CTCGGAACCC TTGGCGGAAG CAACAGCCAG
CTCTTTACGG TAACGGTAGA AAACGACATT CCGACCCTGA AAGGTCCATC GACGAGCCTG
ACAGCCATAA GCGGAAAAGT CTACGAAGAC GCACTCGGTC ACGAAAGCGA CAGCGGCGAC
CTCTCGACGG GCAACCAGGA AACCATTCCC GGCGTAACGC AGGTAGCCCA GTTGAGCGGA
AGCGGAACAA GCGGCAGTCA GGCAAGCCTG TCGACACTGT TCAGCATAGG AGCCGACGAA
CCAGCGCTCA GCTACCGCCT GACGACCGAC AGCAGCCTGT TAAGCGCAGC GCAGTCCGGA
CTGACCTCGA ACAGCGAAAC GGTGTTGTAT AGCGTCAACT CCGACGGCAG CCAGTTAACA
GCGACAGCCG GAACGCGGAC GGTATTCACG CTGAACGTCT CAAGCAGCGG CACCTGGAAC
TTCGACCTCG AAGACCAGCT CGACCACAGC GGAAGCGACA GCGACAGCGA AACGATGCTG
GCCAACAGCC AGAGCCTGCT GAACTTCACC AAACTGATCG AAGTAACCGA CGCCGACAAC
GACACGGTAA ACCTCGGAAC CCTTGGCGGA AGCAACAGCC AGCTCTTTAC GGTAACGGTA
GAAAACGACA TTCCGACCCT GAAAGGTCCA TCGACGAGCC TGACAGCCAT AAGCGGAAAA
GTCTACGAAG ACGCACTCGG TCACGAAAGC GACAGCGGCG ACCTCTCGAC GGGCAACCAG
GAAACCATTC CCGGCGTAAC GCAGGTAGCC CAGTTGAGCG GAAGCGGAAC AAGCGGCAGT
CAGGCAAGCC TGTCGACACT GTTCAGCATA GGAGCCGACG AACCAGCGCT CAGCTACCGC
CTGACGACCG ACAGCAGCCT GTTAAGCGCA GCGCAGTCCG GACTGACCTC GAACAGCGAA
ACGGTGTTGT ATAGCGTCAA CTCCGACGGC AGCCAGTTAA CAGCGACAGC CGGAACGCGG
ACGGTATTCA CGCTGAACGT CTCAAGCAGC GGCACCTGGA ACTTCGACCT CGAAGACCAG
CTCGACCACA GCGGAAGCGA CAGCGACAGC GAAACGATGC TGGCCAACAG CCAGAGCCTG
CTGAACTTCA CCAAACTGAT CGAAGTAACC GACGCCGACA ACGACACGGT AAACCTCGGA
ACCCTTGGCG GAAGCAACAG CCAGCTCTTT ACGGTAACGG TAGAAAACGA CATTCCGACC
CTGAAAGGTC CATCGACGAG CCTGACAGCC ATAAGCGGAA AAGTCTACGA AGACGCACTC
GGTCACGAAA GCGACAGCGG CGACCTCTCG ACGGGCAACC AGGAAACCAT TCCCGGCGTA
ACGCAGGTAG CCCAGTTGAC CGGAAGCGGA ACAAGCGGCA GTCAGGCAAG CCTGTCGACA
CTGTTCAGCA TAGGAGCCGA CGAACCAGCG CTCAGCTACC GCCTGACGAC CGACAGCAGC
CTGATAAGCG CAGCGCAGTC CGGACTGACC TCGAACAGCG AAACGGTGTT GTATAGCGTC
AACTCCGACG GCAGCCAGTT AACAGCGACA GCCGGAACGC GGACGGTATT CACGCTGAAC
GTCTCAAGCA GCGGCGCCTG GAACTTCGAC CTCGAAGACC AGCTCGACCA CAGCGGAAGC
GACAGCGACA GCGAAACGAT GCTGGCCAAC AGCCAGAGCC TGCTGAACTT CACCAAACTG
ATCGAAGTAA CCGACGCCGA CAACGACACG GTAAACCTCG GAACCCTTGG CGGAAGCAAC
AGCCAGCTCT TTACGGTAAC GGTAGAAAAC GACATTCCGA CCCTGAAAGG TCCATCGACG
AGCCTGACAG CCATAAGCGG AAAAGTCTAC GAAGACGCAC TCGGTCACGA AAGCGACAGC
GGCGACCTCT CGACGGGCAA CCAGGAAACC ATCCCCGGCG TAACGCAGGT AGCCCAGTTG
AGCGGAAGCG GAACAAGCGG CAGTCAGGCA AGCCTGTCGA CACTGTTCAG CATAGGAGCC
GACGAACCAG CGCTCAGCTA CCGCCTGACG ACCGACAGCA GCCTGATAAG CGCAGCGCAG
TCCGGACTGA CCTCGAACAG CGAAACGGTG TTGTATAGCG TCAACTCCGA CGGCAGCCAG
TTAACAGCGA CAGCCGGAAC GCGGACGGTA TTCACGCTGA ACGTCTCAAG CAGCGGCACC
TGGAACTTCG ACCTCGAAGA CCAGCTCGAC CACAGCGGAA GCGACAGCGA CAGCGAAACG
ATGCTGGCCA ACAGCCAGAG CCTGCTGAAC TTCACCAAAC TGATCGAAGT AACCGACGCC
GACAACGACA CGGTAAACCT CGGAACCCTT GGCGGAAGCA ACAGCCAGCT CTTTACGGTA
ACGGTAGAAA ACGACATTCC GATTGCAACT GACGATTACA AAGTTGGCGA ATCTCCCAAA
AATCAGGCAT GGTGGAGTGT TGAGGAGGGT GGTGTAACAG ATGTTGCCTT GCCTGATTTT
CCTGGGGCAT TGGCAAGTAA TAAAGCCTCA GGCAATCTGC TTTCAGGCAC AGTGATCTCT
GGCAGTTTGA GCGGGGTCGA TCTGCCAGGT GCTGACGAAA CGATCGCACT GGTATCATTC
ACCTACAAGA ATGAAGCTGG AACTGTAGAG ACAGGTACGG TTGGTCAGTG GGCAAATACC
CAGTACGGTT GGCTCAAGGT GATGGCAAAT GGAGATTTTG AGTATATCTC CGATTCTTAC
AGTACTCATA CCGCAAGTAA CTTCCTGAAT GAGGAGTTGA CCTATACGGT TCGGGATGCC
GATGGCGATC TTGCAACAGC GCAGTTTAAA ATAAAAATTA CCGATACCGA TTCAGACATC
GATTCTCCTG ACCAGTCAGA TATTTATGAG AAATATCTGC CGGGAGGAAG TGCGCCAGAT
GCAACAAAGA CATTCGGAAT TGAGGACATA ATAGTTGATC AGGGAAAAGA TCCAAAAGAG
CTCTTTTTTG ATTATGCGGT TGTCGGAGGT GTTCCTGACA TTGGCAGCAA ACCAATCACC
ATAAGTGCAG AGGAAAGTGA TCCTCCTCTT TTTAGTGAGT TGACTTCAAA AGGCAGGGCG
CTGACCTATA CGTTGAGCTC AGACAAGCGA ACCCTGACGG CTTCCACAAC GGTTGGAAGC
GAGACAGTGT TTACGGTCGA GATTTTAGGG AATATGGACT TTAACGGTAC TCCGCAATAC
AAGTTCACGC TTTACCAGCC GCTTGATCAT ATTCCCGATC TGCCAGGCGA TCCGGGTACA
GAGATACGTG AAGACGAGAT CGATCTTATG TTTGATCTGG CAATCTGGGA TAAAAACGAG
CAGCTTGACA AGACGACCGT ACAGATTCCG ATAACGATTT ATGATGACAA TACGTTGCCG
ACACCCAAGG CAATGACCGT TGTGGAGGAT AGCCCGACTG CCGCCAATAC CATTACCACG
AGCGCCGATG CAACATCATT GAACACATCG GTTCCAGCCA AAGGCGACAC AAATGGGCCG
AAATATGGTA CGGCAGTCGT AAACGCGAAT GGTACGATTA CCTATACTCC TGATGGAGAT
TACAGCGGAG AGGATTCCTT CGTTTATACA CATATCGATG AGGACGGAAA TTCTCATCAG
GTGACGGTGA ATGTTACAGT TACGCCGGTA TCGGATGAGC CGGATATGGC GGATACGTCT
ACCGAAACAC GAGAAGATCA GCCCATTCCG CTTGGTCTGG TTGCGCCAGC GATAACGGAC
GAGACAGATC TGAATGGTGC TGCCATAGCA GGTGATGATT CGGAACGGTT CGGGCTGATT
ACCCTTGGAG GAATTCCTGA AGGCGCACAG TTGCTGAAAT CGGATGGAAC GGTGTTGTTT
ACCGGCACAG CGACTGATAA TGATGTTACG ATCCGGTTGA CGGATGGTCT GTATATGAAT
GCAATACCGG TTGCTGAATT GATGACACTG AACACAACGG ATTTTAACGC GCTCAAGATA
TTGCCTCCTC CCGACAGCAA TGTCAATTTT ACGGTCACGA TGTCGGTGAC TGAGTATGAG
GTTGATGGAT CCGGAAATCA GAAAACGGTC GGTGCTGCAC TGGTTCCCGG TGTAACAAAA
ACGGTGAATG ACGTTATTAC CGTCAAGGCG GTCACCGATC GTGTCGATCT TCAATGGAAG
ACGACGGCCC CGGATACCAC GACGGACCCG TCACATCCCG ACTCGTTTGT CGATCCGCTG
CCGACAAGAA CTCTCGATAC TGAGGACAGA GACTATATGT ATACGACTCT TGAGGATGGC
ACGCTGAACA AGTGGATTGC TGAAGATTCG ACATTCAATC TGAAGAGTTT ACTTGAGTAT
ACCGCTGTTG ATCCATTAGA CACCAATAAC ACTGTTCTTG GTAATCCCGC TGAAAATACA
AGAGCTGGAG ATACAAGTGA AACGCGTCAA ATAGTTCTGA GCAATCTGCC TGTTGGAGCC
GTGGTTAACG GGACAACCAT TGACGCCACT GGTACTATTT CCATCCTGCT GGCAGACGGC
AAAACCCTGC CTGATATCAC GATGACGCCT CCGAAAGATT TCAGCGGAGA TCTGAAGGAT
ATCAAGGTTA CCCTGACAAC GATCGATACA GAAAGTGTTG CTGCAGAAAG CGGTCTGATT
GTCCAGGAAG AGGACTATGT CATGCTCAAT CTGTATGTTA AACCGGTAGC AGACGACATT
GTGTCTCCTG ATGTATCGAC ACCCGAAGAT ACCAGCGTGA AGTTCATGCT TGGTCTTGTG
CCGACGGATA CGTCAACCGA TCCGCTCATC GGCGGCGAGG AGGTTATCAC CGGCATAACG
ATAAAGGCTC TCCCTGCGGG ATGGAAGCTG TACGACGAGA ATGGTGTGCT TCTGACAACC
GGCACCGGAG CGGATTACAC AGTCGCTTCC GGGGATGTAA CAACACCGTA TACTGGAGAC
CCGACAGGTA AAACCTTCAA TTATCAATAC TACACGATTA AGCCGCCGGG TCATTCGAGC
GCGGACATTC CTGAACTGAG TATTGAAGTG ACCTCTACTG ATACGAGCGA TCCGGATGGC
GACGGGATTG CGGATATGGT GGATACCAGG ACGGTTATTC ACAAAATGAA GGTTACCGTT
ACCCCGATGG GCGAGAAAAT CGGTGCCGAC ACCGATGAAG ATGGCCAGAC CGATCTTCCG
CTTGTAACGG CTGTTGCTAA TACCGACGAA AACCTCAGTG CCGATCTGAA GATGAATCCG
TCGCACGACT ATGGGGTCGT CACCGCTACG GAGGATGAGT GGTTCAAGCT TCATCGAGTC
AGTATTGCAG GTGATCTGGG TAACTTCGAT CTTAAGACTC CGTGGAGTAA CGAGGACACA
ACGACGTTAT CCTATCAGGA TAGTTCGGAG AAAACGTATG CGCTGTTCAC TCCGCAGGAT
TCGTCCGGCA TTGATCTTAC GGATTCCTGG TTCAAATACA ATGATGGATC AGCAGACCAG
ATCAGGATGT ATTCAGGAAC CCCGATAGAA GTACCCGTTG AATTCCTTGA TACACTGGAG
TTCAAACCTC CGGAACATGT CGCTGCGGAT AATATCAAGA TTATTGTCAA TGCCAAGACC
GTCGATACCG ATCAGGATCC GGGTGGCACA GTCAATACCC AGATTACCGG AAAGGTTGTG
CTGACTCTTG ACATAGTGCC GGCAGCCGAT CAGGTGACGC TTGCTGCCTA CAGTCCGGCC
GGGTATGAGG ACGGATCTTC AAGCGCTCTG AATCCATCAC CAGCTACACC GGCATTGCCG
ATTCCGCTTT ACATCAATCC GCAGAGCGAC GATACGGATG GTTCTGAAAC GTTTGATATT
AAGATCAGCG ATATACCTGA CGGAGCAAAG ATCATGTACG GCGGTTCCGA GTTGTCCATC
AGTTCAGGCA GCGTTGATAT CCCCAATTTC AATTCATCAA TAGAGCTGGG GATTATTCCG
CCGGCAAACA GCGACGATGA TTTTGTTCTT GAGGTGACCG GCACATCAAA AGATGGTTTG
AATCTGAGCC CCGGAGTTGT CCTCAATCTG CCGGTTCAGG TTGCCGGAGT TGCCGATACG
GCAACGTTGT TAGTCAATGA ACCGACATTT GTGGAAAATA ATGTTGATAC GACAGGAAAT
CACAGGATTT TGTTGAATTC AGCTATTACC GGTTCAGCCA TGGTTGATGG TGTTGATGGT
TCAGAGAGCC TGACAGTCAA GATAACAGGT CTCGACAGCA AGTTCGACAT CGAAGGGGCT
ACCTATCTCA GTGGTGATGG CACAGGGAGA ATCTGGGTGA CCTCCGCTCT TTCAGGAGTC
AACATCGTTG TGCCCAACAA CTACAGCGGC GATATCACCT TTACGGGAAC ACCGGTAACC
TCGGAACGGG AAGGAAGCGT CTTGACCGGC ACTGCTCAAA GCTTTACAAT CCATGTTACC
CCGTCGCCGG AATCGGAAAT GGTTCTGGAT ACGGAGTTCA ATGAAGATCA GTTGACGCGG
GTCATGTTCG ATATTGTTCA GGGACCAAGT GCCGATCCGG ATTTGGATGA ACAGTTGCAA
AAGGTTTATC TCAAGGTTGG CGGTACTGGT GTGCCCGACG GCGTTGAAAA CAGGGAGTTT
ACCCTCTATT ACGGTGCGGC GGGCACCAAA ACGCTTGCAC AGGCAGTATT AGCCGACGAT
ATCAGGATCG TGACGATAAG TGGAGTTGTT GCTTATGAAC TGGATGGTAC AGCGGATTAC
AACAACATCT ATGTTCGGTA TGGTGCCGAT ACGGACGGGG AATCTACCTT TAAGGCAAGC
TATGATATTT TCGATCCATC GACAGGTTTG ACAACAACCT GTTGTAACGA CTATACCCTC
ACGGTCAAAG CGGTTACCGA TCCGATAACC GAGAGTCTCG AAACCATTGA TCCTGATACG
AACCATACTG TTACCGTGAA TGCTGGCTAC CAGACCGTAT CAGCAACCGG TTATACGATC
TTTACCGTTC CGGTAACTGT CTCGCAGGTT GACGAGGTTT CCGAAGGGCC AAACGGAGTC
GATGACGACA CGAGTGAAAC GCTGGTGAAG TTTGTGATCG ACGGCGTGCC CCAGGGGGTA
TCGGTAGAGG GTGCCGTTTA TGCCGGCGAT GTGTGGGATG CAGGTACTTT GACCTATGTG
AATTCCGGAC GATGGATTGT GTATGACGGC AGGACGTTTG ATGGATCTCC AGATCTGACA
AAGGATATTG TGTTCAATGT TGATGGTACG GCGGACGAGC TTAAAGGACT CAACCAGCCG
ATCACCATTA TGGCCTATAG TCAGGATATG GGTACGGATC CCGCTCTTGC ATCGTCAGAT
GTAGCGTTTG CTTCGGCAAC CTTTACGCTG GTAACACCAG CTGTTCCGGC CGACTTTAAT
GAAACCGGAC GCCCGACAGA TGTGCCGCCA ACCGTAACAG ACTGGAGAAT CGACGCAAGC
TTTGTTCCGG TTGAAGATAA GCCTGCAGCG CTCAGCGAGC TTGTGAAAAC GCCGACGGTT
TCAGGTACAG GATTAGAGCC CTTCACCGTA ACGCTTTCCG GTCTGCCTGC CGGTTCGGTG
GTCGCCGCAG TGTCAGGCAC CGCATACAGG GTGGATCAGT TCACCCAGGG AGGCAAGACG
GTCTATTCGA TTTCAGGAAG CGGTGGTACT ACAGGTTTAC AGGATCTGCT CTCAAAGGTG
ACCCTGACGA CGCCGCCTGA CGACAACAGC AACAACAGCG CGCCGATAAA CCTGGAAATG
ACCATTACCA CCTATGTTCC GGGCAGTAAT CAGGCCAATG CAGTGGCAAC GAGCACACCA
CTCGAGATCA CTCCGGTAAC GGATGCGACT ACCATAATCA TTGCGGCTCC TGATGTGCCT
GAAGATACCA GTGAAACCTT TACCATTACC TTCAGCAACA GCGCTGATAC AGCAACTTAT
ACCGATGTTC TTGCTGATAA GCTTTATATT CAGCTCGATG AGAGCGGAAT GGTGCCTGAC
GGAACCATTA ATCAGGGGAC GCTCAGCCTC GAATCCGGCG GAACGACAGT AACAGAGGTT
TTAAGCATAC CGGGAGTGAC GCCAGTTGCA GGTATGCGTT ATTTTGAAGT AACAGGTGTA
GACTCGATTT CCGGATCTGT AGAGCTTACC TACAAGCCGT ATAATTCGCC GACAGGTCAT
GCATCTGGTT CTGTTGGGCT TACGGCATAT CTGGATACGC TGGAGTCAAA CGCCAGCAAT
CAGTTGCAAA ACAGCCAGAC AGCCACCTTC GACATTACGC CGGTCAATGA TCTTTACGTT
ATCACTTCAG TGATTGCGAC TGGTTCGGAA GATGATCTTC GGATTCCTCT TGTGATAACC
GGTACCGGTC TGATCGACAC GGATGGTTCC GAAAAGGTGG TAAGCGCCCT GCTTGAAAAT
GTGCCCGATG GTTATCTGGT CTATTATGGT GTCGACAGCG CTTCTGCGAC ATTGGCGCTG
AATGTCGGCG ATGACGGAAC AGGCAATACC TGGGCTCTTC CTCTGAATTC GGGCGCCCTG
CCCGCATATA TAGCGGTCAA GCCCCCTCTG CATGTGAGCG GCGATGTGAG TGGTATGCAG
TTGACGGTGT ACAGCAGGGA GAGTGAGTTG ACTCTGCTTG AAAAGACCTC CGTGCCATTT
ACGCTCAAGG TGGCTCCTGT TGCTGATGAG GTCAAACCGG ACTTTTTCAA ACCGACCAAG
ACGTTCGGCA CGGAAGGTGC GCTAATTCCG CTCAACCTCA ATCTGATTCT TGATGATCAG
GATGGTTCGG AAACGGCGAC CCTGACGTTT GAAGGTCTTG GCGCCCATGC TTCATTCTAT
GATCAAGCAG GCAATGAGGT TGTGCCTGCC GGTTATGATT CCGTTAACGA TATCTATACG
CTTTCCGGCA TACCTGCATA TGACCCGCTG GGTAAATATG ATGTGAACAA TCTCTTTGTG
GTTCAGTCGG CCCGAACCGG AACGGTTGCC GTAACAGCAT ATACCGTCGA TACGGCATCC
GGATATCTGC CTGTTGATTC AAGTGGTTCA ACACCAACCG CCTCGTTTAC CCTCGATATA
TCGGAGAAGG TTCCAACAGC CGGTGCCGAT ACCCTGCTGT ACGACGGGGA TGCTGATGTT
GCCAATACCA GAAGCTATGA CGGGCTTGCC GGCGAGGATA CGCTGGTACT CCGGAAGGGT
GAGGGTATTG ATTTTGCGAC GGATCGATCG ATATTCAATA TCGAGAAAAT TGATATGACG
GTCAGTGGCG CCAATTCACT CACGCATATC ACCTGGGAGG ATGTTGCTGC AATGACCGAT
CCTTCGACCC ATGATCTGTA TATTCTTGGC GACGGAAGCG ATAGCGTTCA GTTTGCGTCA
ACCGGGTGGA GCAAGAGTTC TCCGGGCGGC GCTTATGACG AATACACCAA TACAAATGAT
GTGACGATCA AGGTATATGT TCAAACAGCC ATTAATGATA CAATAGTGTA A
 
Protein sequence
MSTYTTTPMA IAATVTGIVW VQSKDGTRRR LYEGDKVYEG EVIITEQGST VEFKTPSGNT 
LKVLGNQEIV LAAGLFDNPG DSDQSDSGAP VEVVSVIGKV WVLDENGTRR RLHKGDTVHE
GDTIITQSGS KVRLRNADGT TLSVTGNKET VLSSGIFDNQ NLFDPLQRDS VTFPESTARN
IRDNQQDSSS PYTSPDGDHG HGYIRAPRIL ESVVPVPYRY WSNTGNYVSS FGARLESPDA
LLGGRATTDE RLVMYTTPLT FDYEQSGYVL REFEGEGEER TPNYLPKAFG ETAVVVEGEN
TITGNLLLND ENGNGPSKVS AITYFPEGGG APVTESVPAG GSLTANTQYG SFTINSDGTW
SYLSDPTETH GADNVLKDPI VYTVSDIDGD IALSSLVIDV LDTFPVIGTP LPSSVDEDDL
DNAQSVGTDP VKESVTVGGS LGVVPGEDPI DTYFTSQDAP TGLTSGGKEV KYYVSGDGHT
LIAYTGSSVN PDGTPVDKVF SVEIIDPYDQ SGSQRYEFTL LDQIDHPAAA GENTMDFTFD
FQVRDSNEGT LDTDNGSFTV TVVDDVPILK GPSTSLTAIS GKVYEDALGH ESDSGDLSTG
NQETIPGVTQ VAQLSGSGTS GSQASLSTLF SIGADEPALS YRLTTDSSLL SAAQSGLTSN
SETVLYSVNS DGSQLTATAG TRTVFTLNVS SSGTWNFDLE DQLDHSGSDS DSETMLANSQ
SLLNFTKLIE VTDADNDTVN LGTLGGSNSQ LFTVTVENDI PTLKGPSTSL TAISGKVYED
ALGHESDSGD LSTGNQETIP GVTQVAQLTG SGTSGSQASL STLFSIGADE PALSYRLTTD
SSLLSAAQSG LTSNSETVLY SVNSDGSQLT ATAGTRTVFT LNVSSSGTWN FDLEDQLDHS
GSDSDSETML ANSQSLLNFT KLIEVTDADN DTVNLGTLGG SNSQLFTVTV ENDIPTLKGP
STSLTAISGK VYEDALGHES DSGDLSTGNQ ETIPGVTQVA QLSGSGTSGS QASLSTLFSI
GADEPALSYR LTTDSSLLSA AQSGLTSNSE TVLYSVNSDG SQLTATAGTR TVFTLNVSSS
GTWNFDLEDQ LDHSGSDSDS ETMLANSQSL LNFTKLIEVT DADNDTVNLG TLGGSNSQLF
TVTVENDIPT LKGPSTSLTA ISGKVYEDAL GHESDSGDLS TGNQETIPGV TQVAQLSGSG
TSGSQASLST LFSIGADEPA LSYRLTTDSS LISAAQSGLT SNSETVLYSV NSDGSQLTAT
AGTRTVFTLN VSSSGTWNFD LEDQLDHSGS DSDSETMLAN SQSLLNFTKL IEVTDADNDT
VNLGTLGGSN SQLFTVTVEN DIPTLKGPST SLTAISGKVY EDALGHESDS GDLSTGNQET
IPGVTQVAQL TGSGTSGSQA SLSTLFSIGA DEPALSYRLT TDSSLLSAAQ SGLTSNSETV
LYSVNSDGSQ LTATAGTRTV FTLNVSSSGT WNFDLEDQLD HSGSDSDSET MLANSQSLLN
FTKLIEVTDA DNDTVNLGTL GGSNSQLFTV TVENDIPTLK GPSTSLTAIS GKVYEDALGH
ESDSGDLSTG NQETIPGVTQ VAQLTGSGTS GSQASLSTLF SIGADEPALS YRLTTDSSLL
SAAQSGLTSN SETVLYSVNS DGSQLTATAG TRTVFTLNVS SSGTWNFDLE DQLDHSGSDS
DSETMLANSQ SLLNFTKLIE VTDADNDTVN LGTLGGSNSQ LFTVTVENDI PTLKGPSTSL
TAISGKVYED ALGHESDSGD LSTGNQETIP GVTQVAQLSG SGTSGSQASL STLFSIGADE
PALSYRLTTD SSLLSAAQSG LTSNSETVLY SVNSDGSQLT ATAGTRTVFT LNVSSSGTWN
FDLEDQLDHS GSDSDSETML ANSQSLLNFT KLIEVTDADN DTVNLGTLGG SNSQLFTVTV
ENDIPTLKGP STSLTAISGK VYEDALGHES DSGDLSTGNQ ETIPGVTQVA QLSGSGTSGS
QASLSTLFSI GADEPALSYR LTTDSSLLSA AQSGLTSNSE TVLYSVNSDG SQLTATAGTR
TVFTLNVSSS GTWNFDLEDQ LDHSGSDSDS ETMLANSQSL LNFTKLIEVT DADNDTVNLG
TLGGSNSQLF TVTVENDIPT LKGPSTSLTA ISGKVYEDAL GHESDSGDLS TGNQETIPGV
TQVAQLTGSG TSGSQASLST LFSIGADEPA LSYRLTTDSS LISAAQSGLT SNSETVLYSV
NSDGSQLTAT AGTRTVFTLN VSSSGAWNFD LEDQLDHSGS DSDSETMLAN SQSLLNFTKL
IEVTDADNDT VNLGTLGGSN SQLFTVTVEN DIPTLKGPST SLTAISGKVY EDALGHESDS
GDLSTGNQET IPGVTQVAQL SGSGTSGSQA SLSTLFSIGA DEPALSYRLT TDSSLISAAQ
SGLTSNSETV LYSVNSDGSQ LTATAGTRTV FTLNVSSSGT WNFDLEDQLD HSGSDSDSET
MLANSQSLLN FTKLIEVTDA DNDTVNLGTL GGSNSQLFTV TVENDIPIAT DDYKVGESPK
NQAWWSVEEG GVTDVALPDF PGALASNKAS GNLLSGTVIS GSLSGVDLPG ADETIALVSF
TYKNEAGTVE TGTVGQWANT QYGWLKVMAN GDFEYISDSY STHTASNFLN EELTYTVRDA
DGDLATAQFK IKITDTDSDI DSPDQSDIYE KYLPGGSAPD ATKTFGIEDI IVDQGKDPKE
LFFDYAVVGG VPDIGSKPIT ISAEESDPPL FSELTSKGRA LTYTLSSDKR TLTASTTVGS
ETVFTVEILG NMDFNGTPQY KFTLYQPLDH IPDLPGDPGT EIREDEIDLM FDLAIWDKNE
QLDKTTVQIP ITIYDDNTLP TPKAMTVVED SPTAANTITT SADATSLNTS VPAKGDTNGP
KYGTAVVNAN GTITYTPDGD YSGEDSFVYT HIDEDGNSHQ VTVNVTVTPV SDEPDMADTS
TETREDQPIP LGLVAPAITD ETDLNGAAIA GDDSERFGLI TLGGIPEGAQ LLKSDGTVLF
TGTATDNDVT IRLTDGLYMN AIPVAELMTL NTTDFNALKI LPPPDSNVNF TVTMSVTEYE
VDGSGNQKTV GAALVPGVTK TVNDVITVKA VTDRVDLQWK TTAPDTTTDP SHPDSFVDPL
PTRTLDTEDR DYMYTTLEDG TLNKWIAEDS TFNLKSLLEY TAVDPLDTNN TVLGNPAENT
RAGDTSETRQ IVLSNLPVGA VVNGTTIDAT GTISILLADG KTLPDITMTP PKDFSGDLKD
IKVTLTTIDT ESVAAESGLI VQEEDYVMLN LYVKPVADDI VSPDVSTPED TSVKFMLGLV
PTDTSTDPLI GGEEVITGIT IKALPAGWKL YDENGVLLTT GTGADYTVAS GDVTTPYTGD
PTGKTFNYQY YTIKPPGHSS ADIPELSIEV TSTDTSDPDG DGIADMVDTR TVIHKMKVTV
TPMGEKIGAD TDEDGQTDLP LVTAVANTDE NLSADLKMNP SHDYGVVTAT EDEWFKLHRV
SIAGDLGNFD LKTPWSNEDT TTLSYQDSSE KTYALFTPQD SSGIDLTDSW FKYNDGSADQ
IRMYSGTPIE VPVEFLDTLE FKPPEHVAAD NIKIIVNAKT VDTDQDPGGT VNTQITGKVV
LTLDIVPAAD QVTLAAYSPA GYEDGSSSAL NPSPATPALP IPLYINPQSD DTDGSETFDI
KISDIPDGAK IMYGGSELSI SSGSVDIPNF NSSIELGIIP PANSDDDFVL EVTGTSKDGL
NLSPGVVLNL PVQVAGVADT ATLLVNEPTF VENNVDTTGN HRILLNSAIT GSAMVDGVDG
SESLTVKITG LDSKFDIEGA TYLSGDGTGR IWVTSALSGV NIVVPNNYSG DITFTGTPVT
SEREGSVLTG TAQSFTIHVT PSPESEMVLD TEFNEDQLTR VMFDIVQGPS ADPDLDEQLQ
KVYLKVGGTG VPDGVENREF TLYYGAAGTK TLAQAVLADD IRIVTISGVV AYELDGTADY
NNIYVRYGAD TDGESTFKAS YDIFDPSTGL TTTCCNDYTL TVKAVTDPIT ESLETIDPDT
NHTVTVNAGY QTVSATGYTI FTVPVTVSQV DEVSEGPNGV DDDTSETLVK FVIDGVPQGV
SVEGAVYAGD VWDAGTLTYV NSGRWIVYDG RTFDGSPDLT KDIVFNVDGT ADELKGLNQP
ITIMAYSQDM GTDPALASSD VAFASATFTL VTPAVPADFN ETGRPTDVPP TVTDWRIDAS
FVPVEDKPAA LSELVKTPTV SGTGLEPFTV TLSGLPAGSV VAAVSGTAYR VDQFTQGGKT
VYSISGSGGT TGLQDLLSKV TLTTPPDDNS NNSAPINLEM TITTYVPGSN QANAVATSTP
LEITPVTDAT TIIIAAPDVP EDTSETFTIT FSNSADTATY TDVLADKLYI QLDESGMVPD
GTINQGTLSL ESGGTTVTEV LSIPGVTPVA GMRYFEVTGV DSISGSVELT YKPYNSPTGH
ASGSVGLTAY LDTLESNASN QLQNSQTATF DITPVNDLYV ITSVIATGSE DDLRIPLVIT
GTGLIDTDGS EKVVSALLEN VPDGYLVYYG VDSASATLAL NVGDDGTGNT WALPLNSGAL
PAYIAVKPPL HVSGDVSGMQ LTVYSRESEL TLLEKTSVPF TLKVAPVADE VKPDFFKPTK
TFGTEGALIP LNLNLILDDQ DGSETATLTF EGLGAHASFY DQAGNEVVPA GYDSVNDIYT
LSGIPAYDPL GKYDVNNLFV VQSARTGTVA VTAYTVDTAS GYLPVDSSGS TPTASFTLDI
SEKVPTAGAD TLLYDGDADV ANTRSYDGLA GEDTLVLRKG EGIDFATDRS IFNIEKIDMT
VSGANSLTHI TWEDVAAMTD PSTHDLYILG DGSDSVQFAS TGWSKSSPGG AYDEYTNTND
VTIKVYVQTA INDTIV