Gene Daro_3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3199 
Symbol 
ID3566869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3435124 
End bp3449688 
Gene Length14565 bp 
Protein Length4854 aa 
Translation table11 
GC content64% 
IMG OID637681670 
ProductVCBS 
Protein accessionYP_286399 
Protein GI71908812 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAG CCCAAGTCGT CGCTAAAATT TCCGAACTGT CTGGCCAGGC GTTTGCGCGA 
GATAGTGCAG GAAACACGCG CCGTCTCAAG CTTGGCGACG TGATTCGCGA AGGCGAGAGC
GTGGTGGCAG CTGATGGTGC CAAGGTCGTG CTGGTCTTGG CTGATGGCCG CGAGATGACC
GTCCGTCCGG GCGAAACGGC CAGGATTGAT GCTGAAGTAG CGGCAGCGGT CATGCCGGAT
GCCTCCGACA GTGCAGTCGT CAATGATCAA AACAGCTTTC AGAAAATCGC CAAGGCCCTG
CAGTCGGGTA GCGATCTCGA TGCCCTGCTC GAAGAAGATG CGCCGGCAGC GGGCTTGGCC
GGCCAAGGCG GCAACGAAGG GCATACCTTT GTCGAACTCC TGCGTATTGT TGAAACAGTC
GATCCGTTGG CGTATCAGTT TGGTACCAAT CGTGGCAGTC CGACTGAGAC CATCGAAGGT
GCGCCAGTCA CGGTGACCGT CGCTAGCGAG CCGGAGCAGG TGCTCTTTGC TCGCGATGAT
GCGAATGCGG TGAGCGAGCG CGTCGATAGC ACCGCTGCCA GCACCATCAG TGGGAATGTC
GTGGCTGCGG GTGCTGCAAG CGATGTTGCT GATTCTGGTT TGCCTGGTGC CGTGCTTACG
GTGACCCAGA TAGCTTTTGG TACAACGGTC GCAGCTCCGG GGACGGTGAT TGCCCTGGCG
CACGGTACGC TCACCATTGC TGCCGACGGC AGCTATACAT ACACCGTCAA TAACGCTGAT
CCAGTCGTCA ATGCCCTTAA TGTCGGCAAT CAACTGACTG AGCTGGTGAC CTACACAATT
ACCGATGGCC AGGGGCATAC CGCCCAGGCG ACGCTGACGC TGACCATCAA TGGTGTGAAC
GACAGCCCGA CCGCTACTGT TGGCGCGGCG ATAGCCTCTG CGGAAGATAC TTCGATCGTT
GTTCCGCTTT CCGGAACCGA TCCAGATAGC CCGATCGCAT TCGTGACGGT TACTACCTTG
CCGACGGACG GTTCCCTGCT TAAGGCTGAC GGTACGCCAG TCGTTGCCGG TGAGCAAATC
GCCGTCGATC CCGCTACGCA TCAGGCGCTA CTGACTTTTG TGCCCAATGC GAATTTCAAT
GGAATGGTTT CCTTCCAGTT CACGGTGACC GACACCGGTG GCCTGAGTTC GGCGCCGGCA
ACCCAGCAAA TCACCATCAA CCCGATCAAC GATGCCCCGA TTGCTGTCGT TGCCCCGGTC
TCGGGTGACG AAGACAGCAC GATTGCGGTC CCGCTGAGCG GGACCGATGT CGATGGCAGC
ATTCAGTACG TCACCGTCAC GGCCTTGCCG CCGGCGGCCC AAGGCATCCT GACCCTGGCC
GACGGCACTA CCGCCGTCGT TGCCGGCGAC CACCTGAGCC CGGCCCAAGC CGCCGGCCTG
CTCTTCAAGC CGGTCGCTGA CTACAACGGC ACGGTCGACA TTGCCTTCAC CGTCACCGAC
AACGCCAACG CCACCTCGGT CCCGGCCACC CAGCAAATCA CGATCAATCC GGTCGCCGAC
ATTGCGAACA ACACCGTCAG CACCGACGAA GACACCGCGA TCACCGTCAA TGTGCTGGGC
AACGACACCT TCGAAGGCAG CCCGAACGTC ACCGGCGTTG GCCCGGCGGC CCACGGCAGC
GTCGTGATCA ATCCCGACAA CACCGTCACC TACACCCCGT CGGCCAACTA CAACGGCAGC
GACAGCTTCA CCTACACCGT CACCAGCCCG ACCGGCATCA CCGAAACGGC GACGGTCAAT
GTCACCATCA ACCCGATCAA CGATGCCCCG ATTGCTGTCG TTGCCCCGGT CTCGGGTGAC
GAAGACAGCA CGATTGCGGT CCCGCTGAGC GGGACCGATG TCGATGGCAG CCTCCAGTAC
GTCACCGTGA CGGCCTTGCC GCCGGCGGCG CAAGGCATCC TGACCCTGGC CGACGGCACC
ACCCCGGTCG TTGCCGGCGA CCACCTGAGC CCGGCCCAAG CCGCCGGCCT GCTCTTCAAG
CCGGTCGCTG ACTACAACGG CACGGTCGAC ATTGCCTTCA CCGTCACCGA CAACGCCAAC
GCCACCTCGG TCCCGGCCAC CCAGCAAATC ACGATCAATC CGGTCGCCGA CATTGCGAAC
AACACCGTCA GCACCGACGA AGACACCGCG ATCACCGTCA ATGTGCTGGG CAACGACACC
TTCGAAGGCA GCCCGAACGT CACCGGCGTT GGCCCGGCGG CCCACGGCAG CGTCGTGATC
AATCCCGACA ACACCGTCAC CTACACCCCG TCGGCCAACT ACAACGGCAG CGACAGCTTC
ACCTACACCG TCACCAGCCC GACCGGCATC ACCGAAACGG CGACGGTCAA TGTCACCATC
AACCCGATCA ACGATGCCCC GATTGCTGTC GTTGCCCCGG TCTCGGGTGA CGAAGACAGC
ACGATTGCGG TCCCGCTGAG CGGGACCGAT GTCGATGGCA GCCTCCAGTA CGTCACCGTG
ACGGCCTTGC CGCCGGCGGC GCAAGGCATC CTGACCCTGG CCGACGGCAC CACCCCGGTC
GTTGCCGGCG ACCACCTGAG CCCGGCCCAA GCCGCCGGCC TGCTCTTCAA GCCGGTCGCT
GACTACAACG GCACGGTCGA CATTGCCTTC ACCGTCACCG ACAACGCCAA CGCCACCTCG
GTCCCGGCCA CCCAGCAAAT CACGATCAAT CCGGTCGCCG ACATTGCGAA CAACACCGTC
AGCACCGACG AAGACACCGC GATCACCGTC AATGTGCTGG GCAACGACAC CTTCGAAGGC
AGCCCGAACG TCACCGGCGT TGGCCCGGCG GCCCACGGCA GCGTCGTGAT CAATCCCGAC
AACACCGTCA CCTACACCCC GTCGGCCAAC TACAACGGCA GCGACAGCTT CACCTACACC
GTCACCAGCC CGACCGGCAT CACCGAAACG GCGACGGTCA ATGTCACCAT CAACCCGGTC
AACCATATCC CGGTAATTGC TGGCAACAGC ACCGGCACGG CGGTTGAAGC CGGCGGCGTG
GCCAACGCCC TCGTTGGCAG CCCGAACGCC TCGGGCACGC TGACCATCAC CGACGCGGAC
CAGAACCAAT CCAGCTTCCA GACCCCGGCC AGCCTGTCGG GCACCTACGG CAACTTCAGC
TTCGACGCCG CCACCGGCGC CTGGACCTAC GCTTTGGACA ACGCCAAGGC GGCGACCCAA
GGGCTGACCG CCGGTCAAGT CGTGCACGAC ACCCTGACCG TGACCAGCCT GGACGGCACG
GCCAGCCGAG CCCTCGACAT CACCATCACC GGCGCCAATG ACAACGCCGC GATCACCGGC
ACGGCGACCG GCAACCTGAC CGAAGACACC AACGTCACGG CCGGCAACCT GACCGCCAGC
GGCACGCTGA CGGTGGCCGA CGTCGACAGC GGCGAAGCCG TGTTCCAGAC CCCGGCCAGC
CTGGCCGGCA CCTACGGCAC CTTCACCTTC AATCCGACGA CCGGTGCCTG GACCTACGCC
GCCAACAACA GCCAGGCCGC GATCCAGTCG CTCGGTGCCG GGGATAGCCT GACCGACAGC
CTGACCGTGG TCAGCCAGGA CGGCACGGCC AGCCAGGCCA TCACGGTGAC CATCCACGGC
ACCAACGACG TACCGACCAT CGGCAGCGGT GCCGGCAACA GCACCGGCAC GGCGGTTGAA
GCCGGCGGCG TGGCCAACGC CCTCGTTGGC AGCCCGAACG CCTCGGGCAC GCTGACCATC
ACCGACGCGG ACCAGAACCA ATCCAGCTTC CAGACCCCGG CCAGCCTGTC GGGCACCTAC
GGCAACTTCA GCTTCGACGC CGCCACCGGC GCCTGGACCT ACGCTTTGGA CAACGCCAAG
GCGGCGACCC AAGGGCTGAC CGCCGGTCAA GTCGTGCACG ACACCCTGAC CGTGACCAGC
CTGGACGGCA CGGCCAGCCG AGCCCTCGAC ATCACCATCA CCGGCGCCAA TGACAACGCC
GCGATCACCG GCACGGCGAC CGGCAACCTG ACCGAAGACA CCAACGTCAC GGCCGGCAAC
CTGACCGCCA GCGGCACGCT GACGGTGGCC GACGTCGACA GCGGCGAAGC CGTGTTCCAG
ACCCCGGCCA GCCTGGCCGG CACCTACGGC ACCTTCACCT TCGACCCGAC GACCGGTGCC
TGGACCTACG CCGCCAACAA CAGCCAGGCC GCGATCCAGT CGCTCGGTGC CGGGGATAGC
CTGACCGACA GCCTGACCGT GGTCAGCCAG GACGGCACGG CCAGCCAGGC CATCACGGTG
ACCATCCACG GCACCAACGA CGTACCGACC ATCGGCAGCG GTGCCGGCAA CAGCACCGGC
ACGGCGGTTG AAGCCGGCGG CGTGGCCAAC GCCCTCGTTG GCAGCCCGAA CGCCTCGGGC
ACGCTGAGCA TCACCGACGC GGACCAGAAC CAATCCAGCT TCCAGACCCC GGCCAGCCTG
TCGGGCACCT ACGGCAACTT CAGCTTCGAC GCCGCCACCG GCGCCTGGAC CTACGCTTTG
GACAACGCCA AGGCGGCGAC CCAAGGGCTG ACCGCCGGTC AAGTCGTGCA CGACACCCTG
ACCGTGACCA GCCTGGACGG CACGGCCAGC CGAGCCCTCG ACATCACCAT CACCGGCGCC
AATGACAACG CCGCGATCAC CGGCACGGCG ACCGGCAACC TGACCGAAGA CACCAACGTC
ACGGCCGGCA ACCTGACCGC CAGCGGCACG CTGACGGTGG CCGACGTCGA CAGCGGCGAA
GCCGTGTTCC AGACCCCGGC CAGCCTGGCC GGCACCTACG GCACCTTCAC CTTCAATCCG
ACGACCGGTG CCTGGACCTA CGCCGCCAAC AACAGCCAGG CCGCGATCCA GTCGCTCGGT
GCCGGGGATA GCCTGACCGA CAGCCTGACC GTGGTCAGCC AGGACGGCAC GGCCAGCCAG
GCCATCACGG TGACCATCCA CGGCACCAAC GACGTACCGA CCATCGGCAG CGGTGCCGGC
AACAGCACCG GCACGGCGGT TGAAGCCGGC GGCGTGGCCA ACGCCCTCGT TGGCAGCCCG
AACGCCTCGG GCACGCTGAG CATCACCGAC GCGGACCAGA ACCAATCCAG CTTCCAGACC
CCGGCCAGCC TGTCGGGCAC CTACGGCAAC TTCAGCTTCG ACGCCGCCAC CGGCGCCTGG
ACCTACGCTT TGGACAACGC CAAGGCGGCG ACCCAAGGGC TGACCGCCGG TCAAGTCGTG
CACGACACCC TGACCGTGAC CAGCCTGGAC GGCACGGCCA GCCGAGCCCT CGACATCACC
ATCACCGGCG CCAATGACAA CGCCGCGATC ACCGGCACGG CGACCGGCAA CCTGACCGAA
GACACCAACG TCACGGCCGG CAACCTGACC GCCAGCGGCA CGCTGACGGT GGCCGACGTC
GACAGCGGCG AAGCCGTGTT CCAGACCCCG GCCAGCCTGG CCGGCACCTA CGGCACCTTC
ACCTTCAATC CGACGACCGG TGCCTGGACC TACGCCGCCA ACAACAGCCA GGCCGCGATC
CAGTCGCTCG GTGCCGGGGA TAGCCTGACC GACAGCCTGA CCGTGGTCAG CCAGGACGGC
ACGGCCAGCC AGGCCATCAC GGTGACCATC CACGGCACCA ACGACGTACC GACCATCGGC
AGCGGTGCCG GCAACAGCAC CGGCACGGCG GTTGAAGCCG GCGGCGTGGC CAACGCCCTC
GTTGGCAGCC CGAACGCCTC GGGCACGCTG AGCATCACCG ACGCGGACCA GAACCAATCC
AGCTTCCAGA CCCCGGCCAG CCTGTCGGGC ACCTACGGCA ACTTCAGCTT CGACGCCGCC
ACCGGCGCCT GGACCTACGC TTTGGACAAC GCCAAGGCGG CGACCCAAGG GCTGACCGCC
GGTCAAGTCG TGCACGACAC CCTGACCGTG ACCAGCCTGG ACGGCACGGC CAGCCGAGCC
CTCGACATCA CCATCACCGG CGCCAATGAC AACGCCGCGA TCACCGGCAC GGCGACCGGC
AACCTGACCG AAGACACCAA CGTCACGGCC GGCAACCTGA CCGCCAGCGG CACGCTGACG
GTGGCCGACG TCGACAGCGG CGAAGCCGTG TTCCAGACCC CGGCCAGCCT GGCCGGCACC
TACGGCACCT TCACCTTCAA TCCGACGACC GGTGCCTGGA CCTACGCCGC CAACAACAGC
CAGGCCGCGA TCCAGTCGCT CGGTGCCGGG CAGAGCCTGA CCGACAGCCT GACCGTGGTC
AGCCAGGACG GCACGGCCAG CCAGGCCATC ACGGTGACCA TCCACGGCAC CAACGACGTA
CCGACCATCG GCAGCGGTGC CGGCAACAGC ACCGGCACGG CGGTTGAAGC CGGCGGCGTG
GCCAACGCCC TCGTTGGCAG CCCGAACGCC TCGGGCACGC TGAGCATCAC CGACGCGGAC
CAGAACCAAT CCAGCTTCCA GACCCCGGCC AGCCTGTCGG GCACCTACGG CAACTTCAGC
TTCGACGCCG CCACCGGCGC CTGGACCTAC GCTTTGGACA ACGCCAAGGC GGCGACCCAA
GGGCTGACCG CCGGTCAAGT CGTGCACGAC ACCCTGACCG TGACCAGCCT GGACGGCACG
GCCAGCCGAG CCCTCGACAT CACCATCACC GGCGCCAATG ACAACGCCGC GATCACCGGC
ACGGCGACCG GCAACCTGAC CGAAGACACC AACGTCACGG CCGGCAACCT GACCGCCAGC
GGCACGCTGA CGGTGGCCGA CGTCGACAGC GGCGAAGCCG TGTTCCAGAC CCCGGCCAGC
CTGGCCGGCA CCTACGGCAC CTTCACCTTC GACCCGACGA CCGGTGCCTG GACCTACGCC
GCCAACAACA GCCAGGCCGC GATCCAGTCG CTCGGTGCCG GGGATAGCCT GACCGACAGC
CTGACCGTGG TCAGCCAGGA CGGCACGGCC AGCCAGGCCA TCACGGTGAC CATCCACGGC
ACCAACGACG TACCGACCAT CGGCAGCGGT GCCGGCAACA GCACCGGCAC GGCGGTTGAA
GCCGGCGGCG TGGCCAACGC CCTCGTTGGC AGCCCGAACG CCTCGGGCAC GCTGAGCATC
ACCGACGCGG ACCAGAACCA ATCCAGCTTC CAGACCCCGG CCAGCCTGTC GGGCACCTAC
GGCAACTTCA GCTTCGACGC CGCCACCGGC GCCTGGACCT ACGCTTTGGA CAACGCCAAG
GCGGCGACCC AAGGGCTGAC CGCCGGTCAA GTCGTGCACG ACACCCTGAC CGTGACCAGC
CTGGACGGCA CGGCCAGCCG AGCCCTCGAC ATCACCATCA CCGGCGCCAA TGACAACGCC
GCGATCACCG GCACGGCGAC CGGCAACCTG ACCGAAGACA CCAACGTCAC GGCCGGCAAC
CTGACCGCCA GCGGCACGCT GACGGTGGCC GACGTCGACA GCGGCGAAGC CGTGTTCCAG
ACCCCGGCCA GCCTGGCCGG CACCTACGGC ACCTTCACCT TCAATCCGAC GACCGGTGCC
TGGACCTACG CCGCCAACAA CAGCCAGGCC GCGATCCAGT CGCTCGGTGC CGGGGATAGC
CTGACCGACA GCCTGACCGT GGTCAGCCAG GACGGCACGG CCAGCCAGGC CATCACGGTG
ACCATCCACG GCACCAACGA CGTACCGACC ATCGGCAGCG GTGCCGGCAA CAGCACCGGC
ACGGCGGTTG AAGCCGGCGG CGTGGCCAAC GCCCTCGTTG GCAGCCCGAA CGCCTCGGGC
ACGCTGACCA TCACCGACGC GGACCAGAAC CAATCCAGCT TCCAGACCCC GGCCAGCCTG
TCGGGCACCT ACGGCAACTT CAGCTTCGAC GCCGCCACCG GCGCCTGGAC CTACGCTTTG
GACAACGCCA AGGCGGCGAC CCAAGGGCTG ACCGCCGGTC AAGTCGTGCA CGACACCCTG
ACCGTGACCA GCCTGGACGG CACGGCCAGC CGAGCCCTCG ACATCACCAT CACCGGCGCC
AATGACAACG CCGCGATCAC CGGCACGGCG ACCGGCAACC TGACCGAAGA CACCAACGTC
ACGGCCGGCA ACCTGACCGC CAGCGGCACG CTGACGGTGG CCGACGTCGA CAGCGGCGAA
GCCGTGTTCC AGACCCCGGC CAGCCTGGCC GGCACCTACG GCACCTTCAC CTTCAATCCG
ACGACCGGTG CCTGGACCTA CGCCGCCAAC AACAGCCAGG CCGCGATCCA GTCGCTCGGT
GCCGGGGATA GCCTGACCGA CAGCCTGACC GTGGTCAGCC AGGACGGCAC GGCCAGCCAG
GCCATCACGG TGACCATCCA CGGCACCAAC GACGTACCGA CCATCGGCAG CGGTGCCGGC
AACAGCACCG GCACGGCGGT TGAAGCCGGC GGCGTGGCCA ACGCCCTCGT TGGCAGCCCG
AACGCCTCGG GCACGCTGAG CATCACCGAC GCGGACCAGA ACCAATCCAG CTTCCAGACC
CCGGCCAGCC TGTCGGGCAC CTACGGCAAC TTCAGCTTCG ACGCCGCCAC CGGCGCCTGG
ACCTACGCTT TGGACAACGC CAAGGCGGCG ACCCAAGGGC TGACCGCCGG TCAAGTCGTG
CACGACACCC TGACCGTGAC CAGCCTGGAC GGCACGGCCA GCCGAGCCCT CGACATCACC
ATCACCGGCG CCAATGACAA CGCCGCGATC ACCGGCACGG CGACCGGCAA CCTGACCGAA
GACACCAACG TCACGGCCGG CAACCTGACC GCCAGCGGCA CGCTGACGGT GGCCGACGTC
GACAGCGGCG AAGCCGTGTT CCAGACCCCG GCCAGCCTGG CCGGCACCTA CGGCACCTTC
ACCTTCGACC CGACGACCGG TGCCTGGACC TACGCCGCCA ACAACAGCCA GGCCGCGATC
CAGTCGCTCG GTGCCGGGCA GAGCCTGACC GACAGCCTGA CCGTGGTCAG CCAGGACGGC
ACGGCCAGCC AGGCCATCAC GGTGACCATC CACGGCACCA ACGACGTACC GACCATCGGC
AGCGGTGCCG GCAACAGCAC CGGCACGGCG GTTGAAGCCG GCGGCGTGGC CAACGCCCTC
GTTGGCAGCC CGAACGCCTC GGGCACGCTG AGCATCACCG ACGCGGACCA GAACCAATCC
AGCTTCCAGA CCCCGGCCAG CCTGTCGGGC ACCTACGGCA ACTTCAGCTT CGACGCCGCC
ACCGGCGCCT GGACCTACGC TTTGGACAAC GCCAAGGCGG CGACCCAAGG GCTGACCGCC
GGTCAAGTCG TGCACGACAC CCTGACCGTG ACCAGCCTGG ACGGCACGGC CAGCCGAGCC
CTCGACATCA CCATCACCGG CGCCAATGAC AACGCCGCGA TCACCGGCAC GGCGACCGGC
AACCTGACCG AAGACACCAA CGTCACGGCC GGCAACCTGA CCGCCAGCGG CACGCTGACG
GTGGCCGACG TCGACAGCGG CGAAGCCGTG TTCCAGACCC CGGCCAGCCT GGCCGGCACC
TACGGCACCT TCACCTTCAA TCCGACGACC GGTGCCTGGA CCTACGCCGC CAACAACAGC
CAGGCCGCGA TCCAGTCGCT CGGTGCCGGG GATAGCCTGA CCGACAGCCT GACCGTGGTC
AGCCAGGACG GCACGGCCAG CCAGGCCATC ACGGTGACCA TCCACGGCAC CAACGACGTA
CCGACCATCG GCAGCGGTGC CGGCAACAGC ACCGGCACGG CGGTTGAAGC CGGCGGCGTG
GCCAACGCCC TCGTTGGCAG CCCGAACGCC TCGGGCACGC TGAGCATCAC CGACGCGGAC
CAGAACCAAT CCAGCTTCCA GACCCCGGCC AGCCTGTCGG GCACCTACGG CAACTTCAGC
TTCGACGCCG CCACCGGCGC CTGGACCTAC GCTTTGGACA ACGCCAAGGC GGCGACCCAA
GGGCTGACCG CCGGTCAAGT CGTGCACGAC ACCCTGACCG TGACCAGCCT GGACGGCACG
GCCAGCCGAG CCCTCGACAT CACCATCACC GGCGCCAATG ACAACGCCGC GATCACCGGC
ACGGCGACCG GCAACCTGAC CGAAGACACC AACGTCACGG CCGGCAACCT GACCGCCAGC
GGCACGCTGA CGGTGGCCGA CGTCGACAGC GGCGAAGCCG TGTTCCAGAC CCCGGCCAGC
CTGGCCGGCA CCTACGGCAC CTTCACCTTC AATCCGACGA CCGGTGCCTG GACCTACGCC
GCCAACAACA GCCAGGCCGC GATCCAGTCG CTCGGTGCCG GGCAGAGCCT GACCGACAGC
CTGACCGTGG TCAGCCAGGA CGGCACGGCC AGCCAGGCCA TCACGGTGAC CATCCACGGC
ACCAACGACG TACCGGTTGC CAATAATGCC AGCGCGACGG GGAATGAAGA CACGCTGATT
CCGATTACCC TGACGGGAAC TGATATCGAT GGAACGGTAG CCAGTTTCAC GCTTTCCTCC
CTGCCGGCTA ATGGCCGCCT GTATCTGGAT GCAGCCATGA CGCAACTGGC TCCAACGGGA
ACGGCGCTTA CGGCAAGCGG CAATGCGCTG ACCTTGTACT TCAAGCCGAA TGCCGACTGG
AACAGCCATA TCGTCAATAC CACGGCCAGT TTGCCGACAT TCAACTACAC GGCGACCGAC
AACTCCGGTG GTGTGTCCAA TGTGGCCACC GCAACGATTG ACGTGCTGGC CGTCAATGAT
GGTGCCCCGG TTGCGGTGAA TGATTCGTTC AATGCCTTGC TGGGTACTCC GATCATCATC
AGCAAGGCTG CGTTGCTCGG CAATGACACG TTGCCTGACC ACGCCACGAT TGTGTCCGTT
GGCTCTCCTT CCAGCGGGGC GCTGGTTGAT AACGGTGACG GAACCTATAC CTATACGCCG
AGCGCGACCG GGACGGCAAG TTTTACCTAT TTGTTGCGGG ATGATGACTC CCAAACCAGT
ACCGGTACGG TCTCAATCAA TACCTACAAC AGCCGGGATG ACCTGGCCAC AGTTAATGAG
TCGGCGTTGG CAACTGGTTC AGGTGGTGGC TCGACGGTAG CGACCGGTAA CCTGATGACC
AACGACGTGA CCAATACCAG CATTACCAGC GTAACGTTCA ATGGTGTGAC CTACACGGCT
TCCGGGGGAG TAATTACCGT TCCGGATACT GCTGCCGGGG CGCATGGCAC CTTGGTGGTC
ACGGCTGCAA CGGGAGCCTA TACCTACACC CTGACCCATG CCGCCACCAA TGGTGCCGCG
AATTCGGCGA CCGACACGTC GCTGGTCGAT AGCTACAGCT ATGCCGGCAA TAGCGTCTCG
GCCAACCTGA AGGTGACGAT CGTCGATGAC AAGCCGGTGG TGGTCAATCA GGTGGTGGAA
GTACCGCAGA GCGTCCTGCC GAAATACACG ATTGCCGTGG TGCTGGATAT TTCCGGCAGT
ATGGCTGCAG CTGTCTCGGC CGATGGTCTG ACAACGCGTC TGGACATGGC CAAGGCTGCC
TTGGCCTCTC TGATCAGCGA GTACTACACC CAAGCCTCCG ATGTCGTCGT CAAGTTCATC
GATTTCAGTA GCGGTGCCAC CCTGATTGGC TCTTATACGA CGGAAGGGAC TGCAATCTCC
GCGCTGACTT CACCGACGAT CGTGGCCGGG GGAGCTACCA ACTATCAGGC GGCGCTCGAT
CTGGTCCGTA GCGCCAGTGG CCTCGGGACG ACAGCGGATG CGTCGCGTCA GAACATCGTC
TACTTCCTGT CGGATGGCGT GCCGACCACA GGGACGACGG CGACCGGCCT GAGCAATTTC
CAGACTTACC TCGCGGCGAA TCCGTCGGTC CAGTCCTATG CGGTGGGGAT CGGGACCGGC
ATCGTTGATT TCACCAGCCT GAATGCCATT CACAATGTTG ATGCGCTGGG CGATGGCGTC
AAGGATCCGG CCATTGTTGT TCCGGATCTC AGCCAGCTTT CCAGCACGCT GTTGTCGACC
GTGCCGAACG CCTTTGGCGG CAATATCATG GCCTCGGCCA ACATGCGGGG TCTCGTGTTT
GGTGCGGATG GCGGCTATAT CAGTTCGATC AGTCTGATGC TGGATAGTGA CGGAAATGGT
ACGGCCGACC AGAAGGTGAC CTTCACCTAC AATCACCTGA CCGACACGAT TACCCAGAAT
AGTACGTTCC TGACCGGCTT CCCGCTTTCC GGCCACCTGT TGTCGCTGAG CAGTACTTCC
GGGTTCATCT ACGGTGATTT GCGTTTCGAC TTCTCCACCG GTGATTACAA GTACTATACC
AAGGGGCTGG CTACGCTGGG AACGCAATTC GACATCGGCT TTACCGCCAG CGACAACGAT
CAGGATGTGG CCTCTGCGGT GCAGACAATC TCCATCATCG ATGGCAAGCC GATTGCCCGT
AACGACACTG ATACACTCTT TGCCAAGGAT ACTTTCCTCG AAGGCAATGT CGTCACCGGA
CTCGGAACGG ATGGCGGCGT TGGCGCAGCG CAGATCACCT CCTTCACGAC GCAAGGGGGA
GGGGTGGATA CGATCGTCGA TAACGCCAAG GTAACGGCGG TCGATTTCGA TGGCTTACAC
ATTGTGCTCG GTACCTGGGC CGGCGGCGTT TATACCGCAG CCAACAGCAG CGGCAGTGGT
ACTGGCTACT CCTATAACGT CGTCAATGGC ACATTGACCT GGACGGCGAC CAGTGGTGGA
CAGAAACTGG TCTTCGACGA CAGTGGTTAT TACAAATACA CGCCACCGAC GGCGGATATT
CCGACCAATG TTCTGGGTGC GCCGGTCACG GTCAACATGA CCAGTGCTGC AAATGTGACG
ACCGGCGGCC TGACCGTGAC GGCCGAAAAC TGGACAAGCA CCACGGTTTC CCTGACCAGT
GCAGCTAATG CGGCAACTGG TAACCATCTG ACGGTTTCCG GTCTGACGGC AAGCGGGCTG
GTCGCCGGTG CTCCGGTTTA CAACGCCACT AACGGTGTCG GCGTTAACAC TGGCGGCGAA
ACGGCGGCCA ATCAGGCCAG CATCAATGGC AAAGAGACCT TGATTCTCGA TTTCTCTGCG
GCGACGCATC CGAACGGCGT TTCGAATATC AGCCTGACGA TTGCCGGAGC CAGCAGCCTG
GCAAATACTG CAGGTGGTAC GCCATCGCTG ACCTACACGC TGTACGATGC CAGCGGCAAC
CTTTTGGGCA GCCTGACCAG TGGTCTGGAA AATACGGTGA CGATGTTGCC CTATTCCGGT
GTGGCTTCGA TTCATGTCAC CGGTAGCGCT ACGGCAACGG CCATGGTGCA CGATGTCAGC
TTCTACGATA CGCCCTCCGC CTCGACCGTC ACCTATAACG CCAATGGCGT TAATGTCACT
GGCGGAACCT CGACTAATAC CTACCTGGAT CACCTGGAGG CGCTCACCAT CAGCTTCAAC
CATGCGACCT ATGCCAACGG TGTTCAGGAT GTCAGCATCA ACGTCAACGC TGGCCGGAGC
AACCTCGCCA GCTCGGGTAG TGATTCGTAT GCCTTGACCT ATACGGTTTA TGGCATTGAT
GGTCACCTGC TGGGGCAGTT CAGCAGCGTT ACCGAAGGCA CCGTCAATCT CAATACCGAT
AACGGTAACG GCGGCGTGCT GGCGACCGCG AGAACCTTCA GCAACATTGG CTCGGTGGTC
GTGACGGCCA GTGATTCCTT TGCTGGCATT ACCATTGCCG ATATCACCGG GGTTACCTTC
ACGCCGGACC TGCTCAACAG TTCGGCTACT GCCGTTGCCC CGGAACATGT GACCTACACC
CTGACCGACA GCAACGGTGA CCAATCCAGC GCTGGCCTGA CGCTCAACGT GATGGCCAAC
ACCATCGTCG GCACGTCGGG TGACGACGGC AGTCTGGTCG GTACATCTGC CAACGACTAT
ATGGATGGCT TGGCCGGCAA CGATACCCTG AGCGGCGGCG CCGGGCATGA CATCCTGCAG
GGCGGCTTGG GCAACGACAT CCTGAGCGGT GATAGCGGTG ATGACGTGCT GGATGGCGGT
GACGGCAGCG ACACCTTGTC TGGCGGCACT GGTAACGACT ACCTCAAGGG GAGTGCCGGC
AATGACACCC TGGATGGTGG TGACGGCAAC GACGTCTTGG TGGGCGGGGC TGGCAACGAT
ATGTTGACCG GTGGCCTCGG GGCCGACACC TTCCGCTGGG AGCTGGGCGA TGCCGGTGCG
AAGGGTAATC CGGCAGTGGA TACGGTGATG GATTTCGACA TTGCTACAAA TTCCAGCCTG
ATGACCCCGA CGGCCGACAT GCTCGACCTG AGAGATTTGC TGATCGGCGA AAACCACAGC
ACCGGCATCA CCGGGAACCT GACCAATTTC CTGCATTTTG AATTGTCAGG TGGCGATACC
AAGGTTCACG TCAGCACGAC GGGTGCTTTT GCTGCCGGCT TCCAGAACTC GCTTGACGAC
CAAGTGATCG TCATGAAGGG GGTTGATCTG GTGACTGCGT TTAACGGCAA CGATCAACAG
ATCATTCTGG ATCTGCTGTC CAAGAACAAG CTGAATGTCG ATTGA
 
Protein sequence
MAQAQVVAKI SELSGQAFAR DSAGNTRRLK LGDVIREGES VVAADGAKVV LVLADGREMT 
VRPGETARID AEVAAAVMPD ASDSAVVNDQ NSFQKIAKAL QSGSDLDALL EEDAPAAGLA
GQGGNEGHTF VELLRIVETV DPLAYQFGTN RGSPTETIEG APVTVTVASE PEQVLFARDD
ANAVSERVDS TAASTISGNV VAAGAASDVA DSGLPGAVLT VTQIAFGTTV AAPGTVIALA
HGTLTIAADG SYTYTVNNAD PVVNALNVGN QLTELVTYTI TDGQGHTAQA TLTLTINGVN
DSPTATVGAA IASAEDTSIV VPLSGTDPDS PIAFVTVTTL PTDGSLLKAD GTPVVAGEQI
AVDPATHQAL LTFVPNANFN GMVSFQFTVT DTGGLSSAPA TQQITINPIN DAPIAVVAPV
SGDEDSTIAV PLSGTDVDGS IQYVTVTALP PAAQGILTLA DGTTAVVAGD HLSPAQAAGL
LFKPVADYNG TVDIAFTVTD NANATSVPAT QQITINPVAD IANNTVSTDE DTAITVNVLG
NDTFEGSPNV TGVGPAAHGS VVINPDNTVT YTPSANYNGS DSFTYTVTSP TGITETATVN
VTINPINDAP IAVVAPVSGD EDSTIAVPLS GTDVDGSLQY VTVTALPPAA QGILTLADGT
TPVVAGDHLS PAQAAGLLFK PVADYNGTVD IAFTVTDNAN ATSVPATQQI TINPVADIAN
NTVSTDEDTA ITVNVLGNDT FEGSPNVTGV GPAAHGSVVI NPDNTVTYTP SANYNGSDSF
TYTVTSPTGI TETATVNVTI NPINDAPIAV VAPVSGDEDS TIAVPLSGTD VDGSLQYVTV
TALPPAAQGI LTLADGTTPV VAGDHLSPAQ AAGLLFKPVA DYNGTVDIAF TVTDNANATS
VPATQQITIN PVADIANNTV STDEDTAITV NVLGNDTFEG SPNVTGVGPA AHGSVVINPD
NTVTYTPSAN YNGSDSFTYT VTSPTGITET ATVNVTINPV NHIPVIAGNS TGTAVEAGGV
ANALVGSPNA SGTLTITDAD QNQSSFQTPA SLSGTYGNFS FDAATGAWTY ALDNAKAATQ
GLTAGQVVHD TLTVTSLDGT ASRALDITIT GANDNAAITG TATGNLTEDT NVTAGNLTAS
GTLTVADVDS GEAVFQTPAS LAGTYGTFTF NPTTGAWTYA ANNSQAAIQS LGAGDSLTDS
LTVVSQDGTA SQAITVTIHG TNDVPTIGSG AGNSTGTAVE AGGVANALVG SPNASGTLTI
TDADQNQSSF QTPASLSGTY GNFSFDAATG AWTYALDNAK AATQGLTAGQ VVHDTLTVTS
LDGTASRALD ITITGANDNA AITGTATGNL TEDTNVTAGN LTASGTLTVA DVDSGEAVFQ
TPASLAGTYG TFTFDPTTGA WTYAANNSQA AIQSLGAGDS LTDSLTVVSQ DGTASQAITV
TIHGTNDVPT IGSGAGNSTG TAVEAGGVAN ALVGSPNASG TLSITDADQN QSSFQTPASL
SGTYGNFSFD AATGAWTYAL DNAKAATQGL TAGQVVHDTL TVTSLDGTAS RALDITITGA
NDNAAITGTA TGNLTEDTNV TAGNLTASGT LTVADVDSGE AVFQTPASLA GTYGTFTFNP
TTGAWTYAAN NSQAAIQSLG AGDSLTDSLT VVSQDGTASQ AITVTIHGTN DVPTIGSGAG
NSTGTAVEAG GVANALVGSP NASGTLSITD ADQNQSSFQT PASLSGTYGN FSFDAATGAW
TYALDNAKAA TQGLTAGQVV HDTLTVTSLD GTASRALDIT ITGANDNAAI TGTATGNLTE
DTNVTAGNLT ASGTLTVADV DSGEAVFQTP ASLAGTYGTF TFNPTTGAWT YAANNSQAAI
QSLGAGDSLT DSLTVVSQDG TASQAITVTI HGTNDVPTIG SGAGNSTGTA VEAGGVANAL
VGSPNASGTL SITDADQNQS SFQTPASLSG TYGNFSFDAA TGAWTYALDN AKAATQGLTA
GQVVHDTLTV TSLDGTASRA LDITITGAND NAAITGTATG NLTEDTNVTA GNLTASGTLT
VADVDSGEAV FQTPASLAGT YGTFTFNPTT GAWTYAANNS QAAIQSLGAG QSLTDSLTVV
SQDGTASQAI TVTIHGTNDV PTIGSGAGNS TGTAVEAGGV ANALVGSPNA SGTLSITDAD
QNQSSFQTPA SLSGTYGNFS FDAATGAWTY ALDNAKAATQ GLTAGQVVHD TLTVTSLDGT
ASRALDITIT GANDNAAITG TATGNLTEDT NVTAGNLTAS GTLTVADVDS GEAVFQTPAS
LAGTYGTFTF DPTTGAWTYA ANNSQAAIQS LGAGDSLTDS LTVVSQDGTA SQAITVTIHG
TNDVPTIGSG AGNSTGTAVE AGGVANALVG SPNASGTLSI TDADQNQSSF QTPASLSGTY
GNFSFDAATG AWTYALDNAK AATQGLTAGQ VVHDTLTVTS LDGTASRALD ITITGANDNA
AITGTATGNL TEDTNVTAGN LTASGTLTVA DVDSGEAVFQ TPASLAGTYG TFTFNPTTGA
WTYAANNSQA AIQSLGAGDS LTDSLTVVSQ DGTASQAITV TIHGTNDVPT IGSGAGNSTG
TAVEAGGVAN ALVGSPNASG TLTITDADQN QSSFQTPASL SGTYGNFSFD AATGAWTYAL
DNAKAATQGL TAGQVVHDTL TVTSLDGTAS RALDITITGA NDNAAITGTA TGNLTEDTNV
TAGNLTASGT LTVADVDSGE AVFQTPASLA GTYGTFTFNP TTGAWTYAAN NSQAAIQSLG
AGDSLTDSLT VVSQDGTASQ AITVTIHGTN DVPTIGSGAG NSTGTAVEAG GVANALVGSP
NASGTLSITD ADQNQSSFQT PASLSGTYGN FSFDAATGAW TYALDNAKAA TQGLTAGQVV
HDTLTVTSLD GTASRALDIT ITGANDNAAI TGTATGNLTE DTNVTAGNLT ASGTLTVADV
DSGEAVFQTP ASLAGTYGTF TFDPTTGAWT YAANNSQAAI QSLGAGQSLT DSLTVVSQDG
TASQAITVTI HGTNDVPTIG SGAGNSTGTA VEAGGVANAL VGSPNASGTL SITDADQNQS
SFQTPASLSG TYGNFSFDAA TGAWTYALDN AKAATQGLTA GQVVHDTLTV TSLDGTASRA
LDITITGAND NAAITGTATG NLTEDTNVTA GNLTASGTLT VADVDSGEAV FQTPASLAGT
YGTFTFNPTT GAWTYAANNS QAAIQSLGAG DSLTDSLTVV SQDGTASQAI TVTIHGTNDV
PTIGSGAGNS TGTAVEAGGV ANALVGSPNA SGTLSITDAD QNQSSFQTPA SLSGTYGNFS
FDAATGAWTY ALDNAKAATQ GLTAGQVVHD TLTVTSLDGT ASRALDITIT GANDNAAITG
TATGNLTEDT NVTAGNLTAS GTLTVADVDS GEAVFQTPAS LAGTYGTFTF NPTTGAWTYA
ANNSQAAIQS LGAGQSLTDS LTVVSQDGTA SQAITVTIHG TNDVPVANNA SATGNEDTLI
PITLTGTDID GTVASFTLSS LPANGRLYLD AAMTQLAPTG TALTASGNAL TLYFKPNADW
NSHIVNTTAS LPTFNYTATD NSGGVSNVAT ATIDVLAVND GAPVAVNDSF NALLGTPIII
SKAALLGNDT LPDHATIVSV GSPSSGALVD NGDGTYTYTP SATGTASFTY LLRDDDSQTS
TGTVSINTYN SRDDLATVNE SALATGSGGG STVATGNLMT NDVTNTSITS VTFNGVTYTA
SGGVITVPDT AAGAHGTLVV TAATGAYTYT LTHAATNGAA NSATDTSLVD SYSYAGNSVS
ANLKVTIVDD KPVVVNQVVE VPQSVLPKYT IAVVLDISGS MAAAVSADGL TTRLDMAKAA
LASLISEYYT QASDVVVKFI DFSSGATLIG SYTTEGTAIS ALTSPTIVAG GATNYQAALD
LVRSASGLGT TADASRQNIV YFLSDGVPTT GTTATGLSNF QTYLAANPSV QSYAVGIGTG
IVDFTSLNAI HNVDALGDGV KDPAIVVPDL SQLSSTLLST VPNAFGGNIM ASANMRGLVF
GADGGYISSI SLMLDSDGNG TADQKVTFTY NHLTDTITQN STFLTGFPLS GHLLSLSSTS
GFIYGDLRFD FSTGDYKYYT KGLATLGTQF DIGFTASDND QDVASAVQTI SIIDGKPIAR
NDTDTLFAKD TFLEGNVVTG LGTDGGVGAA QITSFTTQGG GVDTIVDNAK VTAVDFDGLH
IVLGTWAGGV YTAANSSGSG TGYSYNVVNG TLTWTATSGG QKLVFDDSGY YKYTPPTADI
PTNVLGAPVT VNMTSAANVT TGGLTVTAEN WTSTTVSLTS AANAATGNHL TVSGLTASGL
VAGAPVYNAT NGVGVNTGGE TAANQASING KETLILDFSA ATHPNGVSNI SLTIAGASSL
ANTAGGTPSL TYTLYDASGN LLGSLTSGLE NTVTMLPYSG VASIHVTGSA TATAMVHDVS
FYDTPSASTV TYNANGVNVT GGTSTNTYLD HLEALTISFN HATYANGVQD VSINVNAGRS
NLASSGSDSY ALTYTVYGID GHLLGQFSSV TEGTVNLNTD NGNGGVLATA RTFSNIGSVV
VTASDSFAGI TIADITGVTF TPDLLNSSAT AVAPEHVTYT LTDSNGDQSS AGLTLNVMAN
TIVGTSGDDG SLVGTSANDY MDGLAGNDTL SGGAGHDILQ GGLGNDILSG DSGDDVLDGG
DGSDTLSGGT GNDYLKGSAG NDTLDGGDGN DVLVGGAGND MLTGGLGADT FRWELGDAGA
KGNPAVDTVM DFDIATNSSL MTPTADMLDL RDLLIGENHS TGITGNLTNF LHFELSGGDT
KVHVSTTGAF AAGFQNSLDD QVIVMKGVDL VTAFNGNDQQ IILDLLSKNK LNVD