Gene Veis_4997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4997 
Symbol 
ID4692642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp5530876 
End bp5544162 
Gene Length13287 bp 
Protein Length4428 aa 
Translation table11 
GC content66% 
IMG OID639852732 
Productouter membrane protein 
Protein accessionYP_999701 
Protein GI121611894 
COG category 
COG ID 
TIGRFAM ID[TIGR02059] cyanobacterial long protein repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC TGACCAACAT CTCCTTGAAC AATGGCAACG CCACCCTCAA AACCGGCTAC 
TCCTACACCG TCACCCTCAC TTTTGACTCC AGGGTGGGTG GCGTCACTGC GGCCAATGTA
GTGGTGCCCC CGGGCACCAT GCTGACCACC CCGACCGCCT CCGGCCCTGA CAGCAACGGC
CGCAGCACGA CATGGCAGTT CGAACTGACC CCGCCTGCCA ACACCGAAAG GACCAACGAC
ACCCTCGGCA TGAGCCTCAC CGGTGTGACC GATCAGAACG GCTCGGTCCC GACGAACAAC
GCCGCCTCCG TCACCTACAC CGTAGACACC ATAAAACCCA CGCTCCAGAG CGCGAAGGTG
ACCGGCAACC AACTGGTGCT GACCTACAAC GAAAACCTGG ATACGAGCGC AGCCAACGCG
CCGCTCGCAG ACCGCTTTGT CGTGACCGCA GATGACAAGT CGGTGCTCGC CAATGGCACA
AGAATCAACG TCACCGGCAG CCCGGTCGTG AATGGACGGA CCGTCACCTT GACGCTGGCC
AGCGCCGTGG CATCCGGCCA ACAGGTGAAA GTCAGCTACA GAGACTCCGC CCCGGGCGAC
GACACGCGCG CAATACAAGA CACGGCCGGC AACGACGCCG CCAGTTTCAG CAACCAGGTC
GCGACCAACG AGACGCCCCC GGTGATCCGC AGCGCCACGG TCAACGGCAC CCAGTTGGTG
CTCAGCTACA CCACCCTCTC CGGCCTGGAC GCGGTCCACC CACCGCCCGC CAGCGCTTTT
GCGGTGACCA GCGCTGGCAC CACCACCCCC ATCGGCGTCA GCAACGTCAG TGTGGATGCA
GTGAACAATA CCGTCACGCT GACGCTGAAT CGCCCCGTGG CCAACGGCGA GACGGTGTCC
GTCAACTACA CCCGCCCCGC GACCGGCGAC AACGTGATAC AGGACGCTGC CGGCAACGAC
GCGGCCAACA TCCAGAACCG GGCCGTGACC AATGAAACGC CCCCGGTGTG CACCAGCGCC
ACGGTCACCG GCAACCAACT GGTGCTCCGC TACGACACCG ACAATCTGGA TGAAACCGAA
GCCGGCCGGC CCTCTGCCGG CGCCTTTGAG GTGCTGATCG ATGGAACGAC ACGGGCCAGC
GTCCAGGTCG TCACTGTGAA TTCAGCGAAC AAGACCGTCA CGCTGACGCT GGACCGTACC
GTGACACAGG GCCAGAAAGT GACCGTCGCC TACACGGACC CCACGAACGG CAACGACCCG
CGCGCGGTGC AGGACGCCGC CGGCAACGAC GCCGCCAGCT TCAGCGCCAG GCCAGTCACC
AACGACACCA CGGCTCCGGT GCTGGCAGAA GCCACGGTCG ACCGCAATCA ACTGGTGCTG
CGCTATACCG AAAACGACCG CCTGGACACA GTCAATACGG CGCCCGCCAC AGCCTTTACG
GTGACCGCCG GTGGCCAGCC CGTCACCGTC AGCAGCGTCC TGGTGGATGC CACAGCCAAA
ACCGTCACGC TGACGCTGGC CAGCAACGTG GCAGCGAACC AGGCCGTGAC CGTCGCCTAC
CACGCCCCGA CGACCGGCAA CAACGCGATA CAGGACGCCG CCGGCAACGA CGCAGCCAAT
TTCGAAGCGC GCGCTGTCGA CAACATCACC ACGCGCTTTG CGTGCACCAA GGCCGAGGTC
GACCGCGACA AGCTGGTCCT CTCCTTTCCC CAAGGCACCG AGCTCGACGC TATCAAGCCG
ATCGCAACGG CCTTTGCCCT GAGCAGCAGT GGCGGCCTCT CCGTCATCGA CACTATGGTG
GATACGGCCA ACAAAACCGT CATCCTGACG TTGAGCCGCG CCGTAGCCAA CGGCGAAGCG
GTGACCATCA CCTACACCGA CCCGGCCGGG GACAACACCA CCGGCGTGAT ACAGAGCAGC
AGCGGCGCCG ACCTGCCCAC CTTCACCCGC GATGTGATCA ACCGCACGGG CCCGAGGATG
GACGATGTCA GGGTCAACGG CGACAAGGTG ACGATCACCT ACCTGACCAA CAGCCTGGAT
GCGTTCAGCC AGGTGGCGGC CACGGCTTTC GAGGTTCAAA CGGGCGGCAC AGCACCCGAA
ACCATCGGCA TCAAAGGCAT CCAGGTGGAT GCAGCGGCCA AGACCATCAC GCTGACGCTG
GCCCGCGCCG TGGTCAAAGG CGAGCAGGTG ACCGTCAGCT ACACCGACCC CACGGACGGC
AACGACCGCA ACGCGGTACA AGACACGATC GGCAACGATG CGAAATCCGA ATCGTTGCCG
GCGACCAACG ACACGCCCCG CGCCAGCACG CTGGACAGCA TCGCCATCAG CGACCCGAAC
CTGAAAGCCG GCGAAACCGC CATCGTCACC CTCACGTTCA ACACGGCGGT GGATGGCCTG
ACTGCGGCCA ACCTGGTCCT GCCCGCTGGC ACCGCCGTGT CCAACGTCCG CGCCGTCAAC
GGTGCGACCG ACAGCGACGG CCGGCTCAGA AGCGCGACAT GGCAGTTCGA ACTGACCCCG
CCTGCCAACA CCGAATCGAC CGACAACCGC ATCAACGTCA ACCTCAGCGG CGTCAAGGAT
GCCAATGGCA ACGCGGTCAC GAACAACGCC CCCGCCGCCG CCTACACCGT AGACACCCGC
GCGCCTACGT TCACCAGCGC TACGGTGAAC GGCGACCAAC TGGAGCTTGT CTACAGCGAA
GCGCTGGACG GAACCAACAA GCCGGAGATC GGGCGCTTTA TCGTCAACAT CGATGGCCGA
GACCAAGCCG ACGGCGGCGT CAGGGCCGTG GCCGTGGACG GGCGAAAAGT CATCCTGACA
CTGACCACCC CGGTGACATC CGGCCAACAA GTGAAGGTCA CCTACACCGA CCCCAGTTTC
AGCGCCACCC CCTCTAACGA TACCGGCGAC GACGCGCTCG CGATACAGGA CGCCGCCGGC
AACGATGCCG CCAACGTATT GCGCCAACCC GTCACCAACA CCACGCCCCC GACGTTCCGC
AGCGCCACGG TCAACGGCGA CAAACTGGTG CTGCGCTTTG ACACCCAATC CGGCCTGGAC
GCGGTCAACC CGGCGCCCGC CACAGCCTTT GACGTGACCA ACGCCAATGG CACAAGCATC
CGCGTGCTGA GCGTCCGCGT CGATGCAACG GCCAAGACCG TCACGCTGAC GCTGGAGCGC
TCCGTGCCCC GCAGCGAGAC GGTGTTCGTC GCCTACCGCG ACCCCACGCC CGGCAACGAC
ACCAACGCGA TACAGGACGC CGCCGGCAAC GACATGGCCG ATATTGCGCG CCAGCAGGTC
ACCAGCGAAA CGCCCGCGCC GACCGTGACG TACAAAAGCT CCACGGTCGA CGGCAACCAG
TTGGTGGTGT ACTTCAGCGC CAGCTACGAC CTGGACCTCA CCGCCCTGAC AGGCAGCGCA
GGCTTTACGG TGGGCGGCGC CAACAATAGC CCCATCGCCG TCAGCAGCGT TCGGGTGAAT
GCGGACAAGT CCGTCACGCT GACGCTGGCC CGCGCCGTGG TCAGAGGCGA GCAGGTGACC
GTCCGCTACA CCGACCCCAG GCCCTCCGTC GACGACCCCG CCGGCACGGT GATCCAGGAC
AGGAACGGCA CCGACGCGAC CAGTTTCGAG GTCCAGACCG TCACCAACAA CACGCCCGAG
CTCCCGTCCG TGACGGTCAG GGACGCCACG GTCAACGGCA ACCAGTTGGT GGTGGCCTTC
AACGCCAGCA ATGATCTGGA CCTGACCGCC ATCACGGGCA ACCCGGGTTT TACGGTGGCC
AGCACCACCG CCGGCAGCGC CGCCATCACC GTCAGCAGCG TCCGGGTGAA TGCGGACAAG
ACCGTCACAC TGACGCTATC GCGCGCCGTG GCCAATGGCG AGACGGTGAC CGTCAGCTAC
ACCGACGCCG CCGGCGACGG CGCCTCCGGC AGCGTGATCC AGGACTCTGC CGGCACCGAC
GCGAGCAGTT TCCAGAACCA GGCCGTCACC AACAACACGC CCGCGCCTGC GTCCGTGACG
GTCAGGGACG CCACGGTCAA CGGCAACCAG TTGGTGGTGG CCTTCAACGC CAGCAATGAT
CTGGACCTGA CCGCCATCAC GGGCAACCCG GGCTTTGCGG TGACCGGCGC CAACAATAGC
CCCATCACCG TCAGCAGTGT CCGGGTGAAT CCGGACAAGA CCGTCACGCT GACGCTGGCC
CGCGCCGTGA CCAACGGCGA GACGGTGAAA GTCAGCTACA CCGACGCCGC CGGCGACGGC
GCCTCCGGCA GCGTGATCCA GGACTCTGTC GGCACCGACG CGAGCAGTTT CCAGAACCAG
GCCGTCACCA ACAACACGCC CGCGCCCGCG CCCGTGACGG TCAAGGGCGC CACGGTCAAC
GGCAACCAGT TGGTGGTGGC CTTCAACGCC AGCAATGATC TGGACCTGAC CGCCATCACG
GGCAACCCGG GTTTTACGGT GGCCAGCACC ACCGCCGGCA GCGCCGCCAT CACCGTCAGC
AGCGTCCGGG TGAATGCGGA CAAGACCGTC ACGCTGACGC TATCGCGCGC CGTGGTCAAT
GGCGAGACGG TGACCGTCAG CTACACCGAC GCCGCCGGCG ACGGCGCCTC CGGCAGCGTG
ATCCAGGACT CTGCCGGCAC CGACGCGAGC AGTTTCCAGA ACCAGGCCGT CACCAACAAC
ACGCCCGCGC CTGCGCCCGT AACGGTCAAG GGCGCCACGG TCAACGGCAA CCAGTTGGTG
GTGGCCTTCA ACGCCAGCAA TGATCTGGAC CTGACCGCCA TCACGGGCAA CCCGGGTTTT
ACGGTGGCCA GCACCACCGC CGGCAGCGCC GCCATCACCG TCAGCAGCGT CCGGGTGAAT
GCGGACAAGA CCGTCACACT GACGCTATCG CGCGCCGTGG TCAATGGCGA GACGGTGACC
GTCAGCTACA CCGATGCCGC CGGCGACGGC GCCTCCGGCA GCGTGATCCA GGACTCTGTC
GGCACCGACG CGAGCAGTTT CCAGAACCAG GCCGTCACCA ACAACACGCC CGTGATCCCA
CCCGTGACGG TCAGGGGCGC CACGGTCAAC GGCAACCAGT TGGTGGTGGC CTTCAACGCC
AGCAATGATC TGGACCTGAC CGCCATCACG GGCAACCCGG GTTTTTCGGT GGCCAGCACC
ACCGCCGGCA GCGCCGCCAT CACCGTCAGC AGCGTCCGGG TGAATCCGGA CAAGACCGTC
ACGCTGACGC TATCGCGCGC CGTGGTCAAC GGCGAGACGG TGACCGTCAG CTACACCGAC
GCCGCCGGCG ACGGCGCCTC CGGCAGCGTG ATCCAGGACT CTGCCGGCAC CGACGCGAGC
AGTTTCCAAA ACCAGGCCGT CACCAACAAC ACGCCCGCGC CCGCACCCGT GACGGTCAAG
GGCGCCACGG TCAACGGCAA CCAGTTGGTG GTGGCCTTCA ACGCCAGCAA TGATCTGGAC
CTGACCGCCA TCACGGGCAA CCCGGGCTTT GCGGTGGCCA GCACCACCGC CGGCAGCGCC
GCCATCACCG TCAGCAGCGT CCGGGTGAAT GTGGACAAGA CCGTCACGCT GACGCTATCG
CGCGCCGTGG CCAACGGCGA GACGGTGACC GTCAGCTACA CCGACGCCGC CGGCGACGGC
GCCTCCGGCA GCGTGATCCA GGACTCTGCC GGCACCGACG CGAGCAGTTT CCAGAACCAG
GCCGTCACCA ACAACACGCC CGCGCCTGCG CCCGTGACGT TCAGAGGCGC CACGGTCAAC
GGCAACCAGT TGGTGGTGGC CTTCAGCGCC AGCAATGATC TGGACCTCAC CGCCATCACG
GGCAACCCGG GCTTTGCGGT GGCCAGCACC ACCGCAGGCA GCGCCGCCAT CACCGTCAGC
AGCGTCCGGG TGAATGCGGA CAAGACCGTC ACGCTGACGC TATCGCGCGC CGTGAACAAC
GGCGAGACGG TGACCGTCAG CTACACCGAC GCCGCCGGCG ACGGCGCCTC CGGCAGCGTG
ATCCAGGACT CTGTCGGCAC CGACGCGAGC AGTTTCCAGA ACCAGGCCGT CACCAACAAC
ACGCCCGCGC CCGCGCCCGT GACGGTCAAG GGCGCCACGG TCAACGGCAA CCAGTTGGTG
GTGGCCTTCA ACGCCAGCAA CGATCTGGAC CTGACCGCCA TCACGGGCAA CCCGGGCTTT
GCGGTGGCCA GCACTACCGC CGGCAGCGCC GCCATCACCG TCAGCAGCGT CCGGGTGAAT
GCGGACAAGA CCGTCACGCT GACGCTATCG CGCGCCGTGG CCAACGGCGA GACGGTGACC
GTCAGCTACA CCGACGCCGC CGGCGACGGC GCCTCCGGCA GCGTGATCCA GGACTCTGTC
GGCACCGACG CGAGCAGTTT CCAAAACCAG GCCGTCACCA ACAACACGCC CCCGGTGTGC
ACCGGCGCCA CGGTCAGCGG CAACCAGTTG GTGCTGCGCT TCGACCTGGC CGGCAACCTG
GTCACGACCG GCGTGCCGAA CAGCGCCTTT GAACTGGTCG TCGGCTCCGG TAGCCAGCCG
CTGAGCGTCA CGGCCATCGG CGCCTTCAAT GCGACGGACA AGACCCTCAC GCTGACGCTC
AGCCGCGCCG TGACCCCCGG CGAGACGGTG AGCATCCGCT ACACCGACCC GAACCCCGAC
AGCAACGAAG GCAGCGGCGC GCTGGAAGAC AGCGCCAGCC GTGACGTGCC CACCTTCGTC
AAGGAGGTGA CCAACAACAC GCCCGCGACG CCCCCGGCGT TCAGCAGGGC CGAGGTCAAC
GGCAACCAGT TGGTGGTGGC CTTCACCGCC AGCAACGACC TGGACCTTGC CGCCATCACG
GGCAACCCGG GCTTTGCGGT GGCCAGCACC ACCGCCGGCA GCGCCGCCAT CACCGTCAGC
AGCGTCCGGG TGAATGCGGA CAAGACCGTC ACGCTGACGC TATCGCGCGC CGTGAACAAC
GGCGAGACGG TGAAAGTCAG CTACACCGAC GCCGCCGGCG ACGGCGCCTC CGGCAGCGTG
ATCCAGGACT CTGCCGGCAC CGACGCGAGC AGTTTCCAGA ACCAGACCGT CACCAACAAC
ACGCCCCCGG TGTGCACCGG CGCCACGGTC AGCGGCAACC AGTTGGTGCT GCGCTTCGAC
CTGGCCGGCA ACCTGGTCAC GACCGGCGTG TCGAACAGCG CTTTTGAACT GGTCGTCGGC
TCCGGTAGCC AGCCGCTGAG CGTCACGGCC ATCGGCGCCT TCAATGCGAC GGACAAGACC
CTCACACTGA CGCTCAGCCG CGCCGTGACC CCCGGCGAGA CGGTGAGCAT CCGCTACACC
GACCCGAACC CCGACAGCAA CGAAGGCAGC GGCGCGCTGG AAGACAGCGC CAGCCGCGAC
GTGCCCACCT TCGTCAAGGA GGTGACCAAC AACACGCCCG CGACGCCCCC GGCGTTCAGC
AGGGCCGAGG TCAACGGCAA CCAGTTGGTG GTGGTCTTCA CCGCCAGCAA CGACCTGGAC
CTTGCCGCCC TGACAGGCAG CGCAGGTTTT GTGGTGACCG GCGCCAACAA TAGCTCCATC
ACCGTCAGCA GCGTCCGGGT GAATGCGGAC AAGACCGTCA CACTGACGCT GGCCCGCGCC
GTGAACAACG GCGAGACGGT GAAAGTCAGC TACACCGACG CCGCCGGCGA CGGCGCCTCC
GGCAGCGTGA TCCAGGACTC TGCCGGCACC GACGCGAGCA GTTTCCAGAA CCAGGACGTC
ACCAATAACA CGCCCCCGGT GTGCTCCAGC GCCACGGTCA ACGGCAACCA GTTGGTGCTC
CATTTCCCCA ATGCCGGCAG CCTGAGCAAG GCCGGCGTGC CGATCACCGC TTTTGCGCTG
TCCGTCGATG CCGGTGGCCA AGCCCTGAGC GTCACGGCCA TCGGCGACTT CAATGCGACC
AGCAAGACCC TCACGCTGAC GCTCAACCGC ACCGTGGCCA ACGGCGAGAC GGTGCGCATC
CGCTACACCG ACCCGACCCC CGGCAACGAC AGCAACGTGC TGCAGGACGC CACCACCGAC
GGTCGCGATG TGCCCTCCTT CGACATGGCG GCGATCAACA ACACGCCCGC GACGCCCCCG
GCGTTCAGCA GGGCCGAGGT CAACGGCAAC CAGATGGTGG TGACCTTCAC CGCCACCGAC
GGACTGGACA CTACCGCCCT GCCTCCGGGC AACGCGGGCT TTACGGTGGC CAGCGGCACC
ACCGGCAGCG CCGCCATCAC CGTCAACAGC GTCCGGGTGA ATGCGGACAA GACCGTCACG
CTGACACTGA GCCGCGCCGT GGCCCACGGC GAGACGGTGA CCGTCAGCTA CACCGACCCG
AACCCCACCG TCAACGACGA CTCCGGCGTG ATACAGGACA CGACCCCTGC CCACACCGAC
GCGAGCAGTT TCCAGAACCA GGCCGTCACC AACAACACCC CGCCGGTGTG CACCGGCGCC
ACGGTCAGCG GCAACCAGTT GGTGCTGCGC TTCGACCTGG TCGGCAACCT GGTCACGACC
GGCGTGTCGA ACAGCGCTTT TGAACTGATC GTCGGCTCCG GTAGCCAGCC GCTGAGCGTC
ACGGCCATCG GCGCCTTCAA TGCGACGGAC AAGACCCTCA CGCTGACGCT CAACCGCGCC
GTGGCCAACG GCGAGACGGT GAGCATCCGC TACACCGACC CGAACCCCGA CAGCAACGCC
GGCAGCGGCG CGCTGGAAGA CAGCGCCAGC CGCGACGTGC CCTCCTTCGA AAAGAACGTG
GTCAACCACA CGGCCGCGCC GCCCCCGGTG CTCACCAGCG CCTCGGCCAA CGGCCGTGAG
TTGGTGCTCC AGTACTCCGC AGAGAGAAAC CTGGACGGAC AAAACAAGGC CGCCGCCGCA
GACTTTGCGG TGACCGTCAA TGGGGTCGCC AACGCCGTCA CCGAAGTCGT CGTGCATCCA
CAGAACAAGA CCGTTACGCT GAAGCTGACC ACCCCCGTGC CCGCCGGCGC GGTGGTGGAG
GTCACCTACA ACAAGCAAGC CACCGGCAAC AACGTCATAC AGGACGAGGG CGGCACCGAC
GCCGCCAGTT TCACGACCTC GCCGACGGTA AACACCGGAC CGGACGAGAC GCCACCGACC
ATCGACCGCG CCGAGGTCAC CGGCAACAGC CGCAACCAGT TGCTGCTCAG GTACGACGAA
GCGAACCTTC TGCACGCAAA CAGCGGAGCA GGCAACAATG CCTTTACGGT GACCGTCAAT
GGGCAGACCA ACGCCGTCAC CGGGGTCACC GTGGACAGGG CGGCCAAAAC CGTCACGCTG
GCGCTGACCT CCGCCGTGGC CGCCGGGGCG CAGGTGAGCG TCCAGTACAC CCAGCCCGCC
ACTGGCAGCA GCATCAAGGA CGCCTATGGC AATCCGGCCC CCACACAGAC GCTCACGGCG
GTGGACAGCG GCAGCGATGA CACGCCCCCG CTGCTGATCA CCGATCTGAC CGATGCCGCC
CGCCGCCCCC AGGTCACCGG CAACGGCACC CAGGTGACGC TCACCTACAC CGAAGCGAAC
CTCTTGGACG AGGTCAACAA GCCGCTGCCC AGCGCCTTTT CCGTCACCGT CAATGGAGAC
CCCAGGACGG TCACGAATGT CACCGTGAAC CGGACGGCCA AAACCGTCAC GCTGACGCTC
AGCGGCGCCG CCGTGGCCGA GGGTGCGCGA GTGCGCCTGA CCTACACCGA TCCCACGGCC
GGCGACGACA CCGCCGCGAT ACAGGACGCG CGCGGCAACG ATGCCGCCAG CACCACGACG
CCGATCGAGG TGTACAACGG AACCGACAAC ACGCCCCCGC TGCTGATCAC TACGGGCGCT
GACCGCCCCA AGGTCAGCGG CCGCGAACTG ACGCTCAGCT ACAGCGACGT GAACCTTCTG
GACACGGTCA ACAAGCCGGC CCCCGGTGCC TTTACGGTGA CCGTCAACGG ACGCAACAAC
GTCGTCACCG CAGTCAACGT GCATGCGACG AACCGAACCG TCACGCTGAC ACTGACCGAC
CTCGTGCCCG AAGGGGCGGT CGTGCGCCTG TCCTACGCCG ATCCGACGAC CGGCAACGAC
ACCGCCGCGA TACAGGACGT GGTGGGCAAT GACGCAGCCA GCATCACGAA CCTCAGCGTG
GACAGCGGAC AGGACAGCAC GCCCCCGATG CTGATCACCA ACGGCCCTGG CGCCCCGACG
GTCACCGGCA CCACCCGCAC CCAACTGACG CTCACGTACA CCGAAGACAA CCTGATGGAC
GGGAACGCCA CCGGCCTCAA GGATGCCTAT ACGGTGCTGG TCAATGGAGA ACGCGCCGAG
ATCGTCAGCC ACGCGGTGAA TTCGACGGAC AAAACCGTCA CGCTGACGCT GCGCGACCCC
GTGCCCGTGG GTGCGCAAGT GCGCCTGACC TACACCGATC CCTCGACGGG CGCCAACGAC
ACCAGTGCGA TACAGGACGC GGCAGGCAAT GACGCCCCCA GCACCAACTC GCCGGTCGAT
GTGGCCAGCG GAACGGACAG CACCGCCCCG GTGCTGATCA CCAACGGCGC TGGCGCCCCC
GAGGTCACCG GCGACGCCCG TACCCAACTG ACGCTCACCT ACACCGAAGC CAACCTGCTG
GACGATGAGC ACAAGCCGGT TCCGGGGGCT TTTACGGTGA CCGTCAATCG CGCGAACAAC
CAGGTCACCG CAGTCAGCGT GAATGCAACG AACAGAACCG TCACGCTGAC GCTGACCGAC
CCCGTGCCCA GAGGTGCGGA GATGACCGTC GCCTACACCA AGCCCGCGAC CGGCGATGTC
CTGCAAGACA AGGCGGGCAA CCCCGCCGCC AACACCCCGG CGACCACGGT GAACAGCGGA
GAAGACACCA CGCCCCCGCA ACTGCAACAG ATCGCCTCCC CGAGCCTCAA CACCACCCCC
AAGGTCGTCG ACGACAAACT GACGCTCACC TACACCGACA CCAACCTGCT CGACGATCGC
GACGGCAGCA AGCCACTCGA GACGGCCTTC AGGGTGATGG TCAGCGGCAC CCAAGTGAAT
GTCCGCAACG TCGCCGTGAA TGCAACGGCC AAAACCGTCA CGCTGACGCT GGACCGCGCC
GTGGCCCGCG GCGAGGTCGT GACCGTCGAG TACACCGACC CGACCACCGG CAACGACACC
CGCGCCATAC AGGACGCGAA AGGCAATGAC GCCGCCAGCA CCGGGCCGAT CCCCGTCGAC
AGCGGAGTGG ACAACACGGC CCCGGTGCTG GTCACCACGG GCACGCACCG CCCCAAGATC
TCCGACAACG CCCGCAACCA GTTGGTGCTC ACCTACACCG ACGCGAACAA TCTCCATGAA
TCCAACAAGG CGCCCGGGAG CGCCTTCGAG GTGATCGTCA ATGGAGTCTC CAATGAAGTC
ACCAACGTTT CCGTGGTGGG ATTGAACAAA ACCGTCACGC TGACGCTGCG CGACCCGGTG
CCGCTCGGTG CGCACGTGAC CGTCAGATAC ACCAAGCCCG ATGGCACCAC GACCGTCATA
CAGGACGAGG CTGGCAACGA CGCCCTCAGC ACCAGCTCGC CGGTCGATGT GGCCAGCGGA
CAGGACCAGA CCGCCCCGCA ACTGATCACC ACCGGCGATG ACGCCCCCAG GATCGACGGC
AACCGACTGA CGCTCAGCTA CCGCGAAGAC AACCTGCTGG ACAATCGCGA CGGGCACAAG
CCAGGCGGGA CGGCCTTCAC GGTGCTCGTC AATGGCGAGC GCAATGCCGT CACCGGGGTC
GAGGTGAGTG CAACGAACAA AACCGTGACG CTGATACTGA CCCGCGCCGT GACCGGCGGC
GAGACCGTGA CCGTCGCGTA CACCGCCCCG ACCGACAGCA ACGACCCGCG CGACATACAG
GACGCGGCAG GCAATGACGC GGCCAGCATC CCGCAGACCA ACGTGCGCAA CGCTCCCGAC
AGCACGGCCC CGGTGCTGAT CACCGAGGGC GACAACCGCC CCATCATCAC CGGCGCCGCC
CGCACCCAAC TGACGCTCAC CTACACCGAA GCCAACACGC TGGACCCGGA GAACAAGGCG
AGCCCCGGCG CCTTTGCCGT GCTCGTCAAT GGAGTTCGCG CCGAAGTCAC CGAAGTCACC
GTGGACGCAG CGGGCAAGAC CGTCATACTG ACGCTGCGCA CCCCCGTGCC CGAGGGCGCG
CAAGTGACCG TCACCTACAC CAAGCCGGCG AGCGACACCA TCAACGCCAT ACAGGACCGG
ACAGGCAATG ACGCCGCCAG CACGACGAGC CCGGTGGCCG TCAGAAGCGG AACGGACAGG
ACGGCCCCGG TGCTGATCAC CGCAGGCGAC AACGCCCCCC GGGTCCGCGG CCGCGAACTG
ACGCTCACCT TCGCCGAAGA CAACCTGCTG GACGCTGCAA ACAAGCCGGA AATCAACGCC
TTTTCGGTGA GCAGCAATGG GGCCGACATC GGCGTGCTGG ATGTGAGCGT GAACGCACAG
GACAGAACCG TCACGCTGAC ACTGGCGCGC GCCGTAAGCT CCGGCGATAA GGTGTCCGTC
CGCTACACGG ATCGCACGAC CGGCAACGAT ACCGACGCGA TACAGGACGC CACCGGCAAT
GACGTGAGCG ACTTCACGGT CACATCCGTG GTCAACCGCA CGCCCGCTCC CGCGACGCCG
ACGACACCGA CGACGCCGAC CGATCGGCCG GACGGTGACC GCGACGGCAT ATCGAATGAG
CAGGAGGACG CGGCGCACGG CCTTGCCCGC GCGGATGGCA CCACCGTCGC CGGCGACGGC
AACGGCGACG GCATCAAAGA CAGCGAACAA TCGGCAGTCA GTTCGATCAA CGGCATGACC
CTGGTGGCCG GCAGCCAGAA CGGCAAGATC AAACCCGGCA ACCAGACGCA GATATCGAAC
ATGCTCCACA CGCGCGAGGC CCCCGCCGAT TTGCCCAAGG GTCTGGAGAT GCCGATGGGG
ACACTGCACT TCGACGCGAC GATACCCACC GCCGGCGGCA GCGAGAGCTT CAGCCTGTAT
GTGGACCCGG CGCGCGGCAT CAACGGGTAC TGGGTCAAAG ACCAAACCGG TGTCTGGGTC
AACCTGGCCA GCGCACCCTA CGGCGGCCAG ATGGTGATGG AGGGCGATCG ATTGCGGCTG
GACTTTCAGA TCACCGATGG CGGGCAGTTC GATGCCGACG GCCAGGCCAA TGGCGTCATC
GCCATCCCCG GCGGCGCTGC AGCGCAAATG CAGTTGTCCA TCGTCGGGCA GTCGCCCACG
GCGGAGCAAC ACGGGTTTTG GTTCTGA
 
Protein sequence
MSSLTNISLN NGNATLKTGY SYTVTLTFDS RVGGVTAANV VVPPGTMLTT PTASGPDSNG 
RSTTWQFELT PPANTERTND TLGMSLTGVT DQNGSVPTNN AASVTYTVDT IKPTLQSAKV
TGNQLVLTYN ENLDTSAANA PLADRFVVTA DDKSVLANGT RINVTGSPVV NGRTVTLTLA
SAVASGQQVK VSYRDSAPGD DTRAIQDTAG NDAASFSNQV ATNETPPVIR SATVNGTQLV
LSYTTLSGLD AVHPPPASAF AVTSAGTTTP IGVSNVSVDA VNNTVTLTLN RPVANGETVS
VNYTRPATGD NVIQDAAGND AANIQNRAVT NETPPVCTSA TVTGNQLVLR YDTDNLDETE
AGRPSAGAFE VLIDGTTRAS VQVVTVNSAN KTVTLTLDRT VTQGQKVTVA YTDPTNGNDP
RAVQDAAGND AASFSARPVT NDTTAPVLAE ATVDRNQLVL RYTENDRLDT VNTAPATAFT
VTAGGQPVTV SSVLVDATAK TVTLTLASNV AANQAVTVAY HAPTTGNNAI QDAAGNDAAN
FEARAVDNIT TRFACTKAEV DRDKLVLSFP QGTELDAIKP IATAFALSSS GGLSVIDTMV
DTANKTVILT LSRAVANGEA VTITYTDPAG DNTTGVIQSS SGADLPTFTR DVINRTGPRM
DDVRVNGDKV TITYLTNSLD AFSQVAATAF EVQTGGTAPE TIGIKGIQVD AAAKTITLTL
ARAVVKGEQV TVSYTDPTDG NDRNAVQDTI GNDAKSESLP ATNDTPRAST LDSIAISDPN
LKAGETAIVT LTFNTAVDGL TAANLVLPAG TAVSNVRAVN GATDSDGRLR SATWQFELTP
PANTESTDNR INVNLSGVKD ANGNAVTNNA PAAAYTVDTR APTFTSATVN GDQLELVYSE
ALDGTNKPEI GRFIVNIDGR DQADGGVRAV AVDGRKVILT LTTPVTSGQQ VKVTYTDPSF
SATPSNDTGD DALAIQDAAG NDAANVLRQP VTNTTPPTFR SATVNGDKLV LRFDTQSGLD
AVNPAPATAF DVTNANGTSI RVLSVRVDAT AKTVTLTLER SVPRSETVFV AYRDPTPGND
TNAIQDAAGN DMADIARQQV TSETPAPTVT YKSSTVDGNQ LVVYFSASYD LDLTALTGSA
GFTVGGANNS PIAVSSVRVN ADKSVTLTLA RAVVRGEQVT VRYTDPRPSV DDPAGTVIQD
RNGTDATSFE VQTVTNNTPE LPSVTVRDAT VNGNQLVVAF NASNDLDLTA ITGNPGFTVA
STTAGSAAIT VSSVRVNADK TVTLTLSRAV ANGETVTVSY TDAAGDGASG SVIQDSAGTD
ASSFQNQAVT NNTPAPASVT VRDATVNGNQ LVVAFNASND LDLTAITGNP GFAVTGANNS
PITVSSVRVN PDKTVTLTLA RAVTNGETVK VSYTDAAGDG ASGSVIQDSV GTDASSFQNQ
AVTNNTPAPA PVTVKGATVN GNQLVVAFNA SNDLDLTAIT GNPGFTVAST TAGSAAITVS
SVRVNADKTV TLTLSRAVVN GETVTVSYTD AAGDGASGSV IQDSAGTDAS SFQNQAVTNN
TPAPAPVTVK GATVNGNQLV VAFNASNDLD LTAITGNPGF TVASTTAGSA AITVSSVRVN
ADKTVTLTLS RAVVNGETVT VSYTDAAGDG ASGSVIQDSV GTDASSFQNQ AVTNNTPVIP
PVTVRGATVN GNQLVVAFNA SNDLDLTAIT GNPGFSVAST TAGSAAITVS SVRVNPDKTV
TLTLSRAVVN GETVTVSYTD AAGDGASGSV IQDSAGTDAS SFQNQAVTNN TPAPAPVTVK
GATVNGNQLV VAFNASNDLD LTAITGNPGF AVASTTAGSA AITVSSVRVN VDKTVTLTLS
RAVANGETVT VSYTDAAGDG ASGSVIQDSA GTDASSFQNQ AVTNNTPAPA PVTFRGATVN
GNQLVVAFSA SNDLDLTAIT GNPGFAVAST TAGSAAITVS SVRVNADKTV TLTLSRAVNN
GETVTVSYTD AAGDGASGSV IQDSVGTDAS SFQNQAVTNN TPAPAPVTVK GATVNGNQLV
VAFNASNDLD LTAITGNPGF AVASTTAGSA AITVSSVRVN ADKTVTLTLS RAVANGETVT
VSYTDAAGDG ASGSVIQDSV GTDASSFQNQ AVTNNTPPVC TGATVSGNQL VLRFDLAGNL
VTTGVPNSAF ELVVGSGSQP LSVTAIGAFN ATDKTLTLTL SRAVTPGETV SIRYTDPNPD
SNEGSGALED SASRDVPTFV KEVTNNTPAT PPAFSRAEVN GNQLVVAFTA SNDLDLAAIT
GNPGFAVAST TAGSAAITVS SVRVNADKTV TLTLSRAVNN GETVKVSYTD AAGDGASGSV
IQDSAGTDAS SFQNQTVTNN TPPVCTGATV SGNQLVLRFD LAGNLVTTGV SNSAFELVVG
SGSQPLSVTA IGAFNATDKT LTLTLSRAVT PGETVSIRYT DPNPDSNEGS GALEDSASRD
VPTFVKEVTN NTPATPPAFS RAEVNGNQLV VVFTASNDLD LAALTGSAGF VVTGANNSSI
TVSSVRVNAD KTVTLTLARA VNNGETVKVS YTDAAGDGAS GSVIQDSAGT DASSFQNQDV
TNNTPPVCSS ATVNGNQLVL HFPNAGSLSK AGVPITAFAL SVDAGGQALS VTAIGDFNAT
SKTLTLTLNR TVANGETVRI RYTDPTPGND SNVLQDATTD GRDVPSFDMA AINNTPATPP
AFSRAEVNGN QMVVTFTATD GLDTTALPPG NAGFTVASGT TGSAAITVNS VRVNADKTVT
LTLSRAVAHG ETVTVSYTDP NPTVNDDSGV IQDTTPAHTD ASSFQNQAVT NNTPPVCTGA
TVSGNQLVLR FDLVGNLVTT GVSNSAFELI VGSGSQPLSV TAIGAFNATD KTLTLTLNRA
VANGETVSIR YTDPNPDSNA GSGALEDSAS RDVPSFEKNV VNHTAAPPPV LTSASANGRE
LVLQYSAERN LDGQNKAAAA DFAVTVNGVA NAVTEVVVHP QNKTVTLKLT TPVPAGAVVE
VTYNKQATGN NVIQDEGGTD AASFTTSPTV NTGPDETPPT IDRAEVTGNS RNQLLLRYDE
ANLLHANSGA GNNAFTVTVN GQTNAVTGVT VDRAAKTVTL ALTSAVAAGA QVSVQYTQPA
TGSSIKDAYG NPAPTQTLTA VDSGSDDTPP LLITDLTDAA RRPQVTGNGT QVTLTYTEAN
LLDEVNKPLP SAFSVTVNGD PRTVTNVTVN RTAKTVTLTL SGAAVAEGAR VRLTYTDPTA
GDDTAAIQDA RGNDAASTTT PIEVYNGTDN TPPLLITTGA DRPKVSGREL TLSYSDVNLL
DTVNKPAPGA FTVTVNGRNN VVTAVNVHAT NRTVTLTLTD LVPEGAVVRL SYADPTTGND
TAAIQDVVGN DAASITNLSV DSGQDSTPPM LITNGPGAPT VTGTTRTQLT LTYTEDNLMD
GNATGLKDAY TVLVNGERAE IVSHAVNSTD KTVTLTLRDP VPVGAQVRLT YTDPSTGAND
TSAIQDAAGN DAPSTNSPVD VASGTDSTAP VLITNGAGAP EVTGDARTQL TLTYTEANLL
DDEHKPVPGA FTVTVNRANN QVTAVSVNAT NRTVTLTLTD PVPRGAEMTV AYTKPATGDV
LQDKAGNPAA NTPATTVNSG EDTTPPQLQQ IASPSLNTTP KVVDDKLTLT YTDTNLLDDR
DGSKPLETAF RVMVSGTQVN VRNVAVNATA KTVTLTLDRA VARGEVVTVE YTDPTTGNDT
RAIQDAKGND AASTGPIPVD SGVDNTAPVL VTTGTHRPKI SDNARNQLVL TYTDANNLHE
SNKAPGSAFE VIVNGVSNEV TNVSVVGLNK TVTLTLRDPV PLGAHVTVRY TKPDGTTTVI
QDEAGNDALS TSSPVDVASG QDQTAPQLIT TGDDAPRIDG NRLTLSYRED NLLDNRDGHK
PGGTAFTVLV NGERNAVTGV EVSATNKTVT LILTRAVTGG ETVTVAYTAP TDSNDPRDIQ
DAAGNDAASI PQTNVRNAPD STAPVLITEG DNRPIITGAA RTQLTLTYTE ANTLDPENKA
SPGAFAVLVN GVRAEVTEVT VDAAGKTVIL TLRTPVPEGA QVTVTYTKPA SDTINAIQDR
TGNDAASTTS PVAVRSGTDR TAPVLITAGD NAPRVRGREL TLTFAEDNLL DAANKPEINA
FSVSSNGADI GVLDVSVNAQ DRTVTLTLAR AVSSGDKVSV RYTDRTTGND TDAIQDATGN
DVSDFTVTSV VNRTPAPATP TTPTTPTDRP DGDRDGISNE QEDAAHGLAR ADGTTVAGDG
NGDGIKDSEQ SAVSSINGMT LVAGSQNGKI KPGNQTQISN MLHTREAPAD LPKGLEMPMG
TLHFDATIPT AGGSESFSLY VDPARGINGY WVKDQTGVWV NLASAPYGGQ MVMEGDRLRL
DFQITDGGQF DADGQANGVI AIPGGAAAQM QLSIVGQSPT AEQHGFWF