Gene Veis_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1357 
Symbol 
ID4691255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1507139 
End bp1521631 
Gene Length14493 bp 
Protein Length4830 aa 
Translation table11 
GC content66% 
IMG OID639849128 
Productouter membrane protein 
Protein accessionYP_996142 
Protein GI121608335 
COG category 
COG ID 
TIGRFAM ID[TIGR02059] cyanobacterial long protein repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.381537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCACATC GAAAGGAAAG CACCATGGCC ACCACCATCA CCATCGACAG GCAATACGTC 
AAAATCGGTG AGACTGCCGA GATCAGGTTC ACTTTCACCA ATCCCAATTA CTCCCTCTCC
CTCGAAGGCC TGACCGTCAC CACCCTCATC GGAGAGACCA CGGGCGGCAC CGTACACAGC
CTCGTCAATC GTGGCGTCAT CAACGGTCAG CGTGTCTATA CCGCCACATT CACGCCTACC
GCGAACCTGC AAAGATCCAT CTACCGCCTC ACCTACGATG TTGCCGGCAA AGCCGATGAC
GCAATCAGCG AGAACTTCGC CGTTGACACC GTGCGGCCCA CCCTTGAAAG TGCCTCGATC
GCCAAAAGCG ATCTGCGCCT CGGGGAAAAA ACCACCATCA CCATCACCTT CAGCGAACGG
CTGGCCTTCT ACGACAACTT CACACTCGAA GACCTGCAGG TGGACGCCGG CAAGGGCACT
TTGAGCAACC TGCGCTCCTA CTTCTCCGAC CAAGAGCGCA TTTGGCATGT CGACCTGCAG
GCCCCTACTA CCCGGCCCGC GTCGGGCCTC GATGGCAACC AGATACGGAT CAACCTCGCC
GGCATCACCG ATCGCGCAGG CAACACAATC TGGGACAGTC AGTTGCACGG GACGGGCAAC
CCTTGGGTGA ACCTCCCGAC TGTCAGCTAC AACATCGACA ACGGTGTGCC GCCCATGGTC
GCCATCACCG GGCAGCCTGC CACCACCATC AAGGGCGGCG ATTCCTTTAC CGTCACCTTC
ACCTTCAACG AGCGCGTCAC CGGCTTCGAT CTCGACGACG TTCAGTACGA CACCAGCAAA
GGCACACTGA GCGCTCTGAC GGCGGTCGGC ACCGACGGCA GGGTCTGGAC CGCCACCTAC
ACGCCCCGGC CCAACATCGA GAGCGCCGAA AACACCATCA GCGTGAACCT CGCCGGCGTC
CGGGACGCGC TGGGCAACGC CGGGGTGGGC ACCGGCACCA GCGGCAACTT CAGTATCGAC
ACCAAGCCCC CCGAGGTCGC GGTGACGATC AGCGACGAGC GCCTGACCGC TGGCGAAAGC
GCCACCGTCA CCTTCACCTT CAGCGAGCGC GTCACCGGCT TCGATCTCAA CGACGCTCAA
TACGACACCA GCAAAGGCAC GCTGGGCGCT CTGACGGCGG TCGGCACCGA CGGCAAGGTC
TGGACAGCCA CCTACACGCC CCGCAGCGAC ATCGAGAGCG CCAACAACAC CATCCGCGTG
AACCTGGCCG GCGTCCAGGA CGCGCAGGGC AACGCCGGAG TGGGCACCGG CAGCAGCGGC
AACTTCGTCA TCGACACCAG GCCCACCGTG GTCGACGGGC GGCCCAGCAT CGTCTCTGTG
GTGGGCCCCA CCTCCATCGT CTTGGAGGAC ACCGACGTCA CCATTACCTT CACCTTCAGC
GAGGCAGTCA CCGGCTTCAC GTTGGCCAAC ATCAATTTGG ACAACTCCAG TGCCTCTCCC
TACATCACCT ACAGCCCGAA AGAGCCGGTC AGCGCAGATG GTGGCCGCAC CTGGACCATC
ACCTTCCGTG CCGCTCCGCG TACCACGGAT TCCACCAACA CCGTCAGCAT CCGCAACCTC
GATGGCGTGC GTGACCTCGC AGGCAACCTC GCAGTGCCTA ACTCCAGCGC CAGCACCGAC
AACTACGAGG TCGATACCGA GGATCCCGAT CCCATCAGCG CCACGTTTGA CAAAACCCAC
CTTGCCGCCG GAGAAACCGC CACCGTCACC GTCACCTTCA ACGAGATCGT CAACAACGTC
ACCCAAGGCA CCTTTGATAT CCCCAACGGT TCTGTGAGCA ACCTGAGGCA GGACCCGACC
GACGGCAGAA TCTGGCGGGC CACCTTCACC CCCACGGCCA ACCTGCAGAG CGCCAGCAGC
TTGATCAGCA TCAATCTGAA CGACCTCAGG GACAGCGCCG GCAACGTCAG CAGTGGTCGC
AAGACGTTCT ACGACTCCAC CATCGTCATC GACACCAAGC CCCCCGAGGT CGCGGTGACG
ATCAGCGACG AGCGCCTGAC CGCTGGCGAA AGCGCCACCG TCACCTTCAC CTTCAGCGAG
CGCGTCACCG GCTTCGATCT CAACGACGTT CAGTACGACA CCAGCAAAGG CACGCTGAGC
GCTCTGACGG CGGTCGGCAC CGACGGCAAG GTCTGGAGCG CCACCTACAC GCCCCGGCCC
GACATCGAGA GCGCCGAAAA CACCATCCGC GTGAACCTCG CCGGCGTCCA GGACGCGCAG
GGCAACGCCG GAGTGGGCAC CGGCAGCAGC GGCAACTTCG TCATCGACAC CAGGCCGCCC
GTGGTCGACG GGCGGCCCAG CATCGTCTCT GTGGTGGGCC CCACCTCCAT CGTCTTGGAG
GACACCGACG TCACCATTAC CTTCACCTTC AGCGAGGCAG TCACCGGCTT CACGTTGGCC
AACATCAATT TGGACAACTC CAGTGCCTCT CCCTACATCA CCTACAGCCC GAAAGAGCCG
GTCAGCGCAG ATGGTGGCCG CACCTGGACC ATCACCTACC GTGCCGCTCC GCGTACCACG
GATTCCACCA ACACCGTCAG CATCCGCAAC CTCGATGGCG TGCGTGACCT CGCAGGCAAC
CTCGCAGTGC CTAACTCCAG CGCCAGCACC GACAACTACG AGGTCGATAC CGAGGATCCC
GATCCCATCA GCGCCACGTT TGACAAAACC CACCTTGCCG CCGGAGAAAC CGCCACCGTC
ACCGTCACCT TCAACGAGAT CGTCAACAAC GTCACCGAAG ATAGCTTTCA AATCCCCAAC
GGTTCGGTGA GCAACCTGAG GCAGGACACG ACCGACGGCA GAATCTGGAG GGTCACCTTC
ACCCCCACGG CCAACCTGCA GAGCGCCAGC AGCTTGATCA GCATCAATCT GAACGACCTC
AGGGACAGCG CCGGCAACGT CAGCAGTGGT CGCAAGTCGT TCTACGACTC CACCATCGTC
ATCGACACCA AGCGCCCCGA AGTCACGGTG GTGATCAGCG ACAACCGCCT GACCGCTGGC
GAAACCGCCA CCGTCACCTT CACCTTCAGC GAGCGCGTCA CCGGCTTCGA TCTCAACGAC
GTTCAGTACG ACACCAGCAA AGGCACGCTG GGCGCTCTGA CGGCGGTCGG TACCGACGGC
AAGGTCTGGA CAGCCACCTA CACGCCCCGG CCCAACATCG AGAGCGCCGA CAACACCATC
CGCGTGAACC TCGCCGGCGT CCAGGATGCG CAGGGCAACG CCGGAGAGGG CAGCGTCAGC
AGCGGCAACT TCAGCATCGA CACCAGGCCG CCCGAGGTCG ACAGGCCGCC CGAGGTCACG
GTGACGATCA GCGACAACCG CCTGACCGCT GGCGAAAGCG CCACCGTCAC CTTCACCTTC
AGCAAGAGCG TCACCGGCTT CACGAAAGAT GACATCGACT TGACCCTGGC CAACGGCACG
CTGGGCGACT TGGTGCCGGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTTCACGCCC
CGGCCCGACA CCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGGGT GGGCACCGGC ACCAGCGGCA ACTTCACTAT CGACACCAAG
CGCCCCGAAG TCGCGGTGGC GATCAGCGAC GAGCGCCTGA CGGCTGGCGA AACCGCCACC
GTCACCTTCA CCTTCACCGA GCGCGTCACC GGCTTCGATC TCAACGACGT TCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCGCTGACG GCAGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAA AGCGCCACCA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCT GGGCAACGCC GGGGTGGGCA CCGGCAGCAG CGGCAACTTC
GTCATCGACA CCAGGCCCAC CGTGGTCGAC AGGCCGCCCA GCGCCACCAT TGCCGTGACC
CCCAACCCCG TCACGAACAG CAATGACAGA CTGGTCACCG TCACCATCAC CTTCGACGAG
GCAGTCACCG GCTTCACGGC AGACAACATC GATTTCAGCA ACGCCCATGT CACGCCCTAT
GGCCGCAACC GGATAGGAGC GCTGAACAGC TCAGCCGACG GCCGCACCTA CACCATCACC
TACACGGCAG AGCCGGACGT CGAGGATGCC ACCAACACCA TCAGCCTGCG CAACCTCCAT
ACCATCCGTG ACGCCACAGG CAACGCCGTA GCGGTCAGCC CGACCAGCAA CAACTTCGCG
ATCGACACCA AGGCTCCCGT TCCCATCAGC GCCACGTTTG ACAAAACCCA CCTTGCCGCC
GGAGAAACCG CCACCGTCAC CGTCATCTTC AACGAGATCG TCAACAACGT CACCGAAGAT
ACCTTTCAAA TCCCCAACGG TTCTGTGAGC AACCTGAGGC AGGACACGAC CGACGGCAGA
ATCTGGAGGG CCACCTTCAC CCCCACGGCC AACCTGCAGA GCACCAGCAG CTCGATCAGC
ATCAATCTGG ACGGCCTCAG GGACAGCGCC GGCAACGTCA ACAATGGCAG CCTGCCGTTC
CGCGACTCCA CCATCGTCAT CGACACCAAG CGCCCCGAAG TCACGGTGGC GATCAGCGAC
GAGCGCCTGA CCGCTGGCGA AACTGCCACC GTCACCTTCA CCTTCAGCGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CAACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTCC AGGACGCGCA GGGCAACGCC
GGAGCGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGGCCGCT GGCCAAACCG CCACCGTCAC CTTCACCTTC
AACGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGCAGCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAGG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCAGAGA GAGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTGACG GCGGTCGGCA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCCGCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTGAGCGCT
GGCCAAACCG CCACCGTCAC CTTCACCTTC ACCGAGCGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGGGCGCGC TGACGGCAGT CGGTACCGAC
GGCAAGGTCT GGAGCGCTAC CTACACGCCC CGCAGCGACA TCGAGAGCGC CGAAAACACC
ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGCC
AGCAGCGGCA ACTTCAGCAT CGACACCAGG CCGCCCGAGG TCACGGTGAC GATCAGCGAC
AACCGCCTGA GCGCTGGCCA AACCGCCACC GTCACCTTCA CCTTCAACGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAATACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CGACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTGC TGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCCGCC CGAGGTCACG
GTGACGATCA GCGACAACCG CCTGAGCGCT GGCCAAACCG CCACCGTCAC CTTCACCTTC
AACGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT CGGCACCGAC GGCAAGGTCT GGACAGCCAC CTACACGCCC
CGGCCCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCAGCGG CGTCCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAGG TCACGGTGAC GATCAGTGAC AACCGCCTGA GCGCTGGCCA AACCGCCACC
TTCACCTTCA CCTTCAACGA GCGCGTCACC GGCTTCGATC TCAACGACGT TCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCGCTGACG GCGGTGGGCA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCA GGGCAACGCC GGAACGGGCA GCGTCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCGCCC CGAAGTCACG GTGACGATCA GCGACAACCG CCTGAGCGCT
GGCCAAACCG CCACCTTCAC CTTCACCTTC AGCGAGCGCG TCACCGGCTT CGATCTCAAC
GACGTTCAGT ACGACACCAG CAAAGGCACG CTGGGCGCTC TGACGGCAGT CGGCACCGAC
GGCAAGGTCT GGACAGCCAC CTACACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTGGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAA CGCCCCGAAG TCACGGTGAC GATCAGCGAC
GAGCGCCTGA GCGCAGGAGA AACCGCCACC GTCACCTTCA CCTTCAGAGA GAGCGTCACC
GGCTTTGGCA CCGAGGACAT CCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGCAG CAACATCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTGC TGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGAGCGCT GGCCAAACCG CCACCTTCAC CTTCACCTTC
AGCGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGCAGCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAA
CCGCCCGAAG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCACCGA GCGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTAACG GCGGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCCGCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTCGCCGCT
GGAGAAACCG CCACCGTCAC CTTCACCTTC AACGAGCGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGGGTGCGC TGACGGCAGT CGGCACCGAC
GGCAAGGTCT GGAGCACCAC CTACACGCCC CGGCCCAACA TCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCCGG CGTCCAGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGTGCC
AGCAGCGGCA ACTTCAGCAT CGACACCAGG CCGCCCGAAG TCACGGTGAC GATCAGCGAC
GAGCGCCTGA GCGCAGGAGA AACCGCCACC GTCACCTTCA CCTTCACCGA GCGCGTCACC
GGCTTTGGCA CCGAGGACAT CCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCTCTGACG
GCGGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CGACATCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTC AGCGGCGTGC TGGACGCGCA GGGCAACGCC
GGGGTGGGCA CCGGCACCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGATCGCT GGCCAAACCG CCACCTTCAC CTTCACCTTC
AGCGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGCGCTC TGACGGCGGT CGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGGCCCAACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTCCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAAG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCAGAGA GAGCGTCACC GACTTTGGTA GCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCTCTGACG GCAGTCGGCA CCGACGGCAA GGTCTGGACA
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAGGCCGCC CGAGGTCACG GTGACGATCA GCGACGAGCG CCTGAGCGCA
GGAGAAACCG CCACCGTCAC CTTCACCTTC AGAGAGAGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGAGCGCGC TGACGGCGGT CGGCACCGAC
GGCAAGGTCT GGAGCGCCAC CTACACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAG CGCCCCGAAG TCACGGTGAC GATCAGCGAC
AACCGCCTGA GCGCTGGCCA AACCGCCACC TTCACCTTCA CCTTCAGCGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTCGGCA CCGACGGCAA GGTCTGGAGC GCCACCTACA CGCCCCGCAG CGACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCAGGCGTCC AGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGCCAGCAG CGGCAACTTC AGCATCGACA CCAAACGCCC CGAGGTCACG
GTGGCGATCA GCGACGAGCG CCTGAGTGCG GGAGAAACCG CCACCGTCAC CTTCACCTTC
AGAGAGAGCG TCACCGGCTT CGATCTCAAC GACGTTCAAT ACGACACCAG CAAAGGCACG
CTGAGCGCTC TGACGGCGGT CGGTACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGGCCCAACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTCCAGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAGG
CCGCCCGAGG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCTGGCCA AACCGCCACC
GTCACCTTCA CCTTCAGCGA GCGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTGACG GCGGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAA AGCGCCACCA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCGCCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTGGCCGCT
GGCCAAACCG CCACCGTCAC CTTCACCTTC AACGAGCGCG TCACCGGCTT CGATCTCAAC
GACGTTCAGT ACGACACCAG CAAAGGCACG CTGGGCGCGC TGACGGCGGT GGGCACCGAC
GGCAAGGTCT GGAGCGCCAC CTACACGCCC CGGCCCAACA TCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCAGG CGTCCAGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAA CGCCCGGAAG TCACGGTGAC GATCAGCGAC
GAGCGCCTGA GCGCAGGAGA AACCGCCACC GTCACCTTCA CCTTCAGAGA GAGCGTCACT
GGCTTTGGCA CCGAGGACAT CCAGTACGAC ACCAGCAAAG GCACGCTGAG CGCTCTGACG
GCGGTCGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CAACATCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTCC AGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGCCAGCAG CGGCAACTTC AGCATCGACA CCAGGCCGCC CGAAGTCACG
GTGACGATCA GCGACGAGCG CCTGAGCGCT GGAGAAACCG CCACCGTCAC CTTCACCTTC
ACCGAGCGCG TCACCGGCTT TGGCACCGAG GACATCCAGT ACGACACCAG CAAAGGCACG
CTGAGCGCTC TGACGGCAGT CGGCACCGAC GGCAAGGTCT GGACAGCCAC CTACACGCCC
CGGCCCAACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAA
CGCCCCGAGG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCAGCGA GAGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCTCTGACG GCGGTGGGCA CCGACGGCAA GGTCTGGACA
GCCACCTACA CGCCCCGCAG CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCCGCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTGAGCGCA
GGAGAAACCG CCACCGTCAC TTTCACCTTC ACCGAGCGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGGGCGCTC TGACGGCGGT CGGCACCGAC
GGCAAGGTCT GGAGCGCCAC CTACACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCAGCGG CGTCCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAG CGCCCCGAGG TCACGGTGAC GATCAGCGAC
AACCGCCTGA GCGCTGGCCA AACCGCCACC GTCACCTTCA CCTTCAACGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTCGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CGACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTC GCCGGCGTGC TGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGGCGATCA GCGACAACCG CCTGATCGCT GGCCAAACCG CCACCGTCAC CTTCACCTTC
AGAGAGAGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGCGCGC TGACGGCGGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGGCCCGACA CCGAGAGCGC CGAAAACACC ATCAGCGTGA ACCTCGCCGG CGTCCGGGAC
GCGCTGGGCA ACGCCGGGGT GGGTACCGGC ACCAGCGGCA ACTTCAGTAT CGACACCAAG
CCCCCCGAGG TCGCGGTGAC GATCAGCGAT AACCGCCTCA CCGCTGGCGA AAGCGCCACC
ATCACCTTCA CCTTCAACGA GCGCGTCACC GGCTTCGATC TCGACGACGT TCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCGCTGACG CCGGTCGGCA CCGACGGCAG GGTCTGGAGC
GCCAGCTACA CGCCCCGGCC CGGCATCGAG AGCGCCGACA ACGCCATCAG CGTGCGCCTC
GCCGGCGTCC GGGACGCGCT GGGCAACGCC GGGGTGGGCA CCGGCACCAG CGGCAACTTC
ACTATCGACA CCAAGCCCCC CGAGGTCACG GTGACGATCA GCGATAACCG CCTCACCGCT
GGCGAAAGCG CCACCGTCAC CTTCACCTTC AGCGAGAGCG TCACCGGCTT CACGAAAGAG
GCCATCGACC TGTCCCAAGC CAACGGCACG CTGGGCGACC TGGTGCCGGT CGGCACCGAC
GGCAAGGTCT GGACAGCCAC CTTCACGCCC ACGGACAGAC TGGCGCGCAC CACCAACCAC
CGGCTCACCC TGAACCTGAC CAATGTCAGG GATGCCGCAG GCAACGCCCC GGCGGTGAAC
ACTTACTCGT TCAACCAGTA CACCGTAGAC ACCATGGTCT TTGCGCTCAG CAACGCCACG
GTGAACCGCA ACCAGTTGGT GTTGTTCTAC AGCGATGAAA CCGCGCTGGA CCCGGACCAG
ACCCATAACG CGCCCAACGA TGCGTTTGTG GTGCTGGTCG ATGGCGTGCG CAACAACGTC
ACCGGCGTGG TCGTGGATGC AGCGGCCAAG ACCGTCACGC TGACGCTGGA GCGCGCGGTG
AGCCATGGCC AGCAGGTGAC CGTCGCCTAC AACGACCCCA GCACCGGCGA CGACCCGCAA
GCGGTGCAGG AAGCCGGCAG CGGCGACGAC GCGGCCAGCT TCGCGGCCAG GCCGGTGACC
AACCTCAGCC CGCGTGCGCC CGCGACGGGC ACGACCGACG CCGACAGTCG TAAATCATCC
GAGGACTCTG ACGGCGATAC GGCGAATGCG CTGGACTCCG ACTACGACAG TGTGCCCAAC
GCCCAGGAGG ACCAGGCCCC CGGCCTGCTG CGCCCCGATG GCTCGGCCGG CACGGATGGC
GACGGCAACG GCGACGGTAT CCGGGACAGC CAGCAGGTCG CCGTCGGCTC GACCCGCGAC
CTGACCCTGG TGGCCGGCAG CCAGGACGGC AAACTGATCC CCGGCAGCAA TGCGCGCATC
AGCGAACTGG TGCGCAAAGA TGCCCCCGCC AGCCTGCCCA AGGGCATGGA AATGCCGCTC
AGTCTGACGC AATTCAGGGT CGGCCTGTCC GAAGGACGCT ACACCGAGAG CTTCAGCCTG
TACGTAGACC CGGCGCTCGG CGTCAACGGC TATTGGGTCA AGGACAGCGC CGGCACCTGG
GTGAACCTGG CCAGCGAACC CTACGGCGGC AAAATGAGCA CCGAAGGCGG GCGCACGCGG
CTGGACTTTC AGATCCAGGA CGGCGGCCAG TACGACACCG ACGGGCTGGC CGATGGCCAC
ATCACCGCCC TTGGCGCGGC AGCGAAGATG CCGCTGTCCA TCGTCGGGCA GGCGCCGCCC
CAGGTCGAGT CGCATCGGGG GTTTTGGTTC TAA
 
Protein sequence
MSHRKESTMA TTITIDRQYV KIGETAEIRF TFTNPNYSLS LEGLTVTTLI GETTGGTVHS 
LVNRGVINGQ RVYTATFTPT ANLQRSIYRL TYDVAGKADD AISENFAVDT VRPTLESASI
AKSDLRLGEK TTITITFSER LAFYDNFTLE DLQVDAGKGT LSNLRSYFSD QERIWHVDLQ
APTTRPASGL DGNQIRINLA GITDRAGNTI WDSQLHGTGN PWVNLPTVSY NIDNGVPPMV
AITGQPATTI KGGDSFTVTF TFNERVTGFD LDDVQYDTSK GTLSALTAVG TDGRVWTATY
TPRPNIESAE NTISVNLAGV RDALGNAGVG TGTSGNFSID TKPPEVAVTI SDERLTAGES
ATVTFTFSER VTGFDLNDAQ YDTSKGTLGA LTAVGTDGKV WTATYTPRSD IESANNTIRV
NLAGVQDAQG NAGVGTGSSG NFVIDTRPTV VDGRPSIVSV VGPTSIVLED TDVTITFTFS
EAVTGFTLAN INLDNSSASP YITYSPKEPV SADGGRTWTI TFRAAPRTTD STNTVSIRNL
DGVRDLAGNL AVPNSSASTD NYEVDTEDPD PISATFDKTH LAAGETATVT VTFNEIVNNV
TQGTFDIPNG SVSNLRQDPT DGRIWRATFT PTANLQSASS LISINLNDLR DSAGNVSSGR
KTFYDSTIVI DTKPPEVAVT ISDERLTAGE SATVTFTFSE RVTGFDLNDV QYDTSKGTLS
ALTAVGTDGK VWSATYTPRP DIESAENTIR VNLAGVQDAQ GNAGVGTGSS GNFVIDTRPP
VVDGRPSIVS VVGPTSIVLE DTDVTITFTF SEAVTGFTLA NINLDNSSAS PYITYSPKEP
VSADGGRTWT ITYRAAPRTT DSTNTVSIRN LDGVRDLAGN LAVPNSSAST DNYEVDTEDP
DPISATFDKT HLAAGETATV TVTFNEIVNN VTEDSFQIPN GSVSNLRQDT TDGRIWRVTF
TPTANLQSAS SLISINLNDL RDSAGNVSSG RKSFYDSTIV IDTKRPEVTV VISDNRLTAG
ETATVTFTFS ERVTGFDLND VQYDTSKGTL GALTAVGTDG KVWTATYTPR PNIESADNTI
RVNLAGVQDA QGNAGEGSVS SGNFSIDTRP PEVDRPPEVT VTISDNRLTA GESATVTFTF
SKSVTGFTKD DIDLTLANGT LGDLVPVGTD GKVWSATFTP RPDTESADNT IRVNLAGVLD
AQGNAGVGTG TSGNFTIDTK RPEVAVAISD ERLTAGETAT VTFTFTERVT GFDLNDVQYD
TSKGTLGALT AVGTDGKVWS ATYTPRPDTE SATNTIRVNL AGVLDALGNA GVGTGSSGNF
VIDTRPTVVD RPPSATIAVT PNPVTNSNDR LVTVTITFDE AVTGFTADNI DFSNAHVTPY
GRNRIGALNS SADGRTYTIT YTAEPDVEDA TNTISLRNLH TIRDATGNAV AVSPTSNNFA
IDTKAPVPIS ATFDKTHLAA GETATVTVIF NEIVNNVTED TFQIPNGSVS NLRQDTTDGR
IWRATFTPTA NLQSTSSSIS INLDGLRDSA GNVNNGSLPF RDSTIVIDTK RPEVTVAISD
ERLTAGETAT VTFTFSERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWT ATYTPRPNTE
SADNTIRVNL AGVQDAQGNA GAGSVSSGNF SIDTKRPEVT VTISDNRLAA GQTATVTFTF
NERVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWSATYTP RSDIESADNT IRVNLAGVLD
AQGNAGTGSV SSGNFSIDTK RPEVTVTISD ERLSAGETAT VTFTFRESVT GFGTEDIQYD
TSKGTLSALT AVGTDGKVWS ATYTPRPNIE SADNTIRVNL AGVQDAQGNA GTGSASSGNF
SIDTKPPEVT VTISDERLSA GQTATVTFTF TERVTGFGTE DIQYDTSKGT LGALTAVGTD
GKVWSATYTP RSDIESAENT IRVNLAGVLD AQGNAGTGSA SSGNFSIDTR PPEVTVTISD
NRLSAGQTAT VTFTFNERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWT ATYTPRPDTE
SADNTIRVNL AGVLDAQGNA GTGSVSSGNF SIDTKPPEVT VTISDNRLSA GQTATVTFTF
NERVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWTATYTP RPDIESADNT IRVNLSGVLD
AQGNAGTGSV SSGNFSIDTK RPEVTVTISD NRLSAGQTAT FTFTFNERVT GFDLNDVQYD
TSKGTLGALT AVGTDGKVWS ATYTPRPDTE SADNTIRVNL AGVLDAQGNA GTGSVSSGNF
SIDTKRPEVT VTISDNRLSA GQTATFTFTF SERVTGFDLN DVQYDTSKGT LGALTAVGTD
GKVWTATYTP RPDTESADNT IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD
ERLSAGETAT VTFTFRESVT GFGTEDIQYD TSKGTLGALT AVGTDGKVWT ATYTPRSNIE
SADNTIRVNL AGVLDAQGNA GTGSVSSGNF SIDTKRPEVT VTISDNRLSA GQTATFTFTF
SERVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWSATYTP RSDIESADNT IRVNLAGVLD
AQGNAGTGSV SSGNFSIDTK PPEVTVTISD ERLSAGETAT VTFTFTERVT GFGTEDIQYD
TSKGTLSALT AVGTDGKVWS ATYTPRPNIE SADNTIRVNL AGVQDAQGNA GTGSASSGNF
SIDTKPPEVT VTISDERLAA GETATVTFTF NERVTGFGTE DIQYDTSKGT LGALTAVGTD
GKVWSTTYTP RPNIESADNT IRVNLAGVQD AQGNAGTGSA SSGNFSIDTR PPEVTVTISD
ERLSAGETAT VTFTFTERVT GFGTEDIQYD TSKGTLGALT AVGTDGKVWT ATYTPRPDIE
SADNTIRVNL SGVLDAQGNA GVGTGTSGNF SIDTKRPEVT VTISDNRLIA GQTATFTFTF
SERVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWSATYTP RPNIESADNT IRVNLAGVLD
AQGNAGTGSV SSGNFSIDTK RPEVTVTISD ERLSAGETAT VTFTFRESVT DFGSEDIQYD
TSKGTLGALT AVGTDGKVWT ATYTPRPNIE SADNTIRVNL AGVLDAQGNA GTGSASSGNF
SIDTRPPEVT VTISDERLSA GETATVTFTF RESVTGFGTE DIQYDTSKGT LSALTAVGTD
GKVWSATYTP RPDTESADNT IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD
NRLSAGQTAT FTFTFSERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWS ATYTPRSDTE
SADNTIRVNL AGVQDAQGNA GTGSASSGNF SIDTKRPEVT VAISDERLSA GETATVTFTF
RESVTGFDLN DVQYDTSKGT LSALTAVGTD GKVWSATYTP RPNIESADNT IRVNLAGVQD
AQGNAGTGSV SSGNFSIDTR PPEVTVTISD ERLSAGQTAT VTFTFSERVT GFGTEDIQYD
TSKGTLSALT AVGTDGKVWS ATYTPRPDTE SATNTIRVNL AGVQDAQGNA GTGSASSGNF
SIDTKRPEVT VTISDERLAA GQTATVTFTF NERVTGFDLN DVQYDTSKGT LGALTAVGTD
GKVWSATYTP RPNIESADNT IRVNLAGVQD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD
ERLSAGETAT VTFTFRESVT GFGTEDIQYD TSKGTLSALT AVGTDGKVWT ATYTPRPNIE
SADNTIRVNL AGVQDAQGNA GTGSASSGNF SIDTRPPEVT VTISDERLSA GETATVTFTF
TERVTGFGTE DIQYDTSKGT LSALTAVGTD GKVWTATYTP RPNIESADNT IRVNLAGVLD
AQGNAGTGSV SSGNFSIDTK RPEVTVTISD ERLSAGETAT VTFTFSESVT GFGTEDIQYD
TSKGTLGALT AVGTDGKVWT ATYTPRSNIE SADNTIRVNL AGVQDAQGNA GTGSASSGNF
SIDTKPPEVT VTISDERLSA GETATVTFTF TERVTGFGTE DIQYDTSKGT LGALTAVGTD
GKVWSATYTP RPDTESADNT IRVNLSGVLD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD
NRLSAGQTAT VTFTFNERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWT ATYTPRPDTE
SADNTIRVNL AGVLDAQGNA GTGSVSSGNF SIDTKRPEVT VAISDNRLIA GQTATVTFTF
RESVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWSATYTP RPDTESAENT ISVNLAGVRD
ALGNAGVGTG TSGNFSIDTK PPEVAVTISD NRLTAGESAT ITFTFNERVT GFDLDDVQYD
TSKGTLGALT PVGTDGRVWS ASYTPRPGIE SADNAISVRL AGVRDALGNA GVGTGTSGNF
TIDTKPPEVT VTISDNRLTA GESATVTFTF SESVTGFTKE AIDLSQANGT LGDLVPVGTD
GKVWTATFTP TDRLARTTNH RLTLNLTNVR DAAGNAPAVN TYSFNQYTVD TMVFALSNAT
VNRNQLVLFY SDETALDPDQ THNAPNDAFV VLVDGVRNNV TGVVVDAAAK TVTLTLERAV
SHGQQVTVAY NDPSTGDDPQ AVQEAGSGDD AASFAARPVT NLSPRAPATG TTDADSRKSS
EDSDGDTANA LDSDYDSVPN AQEDQAPGLL RPDGSAGTDG DGNGDGIRDS QQVAVGSTRD
LTLVAGSQDG KLIPGSNARI SELVRKDAPA SLPKGMEMPL SLTQFRVGLS EGRYTESFSL
YVDPALGVNG YWVKDSAGTW VNLASEPYGG KMSTEGGRTR LDFQIQDGGQ YDTDGLADGH
ITALGAAAKM PLSIVGQAPP QVESHRGFWF