Gene Veis_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4143 
Symbol 
ID4695187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4544738 
End bp4556332 
Gene Length11595 bp 
Protein Length3864 aa 
Translation table11 
GC content66% 
IMG OID639851890 
Productouter membrane protein 
Protein accessionYP_998866 
Protein GI121611059 
COG category 
COG ID 
TIGRFAM ID[TIGR02059] cyanobacterial long protein repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCCCA CCATTGAAAG TGCCTCGATC GCCAAAAGCG ATCTGCGCCT CGGGGAAAAA 
ACCACCATCA CCATCACCTT CAGCGAACCG CTGACCTATT CCAGCTTCAC ACTTGCAGAC
CTGCAGGTGG ACGCCGGCAA GGGCACTTTG AGCAACCTGC GCCGCGTCCA CTCCGACACC
GTGGCCTCCG CCACCTGGCA GGTCGACCTG CAGGCCCCGA CCACCCGGCC CGCGTCGGGC
CTCGATGGCA ACCAGATACG GATCAACCTC GCCGGCATCA CCGATAAAGC AGGCAACACA
GGCGAGAACA GTGTGGTGAA CGGGGTGGCC ACCTCTTGGA CTAACCTCCC GACTGTCAGC
TACAACATCG ACAACGGTGT GCCGCCCATG GTCGCCATCA CCGGGCAGCC TGCCACCACC
ATCAAGGGCG GCGATTCCTT TACCGTCACC TTCACCTTCA ACGAGCGCGT CACCGGCTTC
GATCTCGACG ACGTTCAGTA CGACACCAGC AAAGGCACGC TGAGCGCTCT GACGGCGGTC
GGCACCGACG GCAGGGTCTG GACCGCCACC TACACGCCCC GGCCCAACAT CGAGAGCGCC
GAAAACACCA TCAGCGTGAA CCTCGCCGGC GTCCGGGACG CAACGGACAA CGCCGGGGTG
GGCACCGGCA GCAGCGGCAA CTTCAGTATC GACACCAAGC CCCCCGAGGT CGCGGTGACG
ATCAGCGACG AGCGCCTGAC CGCTGGCGAA AGCGCCACCG TCACCTTCAC CTTCAGCGAG
CGCGTCACCG GCTTCGATCT CAACGACGTT CAGTACGACA CCAGCAAAGG CACGCTGAGC
GCTCTGACGG CGGTCGGCAC CGACGGCAAG GTCTGGAGCG CCACCTACAC GCCCCGGCCC
GACATCGAGA GCGCCGAAAA CACCATCCGC GTGAACCTCG CCGGCGTCCA GGACGCGCAG
GGCAACGCCG GAGTGGGCAC CGGCAGCAGC GGCAACTTCG TCATCGACAC CAGGCCGCCC
GTGGTCGACG GGCGGCCCAG CATCGTCTCT GTGGTGGGCC CCACCTCCAT CGTCTTGGAG
GACACCGACG TCACCATTAC CTTCACCTTC AGCGAGGCAG TCACCGGCTT CACGTTGGCC
AACATCAATT TGGACAACTC CAGTGCCTCT CCCTACATCA CCTACAGCCC GAAAGAGCCG
GTCAGCGCAG ATGGTGGCCG CACCTGGACC ATCACCTACC GTGCCGCTCC GCGTACCACG
GATTCCACCA ACACCGTCAG CATCCGCAAC CTCGATGGCG TGCGTGACCT CGCAGGCAAC
CTCGCAGTGC CTAACTCCAG CGCCAGCACC GACAACTACG AGGTCGATAC CGAGGATCCC
GATCCCATCA GCGCCACGTT TGACAAAACC CACCTTGCCG CCGGAGAAAC CGCCACCGTC
ACCGTCACCT TCAACGAGAT CGTCAACAAC GTCACCGAAG ATAGCTTTCA AATCCCCAAC
GGTTCGGTGA GCAACCTGAG GCAGGACACG ACCGACGGCA GAATCTGGAG GGTCACCTTC
ACCCCCACGG CCAACCTGCA GAGCGCCAGC AGCTTGATCA GCATCAATCT GAACGACCTC
AGGGACAGCG CCGGCAACGT CAGCAGTGGT CGCAAGTCGT TCTACGACTC CACCATCGTC
ATCGACACCA AGCGCCCCGA AGTCACGGTG GTGATCAGCG ACAACCGCCT GACCGCTGGC
GAAACCGCCA CCGTCACCTT CACCTTCAGC GAGCGCGTCA CCGGCTTCGA TCTCAACGAC
GTTCAGTACG ACACCAGCAA AGGCACGCTG GGCGCTCTGA CGGCGGTCGG TACCGACGGC
AAGGTCTGGA CAGCCACCTA CACGCCCCGG CCCAACATCG AGAGCGCCGA CAACACCATC
CGCGTGAACC TCGCCGGCGT CCAGGATGCG CAGGGCAACG CCGGAGAGGG CAGCGTCAGC
AGCGGCAACT TCAGCATCGA CACCAGGCCG CCCGAGGTCG ACAGGCCGCC CGAGGTCACG
GTGACGATCA GCGACAACCG CCTGACCGCT GGCGAAAGCG CCACCGTCAC CTTCACCTTC
AGCAAGAGCG TCACCGGCTT CACGAAAGAT GACATCGACT TGACCCTGGC CAACGGCACG
CTGGGCGACT TGGTGCCGGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTTCACGCCC
CGGCCCGACA CCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGGGT GGGCACCGGC ACCAGCGGCA ACTTCACTAT CGACACCAAG
CGCCCCGAAG TCGCGGTGGC GATCAGCGAC GAGCGCCTGA CGGCTGGCGA AACCGCCACC
GTCACCTTCA CCTTCACCGA GCGCGTCACC GGCTTCGATC TCAACGACGT TCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCGCTGACG GCAGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAA AGCGCCACCA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCT GGGCAACGCC GGGGTGGGCA CCGGCAGCAG CGGCAACTTC
GTCATCGACA CCAGGCCCAC CGTGGTCGAC AGGCCGCCCA GCGCCACCAT TGCCGTGACC
CCCAACCCCG TCACGAACAG CAATGACAGA CTGGTCACCG TCACCATCAC CTTCGACGAG
GCAGTCACCG GCTTCACGGC AGACAACATC GATTTCAGCA ACGCCCATGT CACGCCCTAT
GGCCGCAACC GGATAGGAGC GCTGAACAGC TCAGCCGACG GCCGCACCTA CACCATCACC
TACACGGCAG AGCCGGACGT CGAGGATGCC ACCAACACCA TCAGCCTGCG CAACCTCCAT
ACCATCCGTG ACGCCACAGG CAACGCCGTA GCGGTCAGCC CGACCAGCAA CAACTTCGCG
ATCGACACCA AGGCTCCCGT TCCCATCAGC GCCACGTTTG ACAAAACCCA CCTTGCCGCC
GGAGAAACCG CCACCGTCAC CGTCATCTTC AACGAGATCG TCAACAACGT CACCGAAGAT
ACCTTTCAAA TCCCCAACGG TTCTGTGAGC AACCTGAGGC AGGACACGAC CGACGGCAGA
ATCTGGAGGG CCACCTTCAC CCCCACGGCC AACCTGCAGA GCACCAGCAG CTCGATCAGC
ATCAATCTGG ACGGCCTCAG GGACAGCGCC GGCAACGTCA ACAATGGCAG CCTGCCGTTC
CGCGACTCCA CCATCGTCAT CGACACCAAG CGCCCCGAAG TCACGGTGGC GATCAGCGAC
GAGCGCCTGA CCGCTGGCGA AACTGCCACC GTCACCTTCA CCTTCAGCGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CAACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTCC AGGACGCGCA GGGCAACGCC
GGAGCGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGGCCGCT GGCCAAACCG CCACCGTCAC CTTCACCTTC
AACGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGCAGCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAGG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCAGAGA GAGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTGACG GCGGTCGGCA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCCGCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTGAGCGCT
GGCCAAACCG CCACCGTCAC CTTCACCTTC ACCGAGCGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGGGCGCGC TGACGGCAGT CGGTACCGAC
GGCAAGGTCT GGAGCGCTAC CTACACGCCC CGCAGCGACA TCGAGAGCGC CGAAAACACC
ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGCC
AGCAGCGGCA ACTTCAGCAT CGACACCAGG CCGCCCGAGG TCACGGTGAC GATCAGCGAC
AACCGCCTGA GCGCTGGCCA AACCGCCACC GTCACCTTCA CCTTCAACGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAATACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CGACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTGC TGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCCGCC CGAGGTCACG
GTGACGATCA GCGACAACCG CCTGAGCGCT GGCCAAACCG CCACCGTCAC CTTCACCTTC
AACGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT CGGCACCGAC GGCAAGGTCT GGACAGCCAC CTACACGCCC
CGGCCCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTCAGCGG CGTCCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAGG TCACGGTGAC GATCAGTGAC AACCGCCTGA GCGCTGGCCA AACCGCCACC
TTCACCTTCA CCTTCAACGA GCGCGTCACC GGCTTCGATC TCAACGACGT TCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCGCTGACG GCGGTGGGCA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCA GGGCAACGCC GGAACGGGCA GCGTCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCGCCC CGAAGTCACG GTGACGATCA GCGACAACCG CCTGAGCGCT
GGCCAAACCG CCACCTTCAC CTTCACCTTC AGCGAGCGCG TCACCGGCTT CGATCTCAAC
GACGTTCAGT ACGACACCAG CAAAGGCACG CTGGGCGCTC TGACGGCAGT CGGCACCGAC
GGCAAGGTCT GGACAGCCAC CTACACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTGGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAA CGCCCCGAAG TCACGGTGAC GATCAGCGAC
GAGCGCCTGA GCGCAGGAGA AACCGCCACC GTCACCTTCA CCTTCAGAGA GAGCGTCACC
GGCTTTGGCA CCGAGGACAT CCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGCAG CAACATCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCCGGCGTGC TGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGTCAGCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGAGCGCT GGCCAAACCG CCACCTTCAC CTTCACCTTC
AGCGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGTGCGC TGACGGCAGT GGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGCAGCGACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTGCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAA
CCGCCCGAAG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCACCGA GCGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTAACG GCGGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCCGCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTCGCCGCT
GGAGAAACCG CCACCGTCAC CTTCACCTTC AACGAGCGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGGGTGCGC TGACGGCAGT CGGCACCGAC
GGCAAGGTCT GGAGCACCAC CTACACGCCC CGGCCCAACA TCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCCGG CGTCCAGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGTGCC
AGCAGCGGCA ACTTCAGCAT CGACACCAGG CCGCCCGAAG TCACGGTGAC GATCAGCGAC
GAGCGCCTGA GCGCAGGAGA AACCGCCACC GTCACCTTCA CCTTCACCGA GCGCGTCACC
GGCTTTGGCA CCGAGGACAT CCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCTCTGACG
GCGGTGGGCA CCGACGGCAA GGTCTGGACA GCCACCTACA CGCCCCGGCC CGACATCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTC AGCGGCGTGC TGGACGCGCA GGGCAACGCC
GGGGTGGGCA CCGGCACCAG CGGCAACTTC AGCATCGACA CCAAGCGCCC CGAAGTCACG
GTGACGATCA GCGACAACCG CCTGATCGCT GGCCAAACCG CCACCTTCAC CTTCACCTTC
AGCGAGCGCG TCACCGGCTT CGATCTCAAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGCGCTC TGACGGCGGT CGGCACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGGCCCAACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTCCTGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAAG
CGCCCCGAAG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCAGGAGA AACCGCCACC
GTCACCTTCA CCTTCAGAGA GAGCGTCACC GACTTTGGTA GCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGGG CGCTCTGACG GCAGTCGGCA CCGACGGCAA GGTCTGGACA
GCCACCTACA CGCCCCGGCC CAACATCGAG AGCGCCGACA ACACCATCCG CGTGAACCTC
GCCGGCGTGC TGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAGGCCGCC CGAGGTCACG GTGACGATCA GCGACGAGCG CCTGAGCGCA
GGAGAAACCG CCACCGTCAC CTTCACCTTC AGAGAGAGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGAGCGCGC TGACGGCGGT CGGCACCGAC
GGCAAGGTCT GGAGCGCCAC CTACACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCCGG CGTGCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC
AGCAGCGGCA ACTTCAGCAT CGACACCAAG CGCCCCGAAG TCACGGTGAC GATCAGCGAC
AACCGCCTGA GCGCTGGCCA AACCGCCACC TTCACCTTCA CCTTCAGCGA GCGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCAGTCGGCA CCGACGGCAA GGTCTGGAGC GCCACCTACA CGCCCCGCAG CGACACCGAG
AGCGCCGACA ACACCATCCG CGTGAACCTG GCAGGCGTCC AGGACGCGCA GGGCAACGCC
GGAACGGGCA GCGCCAGCAG CGGCAACTTC AGCATCGACA CCAAACGCCC CGAGGTCACG
GTGGCGATCA GCGACGAGCG CCTGAGTGCG GGAGAAACCG CCACCGTCAC CTTCACCTTC
AGAGAGAGCG TCACCGGCTT CGATCTCAAC GACGTTCAAT ACGACACCAG CAAAGGCACG
CTGAGCGCTC TGACGGCGGT CGGTACCGAC GGCAAGGTCT GGAGCGCCAC CTACACGCCC
CGGCCCAACA TCGAGAGCGC CGACAACACC ATCCGCGTGA ACCTGGCCGG CGTCCAGGAC
GCGCAGGGCA ACGCCGGAAC GGGCAGCGTC AGCAGCGGCA ACTTCAGCAT CGACACCAGG
CCGCCCGAGG TCACGGTGAC GATCAGCGAC GAGCGCCTGA GCGCTGGCCA AACCGCCACC
GTCACCTTCA CCTTCAGCGA GCGCGTCACC GGCTTTGGCA CCGAGGACAT CCAGTACGAC
ACCAGCAAAG GCACGCTGAG CGCTCTGACG GCGGTCGGTA CCGACGGCAA GGTCTGGAGC
GCCACCTACA CGCCCCGGCC CGACACCGAA AGCGCCACCA ACACCATCCG CGTGAACCTC
GCCGGCGTCC AGGACGCGCA GGGCAACGCC GGAACGGGCA GCGCCAGCAG CGGCAACTTC
AGCATCGACA CCAAGCGCCC CGAAGTCACG GTGACGATCA GCGACGAGCG CCTGAGTGCG
GGAGAAACCG CCACCGTCAC CTTCACCTTC AGCAAGAGCG TCACCGGCTT TGGCACCGAG
GACATCCAGT ACGACACCAG CAAAGGCACG CTGAGCGCGC TGACGGCAGT CGGCACCGAC
GGCAAGGTCT GGAGCGCCAC CTTCACGCCC CGGCCCGACA CCGAGAGCGC CGACAACACC
ATCCGCGTGA ACCTCGCCGG CGTCCTGGAC GCGCAGGGCA ACGCCGGAAC GGGCAGCGGC
AGCAGCGGCA ACTTCAGCAT CGACACCAGG CCGCCCGAGG TCACGGTGAC GATCAGCGAC
AACCGCCTGA TCGCTGGCCA AACCGCCACC GTCACCTTCA CTTTCAGAGA GAGCGTCACC
GGCTTCGATC TCAACGACGT TCAGTACGAC ACCAGCAAAG GCACGCTGGG CGCGCTGACG
GCGGTGGGCA CCGACGGCAA GGTCTGGAGC GCCACCTACA CGCCCCGCAG CGACATCGAG
AGCGCCGAAA ACACCATCAG CGTGAACCTC GCCGGCGTCC GGGACGCGCT GGGCAACGCC
GGGGTGGGCA CCGGCACCAG CGGCAACTTC AGTATCGACA CCAAGCCCCC CGAGGTCGCG
GTGACGATCA GCGACGAGCG CCTGAGCGCT GGCGAAAGCG CCACCGTCAC CTTCACCTTC
AGCGAGCGCG TCACCGGCTT CGATCTCGAC GACGTTCAGT ACGACACCAG CAAAGGCACG
CTGGGCGCTC TGACGCCGGT CGGCACCGAC GGCAGGGTCT GGAGCGCCAG CTACACGCCC
CGGCCCGGCA TCGAGAGCGC CGACAACGCC ATCAGCGTGC GCCTCGCCGG CGTCCGGGAC
GCGCTCGGCA ACGCCGGGGT GGGCACCGGC ACCAGCGGCA ACTTCACTAT CGACACCAAG
CCCCCCGAGG TCACGGTGAC GATCAGCGAT AACCGCCTCA CCGCTGGCGA AAGCGCCACC
GTCACCTTCA CCTTCAGCGA GAGCGTCACC GGCTTCACGA AAGAGGCCAT CGACCTGTCC
CAGGCCAACG GCACGCTGGG CGAGCTGGTG CCGGTCGGCA CCGACGGCAA GGTCTGGACA
GCCACCTTCA CGCCCACGGA CAGACTGGCG CGCACCACCA ACCACCGGCT CACCCTGAAC
CTGACCAATG TCAGGGATGC CGCAGGCAAC GCCCCGGCGG TGAACACTTA CTCGTTCAAC
CAGTACACCG TAGACACCAT GGTCTTTGCG CTCAGCAACG CCACGGTGAA CCGCAACCAG
TTGGTGTTGT TCTACAGCGA TGAAACCGCG CTGGACCCGG ACCAGACCCA TAACGCGCCC
AACGATGCGT TTGTGGTGCT GGTCGATGGC GTGCGCAACA ACGTCACCGG CGTGGTCGTG
GATGCAGCGG CCAAGACCGT CACGCTGACG CTGGAGCGCG CGGTGAGCAA TGGCCAGCAG
GTGAGCGTCG CCTACAACGA CCCCAGCACC GGCGACGACC GGCAAGCGGT GCAGGAAGCC
GGCAGCGGCG ACGACGCGGC CAGCTTCGCG GCCAGGCCGG TGACCAACCT CAGCCCGCGT
GCGCCCGCGA CGGGCACGAC CGACGCCGAC AGTCGTAAAT CATCCGAGGA CTCTGACGGC
GATACGGCGA ATGCGCTGGA CTCCGACTAC GACAGTGTGC CCAACGCCCA GGAGGACCAG
GCCCCCGGCC TGCTGCGCCC CGATGGCTCG GCCGGCACGG ATGGCGACGG CAACGGCGAC
GGTATCCGGG ACAGCCAGCA GGTCGCCGTC GGCTCGACCC GCGACCTGAC CCTGGTGGCC
GGCAGCCAGG ACGGCAAACT GATCCCCGAC AGCAATGCGC GCATCAGCAA ACTGGTGCGC
AACGATGCCC CCGCCAGCCT GCCCAAGGGC ATGGAGATGC CGATCGGCCT GACGCAATTC
AGGGTCGGCC TGTCCGAAGG ACGCTACACC GAGAGCTTCA GCCTGTACGT AGACCCGGCG
CTCGGCGTCA ACGGCTATTG GGTCAAGGAC AGCGCCGGCA CCTGGGTGAA CCTGGCCAGC
GAACCCTACG GCGGCAAAAT GAGCACCGAA GGCGGGCGCA CGCGGCTGGA CTTCCAGATC
CAGGACGGCG GCCAGTACGA CACCGACGGG CTGGCCGATG GCCACATCAC CGCCCTTGGC
GCGGCAGCGA AGATGCCGCT GTCCATCGTC GGGCAGGCGC CGCCCCAGGT CGAGTCGCAT
GGGTTTTGGT TCTGA
 
Protein sequence
MRPTIESASI AKSDLRLGEK TTITITFSEP LTYSSFTLAD LQVDAGKGTL SNLRRVHSDT 
VASATWQVDL QAPTTRPASG LDGNQIRINL AGITDKAGNT GENSVVNGVA TSWTNLPTVS
YNIDNGVPPM VAITGQPATT IKGGDSFTVT FTFNERVTGF DLDDVQYDTS KGTLSALTAV
GTDGRVWTAT YTPRPNIESA ENTISVNLAG VRDATDNAGV GTGSSGNFSI DTKPPEVAVT
ISDERLTAGE SATVTFTFSE RVTGFDLNDV QYDTSKGTLS ALTAVGTDGK VWSATYTPRP
DIESAENTIR VNLAGVQDAQ GNAGVGTGSS GNFVIDTRPP VVDGRPSIVS VVGPTSIVLE
DTDVTITFTF SEAVTGFTLA NINLDNSSAS PYITYSPKEP VSADGGRTWT ITYRAAPRTT
DSTNTVSIRN LDGVRDLAGN LAVPNSSAST DNYEVDTEDP DPISATFDKT HLAAGETATV
TVTFNEIVNN VTEDSFQIPN GSVSNLRQDT TDGRIWRVTF TPTANLQSAS SLISINLNDL
RDSAGNVSSG RKSFYDSTIV IDTKRPEVTV VISDNRLTAG ETATVTFTFS ERVTGFDLND
VQYDTSKGTL GALTAVGTDG KVWTATYTPR PNIESADNTI RVNLAGVQDA QGNAGEGSVS
SGNFSIDTRP PEVDRPPEVT VTISDNRLTA GESATVTFTF SKSVTGFTKD DIDLTLANGT
LGDLVPVGTD GKVWSATFTP RPDTESADNT IRVNLAGVLD AQGNAGVGTG TSGNFTIDTK
RPEVAVAISD ERLTAGETAT VTFTFTERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWS
ATYTPRPDTE SATNTIRVNL AGVLDALGNA GVGTGSSGNF VIDTRPTVVD RPPSATIAVT
PNPVTNSNDR LVTVTITFDE AVTGFTADNI DFSNAHVTPY GRNRIGALNS SADGRTYTIT
YTAEPDVEDA TNTISLRNLH TIRDATGNAV AVSPTSNNFA IDTKAPVPIS ATFDKTHLAA
GETATVTVIF NEIVNNVTED TFQIPNGSVS NLRQDTTDGR IWRATFTPTA NLQSTSSSIS
INLDGLRDSA GNVNNGSLPF RDSTIVIDTK RPEVTVAISD ERLTAGETAT VTFTFSERVT
GFDLNDVQYD TSKGTLGALT AVGTDGKVWT ATYTPRPNTE SADNTIRVNL AGVQDAQGNA
GAGSVSSGNF SIDTKRPEVT VTISDNRLAA GQTATVTFTF NERVTGFDLN DVQYDTSKGT
LGALTAVGTD GKVWSATYTP RSDIESADNT IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK
RPEVTVTISD ERLSAGETAT VTFTFRESVT GFGTEDIQYD TSKGTLSALT AVGTDGKVWS
ATYTPRPNIE SADNTIRVNL AGVQDAQGNA GTGSASSGNF SIDTKPPEVT VTISDERLSA
GQTATVTFTF TERVTGFGTE DIQYDTSKGT LGALTAVGTD GKVWSATYTP RSDIESAENT
IRVNLAGVLD AQGNAGTGSA SSGNFSIDTR PPEVTVTISD NRLSAGQTAT VTFTFNERVT
GFDLNDVQYD TSKGTLGALT AVGTDGKVWT ATYTPRPDTE SADNTIRVNL AGVLDAQGNA
GTGSVSSGNF SIDTKPPEVT VTISDNRLSA GQTATVTFTF NERVTGFDLN DVQYDTSKGT
LGALTAVGTD GKVWTATYTP RPDIESADNT IRVNLSGVLD AQGNAGTGSV SSGNFSIDTK
RPEVTVTISD NRLSAGQTAT FTFTFNERVT GFDLNDVQYD TSKGTLGALT AVGTDGKVWS
ATYTPRPDTE SADNTIRVNL AGVLDAQGNA GTGSVSSGNF SIDTKRPEVT VTISDNRLSA
GQTATFTFTF SERVTGFDLN DVQYDTSKGT LGALTAVGTD GKVWTATYTP RPDTESADNT
IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD ERLSAGETAT VTFTFRESVT
GFGTEDIQYD TSKGTLGALT AVGTDGKVWT ATYTPRSNIE SADNTIRVNL AGVLDAQGNA
GTGSVSSGNF SIDTKRPEVT VTISDNRLSA GQTATFTFTF SERVTGFDLN DVQYDTSKGT
LGALTAVGTD GKVWSATYTP RSDIESADNT IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK
PPEVTVTISD ERLSAGETAT VTFTFTERVT GFGTEDIQYD TSKGTLSALT AVGTDGKVWS
ATYTPRPNIE SADNTIRVNL AGVQDAQGNA GTGSASSGNF SIDTKPPEVT VTISDERLAA
GETATVTFTF NERVTGFGTE DIQYDTSKGT LGALTAVGTD GKVWSTTYTP RPNIESADNT
IRVNLAGVQD AQGNAGTGSA SSGNFSIDTR PPEVTVTISD ERLSAGETAT VTFTFTERVT
GFGTEDIQYD TSKGTLGALT AVGTDGKVWT ATYTPRPDIE SADNTIRVNL SGVLDAQGNA
GVGTGTSGNF SIDTKRPEVT VTISDNRLIA GQTATFTFTF SERVTGFDLN DVQYDTSKGT
LGALTAVGTD GKVWSATYTP RPNIESADNT IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK
RPEVTVTISD ERLSAGETAT VTFTFRESVT DFGSEDIQYD TSKGTLGALT AVGTDGKVWT
ATYTPRPNIE SADNTIRVNL AGVLDAQGNA GTGSASSGNF SIDTRPPEVT VTISDERLSA
GETATVTFTF RESVTGFGTE DIQYDTSKGT LSALTAVGTD GKVWSATYTP RPDTESADNT
IRVNLAGVLD AQGNAGTGSV SSGNFSIDTK RPEVTVTISD NRLSAGQTAT FTFTFSERVT
GFDLNDVQYD TSKGTLGALT AVGTDGKVWS ATYTPRSDTE SADNTIRVNL AGVQDAQGNA
GTGSASSGNF SIDTKRPEVT VAISDERLSA GETATVTFTF RESVTGFDLN DVQYDTSKGT
LSALTAVGTD GKVWSATYTP RPNIESADNT IRVNLAGVQD AQGNAGTGSV SSGNFSIDTR
PPEVTVTISD ERLSAGQTAT VTFTFSERVT GFGTEDIQYD TSKGTLSALT AVGTDGKVWS
ATYTPRPDTE SATNTIRVNL AGVQDAQGNA GTGSASSGNF SIDTKRPEVT VTISDERLSA
GETATVTFTF SKSVTGFGTE DIQYDTSKGT LSALTAVGTD GKVWSATFTP RPDTESADNT
IRVNLAGVLD AQGNAGTGSG SSGNFSIDTR PPEVTVTISD NRLIAGQTAT VTFTFRESVT
GFDLNDVQYD TSKGTLGALT AVGTDGKVWS ATYTPRSDIE SAENTISVNL AGVRDALGNA
GVGTGTSGNF SIDTKPPEVA VTISDERLSA GESATVTFTF SERVTGFDLD DVQYDTSKGT
LGALTPVGTD GRVWSASYTP RPGIESADNA ISVRLAGVRD ALGNAGVGTG TSGNFTIDTK
PPEVTVTISD NRLTAGESAT VTFTFSESVT GFTKEAIDLS QANGTLGELV PVGTDGKVWT
ATFTPTDRLA RTTNHRLTLN LTNVRDAAGN APAVNTYSFN QYTVDTMVFA LSNATVNRNQ
LVLFYSDETA LDPDQTHNAP NDAFVVLVDG VRNNVTGVVV DAAAKTVTLT LERAVSNGQQ
VSVAYNDPST GDDRQAVQEA GSGDDAASFA ARPVTNLSPR APATGTTDAD SRKSSEDSDG
DTANALDSDY DSVPNAQEDQ APGLLRPDGS AGTDGDGNGD GIRDSQQVAV GSTRDLTLVA
GSQDGKLIPD SNARISKLVR NDAPASLPKG MEMPIGLTQF RVGLSEGRYT ESFSLYVDPA
LGVNGYWVKD SAGTWVNLAS EPYGGKMSTE GGRTRLDFQI QDGGQYDTDG LADGHITALG
AAAKMPLSIV GQAPPQVESH GFWF