Gene Veis_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1300 
Symbol 
ID4691422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1437797 
End bp1439377 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content63% 
IMG OID639849072 
Productextracellular solute-binding protein 
Protein accessionYP_996086 
Protein GI121608279 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.901266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGC ACAAGAAAAT CGCCGTGACC GCCCTCGTGG CCGCGCTGAC GGCGGCCAGC 
GGGGTTGCCG GCGCGCAGAC CGTCAGGATC GCCAACCAGG GCGATGCGCT CTCGATGGAC
CCGCACTCGC TCAACGAAAC GCTGCAACTG AGCATCACCG GCAATGTGTA TGAGGGACTG
ACCGGGCGCA ACAAGGACCT GAGCCTGGCC CCGGCGCTGG CCACCCGTTG GCAGCAGACC
GCGCCCACCG TGTGGCGTTT CGATCTGCGC CGGGGTGTGC TGTTTCATGA CGGCAGCCCC
TTTACCGCCG ACGATGTGCT GTTCAGCCTG GCGCGCACCC AGGTCGAAGG CTCGGACATG
AAGAGCAACA CCAACGACTT CAAGCAGGTG CGCAAGATCG ATGAGCACAC CGTGGAGATC
GAGACCAAGA TTCCGTTCCT GACCCTGCCC GACACGCTCT CGCTGGTGTA CATGATGAGC
AAGAAATGGT GCGAGACCCA CCAGGCCAGC GCGCCGGTGG ATCGCCGCAA GGGCATAGAG
AACGCGGCCT CGTTCCGCGC CAACGGCACC GGGCCTTTTC GCCTGCGCGA GCGCCAGCCC
GGGGTGCGCA CCGTGTTCAC GCGCAATGGC TCGTACTGGG ACAAGATCGA AGGCAACGTC
AGCGAGGTGG TGTTCACGCC GATCGGCAAT GAGGCCACCC GGGTGGCGGC CTTGCTGTCC
GGGCAGGTGG ACGTGATGGA GCCGGTGCCG GTGCAAGACA TCGCGCGCGT CAACCGTGGC
GCCGACACGC GCGTGATCGC CGGCCCCGAA CTGCGCACCG TCTTTCTGGG CATGGACCAA
AGGCGCGACG AACTGCTGTA CTCCAACGTC AAGGGCAAGA ACCCCTTCAA GGACAAGCGC
GTGCGCCAAG CCTTCTACCA GGCCATCGAT ATCGAAGGCA TCAAGCGCAC CGTGATGCGC
GGCGCATCCC TTCCTTCGGC GCTGATGGTA GGCCCGGGCG TGAATGGCTT TCAGCCGGAC
GCCAAGCGCC TGCCCTACGA TGTGGAGGCG GCGAAAAAGC TGATGCTGCA AGCGGGCTAT
CCGAATGGTT TCGAGGTCTC GATGAACTGC CCGAACGACC GCTATGTGAA CGATAGCCGC
ATCTGCCAGG CGGTGGCGGC CAACCTGTCG CGCATCAACG TCAAGATCAA CCTGCAGGCC
GAAACCAAGG GTTCCTACTT CCCCAAACTC CTGCGCCGCG ACACCAGCTT TTACATGCTG
GGCTGGATGC CTCCGACCTA CGACGCGCAC AATGCGCTCA ATGCGCTGAT GCGTTGCTCG
GACGACCGGG GAGCGGGCCA GTACAACTAT GGCGCTTACT GCAATCCGAA GGTGGACGAG
TTGACGCTCA AGATCCAGTC GGAAAACGAC AAGACCCGGC GCAACGCGAT GATCAAAGAG
GTGTTCGAGA TCCACGCGGC AGACATCGGC CATCTGCCGC TGCACCAGCA GATGCTGGCC
TGGGGCGTGA GCAAGAAGGT CCAGTTGGTG CAGACGGCCG ACAACATCAT GCTGTTCAAG
TGGATCAGTA TCCGCCCGTG A
 
Protein sequence
MRWHKKIAVT ALVAALTAAS GVAGAQTVRI ANQGDALSMD PHSLNETLQL SITGNVYEGL 
TGRNKDLSLA PALATRWQQT APTVWRFDLR RGVLFHDGSP FTADDVLFSL ARTQVEGSDM
KSNTNDFKQV RKIDEHTVEI ETKIPFLTLP DTLSLVYMMS KKWCETHQAS APVDRRKGIE
NAASFRANGT GPFRLRERQP GVRTVFTRNG SYWDKIEGNV SEVVFTPIGN EATRVAALLS
GQVDVMEPVP VQDIARVNRG ADTRVIAGPE LRTVFLGMDQ RRDELLYSNV KGKNPFKDKR
VRQAFYQAID IEGIKRTVMR GASLPSALMV GPGVNGFQPD AKRLPYDVEA AKKLMLQAGY
PNGFEVSMNC PNDRYVNDSR ICQAVAANLS RINVKINLQA ETKGSYFPKL LRRDTSFYML
GWMPPTYDAH NALNALMRCS DDRGAGQYNY GAYCNPKVDE LTLKIQSEND KTRRNAMIKE
VFEIHAADIG HLPLHQQMLA WGVSKKVQLV QTADNIMLFK WISIRP