Gene Veis_3228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3228 
Symbol 
ID4693091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3603054 
End bp3604004 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content62% 
IMG OID639850991 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_997976 
Protein GI121610169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.285749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC GCTTTGCCCT TTCCGGTTTC CTCGGCTTGG TCATGCTGTG CGCGCCGCTG 
GCGCAAGCGG CTGCAGCGGA GCCCGCCAGT TGCAAGACCG TGCGTTTTGC CGATGTGGGC
TGGAGCGATA TCGCGGCCAC GACCGGGCTG GCCTCGGTGG TGCTCGAAGG GCTGGGCTAC
CAGCCGAGCG TGACGATCGC CTCGTTGCCG ATCGCGTTCA CCGGCATCAA GTCCAAGCAG
ATCGACGCCT TCCTGGGCTA CTGGTTTCCC AGCATGACGC CCATCATCGA GCCTTTCGTC
AAGGCCGGGC AGATCAAGGT GCTCGACCGG CCCAACCTGG TCGGCGCCAA GTACACACTG
GCGGTTCCGG CCTATCTGTA CGACAAGGGG CTCAAGACCT TCACCGACAT CGCCAAGTTC
CACAAAGAGT TGGACGGCAA GCTCTACGGC ATCGAGCCTG GCAACGACGG CAACGCGCTG
ATGCAGGGCA TGATCGACAA AAACGAATAT GGCCTGAAGG GCTTCAAACT GGTGGAGTCC
AGCGAAGCCG GCATGCTGGC CGAAGTCCAG CGCGCAGCGC GCAGCGGCAA GGCCATCGTC
TTTCTGGGCT GGGAGCCGCA TCCGATGAAT GTGCAGATGA AGATGAAATA CCTGCAAGGC
GGCGATGCCG TGTTTGGCCC CAACCTGGGC GAGGCCAAGG TCTTCACGGC GCTGCCGCCC
GACTACGAGG CACGCTGCCC GAATGTCGCG CGCTTGCTGA AGAATCTGCG CTTTACCACC
GACATCGAAA ATGCGGTGAT GCTGGACATC CTCGAAAAGG TCAAGCCCAG TGATGCCGCC
CGGGCCTATC TGAAGAAAAA CCCCGCCCCG CTGGGCGAAT GGCTCGATGG CGTCAAGACC
TTCTCCGGCC AGGAAGGTCT GCCCGCCGTC ACGGCGGCGC TGAAAAACTG A
 
Protein sequence
MTKRFALSGF LGLVMLCAPL AQAAAAEPAS CKTVRFADVG WSDIAATTGL ASVVLEGLGY 
QPSVTIASLP IAFTGIKSKQ IDAFLGYWFP SMTPIIEPFV KAGQIKVLDR PNLVGAKYTL
AVPAYLYDKG LKTFTDIAKF HKELDGKLYG IEPGNDGNAL MQGMIDKNEY GLKGFKLVES
SEAGMLAEVQ RAARSGKAIV FLGWEPHPMN VQMKMKYLQG GDAVFGPNLG EAKVFTALPP
DYEARCPNVA RLLKNLRFTT DIENAVMLDI LEKVKPSDAA RAYLKKNPAP LGEWLDGVKT
FSGQEGLPAV TAALKN