Gene EcolC_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0734 
Symbol 
ID6068703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp786762 
End bp788042 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content51% 
IMG OID641600139 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001723735 
Protein GI170018781 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.797641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATGA GTGAAAATGA TACAATCCCA AAGAAGTCTA CAAGTCAGAT TAACAAAGCG 
GTATTCTTTA CATCTGCTTT GCTAATTTTC CTTCTTGTCG CCTTTGCCGC CGTATTCCCG
GATGTCGCCG ACAAAAATTT TAAACTACTT CAGCAACAAA TCTTCACGAA TGCCAGCTGG
TTCTACATCC TTGCTGTGGC CCTGATTTTA CTGAGTGTCA CGTTCCTTGG ACTCTCACGC
TACGGTGATA TCAAGCTGGG CCCGGACCAT GCGCAGCCTG ATTTCAGCTA CCACTCCTGG
TTTGCGATGC TTTTTTCGGC AGGGATGGGG ATCGGCCTGA TGTTCTTTGG CGTTGCCGAA
CCTGTAATGC ATTATCTTTC GCCACCCGTT GGCACTCCAG AAACCGTTGC GGCAGCTAAG
GAAGCAATGC GTCTGACCTT TTTCCACTGG GGACTGCACG CATGGGCAAT TTATGCCATT
GTGGCGCTGA TTCTGGCGTT CTTCAGTTAC CGTCACGGTC TGCCTTTAAC TCTGCGCTCC
GCACTCTATC CCATTATTGG CGATCGCATA TACGGACCTG TAGGACATGC GGTTGATATT
TTCGCTGTTA TAGGCACGGT CTTTGGCGTT GCGACATCAC TGGGTTACGG TGTTTTGCAG
GTGAATGCCG GTTTGAACCA TCTTTTCGGG GTGCCCATCA ATGAAACGGT GCAGGTAATT
CTGATCGTGG TCATCACGGG GTTAGCGACG ATTTCAGTGG TGTCCGGTCT GGATAAGGGA
ATACGTATCC TGTCTGAACT CAATCTGGGT CTGGCTTTGT TGCTCCTGGT GCTGGTCCTG
TGTCTGGGAC CAACCGTGCT TCTGCTGAAG TCATTTGTGG AAAATACGGG CGGTTATCTT
TCGGAACTGG TGAGTAAAAC GTTCAACCTT TACGCGTATG AGCCCAAGTC GAGCAACTGG
CTGGGGGGCT GGACATTACT GTACTGGGGA TGGTGGCTTT CATGGTCGCC GTTTGTGGGG
ATGTTCATCG CACGGGTCTC CCGCGGGCGA ACCATTCGCG AGTTTGTCAC CGGCGTGCTG
TTTGTTCCAG CGGGTTTTAC GCTAATGTGG ATGACGGTGT TTGGTAACAG CGCGATCTAT
CTCATTATGA ACCAGGGGGC CACAGACCTC GCCAATACTG TTCAGCAGGA TGTGTCGCTG
GCCCTGTTTA ATTTCCTGGA GCATTTCCCG TTCTCTTCTG TGCTGTCATT CATTGCAATG
GCGATGGTCA TCGTGGACTG A
 
Protein sequence
MIMSENDTIP KKSTSQINKA VFFTSALLIF LLVAFAAVFP DVADKNFKLL QQQIFTNASW 
FYILAVALIL LSVTFLGLSR YGDIKLGPDH AQPDFSYHSW FAMLFSAGMG IGLMFFGVAE
PVMHYLSPPV GTPETVAAAK EAMRLTFFHW GLHAWAIYAI VALILAFFSY RHGLPLTLRS
ALYPIIGDRI YGPVGHAVDI FAVIGTVFGV ATSLGYGVLQ VNAGLNHLFG VPINETVQVI
LIVVITGLAT ISVVSGLDKG IRILSELNLG LALLLLVLVL CLGPTVLLLK SFVENTGGYL
SELVSKTFNL YAYEPKSSNW LGGWTLLYWG WWLSWSPFVG MFIARVSRGR TIREFVTGVL
FVPAGFTLMW MTVFGNSAIY LIMNQGATDL ANTVQQDVSL ALFNFLEHFP FSSVLSFIAM
AMVIVD