Gene EcSMS35_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0041 
SymbolcaiT 
ID6144631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp45596 
End bp47110 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content52% 
IMG OID641614942 
ProductL-carnitine/gamma-butyrobetaine antiporter 
Protein accessionYP_001742158 
Protein GI170680951 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.206145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATG AAAAGAGAAA AACGGGAATA GAACCGAAGG TTTTCTTTCC GCCATTAATA 
ATCGTCGGCA TACTTTGTTG GCTTACAGTC AGAGATCTGG ATGCGGCGAA TGTCGTTATT
AATGCTGTAT TCAGTTACGT CACCAATGTA TGGGGATGGG CATTTGAATG GTATATGGTA
GTGATGCTTT TCGGTTGGTT CTGGCTGGTG TTTGGCCCGT ATGCCAAAAA GCGTTTAGGT
AACGAACCAC CTGAATTTAG TACCGCCAGT TGGATCTTTA TGATGTTCGC CTCCTGTACG
TCTGCTGCCG TACTGTTCTG GGGATCGATT GAGATCTACT ACTACATCTC CACCCCGCCG
TTTGGCTTAG AACCGAACTC GACAGGGGCG AAAGAGTTGG GGCTGGCTTA CAGCTTGTTC
CACTGGGGAC CTCTGCCGTG GGCCACTTAC AGCTTCCTTT CTGTCGCCTT CGCTTACTTC
TTCTTTGTCC GCAAAATGGA AGTGATTCGC CCCAGCTCCA CCCTGGTGCC GCTGGTGGGT
GAAAAACACG CCAAAGGGTT GTTCGGTACC ATCGTCGACA ACTTCTATCT CGTCGCCTTG
ATTTTCGCGA TGGGTACCAG TCTGGGCCTT GCCACGCCGC TGGTGACCGA GTGTATGCAA
TGGTTGTTTG GCATTCCGCA TACCCTGCAA CTGGACGCTA TCATCATTAC CTGCTGGATT
ATCCTCAACG CCATTTGCGT CGCCTGCGGT CTGCAAAAAG GGGTACGTAT CGCCAGTGAC
GTGCGTAGTT ACCTGAGCTT CCTGATGCTG GGTTGGGTGT TCATTGTCAG CGGTGCCAGC
TTCATCATGA ACTACTTCAC CGATTCGGTG GGGATGTTGC TGATGTATCT GCCGCGCATG
TTGTTCTATA CCGATCCCAT CGCTAAAGGC GGCTTCCCGC AGGGCTGGAC CGTGTTCTAC
TGGGCATGGT GGGTGATTTA CGCTATCCAG ATGAGTATCT TCCTCGCCCG CATCTCCCGT
GGTCGTACCG TGCGTGAACT GTGCTTCGGC ATGGTGCTGG GGCTGACAGC GTCAACCTGG
ATCCTGTGGA CTGTACTCGG TAGTAACACT CTGCTGTTGA TGGATAAAAA CATCATCAAC
ATTCCAAATC TGATCGAACA GTACGGTGTG GCGCGCGCCA TCATCGAAAC CTGGGCCGCT
CTGCCACTCA GCACCGCCAC CATGTGGGGC TTCTTCATCC TCTGCTTTAT TGCCACCGTT
ACGCTGGTTA ACGCCTGCTC TTATACCCTG GCGATGTCCA CTTGCCGCGA AGTACGCGAT
GGTGAAGAAC CACCGCTGCT GGTGCGTATC GGCTGGTCAA TTCTGGTTGG CATTATCGGT
ATTGTTCTGC TGGCGCTCGG CGGCCTGAAA CCGATTCAAA CCGCCATTAT CGCCGGAGGA
TGCCCGCTGT TCTTCGTCAA CATTATGGTG ACGCTCTCCT TTATTAAAGA CGCGAAACAG
AACTGGAAAG ATTAA
 
Protein sequence
MKNEKRKTGI EPKVFFPPLI IVGILCWLTV RDLDAANVVI NAVFSYVTNV WGWAFEWYMV 
VMLFGWFWLV FGPYAKKRLG NEPPEFSTAS WIFMMFASCT SAAVLFWGSI EIYYYISTPP
FGLEPNSTGA KELGLAYSLF HWGPLPWATY SFLSVAFAYF FFVRKMEVIR PSSTLVPLVG
EKHAKGLFGT IVDNFYLVAL IFAMGTSLGL ATPLVTECMQ WLFGIPHTLQ LDAIIITCWI
ILNAICVACG LQKGVRIASD VRSYLSFLML GWVFIVSGAS FIMNYFTDSV GMLLMYLPRM
LFYTDPIAKG GFPQGWTVFY WAWWVIYAIQ MSIFLARISR GRTVRELCFG MVLGLTASTW
ILWTVLGSNT LLLMDKNIIN IPNLIEQYGV ARAIIETWAA LPLSTATMWG FFILCFIATV
TLVNACSYTL AMSTCREVRD GEEPPLLVRI GWSILVGIIG IVLLALGGLK PIQTAIIAGG
CPLFFVNIMV TLSFIKDAKQ NWKD