Gene EcSMS35_4862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4862 
Symbol 
ID6145795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4970675 
End bp4972216 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content45% 
IMG OID641619666 
Productputative carnitine transporter CglC 
Protein accessionYP_001746773 
Protein GI170682254 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.905862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATT TAAAAAAACG CTTTTCTTTG ATTGAACTGA ATGTGTTTAT CCCAGCAATA 
TTATTTATTG CGGTAATCAT TCTGTGTTTG ACGATATATC CACAAGATAC CAGCCGGTAT
ATAAATAAAA TACATCATTT TTTAACCTGG GAAATGGGCG GGATTTTTCT GGTCATGACT
TTTCTGGTTG TACTTTGTTG TCTGTGGCTG GCCTTCTCAC GTTATGGCGA TATCTTGTTA
GGTCAGTCGG GAGAAAAGCC TGACTTCAGC TTGTTAACCT GGCTTGGACT TATTTTTACT
TCTGGAACGG GGGGCAGCTT GTTATATCTG GCCTCTGTAG AGTGGATTTG GATCATTCAG
CAACCGCCTT TTGGCGCGAC AGCAGGAAGC GCTCAGGCTG CTCGTTGGGC CTCTGCTTAT
GGCATGTTCC ATTGGGGGCC GTCCGCATGG GCATGGTATC TGATTTGTGC CGTTCCCATT
GGTTGGTTTA TGCATGTTAA GAAAACGAAC TCATTAAAGG TCAGTGATTT ATGCCGTGGG
TGTCTGGGGG CACGTGCTGA TGGTTTTTGC GGGCATTGTG TGAATTTTTT CTACATGTTT
GGTTTGCTCG GCGGCGCGGT AACGTCTCTG GCGCTGGGAA CGCCGATGAT TTCTGCCGTA
TTTTGCCATG TGTTCCATCT GGATCCTGCC GGGCAGTTTA TCAATGTCAT GGTTATTTTT
ATCTGGACGC TAGTGCCATT ATTAATTCTC TTTTTTGGAC TTAAAAAAGG TGTGGCATGG
GCCAGTAACT GGAATATTCG TGCCGATATT TTTATGCTAC TGGCAATACT GATTTGTGGA
CCGACAGCTT TTATACTTAA CCAATCAATT GATGGTTTCG GCCTGATGCT GCAAAATTTT
GTAGCGATGA GTTTAAGTAC CGATGCTATT GGTCGTAGCG GATTTCCACA GATGTGGACC
GTATTTTATT TTTCATGGTG GGTCGTGTAT GCCATCCCAT TCGGTTTATT TATCGCCCGT
ATTTCAAAAG GAAGGACGAT CCGGCAATTG ATTGTATGTG GAACTCTGGC AGGTTCATTG
GGATGCATGG TTTTTTACAT GGTACTGGCT AATTTCGGCT TATCGTTGCA GACAACTCAT
GTCATCGATT TTGTTCCCAT ACTTAACGAA CAAGGGCGAG GCGTTGTCGT TTCTCGTTTA
CTGGAGCAGC TACCCGCAAG TCAGGTGTTT TTGGTTGCTT TTGGGGCTAT AGCATTAATT
TCATATATTA CCGGACACTG TACTGTGGGT TATGCCCTCG GTTTTGCGAC GCAAAAACGG
GCAGATAGTG AGAGTGAACC GGCATTCTGG AACGTGGCAT TTTGGTTGAT TATGACCGGA
ATCGTCGCAA TCACACTCTA TCTTCTTGAT GCGCAAAGTC TGCAACCGCT ACAAACGGTC
TCTATCCTGG CCGGACTACC GCTTTGCGGC GTAGTGTTTA TTTTATTGAA GAGTTTTTTG
ACACAGCTTG CGGCTGAAGA GAAAACCGCG AGAGATGAAT AA
 
Protein sequence
MQYLKKRFSL IELNVFIPAI LFIAVIILCL TIYPQDTSRY INKIHHFLTW EMGGIFLVMT 
FLVVLCCLWL AFSRYGDILL GQSGEKPDFS LLTWLGLIFT SGTGGSLLYL ASVEWIWIIQ
QPPFGATAGS AQAARWASAY GMFHWGPSAW AWYLICAVPI GWFMHVKKTN SLKVSDLCRG
CLGARADGFC GHCVNFFYMF GLLGGAVTSL ALGTPMISAV FCHVFHLDPA GQFINVMVIF
IWTLVPLLIL FFGLKKGVAW ASNWNIRADI FMLLAILICG PTAFILNQSI DGFGLMLQNF
VAMSLSTDAI GRSGFPQMWT VFYFSWWVVY AIPFGLFIAR ISKGRTIRQL IVCGTLAGSL
GCMVFYMVLA NFGLSLQTTH VIDFVPILNE QGRGVVVSRL LEQLPASQVF LVAFGAIALI
SYITGHCTVG YALGFATQKR ADSESEPAFW NVAFWLIMTG IVAITLYLLD AQSLQPLQTV
SILAGLPLCG VVFILLKSFL TQLAAEEKTA RDE