Gene EcSMS35_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0914 
Symbol 
ID6147226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp922164 
End bp923321 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID641615802 
Productquaternary amine ABC transporter permease 
Protein accessionYP_001742994 
Protein GI170682890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.664933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTATT TACGTATTAA TCCTGTTCTG GCGCTGCTGC TGTTGCTGAC GGCAATCGCA 
GCGGCGCTGC CGTTTATCAG TTACGCGCCT AATCGTTTAG TCTCTGGTGA GGGGCGTCAT
CTCTGGCAGT TGTGGCCGCA AACGATCTGG ATGCTGGTGG GCGTATGTTG CGCCTGGCTG
ACAGCCTGTT TTATTCCCGG TAGAAAAGGC AGCATTTTTG CACTCATTCT GGCGCAATTC
GTCTTCGTAT TGCTGGTGTG GGGAGCTGGA AAGGCGGCGA CACAACTGGC GCAAAATGGC
AGTGCGCTGG CGCGTACCAG CCTCGGCAGT GGTTTCTGGC TGGCTGCGGC GCTGGCATTG
CTTGCCTGTA GCGATGCCAT CCGACGAATC TCCACGCATC CGCTGTGGCG CTGGTTGTTG
CATATGCAGA TTGCCATCAT TCCGCTGTGG TTGCTGTATT CCGGCGCGCT TAACGATCTC
TCGCTAATGA AAGAATACGC CAACCGTCAG GATGTGTTTG ATGACGCGCT GGCGCAACAT
CTGACGTTGC TGTTTGGTGC GGTGCTGCCT GCGTTAGTGA TTGGTGTGCC GTTGGGCATC
TGGTGCTACT TTTCCACCGC GCGGCAGGGG GCGATTTTTT CGCTACTCAA TGTCATTCAG
ACCGTGCCTT CGGTGGCGCT CTTTGGCCTG TTGATTGCGC CGCTTGCCGC GCTGGTTACG
GCCTTTCCGT GGCTGGGTAA GCTGGGCATA GCAGGAACCG GAATGACACC CGCACTGATT
GCGCTGGTGC TCTATGCCTT GCTGCCGCTG GTGCGCGGCG TGGTAGTCGG CTTGAACCAG
ATCCCGCGCG ATGTGCTGGA GAGCGCCAGA GCGATGGGGA TGAGCGGGGC GCAGCGATTC
CTGCATGTTC AGTTACCGCT GGCGTTACCG GTATTTTTGC GCAGCCTGCG GGTGGTGATG
GTGCAAACTG TAGGCATGGC GGTGATTGCG GCGTTAATCG GCGCAGGCGG TTTTGGTGCG
CTGGTTTTCC AGGGGCTGCT AAGCAGCGCC ATTGATTTAG TGTTGCTGGG GGTGATCCCG
GTAATTGTTC TGGCGGTGCT TACCGACGCG CTGTTCGATT TGCTTATCGC ACTGCTGAAG
GTGAAACGTA ATGATTGA
 
Protein sequence
MTYLRINPVL ALLLLLTAIA AALPFISYAP NRLVSGEGRH LWQLWPQTIW MLVGVCCAWL 
TACFIPGRKG SIFALILAQF VFVLLVWGAG KAATQLAQNG SALARTSLGS GFWLAAALAL
LACSDAIRRI STHPLWRWLL HMQIAIIPLW LLYSGALNDL SLMKEYANRQ DVFDDALAQH
LTLLFGAVLP ALVIGVPLGI WCYFSTARQG AIFSLLNVIQ TVPSVALFGL LIAPLAALVT
AFPWLGKLGI AGTGMTPALI ALVLYALLPL VRGVVVGLNQ IPRDVLESAR AMGMSGAQRF
LHVQLPLALP VFLRSLRVVM VQTVGMAVIA ALIGAGGFGA LVFQGLLSSA IDLVLLGVIP
VIVLAVLTDA LFDLLIALLK VKRND