Gene EcSMS35_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2400 
Symbol 
ID6145913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2448243 
End bp2449532 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content53% 
IMG OID641617273 
Productmajor facilitator transporter 
Protein accessionYP_001744445 
Protein GI170682702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CTTTGCTTGA CGCCGTGGTG AAGAAAAACC GCGCGCGTTT AATTCCGTTT 
ATGTTGGCGC TGTATGTGCT GGCGTTTCTC GACCGTTCGA ATATTGGTTT TGCCAAACAG
ACCTACCAGA TTGATACCGG GCTGAGTAAT GAAGCTTATG CGCTGGGAGC AGGCATTTTC
TTTGTGGTAT ACGCGTTTCT GGGGGTTCCG GCGAATCTTT TGATGCGCAA ACTGGGGGCA
AGAACCTGGA TTGGTACGAC AACACTGCTG TGGGGATTTC TTTCGGCTGC CATGGCATGG
GCCGATACTG AAGCGAAATT TCTGATTGTT CGCACTCTGC TTGGTGCTGC GGAGGCTGGG
TTTTTCCCTG GTATGATTTA TCTCACCTCG CAATGGTTTC CGCAGCGTAA TCGCGCCAGC
ATTATGGGGC TGTTCTATAT GGGCGCACCG CTGGCGTTAA CACTGGGATC ACCGCTTTCT
GGCGCGCTGC TGGAGATGCA TGGATTTATG GGGCATCCCG GCTGGTTCTG GATGTTTGTT
ATTGAAGGAT TGTTGGCAGT CGGCGCTGGG GTATTCACAT TCTTTTGGCT TGATGACACA
CCGGAGCAGG CACGTTTTCT GAGTAAAGAA GAAAAAACGT TGCTTATCAA CCAACTGGCA
AGTGAAGAAC AACAGAAAGT GACTTCCCGA CTGAGCGATG CGCTGCGTAA TGGGCGAGTC
TGGCAACTGG CGATTATCTA CCTGACCATT CAGGTAGCGG TTTACGGATT AATTTTCTTC
CTGCCGACCC AGGTTGCTGC ATTGCTGGGG ACAAAAGTGG GCTTTACGGC GTCGGTGGTC
ACCGCCATTC CGTGGGTTGC GGCCTTGTTT GGAACCTGGC TTATTCCGCG CTATTCCGAT
AAAACCGGCG AACGGCGTAA TGTCGCAGCG CTGACATTAC TGGCGGCGGG CATTGGTATT
GGTCTGTCCG GGCTGCTTTC TCCAGTACTG GCGATCGTAG CGCTGTGTGT TGCAGCCATC
GGGTTTATTG CCGTGCAGCC GGTGTTCTGG ACGATGCCGA CACAACTGCT TTCCGGTACG
GCGCTGGCTG CGGGGATTGG TTTTGTAAAC CTGTTTGGTG CAGTGGGCGG GTTTATTGCC
CCGATCCTGC GCGTGAAAGC AGAAACGTTA TTTTCCAGCG ATGCGGCGGG ATTACTGACG
CTGGCAGCGG TGGCGGTCAT CGGTTCGCTG ATTATTTTCA CTCTGCGTGT AAATCGCACT
GTTGCGCAGA CCGACGTGGC ACATCATTAA
 
Protein sequence
MSTALLDAVV KKNRARLIPF MLALYVLAFL DRSNIGFAKQ TYQIDTGLSN EAYALGAGIF 
FVVYAFLGVP ANLLMRKLGA RTWIGTTTLL WGFLSAAMAW ADTEAKFLIV RTLLGAAEAG
FFPGMIYLTS QWFPQRNRAS IMGLFYMGAP LALTLGSPLS GALLEMHGFM GHPGWFWMFV
IEGLLAVGAG VFTFFWLDDT PEQARFLSKE EKTLLINQLA SEEQQKVTSR LSDALRNGRV
WQLAIIYLTI QVAVYGLIFF LPTQVAALLG TKVGFTASVV TAIPWVAALF GTWLIPRYSD
KTGERRNVAA LTLLAAGIGI GLSGLLSPVL AIVALCVAAI GFIAVQPVFW TMPTQLLSGT
ALAAGIGFVN LFGAVGGFIA PILRVKAETL FSSDAAGLLT LAAVAVIGSL IIFTLRVNRT
VAQTDVAHH