Gene EcSMS35_4877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4877 
Symbol 
ID6143466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4992126 
End bp4993304 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content55% 
IMG OID641619681 
Productmajor facilitator transporter 
Protein accessionYP_001746788 
Protein GI170680267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.429993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGA CATCGCATCC CGTAGAACGT TTTTCTTTCA GCACCGCGTT ATTCGGGATG 
CTGGTTCTGA CCTTAGGTAT GGGTTTAGGC CGCTTTCTCT ATACGCCGAT GCTGCCAGTC
ATGCTGGCGG AAGGCGAATT TTCCTTTAGC GAACTCTCAT GGATCGCCAG TGGTAACTAT
GCCGGGTATC TGGCGGGGAG CCTGCTGTTT TCATTCGGCG CGTTTCATTT ACCCTCACGC
CTGCGCCCGT TCTTGTTAGC TTCCGCCCTC GCAACCGGAT TATTAATCCT CGCGATGGCG
TGGCTGCCGC CGTTTCTTCT GGTTTTCATC ATTCGCTTTC TGGCGGGGGT CGCCAGCGCC
GGGATGTTGA TTTTCGGCTC AACGCTCATC ATGCAGCATA CTCGTCATCC CTTTGTCCTT
GCGGCGCTAT TTTCTGGTGT TGGCGTCGGC ATCGCTCTGG GCAATGAATA TGTGCTGGCA
GGCCTGCATT TTGCCCTCTC TTCACAAACG TTGTGGCAAG GTGCCGGAGC ACTTTCTGCC
ATTATATTGC TTGCTCTGGC GCTGCTCATC CCGTCGAATA AACACGTTAT CCCGCCAGCG
CCATTGGCAA AAATCGCGCA ACAACCCATG AGCTGGTGGT TACTGGCGAT TCTGTATGGT
CTGGCGGGTT TTGGTTATAT CATCGTCGCC ACCTACCTGC CGCTCATGGC GAAAGACGCG
GGCCAGCCTG TGCTGACGGC TCACCTCTGG ACACTGGTCG GCTTGTCGAT TGTCCCAGGT
TGCTTTGGCT GGCTGTGGGC AGCCAAACGG TGGGGAGCAT TACCTTGCCT GACCGCGAAT
TTGCTGGTGC AGGCGATCTG CGTGCTGTTA ACCCTCGCCA GCAGCTCTCC TTTATTACTC
ATCATCAGCA GTATTGGTTT TGGCGGCACC TTTATGGGAA CGACCTCGCT GGTGATGACC
ATCGCCCGCC AGCTTAGCGT GCCGGGAAAT CTTAACCTTT TGGGCTTTGT GACACTCATT
TATGGTATCG GGCAAATTCT TGGCCCGGCG CTGACCAGTA TGCTCGGCAA CGGAACGTCG
GCCCTCGCCA GTGCCACGCT CTGCGGCGCG GCGGCGCTAT TTATCGCAGC ATTAATCTGC
GGGATGCAAA TATTCAAATT GCATACGAAT GATTCTTAA
 
Protein sequence
MNSTSHPVER FSFSTALFGM LVLTLGMGLG RFLYTPMLPV MLAEGEFSFS ELSWIASGNY 
AGYLAGSLLF SFGAFHLPSR LRPFLLASAL ATGLLILAMA WLPPFLLVFI IRFLAGVASA
GMLIFGSTLI MQHTRHPFVL AALFSGVGVG IALGNEYVLA GLHFALSSQT LWQGAGALSA
IILLALALLI PSNKHVIPPA PLAKIAQQPM SWWLLAILYG LAGFGYIIVA TYLPLMAKDA
GQPVLTAHLW TLVGLSIVPG CFGWLWAAKR WGALPCLTAN LLVQAICVLL TLASSSPLLL
IISSIGFGGT FMGTTSLVMT IARQLSVPGN LNLLGFVTLI YGIGQILGPA LTSMLGNGTS
ALASATLCGA AALFIAALIC GMQIFKLHTN DS