Gene EcSMS35_1252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1252 
Symbol 
ID6144581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1249895 
End bp1251100 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID641616130 
Productputative inner membrane protein 
Protein accessionYP_001743313 
Protein GI170683280 
COG category[R] General function prediction only 
COG ID[COG2391] Predicted transporter component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.480893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.396989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATGGC AGCAATTCAA ACACGCCTGG TTGATTAAAT TCTGGGCGCC CATCCCCGCG 
GTCATCGCGG CGGGTATTCT CTCCACTTAC TATTTTGGCA TTACTGGCAC CTTTTGGGCT
GTCACGGGTG AATTTACCCG TTGGGGCGGC CAGCTCCTGC AACTGTTCGG CGTCCATGCT
GAAGAGTGGG GTTACTTTAA AATTATCCAT CTGGAAGGAT CGCCATTAAC CCGCATCGAC
GGGATGATGA TCCTCGGTAT GTTTGGCGGC TGCTTTGCCG CAGCGCTGTG GGCCAACAAT
GTCAAACTGC GAATGCCGCG CAGCCGTATC CGCATTATGC AGGCCATCAT TGGCGGCATT
ATCGCCGGTT TTGGCGCGCG TCTGGCAATG GGCTGTAACC TGGCGGCGTT CTTTACCGGT
ATTCCTCAGT TCTCGCTGCA TGCCTGGTTC TTTGCCATCG CCACTGCCAT TGGTTCATGG
TTTGGCGCGC GCTTTACCCT GCTGCCCATC TTCCGTATTC CCGTGAAAAT GCAGAAAGTT
TCTGCCGCCT CACCGCTGAC GCAAAAACCG GATCAGGCGC GGCGTCGTTT TCGTCTCGGG
ATGCTGGTCT TTTTCGGCAT GCTGGGCTGG GCGCTGCTCA CAGCGATGAA CCAACCCAAA
CTGGGGCTGG CAATGCTGTT TGGCGTCGGC TTTGGTTTAC TGATTGAACG TGCGCAAATC
TGCTTTACTT CGGCGTTCCG CGATATGTGG ATCACCGGAC GTACCCATAT GGCGAAAGCA
ATCATTATTG GTATGGCGGT GAGTGCCATC GGGATCTTCA GTTACGTACA GTTAGGCGTT
GAACCCAAAA TCATGTGGGC GGGACCAAAC GCGGTAATTG GTGGTTTACT GTTTGGTTTT
GGCATCGTGC TGGCTGGCGG CTGCGAAACC GGCTGGATGT ACCGCGCGGT AGAAGGCCAG
GTGCACTACT GGTGGGTCGG TCTGGGCAAT GTGATCGGCT CAACGATTCT GGCGTATTAC
TGGGATGATT TCGCTCCGGC GCTGGCCACC GACTGGGACA AAATCAACCT GCTGAAAACC
TTTGGTCCGA TGGGGGGCCT GCTGGTGACA TATTTGCTGT TGTTTGCTGC GCTGATGTTG
ATTATCGGCT GGGAAAAACG CTTCTTCCGC CGTGCGGCAC CGCAGACTGC TAAGGAGATC
GCATGA
 
Protein sequence
MSWQQFKHAW LIKFWAPIPA VIAAGILSTY YFGITGTFWA VTGEFTRWGG QLLQLFGVHA 
EEWGYFKIIH LEGSPLTRID GMMILGMFGG CFAAALWANN VKLRMPRSRI RIMQAIIGGI
IAGFGARLAM GCNLAAFFTG IPQFSLHAWF FAIATAIGSW FGARFTLLPI FRIPVKMQKV
SAASPLTQKP DQARRRFRLG MLVFFGMLGW ALLTAMNQPK LGLAMLFGVG FGLLIERAQI
CFTSAFRDMW ITGRTHMAKA IIIGMAVSAI GIFSYVQLGV EPKIMWAGPN AVIGGLLFGF
GIVLAGGCET GWMYRAVEGQ VHYWWVGLGN VIGSTILAYY WDDFAPALAT DWDKINLLKT
FGPMGGLLVT YLLLFAALML IIGWEKRFFR RAAPQTAKEI A