Gene EcSMS35_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3884 
Symbol 
ID6143397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3952045 
End bp3953040 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID641618710 
Productacyltransferase family protein 
Protein accessionYP_001745849 
Protein GI170680005 
COG category[S] Function unknown 
COG ID[COG3274] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCA AAATTTACTG GATTGATAAC CTGCGAGGGA TAGCGTGTTT AATGGTGGTG 
ATGATTCACA CCACTACCTG GTATGTGACC AATGCTCATA GTGTTAGCCC CGTCACATGG
GATATCGCCA ATGTTCTGAA TTCTGCCTCT CGTGTCAGCG TGCCGCTATT TTTCATGATT
TCCGGCTATC TCTTTTTTGG CGAACGCAGC GCCCAGCCGC GCCATTTCTT GCGCATCGGC
TTATGCCTGC TGTTTTATAG CACTGTTGCG CTGCTCTACA TTGCGCTGTT TACCTCCATC
AATATGGAGT TAGCGCTGAA AAACCTGCTG CAAAAGCCAG TGTTTTACCA CTTGTGGTTT
TTCTTCGCGA TTGCGGTGAT TTATCTGGTT TCACCGCTGA TTCAGGTGAA GAACGTCGGC
GGAAAAATGT TGCTGGTGCT AATGGTGGTG ATTGGCATTA TCGCTAACCC AAACACAGTG
CCGCAGAAAA TTGACGGTTT TGAATGGCTG CCAATTAACT TATATATCAA TGGCGATACT
TTTTACTACA TCCTGTATGG CATGTTGGGC CGCGCTATAG GGATGATGGA CACACAGCAT
AAAGCACTGT CGTGGGTGTG CGCCGCACTG TTTGCGACGG GGGTATTTAT TATCTCTCGC
GGGACATTAT ATGAATTGCA GTGGCGCGGA AATTTTGCCG ATACCTGGTA TCTTTACTGT
GGGCCGATGG TTTTTATCTG CGCAATCACG CTATTGACTC TGGTTAAAAA CACGCTGGAT
ACACACACTG TTCCCGGGCT TGGGCTGATA TCACGCCATT CTTTAGGTAT ATACGGGTTC
CATGCATTGA TTATCCATGC GCTGCGTACC CGGGGGATTG AGCTTAAAAA CTGGCCAATA
CTCGATATTA TTTGGATCTT TTGCGCGACG TTGGCAGCGA GTTTGTTACT TTCTATGCTG
GTACAACGAA TCGACAGAAA CAGACTAGTG AGTTAA
 
Protein sequence
MQPKIYWIDN LRGIACLMVV MIHTTTWYVT NAHSVSPVTW DIANVLNSAS RVSVPLFFMI 
SGYLFFGERS AQPRHFLRIG LCLLFYSTVA LLYIALFTSI NMELALKNLL QKPVFYHLWF
FFAIAVIYLV SPLIQVKNVG GKMLLVLMVV IGIIANPNTV PQKIDGFEWL PINLYINGDT
FYYILYGMLG RAIGMMDTQH KALSWVCAAL FATGVFIISR GTLYELQWRG NFADTWYLYC
GPMVFICAIT LLTLVKNTLD THTVPGLGLI SRHSLGIYGF HALIIHALRT RGIELKNWPI
LDIIWIFCAT LAASLLLSML VQRIDRNRLV S