Gene EcSMS35_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1015 
Symbol 
ID6147317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1034010 
End bp1035404 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content54% 
IMG OID641615902 
Productputative UDP-glucose lipid carrier transferase 
Protein accessionYP_001743094 
Protein GI170682437 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.742651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG 
CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTCAGC
GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCACTGA TTACGCTGGT GGTGTTCCAG
ATGCTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA
TTTGCCCTGT TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCTGG ACTGGTGGCG
TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT
ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT
GGCTATAACA AGCGCATGGT CGCGGTGGCG GGGGATTTAG CCGCCGGACA AATGCTGATG
GAGAGCTTCC GTAATCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTATA CCACGACCCA
AAACCGGGCG GCGTTTCTAA CGACTGGGCG GGCAATCTGC AACAACTGGT CGAGGATGCG
AAAGCGGGCA AGATTCATAA CGTCTATATC GCGATGCAGA TGTGCGACGG CGCGCGAGTG
AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC
TTTACCTTCA ACATTCTCCA TTCACGCCTC GAAGAGATGA ATGGCGTACC GGTGGTGCCG
CTGTATGACA CGCCACTTTC CGGGGTTAAC CGCCTGATAA AACGTGCGGA AGACATTGTG
CTGGCGACGC TTATCCTGCT GCTGATCTCT CCGGTGCTGT GCTGTATTGC GCTGGCGGTG
AAACTCAGTT CGCCAGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAA
CCGATCAAAG TGTGGAAGTT CCGTTCGATG AAAGTGATGG AGAACGACAA AGTGGTGACT
CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG
CTGGATGAAT TGCCGCAGTT TATCAATGTG CTGACCGGCG GGATGTCGAT TGTCGGGCCA
CGTCCACACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG
CGCCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAACGGCTG GCGCGGCGAA
ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA
TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAAGG TTTCGTTAAC
AAAGCGGCAT ATTGA
 
Protein sequence
MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ 
MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS
IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP
KPGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV
FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLIKRAEDIV LATLILLLIS PVLCCIALAV
KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS
LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE
TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY