Gene EcSMS35_2262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2262 
Symbol 
ID6145675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2283048 
End bp2284208 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content45% 
IMG OID641617137 
Productglycosyl transferase, group 2 
Protein accessionYP_001744310 
Protein GI170681557 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA AAGAGAGTAT AGCTGTTTCC GTTATTATCC CTGTGCACAA TGCGGCGGGA 
TATATTTCCG ACACATTGAG CACTGTTTTG TCGCAAACAT TAAATGATAT TGAAATCATT
ATTGTCAACG ATTGTTCTGA CGATAATACC TTAGAGATTG TCTCGGCGCT GGCTGAAACT
GATTCACGAA TTAGAGTCAT TAACAACACG ACAAATCTTG GTGGGGGAGG CTCAAGAAAC
ATCGGGTTGG ATGCTGCTAC CGGTGAGTAC GTTATTTTTC TTGATGACGA CGATTATGTC
GATAACATGA TGTTGGAGCG CATGTACGCA CGTGCTAGCG ATATACAAGC GGATGTCGTT
ATCTGTCGCA GCCAGTCTTT TGATCCCGCC TTGCAGGTTT ATGCTCCGAT GCCGTGGTCA
GTGCGCCAGG AGCTGTTACC TGATCTCAAG TTTTTTTCAT CCCAGGATAT TCCCTCTGAC
TTTTTCCGCA CCTTTGTCTG GTGGCCATGG GACAAGCTTC TCAGGCGTCA ATTTATTACT
AGTCATCAGC TTCGCTTTCA GGAAATCAGG ACGACCAACG ATCTTTTCTT CGTCTGCGCA
TTTATGCTCA TGGCGAACAG AATTTCGGTC TTAAATGAAA CGTTAATTTC TCATACCATC
AACCGTAGTG AATCCCTATC CGCCACGCGA GCAGAGTCGC ACCGCTGTGC TGTTGAAGCA
TTGGTGGCGC TTAAAGCGTT TATCTGTCAG CAGGGGATGA TGGAACACCG TCTCAGAGAT
TATAAAAACT ATGTTGTCGT GTTCCTTGAG TGGCATTTAA ACACGATTTC TGGTCCGGCA
TTTCACCCGT TTTATCAGCA AGTGAAAGAG TTTGTTGTTG CCCTGGATGC AAAGAGTGAT
GATTTTTATG ATGAATTTAT CGCCGCCGCC CATCATAGGA TCACCACGCT GTCAGCTGAA
GAGTACCTAT TCTCTCTTAA AGATCGGGTT CTGAAGGAAC TTGAGTTTTT CCAGGCCAGG
AGCTCCGCAT TACAGCAAGA AGTTGAGACG CTTACCCACT CATTAGCCGG GCAAAAGGAT
GAAAATGCGA TATTGCATAA TCAATTGCAT GAGATTGAAG AGCGGGTAAC GGCTTTGTTG
AATAAATCGA ACTTTTGCTG A
 
Protein sequence
MSEKESIAVS VIIPVHNAAG YISDTLSTVL SQTLNDIEII IVNDCSDDNT LEIVSALAET 
DSRIRVINNT TNLGGGGSRN IGLDAATGEY VIFLDDDDYV DNMMLERMYA RASDIQADVV
ICRSQSFDPA LQVYAPMPWS VRQELLPDLK FFSSQDIPSD FFRTFVWWPW DKLLRRQFIT
SHQLRFQEIR TTNDLFFVCA FMLMANRISV LNETLISHTI NRSESLSATR AESHRCAVEA
LVALKAFICQ QGMMEHRLRD YKNYVVVFLE WHLNTISGPA FHPFYQQVKE FVVALDAKSD
DFYDEFIAAA HHRITTLSAE EYLFSLKDRV LKELEFFQAR SSALQQEVET LTHSLAGQKD
ENAILHNQLH EIEERVTALL NKSNFC