Gene EcSMS35_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0391 
Symbol 
ID6143894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp405115 
End bp406311 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID641615287 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001742494 
Protein GI170680221 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.105387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT GGATATTTAT CTGTATGGCC GTAGCAATAT TGCTATGGTT CCTGAGTACG 
TTAAGGCGTA AGCCCAGCCA AAAAAAAGGC TGTATTGACG CCATTATACC TGCTTATAAC
GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTGCTGC GTAACCCTTA TTTTTGCCGG
GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA
CGCAAATGGG GCGACCGCTT TATTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG
CTGATGAATG GCCTCAATTA CGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC
TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATAGA GCGCGGTGCT
GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGTCTGTT ACCGCACATC
CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGC
GGCGCACCGT TTATTATCAG CGGTGCCTGC GGGATGTTCC GTACTGATGT ATTGCGTAAG
TTCGGTTTCT CTGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTGGTGGCA
AACGGCTATC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC
CCGCGTGAGG AATGGCGTCG CTGGCGGCGT TGGATTGTGG GCTACGCGGT CTGTATGCGC
CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG
GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC
GGGCCGCATA GTGTGGTGTT GGCAATGTTT CCGCTTATCT GGGTCGGCGT AGTTTGTGTT
ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT
TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT
TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG
TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTTTCTGA AGCTTAA
 
Protein sequence
MKTWIFICMA VAILLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR 
VICVNDGSTD NTEAVMAEVK RKWGDRFIAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT
YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG
GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS
PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT
GPHSVVLAMF PLIWVGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF
FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA