Gene EcSMS35_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3951 
Symbol 
ID6147352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4030264 
End bp4031298 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content46% 
IMG OID641618778 
Productputative glycosyl transferase 
Protein accessionYP_001745917 
Protein GI170679583 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.174357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.660654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA GCACTAATAA ACTAAGTGTT ATTATCCCGT TATATAATGC GGGCGATGAT 
TTCAGCGCTT GCATGGAATC TTTAATCGCG CAAACCTGGA CTGCTCTGGA AATCATTATT
ATTAACGATG GTTCAACGGA TAATTCTGTT GAAATAGCAA AGCATTACGC AGAAAACTAT
CCGCACGTTC GTTTGTTGCA TCAGGCGAAT GCTGGCGCAT CGGTGGCGCG TAATCGTGGG
ATCGAGGTGG CGACGGGCAA ATATGTCGCT TTTGTCGATG CTGACGATGA GGTGTATCCC
ACCATGTACG AAACGCTGAT GACCATGGCG TTAGAGGACG ACCTCGACGT GGCGCAGTGC
AACGCTGATT GGTGTTTTCG TGAAACGGGT GAAACCTGGC AATCCATCCC TACCGATCGC
CTTCGCTCAA CCGGTGTATT AACCGGTCCG GACTGGCTGC GGATGGGGCT TTCTTCCCGC
CGTTGGACAC ACGTGGTCTG GATGGGGGTT TATCGCCGCG ATGTTATTGT TAAAAATAAC
ATTAAATTTA TTGCCGGATT ACATCATCAG GATATTGTCT GGACAACAGA ATTTATGTTT
AACGCGCTGC GTGCGCGATA TACCGAGCAA TCATTATATA AATATTATCT GCATAATACG
TCAGTGAGTC GGTTGAACAG ACAAGGGAAC AAAAACCTTA ATTATCAACG TCACTATATT
AAGATTACCC GCCTGCTGGA GAAATTAAAT CGAAATTATG CCGACAAAAT TACGATTTAT
CCGGAATTTC ATCAGCAAAT AACCTACGAA GCATTGCGTG TTTGCCATGC GGTGCGCAAA
GAGCCGGATA TTATTACCCG CCAACGGATG ATTGCCGAGA TATTTACTTC CGGTATGTAT
AAGCGCCTGA TTACCAATGT GCGCAGCGTG AAGGTGGGTT ATCAGGCGTT ACTGTGGTCT
TTCCGCTTAT GGCAATGGCG CGACAAAACG CGGTCGCACC ATCGCATTAC GCGTAGCGCC
TTTAATTTGC GCTAG
 
Protein sequence
MMNSTNKLSV IIPLYNAGDD FSACMESLIA QTWTALEIII INDGSTDNSV EIAKHYAENY 
PHVRLLHQAN AGASVARNRG IEVATGKYVA FVDADDEVYP TMYETLMTMA LEDDLDVAQC
NADWCFRETG ETWQSIPTDR LRSTGVLTGP DWLRMGLSSR RWTHVVWMGV YRRDVIVKNN
IKFIAGLHHQ DIVWTTEFMF NALRARYTEQ SLYKYYLHNT SVSRLNRQGN KNLNYQRHYI
KITRLLEKLN RNYADKITIY PEFHQQITYE ALRVCHAVRK EPDIITRQRM IAEIFTSGMY
KRLITNVRSV KVGYQALLWS FRLWQWRDKT RSHHRITRSA FNLR