Gene EcSMS35_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1026 
Symbol 
ID6147290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1048051 
End bp1049076 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content33% 
IMG OID641615913 
Productglycosyl transferase, group 1 
Protein accessionYP_001743105 
Protein GI170682395 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG CGATGATAAT ACCATCTTTA AAAAATACCG CGCCAAATAA TATAGCCCTT 
TCAATATGTG AAGGGGCAAG GAAGCATGGT GTCAATATAG ATATTTATTA TCTTGATGAA
ATTGTAGAGT TGGATATAGA CGAGTCAGAA CTGAATATAA ATAGGCTAAA TGATAGAACC
ATTTTAAACA GTTATGATAT TGTTCACTCT CATGGCTTAA GACCCGATGT TATTAATTGT
AAGACCAGCA AAAATGTCAT TAATGTAAGT ACGCTTCATA GTTTTATTTT AGATGATCTT
AAGAATAAAT ATGGAATAAA AGGATATCTG ATCGCTTTTG CTTGGCTTAA ATTATTAAAG
CGATTTGATT CATGTATTGC AATTAGTGAG GCGACTAAAC GGTTTTATGT TAAACATGGT
CTTAAATGCC ACCATATATA TAATGGAATC GACTTTCCTG ACATCGAATT ATATCAAGAT
AACTCAAGGC TAAATAACGG CAAAATAAAT ATAGTATCGA TCGCGCATCT GGAAAAAATA
AAAGGCTTAG AGCAACTCCT ATATCTAGCC GCGGAGCGAG AAGAGTTCCA TGTTCATATA
ATTGGTGAAG GAACTTACCG TGGGAAATTA ACAAATATTA TTGATAAGTT TGATTTATCT
CAGCGTGTAA CTTTGCATGG ATATATATCA AATGCGAGTT CTATGTTGGG GAAATTTGAT
GTTTATGTGC AACCATCTAA AAGTGAGGGG TTTGGCATTG CAGTCATTGA AGCTCTTGTA
AATAAAATCC CAACAGTTTG CTCAGACATT GAAGTGTTCA GAGAGTTATT TGGCTCAGGA
GAAGTTGAAT TTTTCAAGTT AGACGATATT AATTCACTTT ACGATGCAAT TTCAAGTGCA
TTAACCAAGA CAGATATGTC TCGAGATGCA AGTGCCACAG CGATAAGCCA AAAATTTTCT
TCAGAAGTGA TGTCATTGAA CTATTTAAGT TGGTATAAAA AATTATATGA AGAAAGAAAT
TTATAA
 
Protein sequence
MKIAMIIPSL KNTAPNNIAL SICEGARKHG VNIDIYYLDE IVELDIDESE LNINRLNDRT 
ILNSYDIVHS HGLRPDVINC KTSKNVINVS TLHSFILDDL KNKYGIKGYL IAFAWLKLLK
RFDSCIAISE ATKRFYVKHG LKCHHIYNGI DFPDIELYQD NSRLNNGKIN IVSIAHLEKI
KGLEQLLYLA AEREEFHVHI IGEGTYRGKL TNIIDKFDLS QRVTLHGYIS NASSMLGKFD
VYVQPSKSEG FGIAVIEALV NKIPTVCSDI EVFRELFGSG EVEFFKLDDI NSLYDAISSA
LTKTDMSRDA SATAISQKFS SEVMSLNYLS WYKKLYEERN L