Gene EcSMS35_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2266 
Symbol 
ID6144263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2287536 
End bp2289428 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content39% 
IMG OID641617141 
Productglycosyl transferase family protein 
Protein accessionYP_001744314 
Protein GI170679598 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGTA TCAAAATTTA TACATGTCAC CACAAACCCA GTGCTTTTCT AAATGCCTCT 
ATTATTAAAC CACTGCATGT CGGGAAGGCT AATACTTATA ATGATATTGG CTGTGAAGGG
GATGATAGCG GAGACAACAT TTCCTTTAAA AACCCCTTCT ATTGTGAGCT GACGGCCCAC
TACTGGGTGT GGAAGAATGA ATCTCTGGCT GACTATGTGG GATTCATGCA TTACCGCAGA
CACTTGAACT TTGCAGAACA ACAAAATCAT CCTGAGGATA ACTGGGGGGT AGTCAACTAC
CCGCTCATCA ATGCCGAATA CGAAAGCCAG TTTGGATTAA GTGATGAATC CATAAGCACC
TGCGTCGATG GATATGATCT GCTGTTACCC AAAAAATGGT CGGTAACATC GGCAGGAAGT
AAGAATAACC TTGATCATTA TGCAAAAGGT GAGTTTTTAC ACATTAAAGA CTATCAGTCT
GCCCTGGATG TCGTTGAAGA GCTGTATCCA CAATATAAAG CTGCGATTCA ACAATTCAAT
AATGCAACTG ATGGTTATTA TACGAACATG TTTGTTATGC GCAAAGACAT GTTCCTGGAT
TATTCTGAAT GGCTTTTTGC TATTTTGTCT AATCTTGAAG ATCGGATTTC GATGAATAAC
TACAACGCTC AAGAAAAACG AGTTATAGGA CACATTGCTG AGCGTTTATT CAATATCTAT
ATCATCAAAT GTCAACAAGA TAAACAGCTA AAAATAAAAG AACTACAGCG CACGTTTGTT
ACTGCTGAAA CATTTAATGG TAAATTAAAG CCCGTTTTTG ACGAAAGCGT ACCGGTTGTT
ATTAGTTTTG ATAATAACTA TGCATTAAGT GGCGGGGCAT TAATCAATTC AATTGTTCTC
CATTCTGATG CGAGCAGAAA CTATGATATC GTTGTTCTGG AAAATAAGGT CAGTCATTTA
AATAAACAAC GCCTTATCAA GCTAGTTGCT GGTCATAACA ACATATCATT GCGCTTTTTT
GATGTGAATT CATTCACTGA GATGAGTGAT GTTCATACCC GTGCACATTT TAGTGCGTCG
ACCTATGCGC GCTTGTTCAT CCCGCAACTT TTCCGCGAGT ATAAAAAAGT TGTGTTTATC
GATTCTGACA CCGTGGTGAA GGCTGATTTG GCGACACTTC TGGATGTCGA GATCGGCACT
AACCTGGTTG CCGCTGTTAA AGACATCGTC ATGGAAGGGT TTGTGAAGTT TGGTACCATG
TCAGAATCAG ATGATGGCAT TATGCCTGCA GAGCAATACC TGAAAAAGAC ATTAGGAATG
ACTAATCCTG ACGAATATTT TCAGGCCGGG ATTATTGTTT TTAATGTCGA ACAAATGGTT
ACAGAGAATA CCTTTGCTCA ATTGATGTCA GCATTGAAAG CCAAAAAGTA TTGGTTCTTA
GATCAGGATA TCATGAACAA AGTCTTCTTT GGCCGAGTCA AATTCTTACC ATTAGAATGG
AACGTGTACC ATGGTAATGG TAATACCGAT GATTTCTTCC CGAATCTCAA GTTTTCAACC
TATATGCGCT TTTTGCAAGC CAGAAGAAAT CCAAAAATGA TTCACTATGC GGGTGAAAAC
AAACCATGGA ATACTGAGAA AGTCGATTTC TATGATGATT TTCTTGAGAA TGTTTTAAGT
ACGCCATGGG AGAAAGAAAT CTACTATCGC CAGTTACCTG TGGCCACGGT AGTACCTAAC
CAACATACTG AACTGCAGCA AACCGTGTTA CTGCAGACAA AGATTAAAAG AGCTTTAATG
CCATATGTTA ACAAATATGC TCCTGTCGGT TCGCCAAGAA GAAATAAGCT CATCAAATAT
TATTATAAAG TTCGCCGTTC GATTCTTGGC TAA
 
Protein sequence
MNSIKIYTCH HKPSAFLNAS IIKPLHVGKA NTYNDIGCEG DDSGDNISFK NPFYCELTAH 
YWVWKNESLA DYVGFMHYRR HLNFAEQQNH PEDNWGVVNY PLINAEYESQ FGLSDESIST
CVDGYDLLLP KKWSVTSAGS KNNLDHYAKG EFLHIKDYQS ALDVVEELYP QYKAAIQQFN
NATDGYYTNM FVMRKDMFLD YSEWLFAILS NLEDRISMNN YNAQEKRVIG HIAERLFNIY
IIKCQQDKQL KIKELQRTFV TAETFNGKLK PVFDESVPVV ISFDNNYALS GGALINSIVL
HSDASRNYDI VVLENKVSHL NKQRLIKLVA GHNNISLRFF DVNSFTEMSD VHTRAHFSAS
TYARLFIPQL FREYKKVVFI DSDTVVKADL ATLLDVEIGT NLVAAVKDIV MEGFVKFGTM
SESDDGIMPA EQYLKKTLGM TNPDEYFQAG IIVFNVEQMV TENTFAQLMS ALKAKKYWFL
DQDIMNKVFF GRVKFLPLEW NVYHGNGNTD DFFPNLKFST YMRFLQARRN PKMIHYAGEN
KPWNTEKVDF YDDFLENVLS TPWEKEIYYR QLPVATVVPN QHTELQQTVL LQTKIKRALM
PYVNKYAPVG SPRRNKLIKY YYKVRRSILG