Gene EcSMS35_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1013 
SymbolcpsB 
ID6143034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1031047 
End bp1032483 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content56% 
IMG OID641615900 
Productmannose-1-phosphate guanylyltransferase 
Protein accessionYP_001743092 
Protein GI170682585 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0662] Mannose-6-phosphate isomerase
[COG0836] Mannose-1-phosphate guanylyltransferase 
TIGRFAM ID[TIGR01479] mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGT CGAAACTCTA TCCAGTTGTG ATGGCAGGTG GCTCCGGTAG CCGCTTATGG 
CCGCTTTCCC GCGTACTTTA CCCCAAGCAG TTTTTATGCC TGAAAGGCGA TCTCACCATG
CTGCAAACCA CCATCTGCCG CCTGAACGGC GTGGAGTGCG AAAGCCCGGT GGTGATTTGC
AATGAGCAGC ACCGCTTTAT TGTCGCGGAA CAGCTGCGTC AACTGAACAA ACTCACCGAG
AACATTATTC TCGAACCGGC AGGGCGTAAC ACTGCACCGG CCATTGCGCT GGCGGCGCTG
GCGGCAAAAC GTCATAGCCC GGAGAACGAC CCATTAATGC TGGTGCTGGC GGCGGATCAT
GTGATTGCCG ATGAAGACGC GTTCCGTGCC GCCGTGCGTA ATGCCATGCC GTATGCCGAA
GCGGGCAAGC TGGTGACCTT CGGCATTGTG CCGGATCTAC CTGAAACCGG ATATGGCTAT
ATTCGTCGCG GTGAAGTGTC GGCGGGTGAG CAGGATGCGG TGGCCTTTGA AGTGGCGCAG
TTTGTCGAAA AACCGAATCT GGAAACCGCC CAGGCCTATG TGGCAAGTGG CGAATATTAC
TGGAACAGCG GTATGTTCCT GTTCCGTGCC GGACGCTATC TCGAAGAACT GAAAAAGTAT
CGCCCGGATA TTCTCGATGC CTGTGAAAAA GCGATGAACG CCGTCGATCC GGATCTCGAT
TTTATTCGTG TGGATGAAGA GGCGTTTCTC GCCTGTCCGG AAGAGTCGGT GGATTACGCG
GTCATGGAAC GCACGGCAGA TGCCGTTGTG GTGCCGATGG ATGCGGGCTG GAGCGATGTC
GGTTCCTGGT CTTCATTATG GGAGATCAGC GCCCACACCG CCGAGGGCAA CGTTTGCCAT
GGCGATGTGA TTAATCACAA AACTGAAAAC AGCTATGTGT ACGCCGAATC TGGCCTGGTC
ACCACCGTCG GGGTGAAAGA TTTGGTGGTA GTGCAGACCA AAGATGCAGT GCTGATTGCC
GACCGTAATG CGGTGCAGGA TGTGAAGAAA GTGGTCGAGC AGATCAAAGC TGATGGTCGC
CATGAGCATC GGGTGCATCG CGAAGTGTAT CGTCCGTGGG GCAAATATGA CTCTATCGAC
GCGGGCGACC GCTACCAGGT GAAACGCATC ACCGTGAAAC CGGGCGAAGG CTTGTCGGTA
CAGATGCATC ATCACCGCGC GGAGCACTGG GTGGTGGTCG CGGGAACGGC AAAAGTCACC
ATTGACGGTG ATATCAAACT GCTTGGTGAA AACGAGTCCA TTTATATTCC GCTGGGGGCG
ACGCACTGCC TGGAAAACCC GGGGAAAATT CCGCTCGATT TAATTGAAGT GCGCTCCGGC
TCTTATCTCG AAGAGGATGA TGTGGTGCGC TTCGCGGATC GCTACGGACG GGTGTAG
 
Protein sequence
MAQSKLYPVV MAGGSGSRLW PLSRVLYPKQ FLCLKGDLTM LQTTICRLNG VECESPVVIC 
NEQHRFIVAE QLRQLNKLTE NIILEPAGRN TAPAIALAAL AAKRHSPEND PLMLVLAADH
VIADEDAFRA AVRNAMPYAE AGKLVTFGIV PDLPETGYGY IRRGEVSAGE QDAVAFEVAQ
FVEKPNLETA QAYVASGEYY WNSGMFLFRA GRYLEELKKY RPDILDACEK AMNAVDPDLD
FIRVDEEAFL ACPEESVDYA VMERTADAVV VPMDAGWSDV GSWSSLWEIS AHTAEGNVCH
GDVINHKTEN SYVYAESGLV TTVGVKDLVV VQTKDAVLIA DRNAVQDVKK VVEQIKADGR
HEHRVHREVY RPWGKYDSID AGDRYQVKRI TVKPGEGLSV QMHHHRAEHW VVVAGTAKVT
IDGDIKLLGE NESIYIPLGA THCLENPGKI PLDLIEVRSG SYLEEDDVVR FADRYGRV