Gene EcSMS35_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1750 
SymbolmdoD 
ID6143428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1752415 
End bp1754034 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content52% 
IMG OID641616626 
Productglucan biosynthesis protein D 
Protein accessionYP_001743804 
Protein GI170679744 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.5781800000000004e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG 
GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA
ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CGCGTCCGTT ACCTGACACT
CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC
TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC
CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCCGCAACAC ATCTGGCGCG TGAAATTCAC
TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCGGGTGTTG ATACCAAACA ATTAGAAGGG
CAAAGCGATC TCGGTTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT
GATGTCGTAT CATTCCTCGG TGCGAGTTAT TTCCGCGCCG TTGACGACAC ATATCAATAC
GGTCTATCGG CTCGCGGCCT GGCGATCGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC
GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACCACCTT TACCGTTTAT
GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGCGAGAAA
AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG
GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAC
ACCATTCATC CGCAAATCCA TGACTCTGAT CGTTTGTCCA TGTGGCGGGG CAACGGCGAG
TGGATTTGTC GTCCGCTGAA CAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC
AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGATT TCTCCCATTA TCAGGACATT
ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG
GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC
TTCTGGCAGC CAGAAAAAGC TGTAAAAGCG GGTGATGAGT TTGCATTCCA GTATCGTCTG
TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC
GGCATGGGTG GTTTCCCGGA AGGTTGGGCT CCAGGTGAAC ACTATCCCGA AAAATGGGCG
CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCGCC AAAAGGCATT
GAGCCGGTGA TTACGCTTTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA
CCCATTGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CTTCGGACTC CACTGATCCG
GTCGATATGC GGATGTATCT GCGTTGTCAG GGCGACGCTA TCAGTGAAAC ATGGCTGTAT
CAGTATTTCC CGCCAGCGCC CGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
 
Protein sequence
MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT 
LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH
FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK
SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE
WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK
GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT
GMGGFPEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE
PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS