Gene EcolC_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2235 
SymbolmdoD 
ID6065912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2452831 
End bp2454450 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content51% 
IMG OID641601640 
Productglucan biosynthesis protein D 
Protein accessionYP_001725199 
Protein GI170020245 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.720704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG 
GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA
ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CTCGTCCGTT ACCTGACACG
CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC
TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC
CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCAGCAACAC ATCTGGCGCG TGAAATTCAC
TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCAGGTGTTG ATACCAAACA ATTAGAAGGG
CAAAGCGATC TCGGCTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT
GATGTAGTAT CATTTCTCGG CGCGAGTTAT TTCCGCGCCG TTGATGATAC ATATCAATAC
GGTTTGTCGG CCCGCGGCCT GGCGATCGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC
GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACTACCTT TACCGTTTAT
GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGTGAGAAA
AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG
GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAT
ACAATTCATC CGCAAATTCA TGACTCTGAT CGTCTGTCCA TGTGGCGGGG CAACGGCGAG
TGGATTTGCC GTCCGCTGAA TAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC
AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGACT TCTCCCATTA TCAGGACATT
ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG
GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC
TTCTGGCAGC CAGAAAAAGC TGTAAAAGCA GGTGATGAGT TTGCATTCCA GTATCGTCTG
TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC
GGCATGGGCG GTTTCCCGGA AGGTTGGGCG CCAGGTGAAC ACTATCCCGA AAAATGGGCG
CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCGCC AAAAGGCATT
GAGCCGGTGA TTACGCTGTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA
CCCATTGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CGTCGGACTC CACTGATCCG
GTCGATATGC GGATGTATCT GCGTTGTCAG GGGGACGCTA TCAGTGAAAC ATGGCTGTAT
CAGTATTTCC CGCCAGCGCC GGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
 
Protein sequence
MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT 
LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH
FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK
SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE
WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK
GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT
GMGGFPEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE
PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS