Gene EcDH1_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2222 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2379967 
End bp2381586 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content51% 
IMG OID 
Productperiplasmic glucan biosynthesis protein MdoG 
Protein accessionACX39871 
Protein GI260449449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.834174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG 
GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA
ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CTCGTCCGTT ACCTGACACG
CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC
TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC
CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCAGCAACAC ATCTGGCGCG TGAAATTCAC
TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCAGGTGTTG ATACAAAACA ATTAGAAGGG
CAAAGCGATC TCGGCTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT
GATGTAGTAT CATTTCTCGG CGCGAGTTAT TTCCGCGCCG TTGATGATAC ATATCAATAC
GGTTTGTCGG CCCGCGGCCT GGCGATCGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC
GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACTACCTT TACCGTTTAT
GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGTGAGAAA
AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG
GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAT
ACAATTCATC CGCAAATTCA TGACTCTGAT CGTCTGTCCA TGTGGCGGGG CAACGGCGAG
TGGATTTGCC GTCCGCTGAA TAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC
AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGACT TCTCCCATTA TCAGGACATT
ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG
GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC
TTCTGGCAGC CAGAAAAAGC TGTAAAAGCA GGTGATGAGT TTGCATTCCA GTATCGTCTG
TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC
GGCATGGGCG GTTTCTCGGA AGGTTGGGCG CCAGGTGAAC ACTATCCCGA AAAATGGGCG
CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCGCC AAAAGGCATT
GAGCCGGTGA TTACGCTTTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA
CCCATCGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CTTCGGACTC CACTGATCCG
GTCGATATGC GGATGTATCT ACGTTGTCAG GGGGACGCTA TCAGTGAAAC ATGGCTGTAT
CAGTATTTCC CGCCAGCGCC GGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
 
Protein sequence
MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT 
LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH
FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK
SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE
WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK
GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT
GMGGFSEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE
PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS