Gene EcolC_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2551 
SymbolmdoG 
ID6067535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2799042 
End bp2800577 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content50% 
IMG OID641601957 
Productglucan biosynthesis protein G 
Protein accessionYP_001725509 
Protein GI170020555 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00372186 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.55547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC 
TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CCTTAGCCGG GAAAGGCTAT
GAGGCGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG
CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC
GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC
ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT
GACAAAGACA CGGTAAAAGA CCTTGGTTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC
AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT
GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGCCTGG CAATTGATAC CGCCTTGCCA
TCGGGTGAAG AATTTCCACG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCGACTGAT
AAACGTTTAA CCATTTATGC ATTGCTTGAC TCGCCGCGCG CGACAGGTGC TTACAAATTC
GTGGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT
AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA
CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT GTCTATCCAT
GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC
AGCTTCTCGA TGGAGAACCC GCAAGGCTTC GGTCTGTTGC AGCGCGGTCG TGATTTCTCC
CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTTCGTCCAA GCGCATGGGT GACTCCGAAA
GGGGAGTGGG GTAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC
GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC
TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA
TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG
CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA
GAGGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC
ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA
GATGCCAAGA AAACCACTGA AATGCGTGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT
GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
 
Protein sequence
MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY EAPKSNLPSV FRDMKYADYQ 
QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH
DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP
SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD
KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS
SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN
DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ
PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK
DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E