Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1169 |
Symbol | mdoG |
ID | 5590258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1186799 |
End bp | 1188334 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640924869 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001462281 |
Protein GI | 157156764 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.163218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CCTTAGCCGG GAAAGGCTAT GAGGCGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT GACAAAGACA CGGTAAAAGA CCTTGGTTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGTCTGG CAATTGATAC CGCCTTGCCA TCGGGTGAAG AATTTCCACG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCAACTGAT AAACGTTTAA CCATTTATGC ATTGCTTGAC TCGCCGCGCG CGACAGGTGC TTACAAATTC GTAGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT GTCTATCCAT GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC AGCTTCTCCA TGGAAAACCC GCAAGGCTTT GGTCTGTTGC AGCGCGGTCG TGATTTCTCC CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTCCGTCCAA GCGCATGGGT GACTCCGAAA GGGGAGTGGG GAAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA GAAGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA GATGCCAAGA AAACCACTGA AATGCGAGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
|
Protein sequence | MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY EAPKSNLPSV FRDMKYADYQ QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E
|
| |