Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1170 |
Symbol | mdoG |
ID | 5591802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1173008 |
End bp | 1174543 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920329 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001457892 |
Protein GI | 157160574 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.0753618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CCTTAGCCGG GAAAGGCTAT GAGGCGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT GACAAAGACA CGGTAAAAGA CCTTGGTTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGCCTGG CAATTGATAC CGCCTTGCCA TCGGGTGAAG AATTTCCACG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCGACTGAT AAACGTTTAA CCATTTATGC ATTGCTTGAC TCGCCGCGCG CGACAGGTGC TTACAAATTC GTGGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT GTCTATCCAT GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC AGCTTCTCGA TGGAGAACCC GCAAGGCTTC GGTCTGTTGC AGCGCGGTCG TGATTTCTCC CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTTCGTCCAA GCGCATGGGT GACTCCGAAA GGGGAGTGGG GTAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA GAGGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA GATGCCAAGA AAACCACTGA AATGCGTGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
|
Protein sequence | MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY EAPKSNLPSV FRDMKYADYQ QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E
|
| |