Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01052 |
Symbol | mdoG |
ID | 8114171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1111310 |
End bp | 1112845 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644847312 |
Product | hypothetical protein |
Protein accession | YP_002998885 |
Protein GI | 251784581 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0739536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CTTTAGCTGG GAAAGGCTAC GAGACGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT GACAAAGATA CGGTAAAAGA CCTTGGCTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGCCTGG CAATTGATAC CGCCTTGCCA TCGGGTGAAG AATTTCCGCG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCGACTGAT AAACGTTTAA CCATCTATGC ATTGCTTGAC TCGCCGCGTG CGACAGGTGC TTACAAATTC GTGGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT CTCTATCCAT GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC AGCTTCTCCA TGGAAAACCC GCAAGGCTTT GGTCTGTTGC AGCGCGGTCG TGATTTCTCC CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTCCGTCCAA GCGCATGGGT GACTCCGAAA GGGGAGTGGG GTAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA GAGGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA GATGCCAAGA AAACCACTGA AATGCGTGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
|
Protein sequence | MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY ETPKSNLPSV FRDMKYADYQ QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E
|
| |