Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2028 |
Symbol | holB |
ID | 6145190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2049761 |
End bp | 2050765 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616904 |
Product | DNA polymerase III subunit delta' |
Protein accession | YP_001744080 |
Protein GI | 170681498 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2812] DNA polymerase III, gamma/tau subunits |
TIGRFAM ID | [TIGR00678] DNA polymerase III, delta' subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000033373 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000172568 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGATGGT ATCCATGGTT ACGACCTGAT TTCGAAAAAC TGGTAGCCAG CTATCAGGCC GGAAGAGGTC ACCATGCGCT ACTCATTCAG GCGTTACCGG GTATGGGCGA GGATGCTTTA ATCTACGCCC TGAGCCGTTA TTTACTCTGC CAACAACCGC AGGGCCACAA AAGTTGCGGT CACTGTCGTG GATGTCAGTT GATGCAGGCT GGTACGCATC CCGATTATTA CACCCTGGCT CCCGAGAAAG GGAAAAATGC GCTGGGCATT GATGCGGTAC GTGAAGTCAC CGAAAAGTTA AATGAGCACG CACGTTTAGG TGGTGCGAAA GTTGTCTGGG TAACCGATGC TGCCTTACTG ACCGACGCCG CGGCTAACGC ATTGCTGAAA ACGCTTGAAG AGCCACCAGC AGAAACCTGG TTTTTCCTGG CTACCCGCGA GCCTGAACGT TTACTGGCAA CATTACGTAG TCGTTGTCGG TTACATTACC TTGCGCCGCC GCCGGAACAG TACGCCGTGA CCTGGCTTTC ACGCGAAGTG ACAATGTCAC AGGATGCATT ACTTGCCGCA TTGCGCTTAA GCGCCGGTTC GCCTGGCGCG GCACTGGCGT TGTTTCAGGG AGATAACTGG CAGGCTCGTG AAACATTGTG TCAGGCGTTG GCATATAGCG TGCCATCGGG CGACTGGTAT TCGCTGCTGG CGGCCCTTAA TCATGAACAA GCTCCGGCGC GTTTACACTG GCTGGCAACG TTGCTGATGG ATGCGCTAAA ACGCCATCAT GGCGCTGCGC AGGTGACCAA TGTTGATGTG CCGGGCCTGG TCGTCGAACT GGCAAACCAT CTATCTCCCT CGCGCCTGCA GGCTATACTG GGGGATGTTT GCCACATTCG TGAACAGTTA ATGTCTGTCA CAGGCATCAA CCGCGAGCTT CTCATCACCG ATCTTTTACT GCGTATTGAG CATTACCTGC AACCGGGCGT TGTGCTACCG GTTCCTCATC TTTAA
|
Protein sequence | MRWYPWLRPD FEKLVASYQA GRGHHALLIQ ALPGMGEDAL IYALSRYLLC QQPQGHKSCG HCRGCQLMQA GTHPDYYTLA PEKGKNALGI DAVREVTEKL NEHARLGGAK VVWVTDAALL TDAAANALLK TLEEPPAETW FFLATREPER LLATLRSRCR LHYLAPPPEQ YAVTWLSREV TMSQDALLAA LRLSAGSPGA ALALFQGDNW QARETLCQAL AYSVPSGDWY SLLAALNHEQ APARLHWLAT LLMDALKRHH GAAQVTNVDV PGLVVELANH LSPSRLQAIL GDVCHIREQL MSVTGINREL LITDLLLRIE HYLQPGVVLP VPHL
|
| |