Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4378 |
Symbol | dnaG |
ID | 6967052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4053721 |
End bp | 4055466 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388101 |
Product | DNA primase |
Protein accession | YP_002272539 |
Protein GI | 209396560 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0358] DNA primase (bacterial type) |
TIGRFAM ID | [TIGR01391] DNA primase, catalytic core |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000022434 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAC GAATCCCACG CGTATTCATT AATGATCTGC TGGCACGCAC TGACATCGTC GATCTGATCG ATGCCCGTGT GAAGCTGAAA AAGCAGGGCA AGAATTTCCA CGCGTGTTGT CCATTCCACA ACGAGAAAAC CCCGTCCTTC ACCGTTAACG GTGAGAAACA GTTTTACCAC TGCTTTGGAT GTGGCGCGCA CGGCAACGCG ATCGACTTCC TGATGAACTA CGACAAGCTT GAGTTCGTCG AAACGGTCGA AGAGCTGGCG GCAATGCACA ATCTTGAAGT GCCATTTGAA GCAGGTAGCG GCCCCAGCCA GATCGAGCGC CATCAGCGGC AAACGCTTTA TCAGTTGATG GACGGTCTGA ATACGTTTTA CCAACAATCT TTACAACAAC CTGTTGCCAC GTCTGCGCGC CAGTATCTGG AAAAACGCGG ATTAAGCCAC GAGGTTATCG CTCGCTTTGC GATTGGTTTT GCGCCTCCCG GCTGGGACAA CGTCCTGAAG CGGTTTGGCG GCAATCCAGA AAATCGCCAG TCATTGATTG ATGCGGGGAT GTTGGTCACT AACGATCAGG GACGCAGTTA CGATCGTTTC CGCGAGCGGG TGATGTTCCC CATTCGCGAT AAACGCGGTC GGGTGATTGG TTTTGGCGGG CGTGTGCTGG GCAACGATAC CCCCAAATAC CTGAACTCGC CGGAAACAGA CATTTTCCAT AAAGGCCGCC AGCTTTACGG TCTTTATGAA GCGCAGCAGG ATAACGCTGA ACCCAATCGT CTGCTTGTGG TCGAAGGCTA TATGGACGTG GTGGCGCTGG CGCAATACGG CATTAATTAC GCCGTTGCGT CGTTAGGTAC GTCAACCACC GCCGATCACA TACAACTGTT GTTCCGCGCG ACCAACAATG TCATTTGCTG TTATGACGGC GACCGTGCAG GCCGTGATGC CGCATGGCGA GCGCTGGAAA CGGCGCTGCC TTACATGACA GACGGCCGTC AGCTACGCTT TATGTTTTTG CCTGATGGCG AAGACCCTGA CACGCTGGTA CGAAAAGAAG GTAAAGAAGC GTTTGAAGCG CGGATGGAGC AGGCGATGCC GCTTTCCGCA TTTCTGTTTA ACAGCCTGAT GCCGCAAGTT GATCTGAGTA CCCCTGACGG GCGCGCACGT TTGAGTACGC TGGCACTGCC ATTGATATCG CAAGTGCCGG GCGAAACGCT GCGAATATAT CTTCGTCAGG AATTAGGCAA CAAATTAGGC ATACTTGATG ACAGCCAGCT TGAACGATTA ATGCCAAAAG CGGCAGAGAG CGGCGTTTCT CGCCCTGTTC CGCAGCTAAA ACGCACGACC ATGCGTATAC TTATAGGGTT GCTGGTGCAA AATCCAGAAT TAGCGACGTT GGTCCCGCCG CTTGAGAATC TGGATGAAAA TAAGCTCCCT GGACTTGGCT TATTCAGAGA ACTGGTCAAC ACTTGTCTCT CCCAGCCAGG TCTGACCACC GGGCAACTTT TAGAGCACTA TCGTGGTACA AATAATGCTG CCACCCTTGA AAAACTGTCG ATGTGGGACG ATATAGCAGA TAAGAATATT GCTGAGCAAA CCTTCACCGA CTCACTCAAC CATATGTTTG ATTCGCTGCT TGAACTGCGC CAGGAAGAGT TAATCGCTCG TGAGCGCACG CATGGTTTAA GCAACGAAGA ACGCCTGGAG CTCTGGACAT TAAACCAGGA GCTGGCGAAA AAGTGA
|
Protein sequence | MAGRIPRVFI NDLLARTDIV DLIDARVKLK KQGKNFHACC PFHNEKTPSF TVNGEKQFYH CFGCGAHGNA IDFLMNYDKL EFVETVEELA AMHNLEVPFE AGSGPSQIER HQRQTLYQLM DGLNTFYQQS LQQPVATSAR QYLEKRGLSH EVIARFAIGF APPGWDNVLK RFGGNPENRQ SLIDAGMLVT NDQGRSYDRF RERVMFPIRD KRGRVIGFGG RVLGNDTPKY LNSPETDIFH KGRQLYGLYE AQQDNAEPNR LLVVEGYMDV VALAQYGINY AVASLGTSTT ADHIQLLFRA TNNVICCYDG DRAGRDAAWR ALETALPYMT DGRQLRFMFL PDGEDPDTLV RKEGKEAFEA RMEQAMPLSA FLFNSLMPQV DLSTPDGRAR LSTLALPLIS QVPGETLRIY LRQELGNKLG ILDDSQLERL MPKAAESGVS RPVPQLKRTT MRILIGLLVQ NPELATLVPP LENLDENKLP GLGLFRELVN TCLSQPGLTT GQLLEHYRGT NNAATLEKLS MWDDIADKNI AEQTFTDSLN HMFDSLLELR QEELIARERT HGLSNEERLE LWTLNQELAK K
|
| |