Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3359 |
Symbol | dnaG |
ID | 6144369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3436625 |
End bp | 3438370 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618188 |
Product | DNA primase |
Protein accession | YP_001745338 |
Protein GI | 170680884 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0358] DNA primase (bacterial type) |
TIGRFAM ID | [TIGR01391] DNA primase, catalytic core |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000492318 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAC GAATCCCACG CGTATTCATT AATGATCTGC TGGCACGCAC TGACATCGTC GATCTGATCG ATGCCCGTGT GAAGCTGAAA AAGCAGGGCA AGAATTTCCA CGCGTGTTGT CCATTCCACA ACGAGAAAAC CCCATCATTC ACCGTTAACG GTGAGAAACA GTTTTACCAC TGCTTTGGAT GTGGCGCGCA CGGCAACGCG ATCGACTTCC TGATGAACTA CGACAAGCTT GAGTTCGTCG AAACGGTCGA AGAGCTGGCA GCAATGCACA ATCTTGAAGT GCCATTTGAA GCAGGTAGCG GCCCCAGCCA GATCGAGCGC CATCAACGAC AAACGCTTTA TCAGTTGATG GACGGTCTGA ATACGTTTTA CCAACAATCT TTACAGCAAC CTGTTGCCAC GTCTGCGCGC CAGTATCTGG AAAAACGCGG ATTAAGCCAC GAGGTTATCG CCCGCTTTGC GATTGGTTTT GCCCCCCCTG GCTGGGACAA CGTCCTGAAG CGGTTTGGCG GCAATCCAGA AAATCGCCAG TCATTGATTG ATGCGGGCAT GTTGGTCACT AACGATCAGG GGCGCAGTTA CGACCGTTTC CGCGAGCGGG TGATGTTCCC CATTCGTGAT AAACGCGGTC GGGTGATTGG TTTTGGCGGA CGCGTGCTGG GCAACGATAC CCCCAAATAC CTGAACTCGC CGGAAACGGA CATTTTCCAT AAAGGCCGCC AGCTTTACGG TCTTTATGAA GCGCAGCAGG ATAACGCTGA ACCCAATCGT CTGCTTGTGG TCGAAGGCTA TATGGACGTG GTGGCGCTGG CGCAATACGG CATTAATTAC GCCGTTGCGT CGTTAGGTAC GTCAACCACT GCCGATCACA TACAACTGTT GTTCCGCGCG ACCAACAATG TCATTTGCTG TTATGACGGC GACCGTGCAG GCCGCGATGC CGCCTGGCGA GCGCTGGAAA CGGCGCTGCC TTACATGACA GACGGTCGTC AGCTACGCTT TATGTTTTTG CCTGATGGCG AAGACCCTGA CACGCTGGTA CGAAAAGAAG GTAAAGAAGC GTTTGAAGCG CGGATGGAGC AGGCGATGCC ACTCTCCGCA TTTCTGTTTA ACAGTCTGAT GCCGCAAGTT GATCTGAGTA CCCCTGACGG GCGCGCACGT TTGAGTACGC TGGCACTGCC ATTGATATCG CAAGTGCCGG GCGAAACGCT GCGAATATAT CTTCGTCAGG AATTAGGCAA CAAATTAGGC ATACTTGATG ACAGCCAGCT TGAACGATTA ATGCCAAAAG CGGCAGAGAG CGGCGTTTCT CGCCCTGTTC CGCAGCTAAA ACGCACGACC ATGCGTATAC TTATAGGGTT GCTGGTGCAA AATCCAGAAT TAGCGACGTT GGTCCCGCCG CTTGAGAATC TGGATGAAAA TAAGCTCCCT GGACTTGGCT TATTCAGAGA ACTGGTCAAC ACTTGTCTCT CCCAGCCAGG TCTGACCACC GGGCAACTTT TAGAGCACTA TCGTGGTACA AATAATGCTG CCACCCTTGA AAAACTGTCG ATGTGGGACG ATATAGCAGA TAAGAATATT GCTGAGCAAA CCTTCACCGA CTCACTCAAC CATATGTTTG ATTCGCTGCT TGAACTGCGC CAGGAAGAGT TAATCGCTCG TGAGCGCACG CATGGTTTAA GCAACGAAGA ACGCCTGGAG CTCTGGACAT TAAACCAGGA GCTGGCGAAA AAGTGA
|
Protein sequence | MAGRIPRVFI NDLLARTDIV DLIDARVKLK KQGKNFHACC PFHNEKTPSF TVNGEKQFYH CFGCGAHGNA IDFLMNYDKL EFVETVEELA AMHNLEVPFE AGSGPSQIER HQRQTLYQLM DGLNTFYQQS LQQPVATSAR QYLEKRGLSH EVIARFAIGF APPGWDNVLK RFGGNPENRQ SLIDAGMLVT NDQGRSYDRF RERVMFPIRD KRGRVIGFGG RVLGNDTPKY LNSPETDIFH KGRQLYGLYE AQQDNAEPNR LLVVEGYMDV VALAQYGINY AVASLGTSTT ADHIQLLFRA TNNVICCYDG DRAGRDAAWR ALETALPYMT DGRQLRFMFL PDGEDPDTLV RKEGKEAFEA RMEQAMPLSA FLFNSLMPQV DLSTPDGRAR LSTLALPLIS QVPGETLRIY LRQELGNKLG ILDDSQLERL MPKAAESGVS RPVPQLKRTT MRILIGLLVQ NPELATLVPP LENLDENKLP GLGLFRELVN TCLSQPGLTT GQLLEHYRGT NNAATLEKLS MWDDIADKNI AEQTFTDSLN HMFDSLLELR QEELIARERT HGLSNEERLE LWTLNQELAK K
|
| |