Gene ECH74115_4378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4378 
SymboldnaG 
ID6967052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4053721 
End bp4055466 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content52% 
IMG OID643388101 
ProductDNA primase 
Protein accessionYP_002272539 
Protein GI209396560 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000022434 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAC GAATCCCACG CGTATTCATT AATGATCTGC TGGCACGCAC TGACATCGTC 
GATCTGATCG ATGCCCGTGT GAAGCTGAAA AAGCAGGGCA AGAATTTCCA CGCGTGTTGT
CCATTCCACA ACGAGAAAAC CCCGTCCTTC ACCGTTAACG GTGAGAAACA GTTTTACCAC
TGCTTTGGAT GTGGCGCGCA CGGCAACGCG ATCGACTTCC TGATGAACTA CGACAAGCTT
GAGTTCGTCG AAACGGTCGA AGAGCTGGCG GCAATGCACA ATCTTGAAGT GCCATTTGAA
GCAGGTAGCG GCCCCAGCCA GATCGAGCGC CATCAGCGGC AAACGCTTTA TCAGTTGATG
GACGGTCTGA ATACGTTTTA CCAACAATCT TTACAACAAC CTGTTGCCAC GTCTGCGCGC
CAGTATCTGG AAAAACGCGG ATTAAGCCAC GAGGTTATCG CTCGCTTTGC GATTGGTTTT
GCGCCTCCCG GCTGGGACAA CGTCCTGAAG CGGTTTGGCG GCAATCCAGA AAATCGCCAG
TCATTGATTG ATGCGGGGAT GTTGGTCACT AACGATCAGG GACGCAGTTA CGATCGTTTC
CGCGAGCGGG TGATGTTCCC CATTCGCGAT AAACGCGGTC GGGTGATTGG TTTTGGCGGG
CGTGTGCTGG GCAACGATAC CCCCAAATAC CTGAACTCGC CGGAAACAGA CATTTTCCAT
AAAGGCCGCC AGCTTTACGG TCTTTATGAA GCGCAGCAGG ATAACGCTGA ACCCAATCGT
CTGCTTGTGG TCGAAGGCTA TATGGACGTG GTGGCGCTGG CGCAATACGG CATTAATTAC
GCCGTTGCGT CGTTAGGTAC GTCAACCACC GCCGATCACA TACAACTGTT GTTCCGCGCG
ACCAACAATG TCATTTGCTG TTATGACGGC GACCGTGCAG GCCGTGATGC CGCATGGCGA
GCGCTGGAAA CGGCGCTGCC TTACATGACA GACGGCCGTC AGCTACGCTT TATGTTTTTG
CCTGATGGCG AAGACCCTGA CACGCTGGTA CGAAAAGAAG GTAAAGAAGC GTTTGAAGCG
CGGATGGAGC AGGCGATGCC GCTTTCCGCA TTTCTGTTTA ACAGCCTGAT GCCGCAAGTT
GATCTGAGTA CCCCTGACGG GCGCGCACGT TTGAGTACGC TGGCACTGCC ATTGATATCG
CAAGTGCCGG GCGAAACGCT GCGAATATAT CTTCGTCAGG AATTAGGCAA CAAATTAGGC
ATACTTGATG ACAGCCAGCT TGAACGATTA ATGCCAAAAG CGGCAGAGAG CGGCGTTTCT
CGCCCTGTTC CGCAGCTAAA ACGCACGACC ATGCGTATAC TTATAGGGTT GCTGGTGCAA
AATCCAGAAT TAGCGACGTT GGTCCCGCCG CTTGAGAATC TGGATGAAAA TAAGCTCCCT
GGACTTGGCT TATTCAGAGA ACTGGTCAAC ACTTGTCTCT CCCAGCCAGG TCTGACCACC
GGGCAACTTT TAGAGCACTA TCGTGGTACA AATAATGCTG CCACCCTTGA AAAACTGTCG
ATGTGGGACG ATATAGCAGA TAAGAATATT GCTGAGCAAA CCTTCACCGA CTCACTCAAC
CATATGTTTG ATTCGCTGCT TGAACTGCGC CAGGAAGAGT TAATCGCTCG TGAGCGCACG
CATGGTTTAA GCAACGAAGA ACGCCTGGAG CTCTGGACAT TAAACCAGGA GCTGGCGAAA
AAGTGA
 
Protein sequence
MAGRIPRVFI NDLLARTDIV DLIDARVKLK KQGKNFHACC PFHNEKTPSF TVNGEKQFYH 
CFGCGAHGNA IDFLMNYDKL EFVETVEELA AMHNLEVPFE AGSGPSQIER HQRQTLYQLM
DGLNTFYQQS LQQPVATSAR QYLEKRGLSH EVIARFAIGF APPGWDNVLK RFGGNPENRQ
SLIDAGMLVT NDQGRSYDRF RERVMFPIRD KRGRVIGFGG RVLGNDTPKY LNSPETDIFH
KGRQLYGLYE AQQDNAEPNR LLVVEGYMDV VALAQYGINY AVASLGTSTT ADHIQLLFRA
TNNVICCYDG DRAGRDAAWR ALETALPYMT DGRQLRFMFL PDGEDPDTLV RKEGKEAFEA
RMEQAMPLSA FLFNSLMPQV DLSTPDGRAR LSTLALPLIS QVPGETLRIY LRQELGNKLG
ILDDSQLERL MPKAAESGVS RPVPQLKRTT MRILIGLLVQ NPELATLVPP LENLDENKLP
GLGLFRELVN TCLSQPGLTT GQLLEHYRGT NNAATLEKLS MWDDIADKNI AEQTFTDSLN
HMFDSLLELR QEELIARERT HGLSNEERLE LWTLNQELAK K