Gene EcSMS35_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3359 
SymboldnaG 
ID6144369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3436625 
End bp3438370 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content52% 
IMG OID641618188 
ProductDNA primase 
Protein accessionYP_001745338 
Protein GI170680884 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000492318 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAC GAATCCCACG CGTATTCATT AATGATCTGC TGGCACGCAC TGACATCGTC 
GATCTGATCG ATGCCCGTGT GAAGCTGAAA AAGCAGGGCA AGAATTTCCA CGCGTGTTGT
CCATTCCACA ACGAGAAAAC CCCATCATTC ACCGTTAACG GTGAGAAACA GTTTTACCAC
TGCTTTGGAT GTGGCGCGCA CGGCAACGCG ATCGACTTCC TGATGAACTA CGACAAGCTT
GAGTTCGTCG AAACGGTCGA AGAGCTGGCA GCAATGCACA ATCTTGAAGT GCCATTTGAA
GCAGGTAGCG GCCCCAGCCA GATCGAGCGC CATCAACGAC AAACGCTTTA TCAGTTGATG
GACGGTCTGA ATACGTTTTA CCAACAATCT TTACAGCAAC CTGTTGCCAC GTCTGCGCGC
CAGTATCTGG AAAAACGCGG ATTAAGCCAC GAGGTTATCG CCCGCTTTGC GATTGGTTTT
GCCCCCCCTG GCTGGGACAA CGTCCTGAAG CGGTTTGGCG GCAATCCAGA AAATCGCCAG
TCATTGATTG ATGCGGGCAT GTTGGTCACT AACGATCAGG GGCGCAGTTA CGACCGTTTC
CGCGAGCGGG TGATGTTCCC CATTCGTGAT AAACGCGGTC GGGTGATTGG TTTTGGCGGA
CGCGTGCTGG GCAACGATAC CCCCAAATAC CTGAACTCGC CGGAAACGGA CATTTTCCAT
AAAGGCCGCC AGCTTTACGG TCTTTATGAA GCGCAGCAGG ATAACGCTGA ACCCAATCGT
CTGCTTGTGG TCGAAGGCTA TATGGACGTG GTGGCGCTGG CGCAATACGG CATTAATTAC
GCCGTTGCGT CGTTAGGTAC GTCAACCACT GCCGATCACA TACAACTGTT GTTCCGCGCG
ACCAACAATG TCATTTGCTG TTATGACGGC GACCGTGCAG GCCGCGATGC CGCCTGGCGA
GCGCTGGAAA CGGCGCTGCC TTACATGACA GACGGTCGTC AGCTACGCTT TATGTTTTTG
CCTGATGGCG AAGACCCTGA CACGCTGGTA CGAAAAGAAG GTAAAGAAGC GTTTGAAGCG
CGGATGGAGC AGGCGATGCC ACTCTCCGCA TTTCTGTTTA ACAGTCTGAT GCCGCAAGTT
GATCTGAGTA CCCCTGACGG GCGCGCACGT TTGAGTACGC TGGCACTGCC ATTGATATCG
CAAGTGCCGG GCGAAACGCT GCGAATATAT CTTCGTCAGG AATTAGGCAA CAAATTAGGC
ATACTTGATG ACAGCCAGCT TGAACGATTA ATGCCAAAAG CGGCAGAGAG CGGCGTTTCT
CGCCCTGTTC CGCAGCTAAA ACGCACGACC ATGCGTATAC TTATAGGGTT GCTGGTGCAA
AATCCAGAAT TAGCGACGTT GGTCCCGCCG CTTGAGAATC TGGATGAAAA TAAGCTCCCT
GGACTTGGCT TATTCAGAGA ACTGGTCAAC ACTTGTCTCT CCCAGCCAGG TCTGACCACC
GGGCAACTTT TAGAGCACTA TCGTGGTACA AATAATGCTG CCACCCTTGA AAAACTGTCG
ATGTGGGACG ATATAGCAGA TAAGAATATT GCTGAGCAAA CCTTCACCGA CTCACTCAAC
CATATGTTTG ATTCGCTGCT TGAACTGCGC CAGGAAGAGT TAATCGCTCG TGAGCGCACG
CATGGTTTAA GCAACGAAGA ACGCCTGGAG CTCTGGACAT TAAACCAGGA GCTGGCGAAA
AAGTGA
 
Protein sequence
MAGRIPRVFI NDLLARTDIV DLIDARVKLK KQGKNFHACC PFHNEKTPSF TVNGEKQFYH 
CFGCGAHGNA IDFLMNYDKL EFVETVEELA AMHNLEVPFE AGSGPSQIER HQRQTLYQLM
DGLNTFYQQS LQQPVATSAR QYLEKRGLSH EVIARFAIGF APPGWDNVLK RFGGNPENRQ
SLIDAGMLVT NDQGRSYDRF RERVMFPIRD KRGRVIGFGG RVLGNDTPKY LNSPETDIFH
KGRQLYGLYE AQQDNAEPNR LLVVEGYMDV VALAQYGINY AVASLGTSTT ADHIQLLFRA
TNNVICCYDG DRAGRDAAWR ALETALPYMT DGRQLRFMFL PDGEDPDTLV RKEGKEAFEA
RMEQAMPLSA FLFNSLMPQV DLSTPDGRAR LSTLALPLIS QVPGETLRIY LRQELGNKLG
ILDDSQLERL MPKAAESGVS RPVPQLKRTT MRILIGLLVQ NPELATLVPP LENLDENKLP
GLGLFRELVN TCLSQPGLTT GQLLEHYRGT NNAATLEKLS MWDDIADKNI AEQTFTDSLN
HMFDSLLELR QEELIARERT HGLSNEERLE LWTLNQELAK K