Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4847 |
Symbol | |
ID | 6483814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4718963 |
End bp | 4721293 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642740060 |
Product | bacteriophage P4 DNA primase |
Protein accession | YP_002043737 |
Protein GI | 194444462 |
COG category | [R] General function prediction only |
COG ID | [COG3378] Predicted ATPase |
TIGRFAM ID | [TIGR01613] phage/plasmid primase, P4 family, C-terminal domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.645663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.521901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAA TGAAAGTCAG TCAGGCCGTG AAGGCTGCCC GCGGTCACTG GGCTAAGATT CTGCCAGCGC TGGGCGTGAA TATATTGAAA AATCGCCATC AGCCTTGCCC GGTCTGTGGC GGTAAAGATC GTTTTCGCTT CGACGATCAG GAAGGACGCG GAACATGGTT CTGCAATCAG TGCGGTGCCG GGGATGGTTT AGCGCTTGTC ACCAGGGCAC TGAATGTGGA TATCAGTGAA GCAGCTGACA GGATACATGG ACTGACACAT GGTCTGTCTG TAGTTAATTC CGAAGTCAGG GCGTTAACCG CCGATACCGA TAGCGGGAAA GATGCAGCAG CACTTGCCGC GCGACTGCTG CAAGCCTCAC GTGAATCTGC CGGAAACACT TATCTTACAC ATAAAGGTTT CCCGGAGCAT ATTTGCCATG AGCTGACTTC AGCTCATAAA ACCGGTGGGG TGATGTTCCG CCCCGGTGAT TTGATCGTCC CGCTCTATAA CGCAGACGGG GAGCTGGTGA ATATTCAGCT TATCAGTGGC AATGGTAGCA AGTGTTTTCT TAAAGGAGGT CAGGTTAAGG AGGCCTATCA TCTGATAGAA GGGGGCGGGA GTTCTTTAAG AAGGATGTGG ATTGCGGAAG GTTACGCCAC GGCGCTTACC ATTCATCATC TGACAGGAGA AGCCGTCATG GTGGCATTTT CGTCGGTCAA CTTTCTTTCT CTGGCCAGCG TTGCCCATAA CAAGTATCCG GGATATCAGT TAATTATTGC AGCGGACCGC GATCTGAATG GTTCAGGCCA GAACAGGGCT GAAACTGCTG CAAAAGCGTG TCAGTGTGAC ATTGTGCTGC CACCGGTTTT TGGTGACTGG AATGATGCGC TTACCCAATA CGGAGAGGAA TCCACCCGGA AGGCAATTCT TGAGGCTTTG AAGCCACATA ATGCCAGTCC TTTCGACACA ATGAGTGAAG CAGAATTCAC AGCGATGAGT GTCAGCGAAA AAGCTCAGAG GGTTCGGGAG CATTACAGGG ATGCACTGGC GGTTGATCCG AACGGGCAGC TTTTATCCCG CTACGAGTCT GGAGCGTGGA AAGTGATTTC TCAGTCAGAT TTTTCCCGAG ATGTCGCAGC CCTTTTCCAG CGCCTCGGTG CACCGTTTTC CTCAGGAAAA ATTGCCTCAC TGGTGGAAAC ATTAAAGTTA ATTGTTCCGC AGCAGCAAAA TCCGGCACGT CATCTGATTG GTTTTCGTAA TGGCGTCCTC GATACGCGAA CAGGGCTTTT TAGTCCGCAC TGTAAAGAGA ACTGGCTGCG TACCCTGTGT GAAGTCGATT TCACGCCACC GGTAAAGGGG GAAACGCTTG AAACTCATGC CCCGGCATTT TGGCGCTGGC TGGACCGGGC TGCCGGACAC AAGCCAGCTA AACGCGACAT TATTCTTGCG GCGTTATTTA TGGTGCTGGC GAACCGCTAT GACTGGCAGC TCTTTCTGGA AGTGACTGGC CCTGGTGGAA GTGGCAAAAG TATTCTGGCT GAAATTGCGA CCATGCTTGC TGGAGAGGAT AACGCCACCT CTGCGACCAT TGAAACGCTG GAGTCCCCGC GAGAACGTGC GGCTCTGATA GGTTTTTCTC TTATTCGCCT GCCTGATCAG GAGAAGTGGA GCGGCGACGG CGCCGGGCTT AAGGCTATAA CTGGTGGCGA TGCGGTATCG GTGGACCCCA AGTATCAGAA TGCTTATTCA ACACATATTC CGGCGGTTAT CCTGGCCGTA AATAATAATC CTATGCGCTT CACTGATCGT AGTGGTGGTG TGTCCCGCAG GAGAGTGATT CTGCATTTTC CTGAGCAGAT AGCACCGGAA GAACGCGATC CGCAGCTGAA AGATAAAATT GCTCGAGAAC TGGCTGTGAT TGTTCGCCAG CTCATGCAGC GTTTCAGCGA TCCGATGAGC GCCAGAACAT TGCTTCAGTC GCAGCAGAAT TCAGATGAGG CTCTCACCAT CAAGCGTGAT GCTGATCCAG CATTTGATTT TTGTGGCTAT CTTGAGGCAT TACCTGACAC CAACGGTATG TTTATGGGGA ACGCTAATAT TGTCCCACGC CAGCCACGTA CCTACCTTTA CCATGCCTAC CTCGTATACA TGGAGGCTAA TGGCTATAAA AACACCCTGA GTCTTACGAT GTTTGGCAAA GGGTTGCCAG TCATGCTGAA GGAGTACGGG CTACATTATG AGAAACGGCG AACTAATCAG GGAATGCAAA CTAACCTCAC TTTGAAAGAG GAGAGTAATG CAGACTGGCT GCCCAAGTGT GATGAGCCAA TATTAAAATA A
|
Protein sequence | MSGMKVSQAV KAARGHWAKI LPALGVNILK NRHQPCPVCG GKDRFRFDDQ EGRGTWFCNQ CGAGDGLALV TRALNVDISE AADRIHGLTH GLSVVNSEVR ALTADTDSGK DAAALAARLL QASRESAGNT YLTHKGFPEH ICHELTSAHK TGGVMFRPGD LIVPLYNADG ELVNIQLISG NGSKCFLKGG QVKEAYHLIE GGGSSLRRMW IAEGYATALT IHHLTGEAVM VAFSSVNFLS LASVAHNKYP GYQLIIAADR DLNGSGQNRA ETAAKACQCD IVLPPVFGDW NDALTQYGEE STRKAILEAL KPHNASPFDT MSEAEFTAMS VSEKAQRVRE HYRDALAVDP NGQLLSRYES GAWKVISQSD FSRDVAALFQ RLGAPFSSGK IASLVETLKL IVPQQQNPAR HLIGFRNGVL DTRTGLFSPH CKENWLRTLC EVDFTPPVKG ETLETHAPAF WRWLDRAAGH KPAKRDIILA ALFMVLANRY DWQLFLEVTG PGGSGKSILA EIATMLAGED NATSATIETL ESPRERAALI GFSLIRLPDQ EKWSGDGAGL KAITGGDAVS VDPKYQNAYS THIPAVILAV NNNPMRFTDR SGGVSRRRVI LHFPEQIAPE ERDPQLKDKI ARELAVIVRQ LMQRFSDPMS ARTLLQSQQN SDEALTIKRD ADPAFDFCGY LEALPDTNGM FMGNANIVPR QPRTYLYHAY LVYMEANGYK NTLSLTMFGK GLPVMLKEYG LHYEKRRTNQ GMQTNLTLKE ESNADWLPKC DEPILK
|
| |