Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4231 |
Symbol | |
ID | 6793532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4125691 |
End bp | 4128477 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642778341 |
Product | DNA polymerase I |
Protein accession | YP_002148920 |
Protein GI | 197247609 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000696994 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCCCAGAAAA CCCACTTATT CTCGTAGATG GCTCATCCTA TCTCTATCGC GCCTATCATG CGTTTCCGCC GTTAACCAAC AGCGCGGGAG AACCTACGGG CGCAATGTAT GGTGTTCTCA ACATGTTGCG CAGCCTGATC ATGCAGTATC AGCCGACGCA TGCTGCGGTG GTGTTTGACG CCAAAGGAAA AACCTTCCGT GATGAGCTCT TTGAACATTA CAAATCGCAT CGTCCTCCGA TGCCGGATGA TCTGCGAGCG CAAATAGAGC CGTTACATGC CATGGTTAAA GCCATGGGGT TACCTCTGCT GGCGGTCTCT GGCGTAGAAG CGGATGACGT TATCGGTACA CTGGCGCGAG AAGCGGAGAA GGTGGGGCGT CCGGTATTAA TCAGCACCGG CGATAAAGAT ATGGCACAGT TGGTGACGCC GAATATTACG CTGATCAACA CCATGACTAA CACTATCCTC GGCCCGGATG AAGTCGTTAA TAAGTACGGC GTGCCGCCTG AGCTGATTAT CGACTTTCTG GCGCTAATGG GGGACTCCTC GGATAATATT CCAGGCGTAC CAGGCGTGGG TGAGAAGACG GCGCAAGCCT TGCTTCAGGG ATTGGGCGGC CTGGATACGC TGTACGCCGA GCCGGAAAAA ATTGCCGGTC TCACTTTCCG CGGCGCCAAA ACGATGGCCG GTAAATTAGC GCAGAATAAA GACGTAGCGT ACCTGTCTTA TAAACTCGCC ACCATTAAAA CGGATGTTGA GCTGGAGCTG ACCTGCGAAC AGCTTGAAGT GCAGCAGCCG ATTGCGGATG AACTGCTGGG CCTGTTTAAA AAATATGAGT TTAAGCGCTG GACGGCGGAC GTTGAGTCAG GCAAGTGGCT ACAGGCAAAG GGCGCGAAAC CGGCGGCCAA ACCGCAGGAA ACGGTCGTTA TTGATGAATC GCTCAGCGAA CCGGCAGCGG CGCTCTCTTA TGAAAATTAT GTCACTATTC TGGACGACGT TACGCTGGAA AGCTGGATTG AAAAGCTGAA AAAAGCGCCA GTTTTTGCTT TCGACACGGA GACCGACAGT CTGGATAATA TCGCCGCCAA CCTGGTGGGC CTCTCTTTTG CTATCGAACC TGGCGTTGCC GCATATGTAC CTGTCGCGCA TGATTATCTG GACGCTCCGG ATCAAATCTC CCGCCAGCGT GCTCTGGAAC TGCTGAAGCC GCTGCTGGAA GATGAAAAAG TGCGCAAAGT GGGGCAAAAC CTCAAGTACG ATCGCGGCGT CTTGCAAAAT TACGGTATTG AGCTGCGCGG TATCGCCTTC GATACCATGC TTGAGTCTTA CATTCTGAAC AGCGTCGCCG GACGCCATGA TATGGACAGC TTGTCCGATC GTTGGCTGAA GCACAAAACT ATCACCTTTG AAGACATTGC CGGTAAAGGT AAAAACCAGC TCACCTTTAA CCAGATCGCA CTGGAGGAAG CGGGGCGCTA TGCGGCAGAA GATGCGGATG TCACGTTACA GTTGCATCTC AAAATGTGGC CTGAGCTCCA GCAGCACAAA GGCCCGCTGA ATGTTTTCGA AAACATCGAA ATGCCGCTGG TGCCGGTACT GTCACGCGTT GAGCGCAATG GTGTAAAAAT CGATCCTGCC GTCCTGCACA AACATTCGGA AGAAATCACG CTACGTCTGG CGGAACTGGA AAAGAAAGCG CATGACATTG CGGGCGAGGC GTTCAACCTG TCCTCGACGA AGCAGTTGCA GACTATCCTG TTTGAAAAGC AGGGTATTAA GCCGCTGAAG AAAACGCCTG GCGGCGCGCC GTCAACGTCG GAAGAGGTGC TGGAGGAGCT GGCGCTGGAC TATCCGCTGC CGAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTAAA ATCCACCTAT ACCGATAAGC TGCCGCTGAT GATTAACCCG AAAACCGGGC GCGTCCATAC GTCCTATCAT CAGGCGGTAA CGGCGACGGG ACGTTTATCG TCCACCGATC CGAACCTGCA AAATATTCCG GTGCGCAATG AAGAGGGCCG CCGCATTCGT CAGGCATTTA TTGCGCCTGA GGATTATCTC ATCGTGTCTG CGGACTATTC ACAGATTGAG CTGCGTATTA TGGCGCATCT TTCCCGTGAT AAAGGACTAC TCACGGCGTT CGCTGAAGGG AAGGATATTC ACCGCGCAAC GGCGGCGGAA GTCTTTGGCT TGCCGCTGGA TAGCGTGACC GGGGAACAGC GCCGAAGTGC GAAAGCCATT AACTTTGGCC TGATTTACGG GATGAGCGCC TTCGGTCTTT CTCGCCAGCT TAATATTCCG CGTAAAGAGG CGCAGAAGTA TATGGATCTC TACTTCGAAC GCTATCCTGG CGTGCTGGAA TACATGGAGC GCACCCGCGC TCAGGCAAAA GAACAAGGCT ATGTGGAAAC GCTGGAGGGA CGCCGCCTTT ACCTGCCGGA TATTAAATCC AGCAACGCGG CGCGGCGCGC GGGGGCGGAA CGCGCGGCGA TCAATGCTCC CATGCAAGGA ACGGCTGCCG ATATCATCAA GCGCGCCATG ATTGCCGTCG ATGCCTGGCT ACAGGCCGAG CAGCCGCGCG TGCGGATGAT TATGCAGGTA CACGATGAAT TAGTGTTCGA GGTGCATAAA GACGACTTAG ATGCGGTAGC AAAACGTATC CATCAGTTGA TGGAAAACTG CACGCGTATT GATGTGCCGT TGCTGGTAGA AGTCGGTAGC GGGGAAAATT GGGATCAAGC GCACTAA
|
Protein sequence | MVQIPENPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYQPTHAAV VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT LAREAEKVGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPDEVVNKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLTFRGAK TMAGKLAQNK DVAYLSYKLA TIKTDVELEL TCEQLEVQQP IADELLGLFK KYEFKRWTAD VESGKWLQAK GAKPAAKPQE TVVIDESLSE PAAALSYENY VTILDDVTLE SWIEKLKKAP VFAFDTETDS LDNIAANLVG LSFAIEPGVA AYVPVAHDYL DAPDQISRQR ALELLKPLLE DEKVRKVGQN LKYDRGVLQN YGIELRGIAF DTMLESYILN SVAGRHDMDS LSDRWLKHKT ITFEDIAGKG KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPELQQHK GPLNVFENIE MPLVPVLSRV ERNGVKIDPA VLHKHSEEIT LRLAELEKKA HDIAGEAFNL SSTKQLQTIL FEKQGIKPLK KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYL IVSADYSQIE LRIMAHLSRD KGLLTAFAEG KDIHRATAAE VFGLPLDSVT GEQRRSAKAI NFGLIYGMSA FGLSRQLNIP RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLEG RRLYLPDIKS SNAARRAGAE RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDLDAVAKRI HQLMENCTRI DVPLLVEVGS GENWDQAH
|
| |