Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4388 |
Symbol | |
ID | 6875628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4233082 |
End bp | 4235868 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787311 |
Product | DNA polymerase I |
Protein accession | YP_002217922 |
Protein GI | 198245003 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00514081 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCCCAGAAAA CCCACTTATT CTCGTAGATG GCTCATCCTA TCTCTATCGC GCCTATCATG CGTTTCCGCC GTTAACCAAC AGCGCGGGAG AACCTACGGG CGCAATGTAT GGTGTCCTCA ACATGTTGCG CAGCCTGATC ATGCAGTATC AGCCGACGCA TGCTGCGGTG GTGTTTGACG CCAAAGGAAA AACCTTCCGT GATGAGCTCT TTGAACACTA CAAATCGCAT CGTCCTCCGA TGCCGGATGA TCTGCGAGCG CAAATAGAGC CGTTACATGC CATGGTTAAA GCCATGGGGT TACCTCTGCT GGCAGTCTCT GGCGTAGAAG CGGATGACGT TATCGGTACA CTGGCGCGAG AAGCGGAGAA GGTGGGGCGT CCGGTATTAA TCAGCACCGG CGATAAAGAT ATGGCACAGT TGGTGACGCC GAATATTACG CTGATCAACA CCATGACTAA CACCATCCTC GGCCCGGATG AAGTCGTTAA TAAGTACGGC GTGCCGCCTG AGCTGATTAT CGACTTTCTG GCGCTGATGG GGGACTCCTC GGATAATATT CCAGGCGTAC CAGGCGTGGG TGAGAAGACG GCGCAAGCCT TGCTTCAGGG ATTGGGCGGC CTGGATACGC TGTACGCCGA GCCGGAAAAA ATTGCCGGTC TCACTTTCCG CGGCGCCAAA ACGATGGCCG GTAAATTAGC GCAGAATAAA GACGTAGCGT ACCTGTCTTA TAAACTCGCC ACCATTAAAA CGGATGTTGA GCTGGAGCTG ACCTGCGAAC AGCTTGAAGT GCAGCAGCCG ATTGCGGATG AACTGCTGGG CCTGTTTAAA AAATATGAGT TCAAGCGCTG GACGGCGGAC GTCGAGGCAG GCAAGTGGCT ACAGGCAAAG GGCGCGAAAC CGGCGGCCAA ACCGCAGGAA ACGGTCGTTA TTGATGAATC GCCCAGCGAA CCGGCAGCGG CGCTCTCTTA TGAAAATTAT GTCACGATTC TGGACGACGT TACGCTGGAA AGCTGGATTG AAAAGCTGAA AAAAGCGCCA GTTTTTGCTT TCGACACGGA GACCGACAGT CTGGATAATA TCGCCGCCAA CCTGGTGGGC CTCTCGTTTG CTATCGAACC TGGCGTTGCC GCGTATGTAC CTGTCGCGCA TGATTATCTG GACGCTCCGG ATCAAATCTC CCGCCAGCGT GCTCTGGAAC TGCTGAAGCC GCTGCTGGAA GATGAAAAAG TGCGCAAAGT GGGGCAAAAC CTCAAGTACG ATCGCGGCGT CTTGCAAAAT TACGGTATTG AGCTGCGCGG TATCGCCTTC GATACCATGC TTGAGTCTTA CATTCTGAAC AGCGTCGCCG GACGCCATGA TATGGACAGT TTGTCCGATC GTTGGCTGAA GCACAAAACT ATCACCTTTG AAGACATTGC CGGTAAAGGT AAAAACCAGC TCACCTTTAA CCAGATCGCA CTGGAGGAAG CGGGGCGCTA TGCGGCAGAA GATGCGGATG TCACTTTACA GTTGCATCTC AAAATGTGGC CTGAGCTCCA GCAGCACAAA GGCCCGCTGA ATGTTTTCGA AAACATCGAA ATGCCGTTGG TGCCTGTACT GTCACGCGTT GAGCGCAATG GCGTAAAAAT CGATCCTGCC GTCCTGCACA AACATTCGGA AGAAATCACG CTACGTCTGG CGGAACTGGA AAAGAAAGCG CATGACATTG CGGGCGAGGC GTTCAACCTG TCCTCGACGA AGCAGTTGCA GACCATCCTG TTTGAAAAGC AGGGTATTAA GCCGCTGAAG AAAACGCCTG GCGGCGCGCC GTCAACGTCG GAAGAGGTGC TGGAAGAGCT GGCGCTGGAC TATCCGCTGC CGAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTAAA ATCCACCTAT ACCGATAAGC TGCCGCTGAT GATTAACCCG AAAACCGGGC GCGTCCATAC GTCCTATCAT CAGGCGGTAA CGGCGACGGG ACGTTTATCG TCCACCGATC CGAACCTGCA AAATATTCCG GTGCGCAATG AAGAAGGCCG CCGCATTCGT CAGGCATTTA TTGCGCCTGA GGATTATCTC ATCGTGTCTG CGGACTATTC ACAGATTGAG CTGCGTATTA TGGCGCATCT TTCCCGTGAT AAAGGACTGC TCACGGCGTT CGCCGAAGGG AAGGATATTC ACCGCGCAAC GGCGGCGGAA GTCTTTGGCT TGCCGCTGGA TAGCGTGACC GGGGAACAGC GCCGAAGTGC GAAAGCCATT AACTTTGGCC TGATTTACGG GATGAGCGCC TTCGGTCTTT CTCGCCAGCT TAATATTCCG CGTAAAGAGG CGCAGAAGTA TATGGATCTC TACTTCGAAC GCTACCCTGG CGTGCTGGAA TATATGGAGC GCACCCGCGC TCAGGCAAAA GAACAAGGCT ATGTGGAAAC GCTGGAGGGA CGCCGCCTTT ACCTGCCGGA TATTAAATCT AGCAACGCGG CGCGGCGCGC GGGGGCGGAA CGCGCGGCGA TCAATGCTCC CATGCAAGGA ACGGCTGCCG ATATCATCAA GCGCGCCATG ATTGCCGTCG ATGCCTGGCT ACAGGCCGAG CAGCCACGCG TGCGGATGAT TATGCAGGTA CACGATGAAT TAGTGTTCGA GGTGCATAAA GACGACTTAG ATGCGGTAGC AAAACGTATC CATCAGTTGA TGGAAAACTG CACGCGTATT GATGTGCCGT TGCTGGTGGA AGTCGGTAGC GGAGAAAATT GGGATCAAGC GCACTAA
|
Protein sequence | MVQIPENPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYQPTHAAV VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT LAREAEKVGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPDEVVNKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLTFRGAK TMAGKLAQNK DVAYLSYKLA TIKTDVELEL TCEQLEVQQP IADELLGLFK KYEFKRWTAD VEAGKWLQAK GAKPAAKPQE TVVIDESPSE PAAALSYENY VTILDDVTLE SWIEKLKKAP VFAFDTETDS LDNIAANLVG LSFAIEPGVA AYVPVAHDYL DAPDQISRQR ALELLKPLLE DEKVRKVGQN LKYDRGVLQN YGIELRGIAF DTMLESYILN SVAGRHDMDS LSDRWLKHKT ITFEDIAGKG KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPELQQHK GPLNVFENIE MPLVPVLSRV ERNGVKIDPA VLHKHSEEIT LRLAELEKKA HDIAGEAFNL SSTKQLQTIL FEKQGIKPLK KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYL IVSADYSQIE LRIMAHLSRD KGLLTAFAEG KDIHRATAAE VFGLPLDSVT GEQRRSAKAI NFGLIYGMSA FGLSRQLNIP RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLEG RRLYLPDIKS SNAARRAGAE RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDLDAVAKRI HQLMENCTRI DVPLLVEVGS GENWDQAH
|
| |