Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4887 |
Symbol | |
ID | 5602732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 5417070 |
End bp | 5419874 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640940459 |
Product | DNA polymerase I |
Protein accession | YP_001481107 |
Protein GI | 157373118 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000312634 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000015928 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCAAA TTGCAGAAAA CCCACTAATC CTGGTTGACG GTTCCTCTTA CCTCTACCGC GCTTATCACG CCTTCCCTCC GCTGACCAAC TCCGCGGGTG AACCGACCGG GGCAATGTAC GGCGTGCTGA ATATGCTGCG TAGCCTATTG CTGCAGTACC AGCCAAGCCA TGTTGCGGTG GTGTTTGATG CCAAAGGAAA AACCTTCCGT GATGAGCTGT TCGCAGAATA CAAATCACAC CGGCCACCTA TGCCGGACGA TCTGCGCGCG CAAATCGAGC CGTTGCACAA AATGGTCAAG GCCATGGGGT TGCCGCTGTT GGTCACGCCC GGCGTCGAAG CCGACGACGT CATAGGCACG CTGGCGCTGG AAGCTGAAAA GGCCGGTCAT GCGGTGCTGA TCAGCACTGG CGATAAAGAC ATGGCGCAGT TGGTCACGTC GAACGTCACC TTGATCAACA CCATGAACAA CACCATTCTC GGCCCGCAGG AAGTGTGCGA CAAATACGGT ATTCCGCCGG AGCTGATTAT CGACTTCCTG GCGCTGATGG GGGATGCCTC GGATAACATC CCAGGCGTAC CAGGCGTGGG TGAGAAGACC GCGCAGGGGC TGTTACAGGG CCTGGGTGGG CTGGATATGC TCTATGCGAA TCTGGACAGT ATCGCCACGC TCAGCTTCCG TGGAGCCAAG ACCATGGCGG CCAAACTCGA GCAGAACAAA GAGATGGCAT ACCTCTCTTA CAAGCTGGCC ACTATCAAAA CTGACGTTGA GCTGGATATT ACCTGCGCCG ATCTCCAGGT GTCTCCGCTG GACGTCGATA CGTTGCAACA ATTGTTCAAA CAGTATGAAT TTAAGCGCTG GCTGGCAGAT GTCGAAGCCG GCGTTTGGCT GGAAGGCAAG AAAGGTGCCG GTGTGAAAGC AACCAGCGCG GCGAAATCTT CTGCCAGTGC AGTGGCAGAA ACTGGAAAAG CCCAGGCAGA AGCAACGCTA TCGCAAGAGG GTTACGTCAC CATTCTGGAT GAAGACACCT TCACTGAGTG GCTGGAAAAA CTGAAAAAAG CCGAAGTGTT CGCGTTTGAT ACCGAAACCG ACGGCCTGGA TACTCTGACC GCTAACCTGA TCGGTCTGTC ATTTGCCATT GCTCCGGGTG AAGCCGCTTA TCTGCCGGTG GCACATGACT ATCTTGATGC GCCAACGCAG TTGGATCGAG CTCATGTCCT GGCTACGCTG AAACCGCTGC TGGAAGACGA GAAAGCGTTG AAGGTCGGGC AAAACCTGAA GTTTGATATG AGCCTGCTGG CGCGTTACGA CATTACGCTG CGCGGTATTG CCTTTGATAC CATGCTGGAG TCCTATGTGC TGGACAGCGT GGGCGGCCGT CACGATATGG ACAGCTTGTC CGATCGTTAC CTTGGTCATA AAACCGTGAC CTTCGAAGAG ATTGCCGGTA AGGGTAAAAA GCAGCTCACC TTCAACCAGA TTGCACTGGA GCAGGCAGCA CCTTACGCCG CTGAAGATGC TGACGTGACG CTGCAATTAC ATTTGGCGAT GTGGCCGCAA TTGAAGGAAA GCGCCGAGCT GTTGACGGTT TTCAATCAGA TTGAAATGCC GCTGTTGCCG GTGTTGTCGC ATATCGAGCG AACCGGGGTG CTGATTGATC AAAGCATTTT GGCCACCCAT TCCATCGAAT TGACCAAGCG CCTGGCTGAG TTGGAAATTC AGGCCCATGA GCTGGCGGAA GAGCCTTTCA ACCTGGCGTC GACCAAACAG TTGCAGGCGA TCCTGTACGA AAAACAAAAG TTGCCAATAC TGAAGAAAAC TCCGGGCGGT GCACCTTCGA CTAATGAAGA AGTGCTGGCC GAGTTGGCGC TGGATTACCC ATTACCGAAG GTAATTCTGG AATACCGTGG CCTGGCGAAG CTGAAAACCA CCTATACCGA CAAGCTGCCG CTGATGATTA ACCCGGTGAG TGGTCGGGTG CATACCTCCT ATCACCAGGC GGTGACGGCT ACCGGGCGTC TCTCTTCCAG CGATCCCAAC CTGCAGAATA TTCCGGTGCG TAACGACGAA GGGCGCCGTA TCCGTCAGGC ATTTATTGCC CCTGAAGGCT ACCGCATTGT TGCTGCCGAC TATTCACAAA TTGAACTGCG TATTATGGCT CACCTGTCAC AGGATGAGGG GTTGCTGAAA GCCTTTGCGG CTGGTGAGGA TATTCACCGC GCCACGGCGG CTGAGGTGTT TGGCCTGCCG CTCGATAAGG TGACCAACGA GCAGCGCCGC AGCGCCAAGG CGATTAACTT CGGCCTGATT TATGGCATGA GCGCATTTGG TCTGGCGCGT CAGTTAGGGA TCCCACGCGG TGAAGCGCAG CGTTACATGG ATCTTTACTT CGAACGTTAT CCGGGCGTGC TGGAGTATAT GGAGCGCACC CGTCAGCAGG CCGCCAGCCA GGGCTACGTC AGCACGCTGG ATGGCCGCCG TCTGTATCTG CCGGATGTCA GCTCCAGCAA CGGTATGCGT CGCAAGGCGG CCGAGCGAGC GGCGATTAAT GCCCCAATGC AGGGGACGGC AGCCGACATC ATCAAACGTG CGATGATCGA AGTGGACGCC TGGCTGCAAG CTCAGGAAAA GCCACTGGTA CGTATGATTA TGCAGGTACA CGATGAACTG GTGTTCGAGG TACATGAGTC GGTGCTTGAG GAATCCAACC AGCGTATTCG TGAGCTGATG GAAAACAGTA TGGCGCTGGC CGTGCCGCTG AAAGTCGACG TTGGCGTGGG TGCCAATTGG GATGAAGCGC ACTGA
|
Protein sequence | MAQIAENPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLL LQYQPSHVAV VFDAKGKTFR DELFAEYKSH RPPMPDDLRA QIEPLHKMVK AMGLPLLVTP GVEADDVIGT LALEAEKAGH AVLISTGDKD MAQLVTSNVT LINTMNNTIL GPQEVCDKYG IPPELIIDFL ALMGDASDNI PGVPGVGEKT AQGLLQGLGG LDMLYANLDS IATLSFRGAK TMAAKLEQNK EMAYLSYKLA TIKTDVELDI TCADLQVSPL DVDTLQQLFK QYEFKRWLAD VEAGVWLEGK KGAGVKATSA AKSSASAVAE TGKAQAEATL SQEGYVTILD EDTFTEWLEK LKKAEVFAFD TETDGLDTLT ANLIGLSFAI APGEAAYLPV AHDYLDAPTQ LDRAHVLATL KPLLEDEKAL KVGQNLKFDM SLLARYDITL RGIAFDTMLE SYVLDSVGGR HDMDSLSDRY LGHKTVTFEE IAGKGKKQLT FNQIALEQAA PYAAEDADVT LQLHLAMWPQ LKESAELLTV FNQIEMPLLP VLSHIERTGV LIDQSILATH SIELTKRLAE LEIQAHELAE EPFNLASTKQ LQAILYEKQK LPILKKTPGG APSTNEEVLA ELALDYPLPK VILEYRGLAK LKTTYTDKLP LMINPVSGRV HTSYHQAVTA TGRLSSSDPN LQNIPVRNDE GRRIRQAFIA PEGYRIVAAD YSQIELRIMA HLSQDEGLLK AFAAGEDIHR ATAAEVFGLP LDKVTNEQRR SAKAINFGLI YGMSAFGLAR QLGIPRGEAQ RYMDLYFERY PGVLEYMERT RQQAASQGYV STLDGRRLYL PDVSSSNGMR RKAAERAAIN APMQGTAADI IKRAMIEVDA WLQAQEKPLV RMIMQVHDEL VFEVHESVLE ESNQRIRELM ENSMALAVPL KVDVGVGANW DEAH
|
| |