Gene Spro_4887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4887 
Symbol 
ID5602732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5417070 
End bp5419874 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content55% 
IMG OID640940459 
ProductDNA polymerase I 
Protein accessionYP_001481107 
Protein GI157373118 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000312634 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000015928 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCAAA TTGCAGAAAA CCCACTAATC CTGGTTGACG GTTCCTCTTA CCTCTACCGC 
GCTTATCACG CCTTCCCTCC GCTGACCAAC TCCGCGGGTG AACCGACCGG GGCAATGTAC
GGCGTGCTGA ATATGCTGCG TAGCCTATTG CTGCAGTACC AGCCAAGCCA TGTTGCGGTG
GTGTTTGATG CCAAAGGAAA AACCTTCCGT GATGAGCTGT TCGCAGAATA CAAATCACAC
CGGCCACCTA TGCCGGACGA TCTGCGCGCG CAAATCGAGC CGTTGCACAA AATGGTCAAG
GCCATGGGGT TGCCGCTGTT GGTCACGCCC GGCGTCGAAG CCGACGACGT CATAGGCACG
CTGGCGCTGG AAGCTGAAAA GGCCGGTCAT GCGGTGCTGA TCAGCACTGG CGATAAAGAC
ATGGCGCAGT TGGTCACGTC GAACGTCACC TTGATCAACA CCATGAACAA CACCATTCTC
GGCCCGCAGG AAGTGTGCGA CAAATACGGT ATTCCGCCGG AGCTGATTAT CGACTTCCTG
GCGCTGATGG GGGATGCCTC GGATAACATC CCAGGCGTAC CAGGCGTGGG TGAGAAGACC
GCGCAGGGGC TGTTACAGGG CCTGGGTGGG CTGGATATGC TCTATGCGAA TCTGGACAGT
ATCGCCACGC TCAGCTTCCG TGGAGCCAAG ACCATGGCGG CCAAACTCGA GCAGAACAAA
GAGATGGCAT ACCTCTCTTA CAAGCTGGCC ACTATCAAAA CTGACGTTGA GCTGGATATT
ACCTGCGCCG ATCTCCAGGT GTCTCCGCTG GACGTCGATA CGTTGCAACA ATTGTTCAAA
CAGTATGAAT TTAAGCGCTG GCTGGCAGAT GTCGAAGCCG GCGTTTGGCT GGAAGGCAAG
AAAGGTGCCG GTGTGAAAGC AACCAGCGCG GCGAAATCTT CTGCCAGTGC AGTGGCAGAA
ACTGGAAAAG CCCAGGCAGA AGCAACGCTA TCGCAAGAGG GTTACGTCAC CATTCTGGAT
GAAGACACCT TCACTGAGTG GCTGGAAAAA CTGAAAAAAG CCGAAGTGTT CGCGTTTGAT
ACCGAAACCG ACGGCCTGGA TACTCTGACC GCTAACCTGA TCGGTCTGTC ATTTGCCATT
GCTCCGGGTG AAGCCGCTTA TCTGCCGGTG GCACATGACT ATCTTGATGC GCCAACGCAG
TTGGATCGAG CTCATGTCCT GGCTACGCTG AAACCGCTGC TGGAAGACGA GAAAGCGTTG
AAGGTCGGGC AAAACCTGAA GTTTGATATG AGCCTGCTGG CGCGTTACGA CATTACGCTG
CGCGGTATTG CCTTTGATAC CATGCTGGAG TCCTATGTGC TGGACAGCGT GGGCGGCCGT
CACGATATGG ACAGCTTGTC CGATCGTTAC CTTGGTCATA AAACCGTGAC CTTCGAAGAG
ATTGCCGGTA AGGGTAAAAA GCAGCTCACC TTCAACCAGA TTGCACTGGA GCAGGCAGCA
CCTTACGCCG CTGAAGATGC TGACGTGACG CTGCAATTAC ATTTGGCGAT GTGGCCGCAA
TTGAAGGAAA GCGCCGAGCT GTTGACGGTT TTCAATCAGA TTGAAATGCC GCTGTTGCCG
GTGTTGTCGC ATATCGAGCG AACCGGGGTG CTGATTGATC AAAGCATTTT GGCCACCCAT
TCCATCGAAT TGACCAAGCG CCTGGCTGAG TTGGAAATTC AGGCCCATGA GCTGGCGGAA
GAGCCTTTCA ACCTGGCGTC GACCAAACAG TTGCAGGCGA TCCTGTACGA AAAACAAAAG
TTGCCAATAC TGAAGAAAAC TCCGGGCGGT GCACCTTCGA CTAATGAAGA AGTGCTGGCC
GAGTTGGCGC TGGATTACCC ATTACCGAAG GTAATTCTGG AATACCGTGG CCTGGCGAAG
CTGAAAACCA CCTATACCGA CAAGCTGCCG CTGATGATTA ACCCGGTGAG TGGTCGGGTG
CATACCTCCT ATCACCAGGC GGTGACGGCT ACCGGGCGTC TCTCTTCCAG CGATCCCAAC
CTGCAGAATA TTCCGGTGCG TAACGACGAA GGGCGCCGTA TCCGTCAGGC ATTTATTGCC
CCTGAAGGCT ACCGCATTGT TGCTGCCGAC TATTCACAAA TTGAACTGCG TATTATGGCT
CACCTGTCAC AGGATGAGGG GTTGCTGAAA GCCTTTGCGG CTGGTGAGGA TATTCACCGC
GCCACGGCGG CTGAGGTGTT TGGCCTGCCG CTCGATAAGG TGACCAACGA GCAGCGCCGC
AGCGCCAAGG CGATTAACTT CGGCCTGATT TATGGCATGA GCGCATTTGG TCTGGCGCGT
CAGTTAGGGA TCCCACGCGG TGAAGCGCAG CGTTACATGG ATCTTTACTT CGAACGTTAT
CCGGGCGTGC TGGAGTATAT GGAGCGCACC CGTCAGCAGG CCGCCAGCCA GGGCTACGTC
AGCACGCTGG ATGGCCGCCG TCTGTATCTG CCGGATGTCA GCTCCAGCAA CGGTATGCGT
CGCAAGGCGG CCGAGCGAGC GGCGATTAAT GCCCCAATGC AGGGGACGGC AGCCGACATC
ATCAAACGTG CGATGATCGA AGTGGACGCC TGGCTGCAAG CTCAGGAAAA GCCACTGGTA
CGTATGATTA TGCAGGTACA CGATGAACTG GTGTTCGAGG TACATGAGTC GGTGCTTGAG
GAATCCAACC AGCGTATTCG TGAGCTGATG GAAAACAGTA TGGCGCTGGC CGTGCCGCTG
AAAGTCGACG TTGGCGTGGG TGCCAATTGG GATGAAGCGC ACTGA
 
Protein sequence
MAQIAENPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLL LQYQPSHVAV 
VFDAKGKTFR DELFAEYKSH RPPMPDDLRA QIEPLHKMVK AMGLPLLVTP GVEADDVIGT
LALEAEKAGH AVLISTGDKD MAQLVTSNVT LINTMNNTIL GPQEVCDKYG IPPELIIDFL
ALMGDASDNI PGVPGVGEKT AQGLLQGLGG LDMLYANLDS IATLSFRGAK TMAAKLEQNK
EMAYLSYKLA TIKTDVELDI TCADLQVSPL DVDTLQQLFK QYEFKRWLAD VEAGVWLEGK
KGAGVKATSA AKSSASAVAE TGKAQAEATL SQEGYVTILD EDTFTEWLEK LKKAEVFAFD
TETDGLDTLT ANLIGLSFAI APGEAAYLPV AHDYLDAPTQ LDRAHVLATL KPLLEDEKAL
KVGQNLKFDM SLLARYDITL RGIAFDTMLE SYVLDSVGGR HDMDSLSDRY LGHKTVTFEE
IAGKGKKQLT FNQIALEQAA PYAAEDADVT LQLHLAMWPQ LKESAELLTV FNQIEMPLLP
VLSHIERTGV LIDQSILATH SIELTKRLAE LEIQAHELAE EPFNLASTKQ LQAILYEKQK
LPILKKTPGG APSTNEEVLA ELALDYPLPK VILEYRGLAK LKTTYTDKLP LMINPVSGRV
HTSYHQAVTA TGRLSSSDPN LQNIPVRNDE GRRIRQAFIA PEGYRIVAAD YSQIELRIMA
HLSQDEGLLK AFAAGEDIHR ATAAEVFGLP LDKVTNEQRR SAKAINFGLI YGMSAFGLAR
QLGIPRGEAQ RYMDLYFERY PGVLEYMERT RQQAASQGYV STLDGRRLYL PDVSSSNGMR
RKAAERAAIN APMQGTAADI IKRAMIEVDA WLQAQEKPLV RMIMQVHDEL VFEVHESVLE
ESNQRIRELM ENSMALAVPL KVDVGVGANW DEAH