Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_3906 |
Symbol | |
ID | 5081336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 4531281 |
End bp | 4534049 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640501118 |
Product | DNA polymerase I |
Protein accession | YP_001185411 |
Protein GI | 146294987 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000887796 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTACCA TTGCAAACAA CCCACTTGTC CTTGTGGATG GATCTTCTTA TTTATACCGC GCCTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCAACGGG TGCCGTCTAT GGCGTAGTAA ATATGTTACG CAGTCTATTG AGCCGTTACC AACCGAGCCA TATCGCCGTC GTATTCGATG CCAAAGGTAA AACCTTCCGT AATGATATGT ACAGCGAATA CAAAGCACAG CGCCCACCGA TGCCGGATGA TCTTCGTTCA CAAATCGAAC CTCTGCATAG GATTATTCAC GCCCTTGGTT TACCCCTTAT CTCTATCCCT GGCGTTGAAG CGGATGATGT TATCGGCACA ATTGCCCGCC AAGCAAGCCG TGAGAATCGC GCCGTACTCA TTAGTACTGG TGATAAAGAT ATGGCCCAAT TGGTTGATGA AAATATCACG CTGATCAATA CCATGACAGA TACCATCATG GGGCCCGATG AAGTTGCAGC AAAATATGGT GTAGGTCCAG ATCGTATTAT TGATTTCTTA GCCTTGATGG GCGATAAGGC CGATAATATT CCAGGCTTAC CTGGTGTCGG CGAAAAAACG GCATTAGCTA TGCTGACTGG CGCAGGAAGC GTGGCCAACC TACTCGCAGC TCCCGAAAAG GTCTCTGACT TAGGCTTCAG GGGCGCAAAA ACGATGGCGG CAAAAATCAT TGAAAATGCC GACATGCTTA GACTCTCCTA CGAACTTGCT ACCATTAAAA CCGATGTGGA ACTTGAGCAA GATTGGCATG AGCTCACTGC CAAACCCGCA GATAGAGACG AACTGATCAA ATGCTACAGC GAGATGGAGT TTAAGCGCTG GCTTGCCGAG GTCTTAGATA ATAAAGCACC AACGACCGTA GTCGCGGATG CGGCTAAAGT CAAAACCCAA GAAAGCTCTA TGCCAAATCC TGCACTTGAA ACTCAATACG ACACGATTTT AACCGAAGCT CAGCTTGATG AATGGATTGC CAAGCTGAAA CAAGCATCAT TAATAGCCGT GGATACAGAG ACCACGAGTC TCGACTATAT GGTCGCGGAA CTGGTTGGCA TGTCTTTCGC AGTGGAAGCA GGCAAAGCCG CTTACCTGCC TTTAGCTCAC GATTATGTTG GCGCGCCTCA ACAGTTAGAT AAGCAATTTG CGCTCGAAAA ACTACGCCCT ATTTTAGAAG ACGATAAGGT CAAAAAAGTC GGCCAGAACC TGAAATATGA CATCAGCGTT TTGGCTAATG CGGGCATTAA ATTACAGGGT GTCGTATTTG ATACCATGCT CGAATCCTAT GTTTTTAACT CCGTGGCTTC ACGCCATGAT ATGGATGGAT TAGCGATCAA GTATCTAGGC CATAAAAACA TTGGCTTTGA AGATATTGCA GGCAAAGGGG CGAAACAACT TACCTTCAAT CAAATCCCGT TGGAAGTTGC AGCCCCGTAT GCTGCGGAGG ATGCGGATAT TACTCTGCGT TTACACCAGC ACCTATGGCC AAGGCTAGAA AAAGAACCCC AATTGGCCTC AGTATTTACT GAAATTGAAT TACCCTTAAT CCAAGTACTA TCAGACATTG AGCGCCAAGG TGTATTAATT GATAGCATGT TACTCGGCCA ACAGAGTGAT GAGCTAGCGC GTAAAATCGA TACCTTAGAA GAAAAAGCCT ACGAGATTGC AGGTGAAAAA TTTAATCTCG GTTCACCCAA ACAACTGCAA GTGCTATTTT TTGAAAAACT AGGTTATCCA ATCACCAAAA AAACACCAAA GGGCGCGCCA TCGACCGCGG AAGAAGTATT AGTCGAATTA GCCTTAGATT TCCCACTGCC AAAGGTGATC CTCGAGCATC GCAGCCTATC TAAGTTAAAA AGTACTTACA CAGATAAACT ACCGCTGATG GTCAATGCCA AGACTGGGCG CGTTCACACA AGTTACCACC AAGCTAATGC GGCAACAGGG CGCTTATCGT CAAGCGAGCC TAACCTGCAG AATATTCCTA TTCGTACCGA AGAAGGTCGC CGCATTCGCC AAGCCTTTAT CGCCCCTGCA GGTCGTAAAA TTTTGGCCGC TGATTATTCA CAAATCGAAC TGCGGATCAT GGCACATCTA TCCCAAGATG CGGGTCTGCT CAAAGCCTTC GCCGAGGGTA AAGACATTCA CAGAGCCACC GCTGCCGAAG TATTTGGAAC TGACTTTGAT GCTGTAACAA CAGAACAGCG ACGCCGCGCT AAAGCCGTAA ACTTCGGCCT TATTTATGGT ATGTCAGCCT TTGGTTTAGC TCGTCAACTG GATATTCCTC GTAACGAAGC TCAAACCTAT ATTGATACTT ACTTCGCCCG CTACCCGGGA GTGTTAAGGT ATATGGAAGA AACGCGCGCA GGAGCAGCAG AATTAGGATA TGTTTCAACA CTATTTGGTC GTCGCCTCTA CTTGCCCGAA ATCCGGGATC GTAATGCAAT GCGCCGCCAA GGCGCAGAAC GTGCTGCGAT TAACGCACCG ATGCAAGGCA CGGCTGCCGA TATCATCAAA AAAGCCATGA TCAATATTGC CCAGTGGATC AAGACAGAAA CCCAAGGCGA AATCGCCATG ATTATGCAAG TCCACGACGA ATTAGTATTC GAAGTCGATG CCGATAAAGC AGAGACGCTC AAACAAAAAG TATGTGAACT CATGGCGAAG GCTGCAATCC TCGATGTAGC ATTACTCGCA GAAGCTGGCA TTGGTGATAA TTGGGATGAA GCCCACTAA
|
Protein sequence | MPTIANNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYQPSHIAV VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIEPLHRIIH ALGLPLISIP GVEADDVIGT IARQASRENR AVLISTGDKD MAQLVDENIT LINTMTDTIM GPDEVAAKYG VGPDRIIDFL ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VANLLAAPEK VSDLGFRGAK TMAAKIIENA DMLRLSYELA TIKTDVELEQ DWHELTAKPA DRDELIKCYS EMEFKRWLAE VLDNKAPTTV VADAAKVKTQ ESSMPNPALE TQYDTILTEA QLDEWIAKLK QASLIAVDTE TTSLDYMVAE LVGMSFAVEA GKAAYLPLAH DYVGAPQQLD KQFALEKLRP ILEDDKVKKV GQNLKYDISV LANAGIKLQG VVFDTMLESY VFNSVASRHD MDGLAIKYLG HKNIGFEDIA GKGAKQLTFN QIPLEVAAPY AAEDADITLR LHQHLWPRLE KEPQLASVFT EIELPLIQVL SDIERQGVLI DSMLLGQQSD ELARKIDTLE EKAYEIAGEK FNLGSPKQLQ VLFFEKLGYP ITKKTPKGAP STAEEVLVEL ALDFPLPKVI LEHRSLSKLK STYTDKLPLM VNAKTGRVHT SYHQANAATG RLSSSEPNLQ NIPIRTEEGR RIRQAFIAPA GRKILAADYS QIELRIMAHL SQDAGLLKAF AEGKDIHRAT AAEVFGTDFD AVTTEQRRRA KAVNFGLIYG MSAFGLARQL DIPRNEAQTY IDTYFARYPG VLRYMEETRA GAAELGYVST LFGRRLYLPE IRDRNAMRRQ GAERAAINAP MQGTAADIIK KAMINIAQWI KTETQGEIAM IMQVHDELVF EVDADKAETL KQKVCELMAK AAILDVALLA EAGIGDNWDE AH
|
| |