Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3905 |
Symbol | |
ID | 4254468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4655692 |
End bp | 4658460 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638120550 |
Product | DNA polymerase I |
Protein accession | YP_736025 |
Protein GI | 113972232 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0001632 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.292845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACCA TAGCCAATAA CCCACTTGTC CTTGTGGATG GATCTTCTTA TTTATATCGC GCCTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCTACTGG TGCTGTTTAT GGCGTAGTGA ATATGCTCCG CAGTTTATTA AGCCGTTATC AACCTAGCCA TATCGCTGTG GTGTTCGACG CTAAAGGCAA AACCTTCCGC AATGACTTAT ATGAAGAATA CAAGGCACAT CGCCCGCCTA TGCCGGATGA CCTGCGCTCA CAAATTGAAC CACTACACCG TATTATCCGT GCCTTAGGCC TGCCCTTAAT CTCTATTCCT GGTGTTGAGG CGGACGATGT TATCGGCACA ATCGCTCGCC AAGCGAGCCG CGAAAACCGC GCTGTACTCA TCAGCACTGG TGATAAAGAC ATGGCGCAGC TGGTTGATGA AAATATCACG CTGATCAACA CCATGACAGA TACCATCATG GGTCCTGAAG AGGTTGCGGC TAAATATGGT GTTGGCCCAG ACAGAATTAT CGATTTCTTG GCGTTGATGG GCGACAAGGC AGATAACATT CCCGGTTTAC CCGGCGTAGG CGAAAAAACC GCATTAGCTA TGCTCACAGG GGCGGGTAGT GTCGCCAATT TGCTTGCAGA GCCCGAAAAA GTCACCGAAT TAGGCTTTAG GGGCGCAAAA ACCATGGCGG CGAAAATCAT CGACAATGCC GACATGCTAA AACTGTCCTA TGAGCTTGCC ACCATTAAAA CCGATGTTGA GCTCGAACAA GATTGGCATG AACTCGAAGC CAAACCCGCA GACAGGGATG AGCTGATCAA ATGCTATGGC GAAATGGAAT TTAAACGCTG GCTTGCCGAA GTCTTAGATA ATAAGGCGCC AACGACAGCC GCAGCAAAAG TCGAAACGAC AGAAACCCAA GAAGAAACAG CGCCCAGCGT CACGATTGAA ACCCAATACG ATACGATTCT GACCGAAGCT CAGCTCGATG AGTGGATTGC CAAACTCAAG CAAGCGCCAT TAATGGCCGT GGATACCGAG ACCACCAGTC TCGACTATAT GGTTGCGGAA TTGGTTGGCC TGTCCTTTGC TGTTGAAGCC GGTAAAGCCG CTTATCTGCC CTTAGCCCAC GATTATGTTG GCGCGCCTCA ACAATTAGAC AAGCAAACTG CACTCGAAAA ACTGCGCCCG ATACTCGAAG ACGCCAAGCT CAAAAAAGTC GGTCAAAACC TGAAATATGA TATCAGCGTA TTGGCAAATG CAGGCATACA ACTCCAAGGT GTGGTATTCG ACACTATGCT CGAATCCTAT GTGTTTAACT CGGTCGCCTC ACGCCATGAT ATGGATGGGT TGGCGCTTAA ATACCTAGGC CATAAAAATA TCGCCTTTGA AGATATCGCA GGTAAAGGTG CTAAACAGCT GACCTTCAAC CAAATTCAGT TGGAAACAGC AGCCCCTTAT GCGGCGGAAG ATGCCGATAT AACCCTACGT CTACATCAAC ATTTGTGGCC AAGACTCGAA AAAGAGACCG AATTAGCCTC GGTCTTTACC GATATTGAAC TGCCGCTGAT CCAAATACTG TCCGATATTG AACGCCAAGG TGTGTTTATC GATAGTATGT TGCTCGGCCA ACAGAGTGAT GAACTTGCCC GCAAAATCGA TGAGTTAGAA ACAAAAGCTT ATGATATTGC AGGTGAAAAA TTCAATTTAA GCTCACCAAA GCAACTACAA GTGCTGTTTT TTGAAAAGCT GGGTTATCCG GTCATCAAAA AAACCCCTAA GGGCGCCCCC TCTACCGCGG AAGAAGTACT GGTTGAGTTG GCATTGGATT TCCCTCTGCC TAAAGTGATC CTTGAGCATA GAAGCCTAAC CAAGCTAAAG AGTACTTACA CCGACAAGCT CCCTCTAATG GTGAACGCGA AAACGGGTCG AGTACACACA AGCTACCATC AGGCCAACGC GGCAACGGGG CGTTTGTCCT CGAGCGAACC AAACCTACAG AATATTCCTA TCCGCACCGA GGAAGGTCGC CGTATTCGCC AAGCCTTTAT TGCGCCTCAA GGACGTAAGA TTTTGGCCGC CGACTATTCG CAGATTGAAT TACGGATCAT GGCGCATTTA TCCCAAGATG CGGGCTTACT CAAAGCCTTC GCTGAAGGTA AAGACATTCA CAGAGCCACC GCCGCCGAAG TATTTGGCAC CGACTTTGAC AGCGTCACCT CGGAGCAGCG TCGCCGCGCC AAAGCCGTTA ACTTTGGCCT TATCTATGGC ATGTCCGCCT TTGGATTGGC GCGTCAGCTC GACATTCCCC GCAACGAGGC ACAAACTTAC ATCGACACTT ACTTCGCTCG CTATCCAGGC GTATTAAGGT ATATGGAAGA AACACGAGCC AGTGCAGCAG AACTTGGCTA TGTCTCTACG TTATTTGGGC GCCGCCTGTA TTTACCCGAA ATTCGCGACC GCAATGCAAT GCGCCGCCAA GCAGCAGAAA GAGCCGCGAT TAACGCCCCA ATGCAAGGCA CCGCTGCGGA TATCATTAAA AAAGCCATGA TCAGCATTGC CGATTGGATA AAAACCGATA CCCAAGGTGA AATCGCAATG ATCATGCAAG TCCACGACGA ATTAGTATTC GAAGTCGATG CCGATAAAGC CGAAACTCTC AAGCTCAAGG TGTGTGAACT CATGGCAAAA GCGGCCAATC TGGACGTGGA ACTTCTGGCA GAAGCTGGTA TTGGCGATAA CTGGGACCAA GCCCACTAG
|
Protein sequence | MPTIANNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYQPSHIAV VFDAKGKTFR NDLYEEYKAH RPPMPDDLRS QIEPLHRIIR ALGLPLISIP GVEADDVIGT IARQASRENR AVLISTGDKD MAQLVDENIT LINTMTDTIM GPEEVAAKYG VGPDRIIDFL ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VANLLAEPEK VTELGFRGAK TMAAKIIDNA DMLKLSYELA TIKTDVELEQ DWHELEAKPA DRDELIKCYG EMEFKRWLAE VLDNKAPTTA AAKVETTETQ EETAPSVTIE TQYDTILTEA QLDEWIAKLK QAPLMAVDTE TTSLDYMVAE LVGLSFAVEA GKAAYLPLAH DYVGAPQQLD KQTALEKLRP ILEDAKLKKV GQNLKYDISV LANAGIQLQG VVFDTMLESY VFNSVASRHD MDGLALKYLG HKNIAFEDIA GKGAKQLTFN QIQLETAAPY AAEDADITLR LHQHLWPRLE KETELASVFT DIELPLIQIL SDIERQGVFI DSMLLGQQSD ELARKIDELE TKAYDIAGEK FNLSSPKQLQ VLFFEKLGYP VIKKTPKGAP STAEEVLVEL ALDFPLPKVI LEHRSLTKLK STYTDKLPLM VNAKTGRVHT SYHQANAATG RLSSSEPNLQ NIPIRTEEGR RIRQAFIAPQ GRKILAADYS QIELRIMAHL SQDAGLLKAF AEGKDIHRAT AAEVFGTDFD SVTSEQRRRA KAVNFGLIYG MSAFGLARQL DIPRNEAQTY IDTYFARYPG VLRYMEETRA SAAELGYVST LFGRRLYLPE IRDRNAMRRQ AAERAAINAP MQGTAADIIK KAMISIADWI KTDTQGEIAM IMQVHDELVF EVDADKAETL KLKVCELMAK AANLDVELLA EAGIGDNWDQ AH
|
| |