Gene Shewmr4_3905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3905 
Symbol 
ID4254468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4655692 
End bp4658460 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content49% 
IMG OID638120550 
ProductDNA polymerase I 
Protein accessionYP_736025 
Protein GI113972232 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0001632 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.292845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCA TAGCCAATAA CCCACTTGTC CTTGTGGATG GATCTTCTTA TTTATATCGC 
GCCTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCTACTGG TGCTGTTTAT
GGCGTAGTGA ATATGCTCCG CAGTTTATTA AGCCGTTATC AACCTAGCCA TATCGCTGTG
GTGTTCGACG CTAAAGGCAA AACCTTCCGC AATGACTTAT ATGAAGAATA CAAGGCACAT
CGCCCGCCTA TGCCGGATGA CCTGCGCTCA CAAATTGAAC CACTACACCG TATTATCCGT
GCCTTAGGCC TGCCCTTAAT CTCTATTCCT GGTGTTGAGG CGGACGATGT TATCGGCACA
ATCGCTCGCC AAGCGAGCCG CGAAAACCGC GCTGTACTCA TCAGCACTGG TGATAAAGAC
ATGGCGCAGC TGGTTGATGA AAATATCACG CTGATCAACA CCATGACAGA TACCATCATG
GGTCCTGAAG AGGTTGCGGC TAAATATGGT GTTGGCCCAG ACAGAATTAT CGATTTCTTG
GCGTTGATGG GCGACAAGGC AGATAACATT CCCGGTTTAC CCGGCGTAGG CGAAAAAACC
GCATTAGCTA TGCTCACAGG GGCGGGTAGT GTCGCCAATT TGCTTGCAGA GCCCGAAAAA
GTCACCGAAT TAGGCTTTAG GGGCGCAAAA ACCATGGCGG CGAAAATCAT CGACAATGCC
GACATGCTAA AACTGTCCTA TGAGCTTGCC ACCATTAAAA CCGATGTTGA GCTCGAACAA
GATTGGCATG AACTCGAAGC CAAACCCGCA GACAGGGATG AGCTGATCAA ATGCTATGGC
GAAATGGAAT TTAAACGCTG GCTTGCCGAA GTCTTAGATA ATAAGGCGCC AACGACAGCC
GCAGCAAAAG TCGAAACGAC AGAAACCCAA GAAGAAACAG CGCCCAGCGT CACGATTGAA
ACCCAATACG ATACGATTCT GACCGAAGCT CAGCTCGATG AGTGGATTGC CAAACTCAAG
CAAGCGCCAT TAATGGCCGT GGATACCGAG ACCACCAGTC TCGACTATAT GGTTGCGGAA
TTGGTTGGCC TGTCCTTTGC TGTTGAAGCC GGTAAAGCCG CTTATCTGCC CTTAGCCCAC
GATTATGTTG GCGCGCCTCA ACAATTAGAC AAGCAAACTG CACTCGAAAA ACTGCGCCCG
ATACTCGAAG ACGCCAAGCT CAAAAAAGTC GGTCAAAACC TGAAATATGA TATCAGCGTA
TTGGCAAATG CAGGCATACA ACTCCAAGGT GTGGTATTCG ACACTATGCT CGAATCCTAT
GTGTTTAACT CGGTCGCCTC ACGCCATGAT ATGGATGGGT TGGCGCTTAA ATACCTAGGC
CATAAAAATA TCGCCTTTGA AGATATCGCA GGTAAAGGTG CTAAACAGCT GACCTTCAAC
CAAATTCAGT TGGAAACAGC AGCCCCTTAT GCGGCGGAAG ATGCCGATAT AACCCTACGT
CTACATCAAC ATTTGTGGCC AAGACTCGAA AAAGAGACCG AATTAGCCTC GGTCTTTACC
GATATTGAAC TGCCGCTGAT CCAAATACTG TCCGATATTG AACGCCAAGG TGTGTTTATC
GATAGTATGT TGCTCGGCCA ACAGAGTGAT GAACTTGCCC GCAAAATCGA TGAGTTAGAA
ACAAAAGCTT ATGATATTGC AGGTGAAAAA TTCAATTTAA GCTCACCAAA GCAACTACAA
GTGCTGTTTT TTGAAAAGCT GGGTTATCCG GTCATCAAAA AAACCCCTAA GGGCGCCCCC
TCTACCGCGG AAGAAGTACT GGTTGAGTTG GCATTGGATT TCCCTCTGCC TAAAGTGATC
CTTGAGCATA GAAGCCTAAC CAAGCTAAAG AGTACTTACA CCGACAAGCT CCCTCTAATG
GTGAACGCGA AAACGGGTCG AGTACACACA AGCTACCATC AGGCCAACGC GGCAACGGGG
CGTTTGTCCT CGAGCGAACC AAACCTACAG AATATTCCTA TCCGCACCGA GGAAGGTCGC
CGTATTCGCC AAGCCTTTAT TGCGCCTCAA GGACGTAAGA TTTTGGCCGC CGACTATTCG
CAGATTGAAT TACGGATCAT GGCGCATTTA TCCCAAGATG CGGGCTTACT CAAAGCCTTC
GCTGAAGGTA AAGACATTCA CAGAGCCACC GCCGCCGAAG TATTTGGCAC CGACTTTGAC
AGCGTCACCT CGGAGCAGCG TCGCCGCGCC AAAGCCGTTA ACTTTGGCCT TATCTATGGC
ATGTCCGCCT TTGGATTGGC GCGTCAGCTC GACATTCCCC GCAACGAGGC ACAAACTTAC
ATCGACACTT ACTTCGCTCG CTATCCAGGC GTATTAAGGT ATATGGAAGA AACACGAGCC
AGTGCAGCAG AACTTGGCTA TGTCTCTACG TTATTTGGGC GCCGCCTGTA TTTACCCGAA
ATTCGCGACC GCAATGCAAT GCGCCGCCAA GCAGCAGAAA GAGCCGCGAT TAACGCCCCA
ATGCAAGGCA CCGCTGCGGA TATCATTAAA AAAGCCATGA TCAGCATTGC CGATTGGATA
AAAACCGATA CCCAAGGTGA AATCGCAATG ATCATGCAAG TCCACGACGA ATTAGTATTC
GAAGTCGATG CCGATAAAGC CGAAACTCTC AAGCTCAAGG TGTGTGAACT CATGGCAAAA
GCGGCCAATC TGGACGTGGA ACTTCTGGCA GAAGCTGGTA TTGGCGATAA CTGGGACCAA
GCCCACTAG
 
Protein sequence
MPTIANNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYQPSHIAV 
VFDAKGKTFR NDLYEEYKAH RPPMPDDLRS QIEPLHRIIR ALGLPLISIP GVEADDVIGT
IARQASRENR AVLISTGDKD MAQLVDENIT LINTMTDTIM GPEEVAAKYG VGPDRIIDFL
ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VANLLAEPEK VTELGFRGAK TMAAKIIDNA
DMLKLSYELA TIKTDVELEQ DWHELEAKPA DRDELIKCYG EMEFKRWLAE VLDNKAPTTA
AAKVETTETQ EETAPSVTIE TQYDTILTEA QLDEWIAKLK QAPLMAVDTE TTSLDYMVAE
LVGLSFAVEA GKAAYLPLAH DYVGAPQQLD KQTALEKLRP ILEDAKLKKV GQNLKYDISV
LANAGIQLQG VVFDTMLESY VFNSVASRHD MDGLALKYLG HKNIAFEDIA GKGAKQLTFN
QIQLETAAPY AAEDADITLR LHQHLWPRLE KETELASVFT DIELPLIQIL SDIERQGVFI
DSMLLGQQSD ELARKIDELE TKAYDIAGEK FNLSSPKQLQ VLFFEKLGYP VIKKTPKGAP
STAEEVLVEL ALDFPLPKVI LEHRSLTKLK STYTDKLPLM VNAKTGRVHT SYHQANAATG
RLSSSEPNLQ NIPIRTEEGR RIRQAFIAPQ GRKILAADYS QIELRIMAHL SQDAGLLKAF
AEGKDIHRAT AAEVFGTDFD SVTSEQRRRA KAVNFGLIYG MSAFGLARQL DIPRNEAQTY
IDTYFARYPG VLRYMEETRA SAAELGYVST LFGRRLYLPE IRDRNAMRRQ AAERAAINAP
MQGTAADIIK KAMISIADWI KTDTQGEIAM IMQVHDELVF EVDADKAETL KLKVCELMAK
AANLDVELLA EAGIGDNWDQ AH