Gene Nwi_0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0357 
Symbol 
ID3674911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp399513 
End bp402545 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content65% 
IMG OID637711897 
ProductDNA polymerase I 
Protein accessionYP_316976 
Protein GI75674555 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.577843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGA AGCCCGCCGC AGCCGAAATC CCGGCCATCA TCCCCGCGAC GATTCCCCCG 
GTGAAAGCGG CGGGGACCAC TTTGCAGGGC AAGCACGTCT TTCTGGTGGA CGGCTCCTCC
TACATCTTCC GCGCCTATCA CGCGCTGCCG CCGCTGAACC GCAAGTCCGA CGGGTTGCAG
GTCAACGCCG TGCTCGGCTT CTGCAACATG CTTTGGAAGC TGTTGCGCGA CATGCCGAAG
AACGACAGGC CGACGCACCT CGCGATCATC TTCGACAAGT CCGAGGTGAC TTTCCGTAAC
AAGCTCTATC CCGCTTACAA GGCGCACCGG CCGCCCGCGC CCGACGACCT GATTCCGCAA
TTCGCGCTGA TCCGCGAGGC GGTGAAGGCG TTCGACCTGC CCTGCCTCGA ACAGATCGGC
TTCGAGGCCG ACGACCTGAT CGCCACTTAC GTTCGGCAGG CGTGCGAGCG CGGCGCGACC
GCGACCATCG TCTCCTCGGA CAAGGATCTG ATGCAGCTCG TCAACGGCTG TGTCACCATG
TACGACACCA TGAAGGATCG TCGCCTCGGC ATCGCCGAGG TGATCGAGAA ATTCGGCGTG
CCGCCGGAGA AGGTGGTCGA GGTGCAGGCT TTGGCCGGCG ACAGCGTCGA CAACGTTCCG
GGCGTGCCGG GCATCGGCGT CAAGACAGCC GCGCAACTCA TCACCGAATA CGGCGACCTC
GACACGTTGC TGGCCCGCGC CACCGAGATC AAGCAGCCGA AGCGCCGCGA GGCGCTGATC
GAGAACGCGG AGAAGGCGCG CATCTCGCGG CAGCTCGTGC TGCTCGACGA CCACGTCGCG
CTCGATGTGC CGCTGGACGA TCTCGCGGTG CAGGAGCCCG ACGCCCGCAA GCTGATCGCT
TTCCTCAAGG CCATGGAATT CACCACGCTC ACGAAGCGCG TCGCGGATTA TTCCGAGGTC
AACGCGGCCG AGATCGAGCC GGATGCGAAG CACACGAGCG GCCGCTCAGC GATGGTGGCC
AAGGCGTCCG CAACCGCCCT CACCGGCGAT CTGTTCGGCA GCAGCGATAG CGTCGCGAAG
GGCACGCCGG CCGGCACGGC AAAAGCGACA GGCGACACCA AGATGAAAAC GCCTGCGGCC
CTTGCGGCCG CACGCCTCGA AACGGTGCGC GCGTTGCCGG TCCGGCCCGA CGCCTACGAA
ACCATCCGCA CGCTCGACCG GCTGCGCGAC TGGATCGCCC GCATCGATGA CAACGGCAGT
TTCGCCTTCG AGGCCCTGGC GCCGACCATC GATCCCATGC AGACCGAGCT GTCCGGCATC
GCGCTGGCGC TCGCCCCGAA CCATGCGTGC TACATTCCGC TCAGCCACAA GCAGTCCGGC
GACAGCGCGG GACTGTTCGC CGCCGGGCTT GCGCCCGATC AGATCGCGAT TCGCGATGCG
CTGGAGCTCC TGAAGCCGAT CCTGGAATCC CGTGGCCACC TGAAGATCGG CCTCAACGTC
AAGTTCAGCG CCGTGCTGCT CGCGCAGCAT GGCATCGTCA TGCAGAATCA CGACGACATC
GAACTGATCT CCTACGCGCT CGACGCCGGG CGCGGCTCCC ATGATCTCGC CTCGCTGGCG
AAGCGCTGGC TCGGCCACGC CACGGTGAGC CACGGCGAAC TGACCGGCAG CGGCAGGAAC
AGACTCGCCT TCGATCAGGT GGCGATCGAT CGCGCCACGG CTTACGCGGC GGAATATGCG
GATATGACCT TGCGGCTATG GCAGGTGCTC AAGCCGCGGC TGGTCGCCGA GCGCATGAAT
ACGGTCTACG AGACTCTGGA GCGGCCGATG ATCGCGACTC TGGCGCAAAT GGAGCGGCGC
GGCATCACCA TCGACCGGCA GGTGCTGTCG CGCCTGTCGG GCGAGTTCGC GCAGAGCGCG
GCCCGGCTGG AAGCCGAGAT CCAGCAGCTC GCCGGCGAGC CGATCAATGT CGGCAGTCCG
AAGCAGATCG GCGAGATCAT GTTCGGAAAG ATGGGCCTGC CGGGCGGCAG CAAGACCAAG
ACCGGCGCAT GGTCAACCTC GGCGCAGATC CTCGACGAAC TCGCCGAGCA GGGCCACGAC
TTCCCGCGCA AGATCCTGGA CTGGCGGCAG GTCTCAAAAC TCAAATCCAC CTATACCGAC
GCCCTGCCCG AATACGTTCA TCCCCAGACA AGGCGCGTGC ACACGACCTA TGCGCTCGCA
GCCACCACCA CGGGCCGGTT GTCGTCCAAC GAGCCTAACC TGCAGAACAT CCCGGTGCGC
AACGAGGAGG GCCGAAAAAT TCGCCGCGCC TTCATCGCCG CTCCCGGACA CAAACTGGTG
TCAGCCGACT ATTCTCAAAT TGAACTGCGG CTGCTCGCTG AGATCGCCGA TATCCCGGTG
CTGAAACAGG CGTTCCGCGA CGGTCTCGAC ATCCACGCCA TGACCGCGTC GGAAATGTTC
GGCGTGCCGG TGCAGGGAAT GCCCGGCGAA ATCCGCCGCC GGGCCAAGGC CATCAACTTC
GGCATCATCT ACGGCATCTC GGCGTTCGGC CTCGCCAACC AGCTTGGCAT CCCGCGCGAG
GAAGCAGCCG CCTACATCAA GAAGTATTTC GAGCGCTTTC CCGGCATTCG CGCCTACATG
GACGCGACCC GTGAATTTTG CCGCGAGCAC GGCTATGTCA CGACCCTGTT CGGCCGCAAA
TGCCATTACC CGGACATCAA GTCGCCGAAC CCGTCGCATC GCGCCTTCAA CGAGCGCGCC
GCGATCAATG CGCGTTTGCA GGGATCGGCC GCCGATATCA TCCGCCGCGC CATGGCGCGG
ATGGATGATG CTCTCGCGGC GAGGAATCTG TCCGCGCGGA TGCTGCTGCA GGTCCACGAC
GAACTGATTT TCGAGGTGCC CGACGATGAG GTCGCGGCGA CGCTGCCTGT CGTGCAGCAT
GTGATGCAGG ACGCGCCGTT CCCGGCTGTG TTGCTGTCGG TGCCGTTGCA GGTCGACGCC
CGCGCCGCCG ACAACTGGGA CGAAGCCCAT TAG
 
Protein sequence
MSKKPAAAEI PAIIPATIPP VKAAGTTLQG KHVFLVDGSS YIFRAYHALP PLNRKSDGLQ 
VNAVLGFCNM LWKLLRDMPK NDRPTHLAII FDKSEVTFRN KLYPAYKAHR PPAPDDLIPQ
FALIREAVKA FDLPCLEQIG FEADDLIATY VRQACERGAT ATIVSSDKDL MQLVNGCVTM
YDTMKDRRLG IAEVIEKFGV PPEKVVEVQA LAGDSVDNVP GVPGIGVKTA AQLITEYGDL
DTLLARATEI KQPKRREALI ENAEKARISR QLVLLDDHVA LDVPLDDLAV QEPDARKLIA
FLKAMEFTTL TKRVADYSEV NAAEIEPDAK HTSGRSAMVA KASATALTGD LFGSSDSVAK
GTPAGTAKAT GDTKMKTPAA LAAARLETVR ALPVRPDAYE TIRTLDRLRD WIARIDDNGS
FAFEALAPTI DPMQTELSGI ALALAPNHAC YIPLSHKQSG DSAGLFAAGL APDQIAIRDA
LELLKPILES RGHLKIGLNV KFSAVLLAQH GIVMQNHDDI ELISYALDAG RGSHDLASLA
KRWLGHATVS HGELTGSGRN RLAFDQVAID RATAYAAEYA DMTLRLWQVL KPRLVAERMN
TVYETLERPM IATLAQMERR GITIDRQVLS RLSGEFAQSA ARLEAEIQQL AGEPINVGSP
KQIGEIMFGK MGLPGGSKTK TGAWSTSAQI LDELAEQGHD FPRKILDWRQ VSKLKSTYTD
ALPEYVHPQT RRVHTTYALA ATTTGRLSSN EPNLQNIPVR NEEGRKIRRA FIAAPGHKLV
SADYSQIELR LLAEIADIPV LKQAFRDGLD IHAMTASEMF GVPVQGMPGE IRRRAKAINF
GIIYGISAFG LANQLGIPRE EAAAYIKKYF ERFPGIRAYM DATREFCREH GYVTTLFGRK
CHYPDIKSPN PSHRAFNERA AINARLQGSA ADIIRRAMAR MDDALAARNL SARMLLQVHD
ELIFEVPDDE VAATLPVVQH VMQDAPFPAV LLSVPLQVDA RAADNWDEAH