Gene Nham_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0453 
Symbol 
ID4030088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp499092 
End bp502136 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content65% 
IMG OID637968981 
ProductDNA polymerase I 
Protein accessionYP_575803 
Protein GI92116074 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAGA AGCCCACCCC TGCCACCAAG CCCGTACCCA CTCCCGCCGC CGCGGAAGCT 
GTGACCGTGA AGTCCGCGGC CGCGACCAAG TCAGACATGC AGGGCAAGCA CGTCTTTCTG
GTCGACGGCT CCTCCTACAT CTTCCGCGCC TATCACGCGC TGCCGCCGCT GAACCGCAAA
TCCGACGGCT TGCAGGTCAA CGCGGTGCTC GGCTTCTGCA ACATGCTGTG GAAGCTGTTG
CGCGACATGC CGAAGGACGA CAAGCCGACC CACCTTGCGA TCATCTTCGA CAAGTCCGAG
GTGACGTTCC GCAACAAGCT CTATCCCGCC TACAAGGCGC ATCGGCCGCC CGCGCCCGAC
GACCTGATCC CGCAATTCGC GCTGATTCGC GAGGCCGTGA AGGCGTTCGA TCTGCCCTGC
ATCGAGCAGG GCGGGTTCGA GGCCGACGAC CTGATCGCGA CCTATGTGCG GCAGGCGTGC
GAACGCGGCG CAACCGCGAC CATCGTCTCC TCGGATAAGG ATCTGATGCA GCTCGTCACC
GATTGCGTCA CCATGTTCGA CACCATGAAG GACCGCCGCC TCGGCATCGC CGAGGTGATC
GAGAAATTCG GCGTACCGCC GGAAAAAGTC GTCGAGGTGC AGGCACTGGC CGGCGACAGC
GTCGACAACG TGCCGGGCGT GCCGGGCATC GGCGTCAAGA CCGCGGCGCA GCTCATCACC
GAATACGGCG ACCTCGAAAC GCTGCTGGCG CAGGCCTCCG AGATCAAGCA GCCGAAGCGA
CGCGAGGCGC TGATCGAAAA CGCCGAGAAG GCGCGCATTT CGCGGCAACT GGTGCTGCTC
GACGACCACG TCGCGCTCGA CGTGCCGCTG GACGACCTCG CCGTGCAGGA GCCCGATGCA
CGCAAGCTGA TCGCTTTCTT GAAGGCGATG GAATTCACCA CGCTGACCAA GCGCGTCGCC
GACTATTCCG AGGTCAACGC GGCCGAGATC GAGCCAGACA GGAAAAACGC CAGCGGCGCG
TCTTCCACAG CAGCCAAGGC ATCCGCAGAG GCCGTCACCA GCGACTTGTT CGGCAGCGAC
GGTGTCGCGA AGGCGACATC GGCCGGCAGG ACAAGGGCGA CGAGCGACAC GGCGATCAAG
ACGCCGCAGG CCCTCGCCGC GGCGCGCCTC GAAGCGGTTC GAAAACTGCC GGTCGATCGC
ACACAATACG AAACCATCCG CACGCTCGAC CGGCTGCAGC ACTGGATCGC CCGCATCGCG
AACCATGGCA GTTTCGTCGT CGAAGCGCTG GCGCCGACAA TAGACCCTAT GCAGGCCGAA
TTGTCCGGCA TCGCGCTGGC GCTCGCGCCG AACGCGGCGT GCTACGTCCC GCTCAACCAC
AAGCAGGCCG GCGACAGCGC CGGTCTGTTC GCCGCCGGCC TTGCGCCCGA TCAGATCGCG
ATCCGTGACG CGCTCGACCT GCTAAAGCCG CTCCTCGAAT CCGGCGGCCA CCTGAAGGTC
GGCTTCAACG TCAAGTTCAC CGCCGTGCTG CTCGCGCAAC ACGGCATCGT CATGCAGAAC
AACGACGACG TCGAGCTGAT TTCCTACGCG CTCGATGCCG GACGCGGCGC CCACGATCTC
GAAGCGCTGG CGCAGCGCTG GCTCGATCAC ACGGCCTTGA ACTATGGCGA ACTGATCGGC
AGCGGCAGGA ACAAGCTTGC CTTCGATCAG GTGACGATCG ATCGCGCCAC GACTTACGCG
GCGGAGTACG CCGCCCTGAC CTTGCGGCTG TGGCAGGTGT TGAAGCCGCG GCTGGTCGCC
GAGCGTATGA ATTCTGTCTA CGAGACGCTG GAACGGCCGA TGATTGCGAC GCTGGCGCGG
ATGGAGCGGC GCGGCATCAC CATCGACCGG CAGGTGCTGT CGCGCCTGTC TGGCGAATTC
GCGCAGACCG CGGCGCGACT GGAAGCCGAA ATCCAGAAGC TTGCCGGCGA GCCGATCAAT
GTCGGCAGCC CGAAGCAGAT CGGCGAGATC ATGTTCGGCA AGATGGGCTT GCCGGGCGGC
AGCAAGACCA AGACCGGCGC ATGGTCCACC TCGGCGCAAA TCCTGGACGA CCTCGCCGAG
CAGGGCCACG ACTTCCCGCG CAAGATTCTC GACTGGCGGC AGGTTTCAAA ACTGAAATCG
ACCTATACCG ACGCGCTGCC GGAATACGTC AATCCGCAGA CCAGCCGCGT GCACACCACC
TATGCGCTCG CCGCCACCAC CACCGGGCGG CTGTCGTCGA ACGAGCCCAA CCTGCAGAAC
ATTCCGGTGC GCAATGAGGA AGGGCGAAAA ATCCGCCGCG CCTTCATCGC CACGCCCGGC
CACAAGCTGG TCTCGGCCGA CTACTCCCAG ATCGAACTGC GGCTGCTCGC CGAGATCGCC
GACATCCCGG TGTTGAAACA AGCGTTCCGC GACGGGCTCG ACATTCACGC CATGACGGCG
TCGGAAATGT TCGGCGTGCC GGTGACGGGC ATGCCGGGCG AAATCCGCCG CCGCGCCAAG
GCCATCAATT TCGGCATCAT CTACGGTATC TCGGCGTTCG GCCTCGCCAA CCAGCTCGGC
ATCCCGCGCG AGGAAGCCGG CACCTACATC AAGAAATATT TCGAGCGCTT TCCCGGCATC
CGCGCCTACA TGGACGCGAC CCGCGACTTC TGCCGCGAGC ACGGTTATGT CGAAACGCTG
TTCGGACGCA AATGTCACTA TCCGGACATC AAGTCGCCGA ACCCGTCGCA CCGCGCCTTT
AACGAGCGCG CCGCGATCAA TGCGCGATTG CAGGGCACCG CCGCCGACAT CATCCGCCGC
GCCATGGTGC GGATGGACGA TGCGCTGGCG GCGAAGAAGC TGTCCGCGCG AATGCTGCTG
CAGGTCCACG ACGAACTGAT TTTTGAAGTG CCAGACGACG AGGTGGCCGC GACACTGCCG
GTCGTCCAGC ATGTGATGCA GGACGCGCCG TTCCCGGCGA TGCTGCTGTC GGTGCCGTTG
CAGGTCGACG CCCGCGCCGC CGACAACTGG GACGAGGCGC ATTAA
 
Protein sequence
MPKKPTPATK PVPTPAAAEA VTVKSAAATK SDMQGKHVFL VDGSSYIFRA YHALPPLNRK 
SDGLQVNAVL GFCNMLWKLL RDMPKDDKPT HLAIIFDKSE VTFRNKLYPA YKAHRPPAPD
DLIPQFALIR EAVKAFDLPC IEQGGFEADD LIATYVRQAC ERGATATIVS SDKDLMQLVT
DCVTMFDTMK DRRLGIAEVI EKFGVPPEKV VEVQALAGDS VDNVPGVPGI GVKTAAQLIT
EYGDLETLLA QASEIKQPKR REALIENAEK ARISRQLVLL DDHVALDVPL DDLAVQEPDA
RKLIAFLKAM EFTTLTKRVA DYSEVNAAEI EPDRKNASGA SSTAAKASAE AVTSDLFGSD
GVAKATSAGR TRATSDTAIK TPQALAAARL EAVRKLPVDR TQYETIRTLD RLQHWIARIA
NHGSFVVEAL APTIDPMQAE LSGIALALAP NAACYVPLNH KQAGDSAGLF AAGLAPDQIA
IRDALDLLKP LLESGGHLKV GFNVKFTAVL LAQHGIVMQN NDDVELISYA LDAGRGAHDL
EALAQRWLDH TALNYGELIG SGRNKLAFDQ VTIDRATTYA AEYAALTLRL WQVLKPRLVA
ERMNSVYETL ERPMIATLAR MERRGITIDR QVLSRLSGEF AQTAARLEAE IQKLAGEPIN
VGSPKQIGEI MFGKMGLPGG SKTKTGAWST SAQILDDLAE QGHDFPRKIL DWRQVSKLKS
TYTDALPEYV NPQTSRVHTT YALAATTTGR LSSNEPNLQN IPVRNEEGRK IRRAFIATPG
HKLVSADYSQ IELRLLAEIA DIPVLKQAFR DGLDIHAMTA SEMFGVPVTG MPGEIRRRAK
AINFGIIYGI SAFGLANQLG IPREEAGTYI KKYFERFPGI RAYMDATRDF CREHGYVETL
FGRKCHYPDI KSPNPSHRAF NERAAINARL QGTAADIIRR AMVRMDDALA AKKLSARMLL
QVHDELIFEV PDDEVAATLP VVQHVMQDAP FPAMLLSVPL QVDARAADNW DEAH