Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0453 |
Symbol | |
ID | 4030088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 499092 |
End bp | 502136 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637968981 |
Product | DNA polymerase I |
Protein accession | YP_575803 |
Protein GI | 92116074 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAGA AGCCCACCCC TGCCACCAAG CCCGTACCCA CTCCCGCCGC CGCGGAAGCT GTGACCGTGA AGTCCGCGGC CGCGACCAAG TCAGACATGC AGGGCAAGCA CGTCTTTCTG GTCGACGGCT CCTCCTACAT CTTCCGCGCC TATCACGCGC TGCCGCCGCT GAACCGCAAA TCCGACGGCT TGCAGGTCAA CGCGGTGCTC GGCTTCTGCA ACATGCTGTG GAAGCTGTTG CGCGACATGC CGAAGGACGA CAAGCCGACC CACCTTGCGA TCATCTTCGA CAAGTCCGAG GTGACGTTCC GCAACAAGCT CTATCCCGCC TACAAGGCGC ATCGGCCGCC CGCGCCCGAC GACCTGATCC CGCAATTCGC GCTGATTCGC GAGGCCGTGA AGGCGTTCGA TCTGCCCTGC ATCGAGCAGG GCGGGTTCGA GGCCGACGAC CTGATCGCGA CCTATGTGCG GCAGGCGTGC GAACGCGGCG CAACCGCGAC CATCGTCTCC TCGGATAAGG ATCTGATGCA GCTCGTCACC GATTGCGTCA CCATGTTCGA CACCATGAAG GACCGCCGCC TCGGCATCGC CGAGGTGATC GAGAAATTCG GCGTACCGCC GGAAAAAGTC GTCGAGGTGC AGGCACTGGC CGGCGACAGC GTCGACAACG TGCCGGGCGT GCCGGGCATC GGCGTCAAGA CCGCGGCGCA GCTCATCACC GAATACGGCG ACCTCGAAAC GCTGCTGGCG CAGGCCTCCG AGATCAAGCA GCCGAAGCGA CGCGAGGCGC TGATCGAAAA CGCCGAGAAG GCGCGCATTT CGCGGCAACT GGTGCTGCTC GACGACCACG TCGCGCTCGA CGTGCCGCTG GACGACCTCG CCGTGCAGGA GCCCGATGCA CGCAAGCTGA TCGCTTTCTT GAAGGCGATG GAATTCACCA CGCTGACCAA GCGCGTCGCC GACTATTCCG AGGTCAACGC GGCCGAGATC GAGCCAGACA GGAAAAACGC CAGCGGCGCG TCTTCCACAG CAGCCAAGGC ATCCGCAGAG GCCGTCACCA GCGACTTGTT CGGCAGCGAC GGTGTCGCGA AGGCGACATC GGCCGGCAGG ACAAGGGCGA CGAGCGACAC GGCGATCAAG ACGCCGCAGG CCCTCGCCGC GGCGCGCCTC GAAGCGGTTC GAAAACTGCC GGTCGATCGC ACACAATACG AAACCATCCG CACGCTCGAC CGGCTGCAGC ACTGGATCGC CCGCATCGCG AACCATGGCA GTTTCGTCGT CGAAGCGCTG GCGCCGACAA TAGACCCTAT GCAGGCCGAA TTGTCCGGCA TCGCGCTGGC GCTCGCGCCG AACGCGGCGT GCTACGTCCC GCTCAACCAC AAGCAGGCCG GCGACAGCGC CGGTCTGTTC GCCGCCGGCC TTGCGCCCGA TCAGATCGCG ATCCGTGACG CGCTCGACCT GCTAAAGCCG CTCCTCGAAT CCGGCGGCCA CCTGAAGGTC GGCTTCAACG TCAAGTTCAC CGCCGTGCTG CTCGCGCAAC ACGGCATCGT CATGCAGAAC AACGACGACG TCGAGCTGAT TTCCTACGCG CTCGATGCCG GACGCGGCGC CCACGATCTC GAAGCGCTGG CGCAGCGCTG GCTCGATCAC ACGGCCTTGA ACTATGGCGA ACTGATCGGC AGCGGCAGGA ACAAGCTTGC CTTCGATCAG GTGACGATCG ATCGCGCCAC GACTTACGCG GCGGAGTACG CCGCCCTGAC CTTGCGGCTG TGGCAGGTGT TGAAGCCGCG GCTGGTCGCC GAGCGTATGA ATTCTGTCTA CGAGACGCTG GAACGGCCGA TGATTGCGAC GCTGGCGCGG ATGGAGCGGC GCGGCATCAC CATCGACCGG CAGGTGCTGT CGCGCCTGTC TGGCGAATTC GCGCAGACCG CGGCGCGACT GGAAGCCGAA ATCCAGAAGC TTGCCGGCGA GCCGATCAAT GTCGGCAGCC CGAAGCAGAT CGGCGAGATC ATGTTCGGCA AGATGGGCTT GCCGGGCGGC AGCAAGACCA AGACCGGCGC ATGGTCCACC TCGGCGCAAA TCCTGGACGA CCTCGCCGAG CAGGGCCACG ACTTCCCGCG CAAGATTCTC GACTGGCGGC AGGTTTCAAA ACTGAAATCG ACCTATACCG ACGCGCTGCC GGAATACGTC AATCCGCAGA CCAGCCGCGT GCACACCACC TATGCGCTCG CCGCCACCAC CACCGGGCGG CTGTCGTCGA ACGAGCCCAA CCTGCAGAAC ATTCCGGTGC GCAATGAGGA AGGGCGAAAA ATCCGCCGCG CCTTCATCGC CACGCCCGGC CACAAGCTGG TCTCGGCCGA CTACTCCCAG ATCGAACTGC GGCTGCTCGC CGAGATCGCC GACATCCCGG TGTTGAAACA AGCGTTCCGC GACGGGCTCG ACATTCACGC CATGACGGCG TCGGAAATGT TCGGCGTGCC GGTGACGGGC ATGCCGGGCG AAATCCGCCG CCGCGCCAAG GCCATCAATT TCGGCATCAT CTACGGTATC TCGGCGTTCG GCCTCGCCAA CCAGCTCGGC ATCCCGCGCG AGGAAGCCGG CACCTACATC AAGAAATATT TCGAGCGCTT TCCCGGCATC CGCGCCTACA TGGACGCGAC CCGCGACTTC TGCCGCGAGC ACGGTTATGT CGAAACGCTG TTCGGACGCA AATGTCACTA TCCGGACATC AAGTCGCCGA ACCCGTCGCA CCGCGCCTTT AACGAGCGCG CCGCGATCAA TGCGCGATTG CAGGGCACCG CCGCCGACAT CATCCGCCGC GCCATGGTGC GGATGGACGA TGCGCTGGCG GCGAAGAAGC TGTCCGCGCG AATGCTGCTG CAGGTCCACG ACGAACTGAT TTTTGAAGTG CCAGACGACG AGGTGGCCGC GACACTGCCG GTCGTCCAGC ATGTGATGCA GGACGCGCCG TTCCCGGCGA TGCTGCTGTC GGTGCCGTTG CAGGTCGACG CCCGCGCCGC CGACAACTGG GACGAGGCGC ATTAA
|
Protein sequence | MPKKPTPATK PVPTPAAAEA VTVKSAAATK SDMQGKHVFL VDGSSYIFRA YHALPPLNRK SDGLQVNAVL GFCNMLWKLL RDMPKDDKPT HLAIIFDKSE VTFRNKLYPA YKAHRPPAPD DLIPQFALIR EAVKAFDLPC IEQGGFEADD LIATYVRQAC ERGATATIVS SDKDLMQLVT DCVTMFDTMK DRRLGIAEVI EKFGVPPEKV VEVQALAGDS VDNVPGVPGI GVKTAAQLIT EYGDLETLLA QASEIKQPKR REALIENAEK ARISRQLVLL DDHVALDVPL DDLAVQEPDA RKLIAFLKAM EFTTLTKRVA DYSEVNAAEI EPDRKNASGA SSTAAKASAE AVTSDLFGSD GVAKATSAGR TRATSDTAIK TPQALAAARL EAVRKLPVDR TQYETIRTLD RLQHWIARIA NHGSFVVEAL APTIDPMQAE LSGIALALAP NAACYVPLNH KQAGDSAGLF AAGLAPDQIA IRDALDLLKP LLESGGHLKV GFNVKFTAVL LAQHGIVMQN NDDVELISYA LDAGRGAHDL EALAQRWLDH TALNYGELIG SGRNKLAFDQ VTIDRATTYA AEYAALTLRL WQVLKPRLVA ERMNSVYETL ERPMIATLAR MERRGITIDR QVLSRLSGEF AQTAARLEAE IQKLAGEPIN VGSPKQIGEI MFGKMGLPGG SKTKTGAWST SAQILDDLAE QGHDFPRKIL DWRQVSKLKS TYTDALPEYV NPQTSRVHTT YALAATTTGR LSSNEPNLQN IPVRNEEGRK IRRAFIATPG HKLVSADYSQ IELRLLAEIA DIPVLKQAFR DGLDIHAMTA SEMFGVPVTG MPGEIRRRAK AINFGIIYGI SAFGLANQLG IPREEAGTYI KKYFERFPGI RAYMDATRDF CREHGYVETL FGRKCHYPDI KSPNPSHRAF NERAAINARL QGTAADIIRR AMVRMDDALA AKKLSARMLL QVHDELIFEV PDDEVAATLP VVQHVMQDAP FPAMLLSVPL QVDARAADNW DEAH
|
| |