Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0568 |
Symbol | |
ID | 3784788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 653959 |
End bp | 656676 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810650 |
Product | DNA polymerase I |
Protein accession | YP_411268 |
Protein GI | 82701702 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAC TACTGCTGGT CGATGGTTCA TCTTATCTGT ATCGCGCTTT TCACGCGCTG CCCGATTTCC GCAACCGCAA TAACGAGCCG ACCGGTGCAG TCTATGGCGT GCTGAATATG CTGCGCCGCT TGCACAAGGA TTATCAGGCC GATTATAGCG CTTGCGTATT TGATGCAAAA GGCAAGACTT TCCGCGATGA GCTCTACGCC GAGTACAAGG CGAACCGGCC GCCCATGCCC GACGAGCTTG CCGCCCAGAT TGCGCCGCTC CTGGAATGCA TAAATGCAAT GGGATGGCCG ATGCTTTCTG TGGAAGGAGT AGAGGCGGAC GATGTGATCG GCACGCTGGT GAAACAGGCC GAAGGCGAGA ACATGCGCTG CATCATTTCG ACTGGCGATA AAGACATTGC GCAACTGGTG AACCCCCAAG TGACCCTCGT CAATACCATG ACGAATGAAA TGCTTGATGA AGCCGGTGTG CTTGGACGTT TCGGCGTACC TCCGGAACGC ATGCTCGACT ATCTGGCGCT GGTGGGTGAC GCGGTCGACA ACATCCCCGG CGTAGCGAAG GTGGGACCAA AAACCGCGGT GAAGTGGCTC AATCAATATG GGACACTCGA TAACTTGCTT GTCCATGCTG GCGAAATAGG CGGCGTAGTA GGGGAGAATC TGCGCAATGC GCTGGGCTGG TTGAGCAGAT CGCGGCAACT GCTCTCCATC AAGTGCGACG TTTCGTTGCC GGTCAGCCTG CATGATCTCG GACCCCAGCC CCCTGACATG GTAAAGCTCG CGGAGTTGTA CGAGCGGCTG GACTTCAAGA GCTGGTTGCG CGAATTGCAA CAGGAATCCC AGGCGGAAGG AGCCGCTGAT GCGCTTTCTC GGGGTTTTGC CCCGGCAGGC TCGAATCTGG TGCAGGCCGA TTATCAGACC ATTCTTACTC CCGAGCAACT CGATGAGTGG ATGATGCGAA TCGCCGCGGC GCCGCTCGCC TCACTCGACA CCGAAACGAC GGGGCTTGAT CCGATGCGCG CCGAGCTGGT GGGTATCTCG TTTTCCGTGG AACCGCATCA CGGGGCCTAT ATACCGCTGG GACATCGTTA TACCGGTGTC CCCCGCCAGT TGCCGCTCGA CTTTGTGCTC GAAAAGCTCA AACCCTGGCT GACAGACCCT TCCGCCCCCA AACTGGGGCA GAACCTGAAG TTTGACAGGC ACGTGTTCGC CAACCACGGC ATCGGGTTGA AAGGAATCGT TCACGATACG CTGCTGCAAT CCTATGTCTT CGAATCCCAT CGCCCGCACG ATATGGACAA CCTTGCGCTG CGGCATCTGG GGATCAAAAC CATCACCTAT GATGAAGTCA CCGGCAAGGG AGCCGCCCGA ATAGGTTTCG AGCAGGTGGA TATCGAACGC GCTGCGCAAT ATGCAGCCGA GGACGCGGAT GTCACGCTGC AGCTGCACCA GCACCTGTAT GCTGAAGTTG GCAAGGATGC AAAGCTCGAT CATATCTATC GCACGCTGGA AATGCCCGTG ATGGATGTCC TGTTCGAGAT GGAGCGCAAC GGGGTGCTCC TGGATATCAA ATTGCTGGAA ACGCAAAGCC GCGAGCTCGG CGAGAAAATG CTGGCGCTGG AGGAGCGCGC CTGCACCATT GCCGGGCTGC CATTCAACCT GAATTCACCC AAACAGATTC AGGAAATCCT GTTTGACAGA CTCAAGCTGC CTGTCATGAA AAAAACACCG AGCGGCGTTC CTTCCACCGA CGAAGATGTA CTGCAGAAAC TTGCACTTGA TTATCCCCTC GCCAGGGCTC TGCTGGATTA TCGGGGTCTT GCCAAACTCA AGTCGACTTA CACCGACAAG CTGCCGCGCA TGGTGCATCC CGTTACTGGA AGAGTGCACA CCAATTATGC CCAGGCTGTG GCGGTGACGG GCAGACTCGC CAGTAACGAA CCGAACTTGC AGAACATACC CATCCGCACT GCCGAAGGGC GGCGTATCCG GGAAGCGTTC ATTGCCCCGA GCGGATACAG AATCATTTCC GCGGATTATT CACAGATCGA ATTACGCATC ATGGCGCACA TTTCGCAGGA TGCCGGACTG CTCAAGGCCT TTGCACAAGG GGAAGACATC CATCGCGCCA CGGGCTCCGA GATATTCGGC GTTCCCCTGG AAACAGTGAA CAGCGAACAG CGCAGGTATG CAAAAATCAT CAATTTCGGC CTGATCTACG GTATGTCGGA ATTCGGACTC GCGACGCAAC TGGGGATTGA GCGGGCCGCG GCCAGAACCT ATATGGACCG CTACTTTTCC CGCTATCCGG GAGTGGCGGA CTACATGCAG CGGACACGGA AAACCGCCAG GGAAAACCGC TATGTGGAGA CCGTTTTCGG GCGGCGGCTG TGGTTGTCGG AGATCAACAG TTCCAACGGC ATGCGTCGTC AGGCGGCGGA ACGCGCTGCC ATCAACGCGC CCATGCAGGG TACTGCCGCC GATATCATCA AGCTTGCCAT GATAGAAGTG CATGACTGGC TGCGGCGGCA TGAACTGCGC AGCAAACTCG TCATGCAGGT CCATGACGAA CTGGTGTTGG AGGTGCCGGA AAACGAAATC GAGACCATAA AGCGCGAGTT GCCGTTGCTC ATGGGCAATG TGGCGCAACT TCAGGTGCCG TTATTGGTGG AAGTAGGCGT CGGGCCAAAT TGGGAGCAGG CCCACTGA
|
Protein sequence | MKTLLLVDGS SYLYRAFHAL PDFRNRNNEP TGAVYGVLNM LRRLHKDYQA DYSACVFDAK GKTFRDELYA EYKANRPPMP DELAAQIAPL LECINAMGWP MLSVEGVEAD DVIGTLVKQA EGENMRCIIS TGDKDIAQLV NPQVTLVNTM TNEMLDEAGV LGRFGVPPER MLDYLALVGD AVDNIPGVAK VGPKTAVKWL NQYGTLDNLL VHAGEIGGVV GENLRNALGW LSRSRQLLSI KCDVSLPVSL HDLGPQPPDM VKLAELYERL DFKSWLRELQ QESQAEGAAD ALSRGFAPAG SNLVQADYQT ILTPEQLDEW MMRIAAAPLA SLDTETTGLD PMRAELVGIS FSVEPHHGAY IPLGHRYTGV PRQLPLDFVL EKLKPWLTDP SAPKLGQNLK FDRHVFANHG IGLKGIVHDT LLQSYVFESH RPHDMDNLAL RHLGIKTITY DEVTGKGAAR IGFEQVDIER AAQYAAEDAD VTLQLHQHLY AEVGKDAKLD HIYRTLEMPV MDVLFEMERN GVLLDIKLLE TQSRELGEKM LALEERACTI AGLPFNLNSP KQIQEILFDR LKLPVMKKTP SGVPSTDEDV LQKLALDYPL ARALLDYRGL AKLKSTYTDK LPRMVHPVTG RVHTNYAQAV AVTGRLASNE PNLQNIPIRT AEGRRIREAF IAPSGYRIIS ADYSQIELRI MAHISQDAGL LKAFAQGEDI HRATGSEIFG VPLETVNSEQ RRYAKIINFG LIYGMSEFGL ATQLGIERAA ARTYMDRYFS RYPGVADYMQ RTRKTARENR YVETVFGRRL WLSEINSSNG MRRQAAERAA INAPMQGTAA DIIKLAMIEV HDWLRRHELR SKLVMQVHDE LVLEVPENEI ETIKRELPLL MGNVAQLQVP LLVEVGVGPN WEQAH
|
| |