Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0554 |
Symbol | |
ID | 3706747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 597587 |
End bp | 600298 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637737063 |
Product | DNA polymerase A |
Protein accession | YP_342604 |
Protein GI | 77164079 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.266566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGG CGGAAAACCC GGTGCTTATT CTAATCGACG GCTCCTCCTA TCTCTTTCGT GCTTTCCATG CACTGCCTTC TTTAACGACT TCTAAAGGTC AACCTACCGG GGCAATCTAT GGGGTCATCA ATATGCTGCG CAAGTTACTT GATGAGTATC AGCCCCAATA CATAGCGGTA GTTTTTGATG CCAAGGGCAA AACATTTCGC CATGAACTCT TTGAGCAGTA CAAAGACCAT CGCCCTCCTA TGCCAGAGGA ACTTGCCTGT CAGATTCAGC CCCTCCACGA TCTCATCCGA GCCCTTGGCT TGCCGCTGCT CTGTGTTAAA GGGGTTGAGG CTGACGATGT CATCGGCACC TTGGCCCGGC AAGCAACCGC GCAACGGCTG GAGACCCTCA TTTCTAGCGG CGATAAGGAC TTAGCCCAGC TAGTCAATCC CCATGTGAGC CTGGTCAACA CCATGAATCT ATCTAAATTG GACCCCGCTG GCGTAAAGGC CAAATTCAAT GTTTCCCCGG AACAAATAGT CGATTACCTA GCACTGGTGG GCGATACGGT AGACAATATC CCCGGTATCC CTGGCATAGG TCCCAAAACA GCCGCCAAAC TGCTCTGCCA ATACCACAGC CTGGATCAAA TCATGGCTTA TGCCTCTGAA ATTAAAGGCA AGATGGGGGA AAGTCTACGA TCCCATCTCA CACAACTTCC CTTGGCAAAG GAATTGGCCA CCGTTCGCCA GGATCTGCTC CTAGATTTAG GCCCCAAAGA CTTGCGGTGC GCGCCACCCA ATATTCCCGC CCTGCGTGAA TTGTACGCCG CCCTGGAATT CAAAAGCTGG CTCCGTGAAC TCTTGGACAA TGAGAACGCC CACTCCTCAA TTTCTAATTC CTCAACGAAC TCTGCTCCCG CCTATGAGAC CGTGTTTTCC GAGGAAAGCT TTGAGAATTG GGTTGCCCGC CTGGAAAAGG CTGAATTATT TGCCTTCGAT CTGGAAACCA ATAACTTGGA CTATATAGAA GCCGAAATCG TCGGTCTCTC TTTTGCCATC CAGCCCCATG AAGCTATGTA TATTCCCCTC GGTCACGAGG ATGCCACCGC CCCCCCCCAG CTACCCCGTG AACAGGTTTT AGCGCGGCTC AAGCCACTGC TAGAAGACCC TCGCCACGGC AAAGTAGGAC AAAACCTTAA ATTCGATTGT AATGTGCTCG CCAATTACGG TATCGAACTG CAAGGCATTC GTCACGACTC CATGCTGGAA TCTTATGTGC TTGACAGTAC CGCTACCCGG CACAATATGG ATTCTTTAGC CCTAAAATAC CTCCAACGAA CTACTATTAC CTATGAGATG GTAGCGGGCA AAGGGGCCAA GCAATTACCC TTCAACCAGG TTACCATTGA AAAGGCTGCC CCCTATGCGG CGGAAGATGC CGATATCAGT CTCCAGCTTC ACCATTGTTT CTGGCCCCGC CTGCAACAAG AAGAGGGTCT TCGCCAGCTC TACCAAGAGC TGGAAATCCC CCTCATCCCC GTGCTTTCCC GTATGGAACG CAATGGAGTC CAGGTGAATA CGGAGCAACT TAAAGCCCAA AGCGATGAAT TGGCAGCGCG CTTGAAGAGA CTCGAACAAG AAGCTTTTGA ACTGGCTGGA GAGTCCTTTA ATCTCGCTTC CCCCAAACAG ATTCAGGCCA TTCTCTACGA GAAGTTAAAA CTGCCGGTAA CCCGCAAAAC CCCTACCGGC CAACCCTCGA CAGCAGAAAC GGTCCTGCAG GAACTGGCAC TTGATTACCC CCTCCCCCAA TTGCTCCTCG AATACCGTAC CCTGAGCAAG CTCAAATCCA CCTATACGGA TCGCTTGCCG TTGCAGGTCA ACTCCCATAC TGGGCGAGTC CATACTTCCT ATCATCAAGC CGTTACTGCC ACTGGACGCC TTTCTTCTTC CGATCCTAAC TTGCAAAACA TTCCTATCCG CAGTACTGAG GGCCGGCGAA TTCGCCAAGC ATTTATCGCC CCACCCGGTT ACCGTTTGGT AGCCGCCGAT TACTCTCAGA TCGAATTGCG CATCATGGCC CATCTCTCGG AAGATGAAGG ACTGCTGGCT GCTTTCGAAG CGGAAGAAGA TATCCACCAG CGAACAGCCA CCGAAATTTT CCGCACGCCC CTGGAGGATG TGACTCCTGA ACAACGGCGC AGCGCTAAGG CCATCAATTT TGGCCTCATC TATGGTATGT CCGCCCATGG GCTTGGCCGG CAACTGGGAA TCAACCGTAC CGCCGCACAG CACTATATAG AACGCTACTT TCAACGCTAC CCTGGCGTTA AAGCCTATAT GGAGAATATC TGCCAGCAAG CCCGCCAGAA AGGTTATGTG GAAACTCTCT TTGGCCGACG GCTTTATCTG CCTGAAATCC ACTCCCGGCA AACCCAACGC CGCAATCAGG CCGAACGCAC TGCCATTAAT GCCCCCATGC AGGGCAGCGC AGCGGATATC ATCAAGCGGG CAATGATCCA CGCTGACCGC TGGTTGCAGG AACAAAAAGC TAACGCCCGA ATGATCATGC AAGTTCACGA TGAGCTGGTG TTTGAGGTGG CAGAAGATAA GCTGGAGGCT ACAATTAGGG CAATCCGGGA GAACATGGCC GCAGCCGCCC AGCTCAAGGT ACCGTTAATC GTGGAGATAG GCAGTGGCAC TAACTGGGAC GAAGCCCACT GA
|
Protein sequence | MNMAENPVLI LIDGSSYLFR AFHALPSLTT SKGQPTGAIY GVINMLRKLL DEYQPQYIAV VFDAKGKTFR HELFEQYKDH RPPMPEELAC QIQPLHDLIR ALGLPLLCVK GVEADDVIGT LARQATAQRL ETLISSGDKD LAQLVNPHVS LVNTMNLSKL DPAGVKAKFN VSPEQIVDYL ALVGDTVDNI PGIPGIGPKT AAKLLCQYHS LDQIMAYASE IKGKMGESLR SHLTQLPLAK ELATVRQDLL LDLGPKDLRC APPNIPALRE LYAALEFKSW LRELLDNENA HSSISNSSTN SAPAYETVFS EESFENWVAR LEKAELFAFD LETNNLDYIE AEIVGLSFAI QPHEAMYIPL GHEDATAPPQ LPREQVLARL KPLLEDPRHG KVGQNLKFDC NVLANYGIEL QGIRHDSMLE SYVLDSTATR HNMDSLALKY LQRTTITYEM VAGKGAKQLP FNQVTIEKAA PYAAEDADIS LQLHHCFWPR LQQEEGLRQL YQELEIPLIP VLSRMERNGV QVNTEQLKAQ SDELAARLKR LEQEAFELAG ESFNLASPKQ IQAILYEKLK LPVTRKTPTG QPSTAETVLQ ELALDYPLPQ LLLEYRTLSK LKSTYTDRLP LQVNSHTGRV HTSYHQAVTA TGRLSSSDPN LQNIPIRSTE GRRIRQAFIA PPGYRLVAAD YSQIELRIMA HLSEDEGLLA AFEAEEDIHQ RTATEIFRTP LEDVTPEQRR SAKAINFGLI YGMSAHGLGR QLGINRTAAQ HYIERYFQRY PGVKAYMENI CQQARQKGYV ETLFGRRLYL PEIHSRQTQR RNQAERTAIN APMQGSAADI IKRAMIHADR WLQEQKANAR MIMQVHDELV FEVAEDKLEA TIRAIRENMA AAAQLKVPLI VEIGSGTNWD EAH
|
| |