Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_0541 |
Symbol | |
ID | 4654155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008740 |
Strand | - |
Start bp | 613761 |
End bp | 616487 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639810493 |
Product | DNA polymerase I |
Protein accession | YP_957829 |
Protein GI | 120553478 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC AAACCAACCC TCCCGTGGTT CTGGTGGACG GATCGTCTTA CCTGTTCCGT GCCTACCATG CCCTGCCGCC ACTGACTACC AGCAAGAATC ACCCGACTGG CGCCATCAAA GGCGTAATCA GCATGTTGCG GCGGATTGAA CAGGACTTTC CGGGCTCAAA AACCGTGGTG GTATTTGATG CCAAGGGTAA GACTTTCCGG CACGACATGT ATGAGGAATA CAAAGCCAAT CGCCCCCCCA TGCCGGATGA TCTGGCGGTA CAAATTGAAC CCATTCACGA GATTGTCAGG GCGATGGGCT TACCACTGTT GATCGTGCCC GGCGTCGAGG CCGACGATGT CATCGGCACC CTGGCCCATG AAGCCACCAG CAAGGGCATC GATGTGGTCG TGTCTACCGG CGATAAAGAC ATGGCTCAAC TGGTCAGCGA CCACGTTACC CTCATCAACA CCATGACCGA AACCGCCATG GACCGGGACG GGGTCGTTGA AAAGTTTGGT GTTACCCCGG AGCAGATCAT CGATTACCTC GCCCTGGTGG GTGACAAAGT CGACAACATT CCCGGTGTAA ACAAATGCGG CCCGAAAACC GCCGTCAAAT GGCTACAGAG CTACGACAAC CTCGACAACC TCATTGAGCA TTCCGACGAG ATCAAAGGCA AAATCGGCGA ATACCTGAGG GAAGCAACCG ACACCCTGCC CCTGAGCCGG GAGCTGGCCA CCATCCGGAC CGATGTGGAA CTCGAATTCG GGCTGGAGGA TCTTCAGGAA CGCCAGCAGG ACGACGCCAG TCTGTTGGAG CTCTTCAAAG AGTACGAATT CCGCACCTGG ATCGCGGAAC TGGAAAACGG CTCCTCATCA GACGAAGGCT CCCAGAGCAG CCAACCACAG CCGAAACCTT CGGTGGAAAA GCAGTATCAG GTCATTACCG AGCAGGCGGA CCTGGACCAG TGGCTGAAGA AACTCAAAGA TTCCGATCTG TTCGCCTTCG ACACCGAAAC CACCAGCCTC CGCTACATGG ATGCCGAAAT TGTCGGCGTT TCGTTTGCTG TTGAGCCGGG AAAGGCCGCT TACGTGCCAC TGGGCCATGA TTATATGGGC GCGCCGGAGC AACTGGACCG CGACTCTGTG CTGGAACAGC TGAAGCCGTT GCTTGAAGAC CCGAAACACA AAAAGGTCGG GCAGAACCTC AAGTACGACA AAAACGTGCT GGCCAACCAT GATATACAGC TGGAGGGCAT CGCCGAGGAC ACCATGGTGG AGTCCTACGT ACTGAATTCG GTCGGCACCC GCCACGACAT GGACAGCCTG GCCCGAACCT ATCTTGATGA AGAAACCATC ACCTACGAAT CGATTGCGGG CAAAGGCGCC AAGCAACTGA CGTTCAACCA AATCGATCTG GAAAGCGCGG GGCCCTATGC CGCCGAAGAC GCTGACATCA CCCTGCGCCT GCACCAGACT CTCGCCCCGA GGCTGAAAGA CACCGGCAAG CTGGAATCCG TCTACCGGGA AATCGACCTG CCGCTGGTGC CGGTGCTGTC CCGGATGGAA CAGCGGGGCA CCCTGATCAG TGCCAGCACG CTACGCCAGC ACAGCCAGGA ACTTGCAGAG CGCATGGCTG AACTGGAGAA GGAAGCCCAC GAGGTTGCCG GCGAAGCTTT CAACCTGGGC TCCACCAAAC AATTGCAGGC CATCCTTTAC GACAAACTTG GCCTGCGGGT CATCAAGAAG ACACCAAAGG GCGCGCCATC GACGGCCGAG CCGGTTCTGC AGGAGCTGGC CCACGAGCAC GAACTGCCCC GGCTGATTGT CGAGCACCGC AGCCTGAGCA AGCTGAAGTC CACTTATACT GACACCCTGC CCGAGCTTAT CCACCACCGC ACCGGGCGAG TGCACACTTC CTATCATCAG GCGGTGACAG CGACAGGACG CTTGTCTTCA TCGGAACCCA ACCTGCAGAA CATTCCGATT CGGACCCAGG AAGGCCGACG CATACGCCAG GCATTTATTG CCCCGAAAGG CTACAAGTTG CTGGCCGCTG ACTACTCCCA GATTGAGCTT CGCATCATGG CTCACCTGTC TGGCGATAGA GGTCTACTGA CCGCGTTCGA ACACGGCGAA GACATTCACA AGGCCACCGC AGCCGAAGTC TTCGGCGTGA CAGTGGATGA GGTAACCGGC GACCAGCGTC GGAGTGCCAA GGCGATCAAC TTCGGGCTGA TCTACGGCAT GTCGGCATTC GGACTTTCGC GCCAGCTGGA AGTGGACCGA AAAACCGCCC AGGAATACAT CGACCGGTAT TTCGAGCGTT ACCCCGGCGT ACTCAAATAC ATGGACAACA TCCGCAAACA GGCCCATGAC GACGGCTACG TGGAAACGCT CTACGGCCGC CGCCTGTACC TGCCGGAGAT CAACGCACGT AACAAGCAGC TGCAGCAGGC TGCCGAGCGC ACCGCCATCA ATGCGCCCAT GCAGGGCACG GCCGCCGATA TCATCAAGCG GGCCATGATC GAGGTAGACA ACTGGCTGCG CAGCGAACAC GCGAGCGATG CCTGCATGAC CATGCAGGTC CACGACGAAC TGATCATCGA AGTTCGCGAA GAAGCCGTAG ACAAAGTCAA AGATGGCCTG GTCAAGCGCA TGTCCGCCGC CGCCAGCCTG GATGTGCCAC TGCTGGTGGA AGCCGGTGTG GGTGACAACT GGGACCAGGC TCACTAA
|
Protein sequence | MTEQTNPPVV LVDGSSYLFR AYHALPPLTT SKNHPTGAIK GVISMLRRIE QDFPGSKTVV VFDAKGKTFR HDMYEEYKAN RPPMPDDLAV QIEPIHEIVR AMGLPLLIVP GVEADDVIGT LAHEATSKGI DVVVSTGDKD MAQLVSDHVT LINTMTETAM DRDGVVEKFG VTPEQIIDYL ALVGDKVDNI PGVNKCGPKT AVKWLQSYDN LDNLIEHSDE IKGKIGEYLR EATDTLPLSR ELATIRTDVE LEFGLEDLQE RQQDDASLLE LFKEYEFRTW IAELENGSSS DEGSQSSQPQ PKPSVEKQYQ VITEQADLDQ WLKKLKDSDL FAFDTETTSL RYMDAEIVGV SFAVEPGKAA YVPLGHDYMG APEQLDRDSV LEQLKPLLED PKHKKVGQNL KYDKNVLANH DIQLEGIAED TMVESYVLNS VGTRHDMDSL ARTYLDEETI TYESIAGKGA KQLTFNQIDL ESAGPYAAED ADITLRLHQT LAPRLKDTGK LESVYREIDL PLVPVLSRME QRGTLISAST LRQHSQELAE RMAELEKEAH EVAGEAFNLG STKQLQAILY DKLGLRVIKK TPKGAPSTAE PVLQELAHEH ELPRLIVEHR SLSKLKSTYT DTLPELIHHR TGRVHTSYHQ AVTATGRLSS SEPNLQNIPI RTQEGRRIRQ AFIAPKGYKL LAADYSQIEL RIMAHLSGDR GLLTAFEHGE DIHKATAAEV FGVTVDEVTG DQRRSAKAIN FGLIYGMSAF GLSRQLEVDR KTAQEYIDRY FERYPGVLKY MDNIRKQAHD DGYVETLYGR RLYLPEINAR NKQLQQAAER TAINAPMQGT AADIIKRAMI EVDNWLRSEH ASDACMTMQV HDELIIEVRE EAVDKVKDGL VKRMSAAASL DVPLLVEAGV GDNWDQAH
|
| |