Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oant_0137 |
Symbol | |
ID | 5381169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ochrobactrum anthropi ATCC 49188 |
Kingdom | Bacteria |
Replicon accession | NC_009667 |
Strand | + |
Start bp | 150127 |
End bp | 153057 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640832786 |
Product | DNA polymerase I |
Protein accession | YP_001368697 |
Protein GI | 153007482 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.909529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG GCGATCATCT CTTTCTCGTT GACGGCTCGG GCTATATTTT CCGCGCCTAT CATGCCCTGC CACCGCTGAC CCGCAAGACG GATGGTCTGC CGGTCGGCGC CGTCTCGGGC TTCTGCAACA TGCTATGGAA GCTCTTGAAA GATGCGCGCA ACACCGATGT GGGCGTGGTG CCGACCCATT TTGCGGTCAT CTTCGATTAT TCGTCGAAGA CGTTCCGCAA TGAAATCTAT CCTGAATACA AGGCCAATCG CACCGCGCCG CCGGAAGATC TCATTCCTCA GTTCGGCCTC ATCCGTCAGG CAACGCGCGC GTTCAATCTG CCCTGCATCG AGAAGGAAGG TTTCGAGGCT GACGACCTGA TCGCGACCTA TGCCCGTATT GCCGAGCAGG CTGGCGGCGA TGTCACCATC GTTTCGTCCG ACAAGGATCT CATGCAGCTC GTGACGCCAA GCGTATCGAT GTATGACAGC ATGAAGGACA AGCAGATTTC GATCCCGGAA GTGATCGAGA AATGGGGCGT TCCGCCGGAA AAGATGATCG ACCTGCAATC GCTGACCGGC GACAGCACCG ACAATGTTCC CGGCATTCCC GGCATCGGGC CGAAGACGGC GGCGCAATTG CTGGAAGAAT TCGGCGATCT CGATACGCTT CTGGCCCGCG CTTCCGAAAT CAAGCAGAAC AAGCGTCGCG AGAACATTCT GGCGTTTGCC GACCAGACGA AGATTTCGCG CGAGCTGGTT ACACTCAAGA CCGACGTGCC GCTGGATGTC GATCTGGACA GTCTGGTGCT GGAGCCGCAG AACGGCCCGA AGCTGATCGG CTTCCTGAAG GCTATGGAAT TCACTTCGCT CACTCGCCGG GTGGCCGAAG CGACTGACAC CGACGCATCG GCTGTCGAGC CGTGCCATGT GGAGACAGAC TGGGGCGCGG ACGCCCATGG TCCCGATGTC GATGTTCCCG CCAAGGCTGA TAATGCTGCG TCGCAACCGA CATCCGCTGT CGCTGCCTCG GATCAGGGCT ACACGCCGAA GGCGCTCGCA GAAAAGCGGG CGACAGAAGC AGCCGCACAG ACAATCGATA CGAGCGCTTA TACCTGCATT CGCGATATCG CCACGCTGAA ACTCTGGCTC GCAGAGGCGG TTGAAACCGG TGTTTTGGCC TTCGATACGG AAACGACGTC GCTTGATCCG ATGCAGGCGG AGCTGGTTGG TTTTTCTCTG GCCTTGGCGC CGGGCAGAGC CGCTTATATT CCGCTCCAAC ACAAGTCTGG CGCGGGTGAT CTTCTCGGCG GCGGCATGGT CGAAGGACAA ATTCCGCTGG ATGAGGCGCT GGCCGCTTTG AAGATCGTGC TTGAGGATGC TTCCGTTCTC AAGATCGCGC AGAACATGAA ATATGACTGG CTGGTCATGC GCCGCCATGG GATCAATACA GTCTCGTTCG ACGATACGAT GCTGATTTCC TATGTACTTG ATGCTGGCAC GGGTAGCCAT GGCATGGACC CGTTGTCCGA GCGCTGGCTC GGTCATACGC CAATTCCCTA TAAGGATGTG GCGGGAAGCG GCAAGAGCGC TGTCAGCTTC GACATGGTCG ATCTCGACCG TGCCACGGCT TATGCGGCAG AAGACGCCGA TGTGACCTTG CGCCTGTGGC AAGTTCTGAA ACCGCGTCTT GCAGCTGAAG GGCTGATGTC GGTCTATGAA CGTCTGGAAC GTCCGCTGGT CGATGTTCTG GCGTGCATGG AAGAACGCGG CATTGCCGTT GACCGCCAGG TACTGTCGCG TCTTTCTGGC GATCTGGCGC AGGCTGCCGC TGCCTATGAA GACGAGATTT ACGAACTGGC AGGCGAAAGA TTCAATATCG GTTCGCCCAA GCAGCTTGGC GATATTCTGT TTGGCAAGAT GAGCTTGTCG GGCGCGTCGA AGACCAAGAC CGGTCAATGG TCCACGTCTG CGCAGGTGCT GGAAGACCTC GCTGCCGAAG GCCATCCGCT GCCGCGCAAG ATCGTCGACT GGCGTCAGCT CACCAAGCTC AAATCCACCT ATACCGATGC GCTACCGGGG TTCATCAACC CGGAGACAAA GCGCGTTCAT ACGTCCTATG CGATGGCGTC CACTTCAACC GGACGTCTTT CGTCATCCGA CCCGAACCTG CAAAATATTC CGGTTCGCAC AGCCGAAGGC CGCAAGATCA GGACGGCCTT CATCGCCGAA CCCGGCAACA AGCTGGTTTC TGCCGATTAC AGCCAGATCG AACTGCGCGT GCTGGCCCAT GTGGCCGATA TCGCGCAACT GAAGCAGGCC TTTGCCGACG GCATAGACAT TCACGCCATG ACGGCATCCG AAATGTTCGG CGTGCCGGTG GAAGGTATGC CGTCGGAAGT GCGCCGCCGT GCCAAGGCGA TCAATTTCGG CATCATCTAC GGCATTTCCG CATTCGGCCT TGCCAACCAG TTGTCGATTC CGCGCGAAGA AGCAGGGCAA TATATCCGCA CCTATTTCGA GCGCTTCCCC GGCATCAAGG ACTACATGGA GGCGACCAAA GCTTTCGCGC GCGAGAATGG CTATGTCGAA ACGATTTTCG GGCGTCGTGC GCATTATCCC GATATCAGGG CATCCAACCC GCAGGTTCGT GCGTTCAATG AGCGCGCGGC CATCAATGCG CCGATACAGG GTTCCGCCGC CGACATTATC CGCCGCGCCA TGATCCGCAT GGAAGATGCG CTGGCGGAGC AAAATCTTGC CGCGCGCATG TTGTTGCAAG TGCACGATGA ACTGATCTTC GAAGTGCCTG ATAATGAAGT CGAAAAGACC ATACCGGTCG TTCGTCACAT TATGGAAAAT GCGGCGATGC CTGCCGTCTC GCTTGCGGTG CCGCTGCATG TTGATGCGCG TGCAGCGCAT AACTGGGATG AGGCGCATTA G
|
Protein sequence | MKKGDHLFLV DGSGYIFRAY HALPPLTRKT DGLPVGAVSG FCNMLWKLLK DARNTDVGVV PTHFAVIFDY SSKTFRNEIY PEYKANRTAP PEDLIPQFGL IRQATRAFNL PCIEKEGFEA DDLIATYARI AEQAGGDVTI VSSDKDLMQL VTPSVSMYDS MKDKQISIPE VIEKWGVPPE KMIDLQSLTG DSTDNVPGIP GIGPKTAAQL LEEFGDLDTL LARASEIKQN KRRENILAFA DQTKISRELV TLKTDVPLDV DLDSLVLEPQ NGPKLIGFLK AMEFTSLTRR VAEATDTDAS AVEPCHVETD WGADAHGPDV DVPAKADNAA SQPTSAVAAS DQGYTPKALA EKRATEAAAQ TIDTSAYTCI RDIATLKLWL AEAVETGVLA FDTETTSLDP MQAELVGFSL ALAPGRAAYI PLQHKSGAGD LLGGGMVEGQ IPLDEALAAL KIVLEDASVL KIAQNMKYDW LVMRRHGINT VSFDDTMLIS YVLDAGTGSH GMDPLSERWL GHTPIPYKDV AGSGKSAVSF DMVDLDRATA YAAEDADVTL RLWQVLKPRL AAEGLMSVYE RLERPLVDVL ACMEERGIAV DRQVLSRLSG DLAQAAAAYE DEIYELAGER FNIGSPKQLG DILFGKMSLS GASKTKTGQW STSAQVLEDL AAEGHPLPRK IVDWRQLTKL KSTYTDALPG FINPETKRVH TSYAMASTST GRLSSSDPNL QNIPVRTAEG RKIRTAFIAE PGNKLVSADY SQIELRVLAH VADIAQLKQA FADGIDIHAM TASEMFGVPV EGMPSEVRRR AKAINFGIIY GISAFGLANQ LSIPREEAGQ YIRTYFERFP GIKDYMEATK AFARENGYVE TIFGRRAHYP DIRASNPQVR AFNERAAINA PIQGSAADII RRAMIRMEDA LAEQNLAARM LLQVHDELIF EVPDNEVEKT IPVVRHIMEN AAMPAVSLAV PLHVDARAAH NWDEAH
|
| |