Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0237 |
Symbol | |
ID | 5693055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 260928 |
End bp | 263939 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641262817 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001528124 |
Protein GI | 158520254 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCCCC TGACAGTCAG ATCCGGTTAT TCGCTGATGT GGGGCACAGC CCCCATCAAG GAGGTGTGCC GCGTGGCCCG GCAGCAGGGA TATGATCGCC TGGCCCTGAC CGACACGGAC AACCTTTACG GGCTGTGGCA TTTTCTGGAC GCCTGCCGCC GGAACGATAT CACGCCGGTT GTCGGGGCGG AACTTACCGA CCCCGCCACA ACAGACCGGG CCGTTCTTCT GGTAAAAAAC CCGGAAGGCT ACAAAAACCT GTGCCGGCTC ATAACCGCCC GGCACCTTGA ACCGGACTTT AAACTGGAAA CAGCCATTGC CCGGCATGCT GATGGCCTGG CCGTGCTGAC CCGCAATCCC TGCCATTTGA AGCAGTGGCA TGAATCCGGT GTGGCGGTGT ATGGCGCCAT GCCCCGGCGG CCGATGTCGC TGTCCAGCCC GCTGTGTCAG ACCGCCAGGT ACTTAAATGT TCCCCTGGTG GCCACGCCCG GCAGCTTTTT TCTTTCTCCG CAAGACATGG ATGCCCACCG GATGCTGCGC GCCATTGACC TGAACACCAC GGTTGACAGG CTTGGCCCGT TAGAAGCGGC GCCCGGGGAA GCCTGGCTGG CATCCCCCCG GGAGTATGAG AAACGTTTTG AAGCCTGCCC GGAAGCTCTG GCCAACACCC GTCGCCTTTC GGAAACCCTG ACGTTTACCG GCCCCGAATT CGGCCTGGTC ATGCCCCCCT GGGAAAGCAA AACGGGCACG GACCCTGGCT CCGCCCTTCG GCAGGCCGCC TACCGGGGCG CTCAGGCGCG GTATGGCCGG GACCTTCCGG AGCCTGTCGC ACGGCGCTTG GAGCACGAAC TGAGTATCAT TATTCAAATG CGCTTTGCCG CTTACTTTCT GGTGGTGCAA GACATTGTGC GGCTGAGCCC CAGGATTTGC GGCCGGGGAT CGGGCGCGGC CTCCCTGGTG GCCTATTGCC TGCGTATCAC CAATGTCTGC CCGATAAAGC ACAACCTGTA TTTCGAGCGC TTTTTAAACC CGGGCCGAAA GGACGCCCCG GATATTGACG TGGATTTTGC CTGGGATGAA CGCGACACCG TCCTGGCAGC GGTGTTCGAG CGGTTTGGCG ATCATTGTGC CATGGTCTGC AACCACGTGC GAATGCAGCC GAGAATGGCG GTCCGGGAGG TGGCCCGGGT ATATGGCCTG ACCGAAGCGG AAATCAGCCG GGTGACCAAA AAAATGCCGT GGTTCTGGCG GCAGGACACC GCCGGCCTGG ACATCCTGCG GGAACTGTCA GAGCGCCCGG AATCCCGGGA TCTGGACCTG CCCGCGCCCT GGCCCGGGAT CATGGCCGTT GCCCGGAAAA TCTGCGGCGC GCCCCGGTAT CTTTCCGTCC ATCCCGGCGG CGTGATCATC ACCCCCCGGC CTGTCTGCGA ATATGTTCCG GTGGAAAGGG CCGCCAAGGG TGTCCGGATT ATCCAGTGGG AAAAAGACCA GGCCGAGGAT GCCGGGCTGG TGAAAATCGA CCTGCTGGGC AACCGGAGCC TGGCCGTGAT CCGGGACGCC ATTTACAACC TGCGGTCAAA CAACCACTTG CTCGACGAAT CCACGTGGGC GCCGGAGGAC GACTTTGCCA CCCAGGAGGC GGTGTCCCAG GGCAAAACCA TGGGCTGTTT TTACATAGAG AGCCCGGCCA CCCGGCTGTT GCAGAAAAAG TCCCGGGTGG GGGACTTTGA ACACCTGGTG ATCCACAGCA GCATTATCCG GCCGGCAGCC AATGATTATA TCCAGGAATA CCTCCGCCGC CTGCACGGCG CGTCCTGGGA CCCCATTCAC CCGCTTCTGG CGGATGTGCT GGACGAGACC TTTGGCATCA TGGTGTACCA GGAAGACGTG TCCCGGGCCG CTGTGTCCGT GGCCGGGTTT TCTCACGCCG ATGCCGACGG CCTGCGCAAG GTCATGTCCA AAAAAGACAA GGACCATGTT CTGGCCGATT TTTACCGGCG CTTTGCGGAC GGGGCCGCCA AAAACGGGGT GACCCCGGAG AAGATTCGCG AGATATGGCG GATGATGATG AGCTTTTCCG GGTATTCCTT CTGCAAGCCC CACAGCGCCT CCTATGCACG GGTGTCGTTT CAGGCGGCCT GGCTCAAGAC CCACTATCCT GCCGCGTTCA TGGCCGCCGT AATCAGCAAC CAGGGCGGGT TTTACAGCAC CTTTGCCTAT GTCTCCGAGG CCCGTCGCTC CGGCATTACC ATCCTGGCCC CGGACGTCAA TGCAAGCGAC ATCCGATGGC AGGGCAACGG CAACACCATC CGGGTGGGGC TTTCTGCTGT CGGTCATCTG TCCCTTAAAA CCATGGAAAA AACGCTTGCC CACCGCAAGG CCGCACGTTT TACCGGCCTG GAAGACTACA TGGACCGGGT CCGGCCGGAT GAGCCTGAAG CTCAGGCCCT TATTCAGAGC GGCGCCTTTG ACGCGTTGTG CCCGGGCCGG TCCAGAAGCA TGATCCGGTG GCAATGGGCC TGCTGGAAAA ACAACCGCCC GCCAGCATCG TCCTCGCCCC TTTTGTTTGA CACGGCTCCC GGGCCGGTGG TCCCGCCCCC GCAGATAACC GAAGACCCGC TGGAAAGATA CCGGCAGGAG TTTTCTGTTT TGGGCTTTTT GTGTGACACC CACCCCATGA CGCTGTATAA GACGGAACTG GAGGGCCGGC AGACCATCAA GGCCAAACAC CTGGGCCGAT ACGCGGGCAA ACGCGTCTGC GTGGCCGGCT GGTTGATCAC CGGCAAGGTC GTGCGCACGA AAAACGGCGA TCCCATGGAG TTTTTGACCT TTGAAGATGA AACCGACATC TTTGAAGCCA CCTTTTTCCC AAAGACGTAT ACCCGGTTCT GCCACATGAT CGACAGGGGA AGCCCCTTCC TCCTCTGGGG CACGGTGGAG ACCAATTGGG GGGCTGTGAC ACTCACGGTG GAAAAAATAG AAGGCCTGCC GGACTACAAT AACCGACTAT GA
|
Protein sequence | MIPLTVRSGY SLMWGTAPIK EVCRVARQQG YDRLALTDTD NLYGLWHFLD ACRRNDITPV VGAELTDPAT TDRAVLLVKN PEGYKNLCRL ITARHLEPDF KLETAIARHA DGLAVLTRNP CHLKQWHESG VAVYGAMPRR PMSLSSPLCQ TARYLNVPLV ATPGSFFLSP QDMDAHRMLR AIDLNTTVDR LGPLEAAPGE AWLASPREYE KRFEACPEAL ANTRRLSETL TFTGPEFGLV MPPWESKTGT DPGSALRQAA YRGAQARYGR DLPEPVARRL EHELSIIIQM RFAAYFLVVQ DIVRLSPRIC GRGSGAASLV AYCLRITNVC PIKHNLYFER FLNPGRKDAP DIDVDFAWDE RDTVLAAVFE RFGDHCAMVC NHVRMQPRMA VREVARVYGL TEAEISRVTK KMPWFWRQDT AGLDILRELS ERPESRDLDL PAPWPGIMAV ARKICGAPRY LSVHPGGVII TPRPVCEYVP VERAAKGVRI IQWEKDQAED AGLVKIDLLG NRSLAVIRDA IYNLRSNNHL LDESTWAPED DFATQEAVSQ GKTMGCFYIE SPATRLLQKK SRVGDFEHLV IHSSIIRPAA NDYIQEYLRR LHGASWDPIH PLLADVLDET FGIMVYQEDV SRAAVSVAGF SHADADGLRK VMSKKDKDHV LADFYRRFAD GAAKNGVTPE KIREIWRMMM SFSGYSFCKP HSASYARVSF QAAWLKTHYP AAFMAAVISN QGGFYSTFAY VSEARRSGIT ILAPDVNASD IRWQGNGNTI RVGLSAVGHL SLKTMEKTLA HRKAARFTGL EDYMDRVRPD EPEAQALIQS GAFDALCPGR SRSMIRWQWA CWKNNRPPAS SSPLLFDTAP GPVVPPPQIT EDPLERYRQE FSVLGFLCDT HPMTLYKTEL EGRQTIKAKH LGRYAGKRVC VAGWLITGKV VRTKNGDPME FLTFEDETDI FEATFFPKTY TRFCHMIDRG SPFLLWGTVE TNWGAVTLTV EKIEGLPDYN NRL
|
| |