Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_2453 |
Symbol | |
ID | 3671772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | + |
Start bp | 2522451 |
End bp | 2525174 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637711159 |
Product | DNA polymerase I |
Protein accession | YP_316211 |
Protein GI | 74318471 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATC CTAGTGTAAG TCAGCCCGGC AAGACCCTGT TGTTGGTGGA CGGCTCGTCC TATCTGTACC GCGCCTTTCA TGCGCTGCCG GACCTGCGCA ACGCGGCGGG CGAACCGACC AGTGCGATCA AGGGCGTTCT CGCCATGCTG CACAGGCTGC GCAAGGATTA CGCGGCCGAT TATATCGCCT GCGTGTTCGA CGCGAAGGGC AAGACCTTCC GCGACGATCT CTATCCTGCG TACAAGGCGA ACCGCCCGGC GATGCCGGAC GATCTGCGCA GCCAGATCGA ACCTCTGCAC GAGGCCATCC GCGCCGAGGG CTGGCCGCTC GTCGTCATCG ACGGCGTCGA GGCGGACGAC GTGATCGGCA CACTCGTGCG CCACGCCGCA CAAAAAGGCA TCCGCAGCAT CGTCTCGACC GGCGACAAGG ACATGGCCCA GCTCGTCGAC GGCCACGTCA CGCTCGTCAA CACCATGAGC CTCGAGACGC TCGACGAAGC CGGCGTCGTC GCCAAGTTCG GCGTGGCGCC CGACCGCATC ATCGACTACC TCACGCTCGT CGGCGACGCG GTCGACAACG TGCCCGGGGT CGCCAAATGC GGGCCGAAGA CCGCCGCGAA ATGGCTCGCC GAATACGGCA CGCTCGACAA CCTGCTCGCC CACGCCGACG CGGTCAAAGG CGCGGTCGGA GAGAACCTGC GCGCGGTCGT CGACTGGCTG CCGCAGGCCC GCGCGCTGCT CACGATCAAG ACCGACGTCG CGCTGCCCTT CGATTTCGAC GACCTCGTGC TGAAGCCGCG CGACATCGAC GCCCTGCGCG GGCTGTTCGA GCGCTACGAA TTCCGTACCT GGCTGCGCGA ACTCGACGCC TCGCCGGCGA CGGCCGTTTC GTCCGAACCC GACTGCAGCG GATCGCACCG CGCCGGCTAC GAGATCATCC TGAGCGAAGC GCAGCTCGAC GCCTGGGTCG CGCGGGTCGC GACCGCCGAC GCCTTCGCGC TCGACGTCGA GGCGAGCAGC GCGAGCCCGC TGCAGTGCGA ACTGATCGGC CTGGCGTTCG CGGTCTGCGA AGGCGAGGCT GCGTATCTGC CGCTCGGCCA CCATTACGCC GGCGCGCCGG CGCAGCTCTC GCGCGCCGCG GCTTTCGCCA GAATCAAGCC CCTGCTCGAG GAGCGCGCGG CGAAAATCGT CGGCCAGAAT CTCAAATACG CGCGCCACGT GCTCGCCAAC TGCGGCATCG CGCTCGGCGG CGCGGCCGAC GACACGCTGC TGCAATCCTA CGTGCTCGAA GCGCATCAGC CGCACGAACT CGCGAGCCTC GCCAATCGCC ATCTCGGGCT CGCGACGCTC GCCTGCGACG AACTCACCGG CAAGGGCGCG GCGCGCATCG GCTTCGATCA GGTCGCGGTC GAGCGCGCCG GCGAATACGC GGCCGAGCAG GCGGACGTCA CGCTGCGCGT CCGCGGCCAG CTCGCAGCGC GCATCGCGGC GTCGGAGAAA CTCGAATACG TCTACCGCGA AATCGAGCTC CCGGTTTCCG AAGTGCTCTT CCGCATGGAA CGCGCGGGCG TGCTGCTCGA TCGCGCGCTC CTCGCGGCGC AAAGCGGCGA GCTCGGCCGC AAGATGCTCG AACTCGAGCA ACGCGCCTTC CAGGAAGCGG GCCAGCCTTT CAATCTCGGC TCGCCCAAGC AGATCGGCGA CATTCTGTTC GCGCAAAAAG GCCTTCCGAT CCTCAAGAAA ACACCGGGAG GGGCGCCGTC GACCGACGAG GAAACGCTCG AACTGCTCGC ACTCGACCAC CCGATCGCGC GTGCGATCCT CGACTACCGC GGACTCGCCA AACTGAAGTC GACCTACACC GACAAGCTGC CGCAGATGGT GCATCCGTTG ACCGGCCGGC TGCACACAAG CTATTCGCAG GCGACCGCGG TGACCGGCCG GCTCGCGAGC GTGGAGCCGA ACCTGCAGAA CATCCCGGCG CGGACGGCGG AAGGGCGCCG CATCCGTGAG GCCTTCATCG CGCCGCCCGG CCACCTGCTC GTCTCGGCCG ACTATTCGCA GATCGAGCTC AGGATCATGG CGCATCTATC GGACGACGCC GGCCTGCTTC ACGCCTTCGC CAACGATCTC GACATCCACA CCGCGACCGC CGCCGAGGTG TTCGGCGTCC CGCTCGACGC GGTCAGCGCC GAGCAGCGCC GGATCGCCAA GGTCATCAAT TTCGGCCTGA TCTACGGCAT GTCGGCCTTC GGCCTGGCGA GCCAGCTCAA TCTCGAGCGC AGCGCCGCGC AGGCGTGGAT CGACCGTTAC TTCACGCGCT ATCCCGGCGT CGCGAACTAC ATGCTGCGCA CCCGCGAATC GGCGCGCGCG CAGGGCTACG TCGAGACCGT CTTCGGCCGC CGCCTCTATC TCCCTGAAAT CAACGCGAGG AATCCGCAAC GCCGACAAGG CGCCGAACGC GCCGCGATCA ATGCGCCGAT GCAGGGCACG GCCGCCGACC TCATCAAGCT GGCGATGATC GCAGTGCAGC GGTGGATCGA CAGCGAAAGA CTCGGCACGC GCCTACTGCT GCAGGTCCAC GACGAACTGA TCCTCGAGGT GCCTGAACAC GAGCTCGAAC GCGTCCGCGC CGAACTGCCG TCGCACATGT GCGACGTCGC CGCGCTCAAG GTACCGCTGC GGGTCGGCGT CGGCGTCGGC GGCAACTGGG AAGCGGCGCA CTGA
|
Protein sequence | MNDPSVSQPG KTLLLVDGSS YLYRAFHALP DLRNAAGEPT SAIKGVLAML HRLRKDYAAD YIACVFDAKG KTFRDDLYPA YKANRPAMPD DLRSQIEPLH EAIRAEGWPL VVIDGVEADD VIGTLVRHAA QKGIRSIVST GDKDMAQLVD GHVTLVNTMS LETLDEAGVV AKFGVAPDRI IDYLTLVGDA VDNVPGVAKC GPKTAAKWLA EYGTLDNLLA HADAVKGAVG ENLRAVVDWL PQARALLTIK TDVALPFDFD DLVLKPRDID ALRGLFERYE FRTWLRELDA SPATAVSSEP DCSGSHRAGY EIILSEAQLD AWVARVATAD AFALDVEASS ASPLQCELIG LAFAVCEGEA AYLPLGHHYA GAPAQLSRAA AFARIKPLLE ERAAKIVGQN LKYARHVLAN CGIALGGAAD DTLLQSYVLE AHQPHELASL ANRHLGLATL ACDELTGKGA ARIGFDQVAV ERAGEYAAEQ ADVTLRVRGQ LAARIAASEK LEYVYREIEL PVSEVLFRME RAGVLLDRAL LAAQSGELGR KMLELEQRAF QEAGQPFNLG SPKQIGDILF AQKGLPILKK TPGGAPSTDE ETLELLALDH PIARAILDYR GLAKLKSTYT DKLPQMVHPL TGRLHTSYSQ ATAVTGRLAS VEPNLQNIPA RTAEGRRIRE AFIAPPGHLL VSADYSQIEL RIMAHLSDDA GLLHAFANDL DIHTATAAEV FGVPLDAVSA EQRRIAKVIN FGLIYGMSAF GLASQLNLER SAAQAWIDRY FTRYPGVANY MLRTRESARA QGYVETVFGR RLYLPEINAR NPQRRQGAER AAINAPMQGT AADLIKLAMI AVQRWIDSER LGTRLLLQVH DELILEVPEH ELERVRAELP SHMCDVAALK VPLRVGVGVG GNWEAAH
|
| |