Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0562 |
Symbol | |
ID | 6027328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 600494 |
End bp | 603259 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641593398 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_001716735 |
Protein GI | 169830753 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCCGCA CGTTTGTGGT GCTGGACTTG GAAACCACCG GCCTCGATCC TCAAGCAGAC GAGATTGTTG AGGTCGGCTT GGTGCGTGTC GAGGACGGGA AGCCCGGAGC GGTTTTCCAC GCCCTGGTGC GCCCCTCGCG ACCTTTGCCG GCGCGGATCA AGAGCCTGAC CGGCCTGGAC GATGCCGACC TGGCGGAAAG GCCGGATTGG TCCGAAGTGC GCCCGGCGGT GACCGCCTTC CTCGGGAATG AGCCGGTGGT CGGGCACCAC GTGCATTTTG ACCTGGCCTT CCTGGAAAGG CACGCCGGGT ACCGGGCAAC GCAGGCGTAC GACACCGTGG ACCTGGCGCG GCTGGTGCTG CCCGGACTGC CGTCGTACCG CCTGGAGTTG CTGTGTGCGC ACCTGGACCT GCCTGAACGG CCGGGCCACC GCGCCATGGA GGACGCCCGG GCGGCGGCCG GGCTGTTCAC GGCGCTCCTG GAAAGGTTTT GCCGCCTCGA ATTCACCACC CAGGCCACGG TGCACCGCAT CCTGTCCCAG ACCCCCGGTT CTCCCTGGTT CCCGCTGGTG GACGCGGCGG TGCGGTTGGG GATCAAGACC CTTCCGGACC ACGGAATCCA GGCGGCAGGC GACGGTGATT GGCCCGCCGC GGTTCCGGCG GTCGGGGCCG GGGTCAAGAC CGTACCCGGG TTCGGCCCGC AGGCCGTTGA AGCGTTCCTG GGGCCCGGCG GTGTGCTGAC CCGTCACTTC GCCGACTTTG AATACCGGCG GCCGCAAATG CAGGTTGCAC TGGCGGTGGG CCGGGCGCTG GAGGAGCAGA CCGTCCTCAC GGTGGAAGCC GGCACCGGGA CCGGCAAGTC TTTCGCCTAC CTGGTGCCGG CCGCGCTCTG GGCCCTGGAC GGGGGACGCC GGGTGGTGGT GTCGACCCAT ACGGTCAACC TTCAGGACCA GTTGATGGAA AAAGACCTAC CGCTTCTCGG CCGGGCGCTA GACGCGCCGC TCAAGACCGT GCTCCTGAAA GGCCGGGGGC ACTATCTCTG CCTGCGGCGT TGGGGAGCGG CCGTGAATGA ACAGAGCTTT TTACCCGACG AGGGCTTCTT GTATGCCCGC ATTGCGGTGT GGCTGGCCCG GACGAAAACC GGAGACCGGG CCGAACTCAG CCTGCGGCCG GAGGAAAAGG GATTCTGGGA CGGAGTGGCC GCGGGAGACT GGGGCTGCGC GCACGCGTAC TGCCGTTTCA ACAGTGTCTG TTTCTTGCAG CGGGCCCGGC GGGCCGCCGA ACAGGCGCAC CTGGTGATCA CCAACCATTC ACTCCTGCTT TCCGACCTGA AGATGGAAAA CCGACTGCTG CCGGCATACG ACGCCCTGGT GCTGGACGAA GCACACCACC TCGAGGGGGT GGCCACCGAG CAACTGGGGA CCACCGTATC GCGGGGCGCG CTGGAACGGT GGGTGGAGGG GCTGAACCGG CTGACCGGAA GAATCGGCGA GTTCTACCCG GACAACCGGG ATGAGTTTCT CCGGGACGGG GAGGCACTGC TGGCGGCGGT ACGGCGTTTT TTCGCCCTGC TGGGTGCGCG TCTGCGGCGG CGGGAACTTC CGGAGGAAGA CTTTGCGGCG GAGCGCCTCG GTCCGCGGGC CCTGGACGCG TGCCCGGAAA CGGCGGGAAG TTACCTTGAA CTTGCCGGGA GTCTGAAAGC CTACGTGCAA CGGCTGTCTA GGGCGGTCGA GCGGCTGACC GACAGCACCG TGCCGGCGGT TCAGGAGGAA CTGCGGGTGG GGCTCGAGCA GGAGTTGGCG GTTGGCAAGC GCTTGGCCAG CGACCTGGAG TTCGTATACG AGGCCGCCGA TCCCGGGTAT GTTTTCTGGC TCGAGGGCGG GGGGCGGCAG ACCGCCGAGG GCGTGCTCCG CGCGGCGCCG GTCCACGTAG GGGAGCTCTT GCACGACGGC TTCTTCCGCT CCGGCAAGCC GGTCATTCTC ACTTCGGCCA CACTCACCGT CCAAAGTTCG TTCTCCTTTT TTGACGAGCG GGTGGGGCTT GAGCATTTGC CGCGGGACCT GCGGGGCAGC CTGATCGTTG GTTCTCCTTT CGCCTATGCC GAACAGGCCC TGCTGTACGT GGTGACCGAC CTGCCCGACC CGGGAGACGG GGACGGGGCC TACCTGGACG CCGTCAGCGA TGCCCTCGGG CGGATCGTCG GGGTCACCCG GGGCCGGACG CTTGCACTAT TTACCGCACA TAAAGCGTTG CGCACCGCAT ATAGAAGACT GAAACCCGTC TTGGAAAAGC AAGGCTTGGA ACTGTTGGGT CACGGCCTGG ACGGGGGGCG GGCGCGTCTT TTGCAGTATT TCCGCAACAC ACCCCAGGCG GTGCTGTTTG GGGCCTCCAG TTTTTGGGAA GGGGTGGACG TTCCGGGGGA CGCGCTGTCC TGCCTGGTGA TTGTGAAACT CCCGTTTGAA CCGCCCAACC GCCCTGTGCT CCAGGCGCGG CGCGAGGAGG TCCGCCGGCG GGGCCGGAGT GACTTCAACG ACCTGTGCCT ACCGCAAGCC GTGCTCCGCC TGAAACAGGG TTTTGGACGC CTGATCCGTA CCACCGCGGA CCGGGGCGTG GTGATTATTT TGGATAACCG GCTGGTACGG AAAAGGTATG GCGCCCTGTT TTTGGAGTCC CTGCCGGCGG CGCCGATGCA CGGGTCCCTG GAGGCAGGGC TGGAAGCGAC GGCCGCGTTT CTTTTTGACG GCACGTTTGC GGCTTCAAAG CCATAA
|
Protein sequence | MSRTFVVLDL ETTGLDPQAD EIVEVGLVRV EDGKPGAVFH ALVRPSRPLP ARIKSLTGLD DADLAERPDW SEVRPAVTAF LGNEPVVGHH VHFDLAFLER HAGYRATQAY DTVDLARLVL PGLPSYRLEL LCAHLDLPER PGHRAMEDAR AAAGLFTALL ERFCRLEFTT QATVHRILSQ TPGSPWFPLV DAAVRLGIKT LPDHGIQAAG DGDWPAAVPA VGAGVKTVPG FGPQAVEAFL GPGGVLTRHF ADFEYRRPQM QVALAVGRAL EEQTVLTVEA GTGTGKSFAY LVPAALWALD GGRRVVVSTH TVNLQDQLME KDLPLLGRAL DAPLKTVLLK GRGHYLCLRR WGAAVNEQSF LPDEGFLYAR IAVWLARTKT GDRAELSLRP EEKGFWDGVA AGDWGCAHAY CRFNSVCFLQ RARRAAEQAH LVITNHSLLL SDLKMENRLL PAYDALVLDE AHHLEGVATE QLGTTVSRGA LERWVEGLNR LTGRIGEFYP DNRDEFLRDG EALLAAVRRF FALLGARLRR RELPEEDFAA ERLGPRALDA CPETAGSYLE LAGSLKAYVQ RLSRAVERLT DSTVPAVQEE LRVGLEQELA VGKRLASDLE FVYEAADPGY VFWLEGGGRQ TAEGVLRAAP VHVGELLHDG FFRSGKPVIL TSATLTVQSS FSFFDERVGL EHLPRDLRGS LIVGSPFAYA EQALLYVVTD LPDPGDGDGA YLDAVSDALG RIVGVTRGRT LALFTAHKAL RTAYRRLKPV LEKQGLELLG HGLDGGRARL LQYFRNTPQA VLFGASSFWE GVDVPGDALS CLVIVKLPFE PPNRPVLQAR REEVRRRGRS DFNDLCLPQA VLRLKQGFGR LIRTTADRGV VIILDNRLVR KRYGALFLES LPAAPMHGSL EAGLEATAAF LFDGTFAASK P
|
| |