Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2156 |
Symbol | |
ID | 7267664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2647569 |
End bp | 2650373 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643566988 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_002463476 |
Protein GI | 219849043 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.814989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00934468 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACAACC GTATTTACGT TGCGATTGAT GTTGAAACGA CCGGTCTACA AAGCGGTCTT GATGAAATTA TCGAAGTAGC GGCCGTTACT TTTCGTGGAC GCGACATTCT CGACCGCTTT GAGCGACTGG TACGTCCACG ACAATCGGTA CCACTCAAAA TTACGCGCTT GACCGGCATT GACCCGGCGG CGCTGGCGCA GGCTCCTCGT TTCAACGAAA TTGGCGCCGA TCTGGCCCGC TTTATCGGTA ATCGCCCGAT TGTTGGCCAT TCAATTGGGT TTGATCTGAT GATGCTGCGG GCGCAGGGCA TGAACTTCAA CCAGCCTATC TATGACACGT TTGAGCTGGC GACCCTCCTT TTACCGCAAG CAACAAGCTA TAAGTTATCG GCATTGGCCG CCCGACTCGG TATTCCTCAT CCTGATGCCC ATCGTGCCCT TAATGATGCT GAAGTCGCGG CCCAGCTCTT TGCGGTGCTG AGCGAGCGGA TGCTCCAACT CGATCTGGCG ACGTTGGGCG AGATGGTGCG ATTGATGAGC AAGATCGGGT TGCCATTACG CGATCTCTTT GAAGAGGCCT TGCGCCAAAA GGCGCGGAAT GCTTTTCTCG AACCAATCGC TGCGCCACCG GAACCAGCCG ATGCGCTGAC TCCTGAACCA ACGCCTTTAC GGCCAACCGG TGATCAGCGT CCGCTCGATC TCGACGCCAT CGGTGCATTC TTCAGCCCCG ATGGCCCGTT CGGTCGGACC TTCCCCGGCT ACGAAGCGCG TCCACCACAA GTCGAGATGG CCCGTGCAGT AGCCGCAGCA TTCAACTGCA GTGAGCCATT AATGGTGGAA GCAGGTACCG GTACCGGTAA AAGTATGGCC TATCTTGTGC CGGCAGCGTT GTACGCAACC CAACGAGGCG AGCGGGTTGT GATTTCGACC AACACGATCA ATCTGCAAGA TCAACTTTAC AATAAAGACA TTCCCGACTT GCAACGAATC TTTGCCGCTG CCGGTTTGCC ACCGTTTCGG GCTGCGCTGC TCAAGGGGCG GAGCAATTAC CTCTGCCTCA AACGCTACCA TGAATTACGT CGTAGTGAGA ATCTTACGGT CGAAGAGGTT CGCGCGCTGC TCAAGATTCA ACTCTGGCTC CCCTCTACCA CCAGTGGCGA TCGTACCGAA TTACCGTTGA TCGATCGTGA ACAGGCGGCT TGGAATAAGC TGAACGTGAC GGTGGAGACA TGCACCGGTA GTCGTTGTCC GCATTTTCGT GAGTGCTATT TTTTCCGCGC CCGCCGAGCC GCCGAGGCAT CCCACTTGGT GGTGGTGAAT CATGCTCTTC TGATTGCGGA TTTGGCAGCG GCGTCGCAAG TACTGCCACC TTACGATCAT CTGATTATCG ATGAAGCGCA TAACTTGGAG GATGTGGCAA CCGATCAGCT CAGCTTCAAT CTCGACCAAG CCGGCTTGCT GAAGTTTCTC GATGATCTGT TCCAGACCGG TGGCGTGCAG GTGGTAAGTG GCTTGCTTAG CGAGCTGCCG ACGGTGTTGA ACGAAATCGG TGGCGGTGGT GCGGCTGGTG AACGGATCAA TGCAGCGATT GAACGGATGC GTCCGACGTT AATTCGGGCC CGTACTGCGA TCTACGATTG TTTTAACCTG CTTACCCGCT TCGTCCAGCT TGATACCGAG GCTGGTCAAT ACGATCCGCG CTTGCGGTTG ACGAAGAGTG TACGGCAGCG GCCAGAATGG CAAGCGATTA CGCAGGCCTG GCAGAATTTG AACGATATAC TGGCCGCAAT TGGCAACGAG CTGGCCGTGA TCGAAGAGCA GGTGCGTGAG TTGAACGAGA CAACCGGCGC ACTCAACGAT GTGCTGGTAC GCACCGAGGT GCTACGCCGG TTTGCGACCG ATGTTCGGGT ACGTAGCGGT CACATCATCT TCGGCGATGA TGATAGTATC TGCTGGTTGA CGTATGATCG TCAACGCGAT ACGCTTACGC TAACGGCAGC CCCGCTAAGT GTAGCAGAGA TTTTGCAGAG CCAACTGTTT GCACAAAAGC AGACGAGTAT TCTCGCTTCG GCTACCCTGA GCATTACCGG CGATTTCAGC TTCGTCAAGA GTCGGATCGG GCTTGACGAA TGCACCGAGC TGATGCTAGA TTCGCCGTTC GATTATGCTC AGCAAGCGCT GGTCTATATT CCCAATGATA TTCCTGAACC GAATCAGCGT GGGTATCAGC AGATGATCGA GCAGGCCATC GTCGATCTCG CCGTGGCTGC CGAGGGGCGG ATGTTGGCGT TGTTTACTGC ATCAAATGCA TTACGCCAAA CTTACACAGC TATTCAAGAG CCGCTTGAAG ACCACAGGAT CGGGGTTATG GCCCAAGGGA TTGACGGCTC GCGGCGGGCA TTGGTTGATC GCCTGAAGGA GTTTCCCCGT TCGGTGCTGC TCGGTACGAA TAGCTTCTGG GAAGGGGTGG ATGTTGTTGG CGACGCGCTA TCGGTGCTTG TGATTACGAA GCTCCCCTTC GCTGTACCGA CCGATCCGGT GGTGGCAGCA CGGAGTGAGC AATTTGCCGA TCCGTTTAAC GAATACAGTG TACCGCAGAG TATTCTGCGG TTTAAGCAGG GTTTTGGGCG TTTGATCCGC TCGCGTGACG ATCGCGGGAT TGTCGTAGTA CTTGACCGAC GCCTGCTTAC GAAGAAGTAT GGGCAGCAAT TCCTCGATTC ATTGCCGCAT ACGCGGGTAC GGACCGGGCC GCTGGCACAG TTGCCGGGAT TGGTAGCGCG ATTTTTGCGC GGTGGGCGCT CGTAG
|
Protein sequence | MNNRIYVAID VETTGLQSGL DEIIEVAAVT FRGRDILDRF ERLVRPRQSV PLKITRLTGI DPAALAQAPR FNEIGADLAR FIGNRPIVGH SIGFDLMMLR AQGMNFNQPI YDTFELATLL LPQATSYKLS ALAARLGIPH PDAHRALNDA EVAAQLFAVL SERMLQLDLA TLGEMVRLMS KIGLPLRDLF EEALRQKARN AFLEPIAAPP EPADALTPEP TPLRPTGDQR PLDLDAIGAF FSPDGPFGRT FPGYEARPPQ VEMARAVAAA FNCSEPLMVE AGTGTGKSMA YLVPAALYAT QRGERVVIST NTINLQDQLY NKDIPDLQRI FAAAGLPPFR AALLKGRSNY LCLKRYHELR RSENLTVEEV RALLKIQLWL PSTTSGDRTE LPLIDREQAA WNKLNVTVET CTGSRCPHFR ECYFFRARRA AEASHLVVVN HALLIADLAA ASQVLPPYDH LIIDEAHNLE DVATDQLSFN LDQAGLLKFL DDLFQTGGVQ VVSGLLSELP TVLNEIGGGG AAGERINAAI ERMRPTLIRA RTAIYDCFNL LTRFVQLDTE AGQYDPRLRL TKSVRQRPEW QAITQAWQNL NDILAAIGNE LAVIEEQVRE LNETTGALND VLVRTEVLRR FATDVRVRSG HIIFGDDDSI CWLTYDRQRD TLTLTAAPLS VAEILQSQLF AQKQTSILAS ATLSITGDFS FVKSRIGLDE CTELMLDSPF DYAQQALVYI PNDIPEPNQR GYQQMIEQAI VDLAVAAEGR MLALFTASNA LRQTYTAIQE PLEDHRIGVM AQGIDGSRRA LVDRLKEFPR SVLLGTNSFW EGVDVVGDAL SVLVITKLPF AVPTDPVVAA RSEQFADPFN EYSVPQSILR FKQGFGRLIR SRDDRGIVVV LDRRLLTKKY GQQFLDSLPH TRVRTGPLAQ LPGLVARFLR GGRS
|
| |