Gene Cagg_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2156 
Symbol 
ID7267664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2647569 
End bp2650373 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content55% 
IMG OID643566988 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_002463476 
Protein GI219849043 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.814989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00934468 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAACC GTATTTACGT TGCGATTGAT GTTGAAACGA CCGGTCTACA AAGCGGTCTT 
GATGAAATTA TCGAAGTAGC GGCCGTTACT TTTCGTGGAC GCGACATTCT CGACCGCTTT
GAGCGACTGG TACGTCCACG ACAATCGGTA CCACTCAAAA TTACGCGCTT GACCGGCATT
GACCCGGCGG CGCTGGCGCA GGCTCCTCGT TTCAACGAAA TTGGCGCCGA TCTGGCCCGC
TTTATCGGTA ATCGCCCGAT TGTTGGCCAT TCAATTGGGT TTGATCTGAT GATGCTGCGG
GCGCAGGGCA TGAACTTCAA CCAGCCTATC TATGACACGT TTGAGCTGGC GACCCTCCTT
TTACCGCAAG CAACAAGCTA TAAGTTATCG GCATTGGCCG CCCGACTCGG TATTCCTCAT
CCTGATGCCC ATCGTGCCCT TAATGATGCT GAAGTCGCGG CCCAGCTCTT TGCGGTGCTG
AGCGAGCGGA TGCTCCAACT CGATCTGGCG ACGTTGGGCG AGATGGTGCG ATTGATGAGC
AAGATCGGGT TGCCATTACG CGATCTCTTT GAAGAGGCCT TGCGCCAAAA GGCGCGGAAT
GCTTTTCTCG AACCAATCGC TGCGCCACCG GAACCAGCCG ATGCGCTGAC TCCTGAACCA
ACGCCTTTAC GGCCAACCGG TGATCAGCGT CCGCTCGATC TCGACGCCAT CGGTGCATTC
TTCAGCCCCG ATGGCCCGTT CGGTCGGACC TTCCCCGGCT ACGAAGCGCG TCCACCACAA
GTCGAGATGG CCCGTGCAGT AGCCGCAGCA TTCAACTGCA GTGAGCCATT AATGGTGGAA
GCAGGTACCG GTACCGGTAA AAGTATGGCC TATCTTGTGC CGGCAGCGTT GTACGCAACC
CAACGAGGCG AGCGGGTTGT GATTTCGACC AACACGATCA ATCTGCAAGA TCAACTTTAC
AATAAAGACA TTCCCGACTT GCAACGAATC TTTGCCGCTG CCGGTTTGCC ACCGTTTCGG
GCTGCGCTGC TCAAGGGGCG GAGCAATTAC CTCTGCCTCA AACGCTACCA TGAATTACGT
CGTAGTGAGA ATCTTACGGT CGAAGAGGTT CGCGCGCTGC TCAAGATTCA ACTCTGGCTC
CCCTCTACCA CCAGTGGCGA TCGTACCGAA TTACCGTTGA TCGATCGTGA ACAGGCGGCT
TGGAATAAGC TGAACGTGAC GGTGGAGACA TGCACCGGTA GTCGTTGTCC GCATTTTCGT
GAGTGCTATT TTTTCCGCGC CCGCCGAGCC GCCGAGGCAT CCCACTTGGT GGTGGTGAAT
CATGCTCTTC TGATTGCGGA TTTGGCAGCG GCGTCGCAAG TACTGCCACC TTACGATCAT
CTGATTATCG ATGAAGCGCA TAACTTGGAG GATGTGGCAA CCGATCAGCT CAGCTTCAAT
CTCGACCAAG CCGGCTTGCT GAAGTTTCTC GATGATCTGT TCCAGACCGG TGGCGTGCAG
GTGGTAAGTG GCTTGCTTAG CGAGCTGCCG ACGGTGTTGA ACGAAATCGG TGGCGGTGGT
GCGGCTGGTG AACGGATCAA TGCAGCGATT GAACGGATGC GTCCGACGTT AATTCGGGCC
CGTACTGCGA TCTACGATTG TTTTAACCTG CTTACCCGCT TCGTCCAGCT TGATACCGAG
GCTGGTCAAT ACGATCCGCG CTTGCGGTTG ACGAAGAGTG TACGGCAGCG GCCAGAATGG
CAAGCGATTA CGCAGGCCTG GCAGAATTTG AACGATATAC TGGCCGCAAT TGGCAACGAG
CTGGCCGTGA TCGAAGAGCA GGTGCGTGAG TTGAACGAGA CAACCGGCGC ACTCAACGAT
GTGCTGGTAC GCACCGAGGT GCTACGCCGG TTTGCGACCG ATGTTCGGGT ACGTAGCGGT
CACATCATCT TCGGCGATGA TGATAGTATC TGCTGGTTGA CGTATGATCG TCAACGCGAT
ACGCTTACGC TAACGGCAGC CCCGCTAAGT GTAGCAGAGA TTTTGCAGAG CCAACTGTTT
GCACAAAAGC AGACGAGTAT TCTCGCTTCG GCTACCCTGA GCATTACCGG CGATTTCAGC
TTCGTCAAGA GTCGGATCGG GCTTGACGAA TGCACCGAGC TGATGCTAGA TTCGCCGTTC
GATTATGCTC AGCAAGCGCT GGTCTATATT CCCAATGATA TTCCTGAACC GAATCAGCGT
GGGTATCAGC AGATGATCGA GCAGGCCATC GTCGATCTCG CCGTGGCTGC CGAGGGGCGG
ATGTTGGCGT TGTTTACTGC ATCAAATGCA TTACGCCAAA CTTACACAGC TATTCAAGAG
CCGCTTGAAG ACCACAGGAT CGGGGTTATG GCCCAAGGGA TTGACGGCTC GCGGCGGGCA
TTGGTTGATC GCCTGAAGGA GTTTCCCCGT TCGGTGCTGC TCGGTACGAA TAGCTTCTGG
GAAGGGGTGG ATGTTGTTGG CGACGCGCTA TCGGTGCTTG TGATTACGAA GCTCCCCTTC
GCTGTACCGA CCGATCCGGT GGTGGCAGCA CGGAGTGAGC AATTTGCCGA TCCGTTTAAC
GAATACAGTG TACCGCAGAG TATTCTGCGG TTTAAGCAGG GTTTTGGGCG TTTGATCCGC
TCGCGTGACG ATCGCGGGAT TGTCGTAGTA CTTGACCGAC GCCTGCTTAC GAAGAAGTAT
GGGCAGCAAT TCCTCGATTC ATTGCCGCAT ACGCGGGTAC GGACCGGGCC GCTGGCACAG
TTGCCGGGAT TGGTAGCGCG ATTTTTGCGC GGTGGGCGCT CGTAG
 
Protein sequence
MNNRIYVAID VETTGLQSGL DEIIEVAAVT FRGRDILDRF ERLVRPRQSV PLKITRLTGI 
DPAALAQAPR FNEIGADLAR FIGNRPIVGH SIGFDLMMLR AQGMNFNQPI YDTFELATLL
LPQATSYKLS ALAARLGIPH PDAHRALNDA EVAAQLFAVL SERMLQLDLA TLGEMVRLMS
KIGLPLRDLF EEALRQKARN AFLEPIAAPP EPADALTPEP TPLRPTGDQR PLDLDAIGAF
FSPDGPFGRT FPGYEARPPQ VEMARAVAAA FNCSEPLMVE AGTGTGKSMA YLVPAALYAT
QRGERVVIST NTINLQDQLY NKDIPDLQRI FAAAGLPPFR AALLKGRSNY LCLKRYHELR
RSENLTVEEV RALLKIQLWL PSTTSGDRTE LPLIDREQAA WNKLNVTVET CTGSRCPHFR
ECYFFRARRA AEASHLVVVN HALLIADLAA ASQVLPPYDH LIIDEAHNLE DVATDQLSFN
LDQAGLLKFL DDLFQTGGVQ VVSGLLSELP TVLNEIGGGG AAGERINAAI ERMRPTLIRA
RTAIYDCFNL LTRFVQLDTE AGQYDPRLRL TKSVRQRPEW QAITQAWQNL NDILAAIGNE
LAVIEEQVRE LNETTGALND VLVRTEVLRR FATDVRVRSG HIIFGDDDSI CWLTYDRQRD
TLTLTAAPLS VAEILQSQLF AQKQTSILAS ATLSITGDFS FVKSRIGLDE CTELMLDSPF
DYAQQALVYI PNDIPEPNQR GYQQMIEQAI VDLAVAAEGR MLALFTASNA LRQTYTAIQE
PLEDHRIGVM AQGIDGSRRA LVDRLKEFPR SVLLGTNSFW EGVDVVGDAL SVLVITKLPF
AVPTDPVVAA RSEQFADPFN EYSVPQSILR FKQGFGRLIR SRDDRGIVVV LDRRLLTKKY
GQQFLDSLPH TRVRTGPLAQ LPGLVARFLR GGRS