Gene Cagg_2499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2499 
Symbol 
ID7269345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3039659 
End bp3043441 
Gene Length3783 bp 
Protein Length1260 aa 
Translation table11 
GC content58% 
IMG OID643567325 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_002463806 
Protein GI219849373 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACT TCGTTCACCT GCATGTCCAC TCCGAATACA GCCTCCTCGA CGGCTATGCC 
ACGACCAAAG GTATCGTTCA GCGCGCCGCC GATTTGGGCA TGGACAGTAT CGCCCTCACC
GATCACGGTG TTCTCTACGG CGCAATGGAG TTTTACGAGG CAGCCAAAAA AGCCGGTATC
CGCCCGATCA TCGGCGTCGA AGCGTACATA GCCCCCGGCT CATTGTCCGA CCCGATGACA
AAAGGGGCGA AAAACTATTT TCACCTCCTC CTGTTGGCGC AAAACGAGAC CGGCTACCGC
AATTTGGTCA AACTCACGAC CCGCGCCCAT CTCGATGGCA TGGGGAAAGG GGTGTTTGCG
CGCCCCCGCA TCGACCGTCA CCTGCTCGAA ACGTATCACG AGGGCCTGAT CGTCACGTCG
GCATGCGTAG CCGGCGAGCT CTTGCATCAC CTCAAACACG GTGACCGTCA CACTGCCGTC
GAGACCGCAG CGTACTACCG TGATCTGCTC GGTCCCGACC GCTACTACAT CGAATTACAA
CTCCACGACA ATACACCCGA ACTCGAGCCA CTCAATGATG AATTGGTGCG CATCGCCCGC
GAACTCGGCA TCCCCCTCGT TGCGACCAGC GATGCCCACT TCATTCACCC TGAGGATAAA
GAGACCCAGC ATAAGGTGAT GGCAATGGGC ATGAACATGA CCTACGCCGA GTTCTGTACC
CGCGGATATG CGATGGACGA AAGCTATCAC ATTATGTCAG GCGAAGCGAT GTGGGCGCGA
TTTAAACGCT ATGGCACCGA GCCACTCGAA AATACCCGCC GAATTTCCGA TATGTGCCGG
CTCAAGCTCG AGTTTGGGCG GGTGCAGTTA CCGGTGTTCG AGCTGCCTGA AGGTCACGAT
GCCGCATCAT ATCTGCGCCT CGTCTGCGAA GAAGGCCTGA TGCGTCGCTT CAACGGTCAG
CCGCCCGAAA CATATCTGCG CCGGTTGGAG TATGAGCTTG ATGTGATCAA TCAGACCGGC
TTCCCCGACT ATATGTTGAT CGTCTGGGAC TACGTGAAAT TCGCCCGCTC GCGCGGTATT
CCCTGTCTCC CGCGTGGTTC GGCCGGTGCA TCGCTCGTCC TCTACGCCCT CGGCATTACC
GATGTCGATC CGGTCAAGAA TAAACTGCTC TTCGAGCGCT TCCTCTCCCC CGAACGACTG
GAGATGCCCG ACATCGACAC CGACTTTGCC GACTCACGTC GCCAAGAGAT TCTCGATTAC
ATTGCCAATA AATACGGGCG CGAGAATGTG GCCCAGATCA TTACCTACGG CACCCTCGGC
GCCAAGGCCG CCATCCGCGA TATGGGTCGC GTCCTCGGTC TCGATCCCGG CGAGGTTGAC
CGCGTGGCGA AACTGATTCC GTCGCTGCCG GTGGGCACGA CGATCGCGCA GGCGATCGAG
CGGGTGCCTG AATTGAAGCA GATCTACGAG ACTCAGCCCC ACTTGCGCGA GTTGCTGATC
GAAGCGCAGA AGGTCGAGGG CCGGATGCGC TCGGTTGGCA CCCATGCCTG TGGCGTGGTG
GTCAGCCGCA CCCCGCTCGA AGAGCTTGTA CCGCTCCAAC GCACCACCAA AGACGAGCAT
GCGCTGATGG CCGCGTTTGA AGGGGCAACC CTGGCCAAGA TGGGTCTGCT CAAAATGGAC
ATTCTCGGTC TCACCAACTT GTCGGTGGTG GCCGAGGCGC TGAAGTATAT TGAGCAGACG
ACCGGTCGCC GCATGTGGCT CGATGAGATT CCGCTCGATG ATCCGAAGGT GTTTGCGGCG
TTGGGCCGCG GCGAGACCAA GAATGTGTTC CAGCTTGAAT CGGCAGGGAT GACCCGCTAC
CTCATGCAGT TGCAACCGAC CCGCGTTGAA GACCTCTACG CGATGGTCGC GCTCTACCGG
CCCGGCCCGC TCGAGCAGAT TCCGGTCTAT ATCCAGAATA AGCATAATCC CTCCCAAATC
CGCTATCTCC ACCCGGTGCT CGAGCCGATT TTGTCCGATA CCTACGGTGT GATCGTCTAC
CAAGAGCAGA TTATGCAATT GCTCCAGACC GTCGCCGATT ATACGCTCGG TCAAGCGTAT
ATCGTGATCA AGGCCATTAG CAAGAAGAAC AAGGAGCTAA TGGCCGAGAA CGAGGCGATC
TTTAAACAGG GGTGCCAGCG CAAAGGCATT AGCAAAGAGG TGGCCGACCA GCTTTGGGAG
CTGATCTTGC CCTTCGCCGG CTATTCGTTC AACCGGCCTC ACGCTACCCT CTACGGCCTG
CTCAGCTATC AGACGGCGTG GCTCAAGGTG AATTATCCGG TTGAGTATAT GACCGCAGTG
CTCACCGGTG CCGGAGGGGT GATTGAAGAT GTGACCAAGG CCGCCCTCGA GGCGCGCCGA
TTGGGCGTCG CCGTATTGCC ACCCGATGTC AACCGCTCAC ACAAGGGCTT TACCATCGAG
CCGTTATCGC CACCACTTCC CGAAGGAGTG AAGTACGACC GCGGGATTCG CTTTGGCCTG
ATGGCAATTA AGAATGTTGG TGAAGGACCG GTTGAGGCAA TCATCGCTGC CCGCGAGGCC
GGTGGCCCCT TCCGGTCACT GGAAGACCTC TGCGCCCGCG TCGATCGCCA CGCCCTCAAC
AAGCGAACGC TTGAGAGTCT GATTAAGGCC GGTGCGCTCG ACTCACTGCC CGGGAGCCGT
CGCCAAAAGC TTGCAATTCT CGATCAGGCG ATAAACGCAG GCGTTGAAGC GCAGAAGGCG
CGCGACATCG GCCAGAGCAG TCTGTTCGAT ATCTTCGGGG AAAGCAACAC CGGCAGCCCG
AATGTCGCTC GTATCCCTCT GCCAATCATC ACCGAGACGC CCGCCGAGCA GAAAGAGGTG
CTGCGATGGG AGAAAGAACT GCTCGGTCTG AACCTCTCCG ATGACCCTAT CGCGAAAGCA
CTTGAAGGGA TCGACCTCAC CGGCGTCACC GACCTCGGAT CGCTTGAAGA GGAGCATGTT
GGCGAGACGC TGACCTTCGT CGGTGTCCTG AGCGGAGTGC GCCGGATTGC CACCAAGAAG
GGCGATTCGA TGGTGGTGGC AACCCTCGAA GATATGACCG GGAGCATCGA GATCGTGGTG
TTTCCGAAGG TGTTGGCAAA AGCCGCCGAT CTGCTCCAAA ATGATGCGGT AGTACGGGTG
ACGGCAAAAG TTGATAACCG GCGCGATACT CCGCAACTCG TCGTAGAGAG CGTTGAAACG
CCATCTACCA TCGACCAAAC GACCAGCCCT CAACCGGTTG AGATGGATCT CGAAGGGATG
AGTGATGAGC TGAGCGCAGA ATTGGCGACC GATGTACCCA TCTCCCTCCC GGATCCATCA
CCCCAACCAC CATCGGTTAC CACGGTGTCG TCACCCCCGG CTCCATCAGT GACCACCACC
CCGCCAACGG CCAAACCACC AACAGTACGC CCGCGCCAAC CGGTCAAACT CGCCAACGGG
AACGGCCAAG GAAATAGCAA CGGACAGAGC AAGAGCGGCG AACCACCCCG CACCGACGGG
CCGCCGTCAC GTAATTTGCG CATCTACCTG CCGCGTACCA ATAACTTCGA CGCCGATGTC
GCCCTGATGC AGCATATTCA CAGCCTGCTC AGCGCCAGCC AAGGCAATGA TCACGTAACG
CTATACCTGC CCAATGGGGT TGGGATGGTC GTGCTGCAAT CGCAACAAAC CATCGAACTG
TCAGACGCGC TGCTCAACGA ATTGCGCCAG CTCCTCGGCC ACGAGCGGGT CTTGGCGGCA
TAA
 
Protein sequence
MHDFVHLHVH SEYSLLDGYA TTKGIVQRAA DLGMDSIALT DHGVLYGAME FYEAAKKAGI 
RPIIGVEAYI APGSLSDPMT KGAKNYFHLL LLAQNETGYR NLVKLTTRAH LDGMGKGVFA
RPRIDRHLLE TYHEGLIVTS ACVAGELLHH LKHGDRHTAV ETAAYYRDLL GPDRYYIELQ
LHDNTPELEP LNDELVRIAR ELGIPLVATS DAHFIHPEDK ETQHKVMAMG MNMTYAEFCT
RGYAMDESYH IMSGEAMWAR FKRYGTEPLE NTRRISDMCR LKLEFGRVQL PVFELPEGHD
AASYLRLVCE EGLMRRFNGQ PPETYLRRLE YELDVINQTG FPDYMLIVWD YVKFARSRGI
PCLPRGSAGA SLVLYALGIT DVDPVKNKLL FERFLSPERL EMPDIDTDFA DSRRQEILDY
IANKYGRENV AQIITYGTLG AKAAIRDMGR VLGLDPGEVD RVAKLIPSLP VGTTIAQAIE
RVPELKQIYE TQPHLRELLI EAQKVEGRMR SVGTHACGVV VSRTPLEELV PLQRTTKDEH
ALMAAFEGAT LAKMGLLKMD ILGLTNLSVV AEALKYIEQT TGRRMWLDEI PLDDPKVFAA
LGRGETKNVF QLESAGMTRY LMQLQPTRVE DLYAMVALYR PGPLEQIPVY IQNKHNPSQI
RYLHPVLEPI LSDTYGVIVY QEQIMQLLQT VADYTLGQAY IVIKAISKKN KELMAENEAI
FKQGCQRKGI SKEVADQLWE LILPFAGYSF NRPHATLYGL LSYQTAWLKV NYPVEYMTAV
LTGAGGVIED VTKAALEARR LGVAVLPPDV NRSHKGFTIE PLSPPLPEGV KYDRGIRFGL
MAIKNVGEGP VEAIIAAREA GGPFRSLEDL CARVDRHALN KRTLESLIKA GALDSLPGSR
RQKLAILDQA INAGVEAQKA RDIGQSSLFD IFGESNTGSP NVARIPLPII TETPAEQKEV
LRWEKELLGL NLSDDPIAKA LEGIDLTGVT DLGSLEEEHV GETLTFVGVL SGVRRIATKK
GDSMVVATLE DMTGSIEIVV FPKVLAKAAD LLQNDAVVRV TAKVDNRRDT PQLVVESVET
PSTIDQTTSP QPVEMDLEGM SDELSAELAT DVPISLPDPS PQPPSVTTVS SPPAPSVTTT
PPTAKPPTVR PRQPVKLANG NGQGNSNGQS KSGEPPRTDG PPSRNLRIYL PRTNNFDADV
ALMQHIHSLL SASQGNDHVT LYLPNGVGMV VLQSQQTIEL SDALLNELRQ LLGHERVLAA