Gene Cagg_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1500 
Symbol 
ID7267277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1839615 
End bp1841534 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content62% 
IMG OID643566344 
ProductAAA ATPase central domain protein 
Protein accessionYP_002462840 
Protein GI219848407 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG CCACGGCAAT CGATTGGGTG ACAGCTAACC AGCGTTATCT GAGTGCGGCG 
CTTGACGTGA TCAAGCGGCG CTTGGCCGGC GAGCGCGACG AAGCAGGCTT GCGCGAGGCC
GGGCAGGCCC TTGCTGCTGC CCGCGCTGCG TTGCCCGCAC TGCCGGCGAT CGAGCGGTTG
CGCATCATCT TTGGCTTAAC CGATTTCGAG ACTGAACTGC TGCTGCTCTG TGCGGGGGTG
GAACTCGATA GCGGTTTGGC AACGCTCTGT GCCACAGCGC AGGGTGATCC GCACCGCTCT
CGCCCAACCA TCGGGTTGGC GCTGACAGTG CTGCCCGCTG CCCACTGGAG TGCGGTGACA
CCGACGGCGC CGCTTCGCCA CTGGCGACTG ATCGAGCTGG TCGAGGGCGA GCCATTGGTG
ACTGCGCCAT TGCGCATCGA CGAGCGGGTG CTGCACTACC TGCTCGGTGT GGCGACGCTT
GACCGGCGCT TGCAACCGCT GGTGCGCCTG CTCGAGCCAG CGGTGGCGTT GCCGGCCTCG
CATCGGCAAA TCGTGGAGCG GATTGCAGCG CTGTGGGAAG AGCAACCTGC GCCGCCAGTG
CAGATCTGCG GCAGCGATTC GTATAGTCGG CAGGCGCTTG CCGCTGCGAT TGGCGATCGA
TTGGGGCGGG CAGTGTACCT GCTGCGGGCC GAAGATGTGC CCGCTGGCCC TGCCGAGCAA
GAGTTGTTGG CCCGCCTGTG GGAGCGAGAA GCAGCGTTGA GTGGTGCGCT GGCACTGATC
GCGGTGGACG ATGGCGACAC GTCGCGAGCT TGGCTTGGGT GGTTGGAACG GGTACAGGGC
ATGGTGTTGT TGGCGAGCGC CGATCCGCTG CCGTCCGGCG ACCGGCTGAT CGTGCGGGTT
GATGTGCCGC AACCCGATCG TACCGAGCAA CGATCGCTCT GGCAACAAAT ATTGGGTGAA
CGCAGTCTGA CACTCAACGG CCAACTCGAA CCGCTGCTGG TCCAGTTCTC CCTCAATACC
AATACCATCC GCGCTGTGAC TTTAGCAACC ACGAACGATC AGGAACCGGA CTGGTGGGAA
GTCTGCCGCG CGCAGGCTCG TCTCCGGCTG GATGGGTTGG CCCAGCGGAT CGAGACAGCG
GCGGGTTGGG ACGATCTGGT GTTACCAGAC GAGCATATGG CAACGCTGCG CCAAATGGTA
GCGCATGTGC GCCAAAGAGC ACGAGTCTAC GATCAGTGGG GGTTTGGTCG GCGCGGCGAA
CGCGGGTTGG GGATTAGCGC TCTCTTTGCC GGGCCAAGTG GTACCGGCAA GACGCTGGCT
GCCGAGGTAT TGGCGAATGA ACTACGGCTC GATCTCTACC GTATCGACCT AAGTGCCGTA
GTAAGTAAAT ACATCGGTGA GACCGAAAAG AATCTGCGGC GTATCTTCGA CGCTGCGGAA
GGCGGGGGCG CGATACTGTT GTTCGACGAA GCCGATGGGC TGTTTGGCCG GCGCAGTGAA
GTGAAAGATA GTCACGATCG GTACGCCAAC CTCGAAGTGA GTTATCTCTT GCAACGAATG
GAAGCGTACC GCGGTTTGGT CATTCTCACC ACCAACATGA AGCAAGCGAT TGACACTGCC
TTCCTGCGCC GTCTTCGCTT TATTGTCAAC TTTCCCTTCC CCGACGCGGC GCAACGCCGG
CGCATCTGGC AGCGGGTCTT CCCGCCCGCC ACGCCGCTAG CCGGGTTGGA TTGGGTACGG
CTGGCGCAGT TGAATCTGGC CGGCGGCAAC ATTCGCAGCA TTGCCTTGAA TGCCGCTTTT
CTTGCGGCTG ATGCCGGCGA ACCGGTCGGG ATGCACCACA TCTTGTCTGC GGCGCACAGC
GAATACGCCA AATTGGACAA ACCGCTGACC GAGACGGAAT TGCGGGGGTG GGGATGTTAG
 
Protein sequence
MSTATAIDWV TANQRYLSAA LDVIKRRLAG ERDEAGLREA GQALAAARAA LPALPAIERL 
RIIFGLTDFE TELLLLCAGV ELDSGLATLC ATAQGDPHRS RPTIGLALTV LPAAHWSAVT
PTAPLRHWRL IELVEGEPLV TAPLRIDERV LHYLLGVATL DRRLQPLVRL LEPAVALPAS
HRQIVERIAA LWEEQPAPPV QICGSDSYSR QALAAAIGDR LGRAVYLLRA EDVPAGPAEQ
ELLARLWERE AALSGALALI AVDDGDTSRA WLGWLERVQG MVLLASADPL PSGDRLIVRV
DVPQPDRTEQ RSLWQQILGE RSLTLNGQLE PLLVQFSLNT NTIRAVTLAT TNDQEPDWWE
VCRAQARLRL DGLAQRIETA AGWDDLVLPD EHMATLRQMV AHVRQRARVY DQWGFGRRGE
RGLGISALFA GPSGTGKTLA AEVLANELRL DLYRIDLSAV VSKYIGETEK NLRRIFDAAE
GGGAILLFDE ADGLFGRRSE VKDSHDRYAN LEVSYLLQRM EAYRGLVILT TNMKQAIDTA
FLRRLRFIVN FPFPDAAQRR RIWQRVFPPA TPLAGLDWVR LAQLNLAGGN IRSIALNAAF
LAADAGEPVG MHHILSAAHS EYAKLDKPLT ETELRGWGC