Gene Cagg_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0514 
Symbol 
ID7267011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp635362 
End bp637335 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content55% 
IMG OID643565377 
ProductAAA ATPase central domain protein 
Protein accessionYP_002461889 
Protein GI219847456 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG AGCCTTACCG TCCAACCCCT GAACCAACCT CCAACCAACG AGAGCAAGAG 
CTTGAGAAGA AAATTAGCAA ACTTTTCGAG CGGTGGAACT GGAAGTTATT CCGGTTTCTC
CGATTCATGC TCTTCCTCGG TATTGGTATT TGGCTGGCGG TTAATCTGCT ACCCGGCATT
TTCGAGATCA TTATCACCGG CGAAATCGGT AACCTGATTC TACAAATCAT TCCGGTTGCC
GTCTACCTCT TCTTCTTCAT TGGGTTCCAG TTCTGGCTGA TGTACTTTTT CATGGCCCGC
ACCCGAATCT ACTGGGTCAA GCCGGGTGAA ACAGGCGTCA GCTTTAAAGA CTATCGCGGT
AACCCTGAAG TGCTCGAAGC TGCACGTCGC GTCGTTACCC TGCTCAAGGG TGCGAAAGAG
TTCAAGCAAA TGGGTGGTGA AGTCACCCGC GGCATTTTAT TGATTGGCCC ACCCGGTACC
GGGAAAAGCT ACCTTGCCCA AGCCATCTCG ACGGAAGCCG GTGTACCGTT TGGCTATCTG
AGCGCACCTT CACTCCTCTC GGCATGGATG GGAATGGGCA ATATCAAGGT CATGAACCTT
TACCGCAAAG CTCGCCGTCT GGCTCGTGAG TACGGTGCCT GCATCTTGTT CATTGATGAA
ATTGATGCGA TTGGGGCCGC CCGTAGCCCA AATCTGATGG GCACAGGCGC AGCCGGCGCC
GGCATGAATA ATCGGGAGAA TGTGGTGATG GGTGTTGGCG GGATGATGGG TGGTGGAAGC
AGCTTGCTGA ATGAACTATT GCTTCAGATG GACCCACCGC CGCAAGAGCA GACATGGTGG
GGCAAGCTCC TGCGCCTCGT CGGCCTCCGC CGTGGAAAAG CCGATATGCC TCCCGTGCTG
ACGATGGCTG CAACTAACCT CGCCGAGACG CTTGATGCTG CGCTGTTGCG TCCCGGTCGC
TTCGACCGCA AGATCGCCGT CGAGCCACCC GATGCCGATG GGCGTCGCGA GGTAATCGAA
TACTACCTGA GTAAGGTCAA GCACGAACCG ATGCCGATTG ACCGTATGGT TGCCGATACA
ATTGGGTATA CACCGGTGGC GATCAAGTAT GTGATAAACG AAGCGACGAT TCACGCTCAC
TTCGATGGCC GCTCGGCGAT TAACTATTGG GACTTCACAC AGGCTCGCGA GATCCATGAA
TGGGGATTGA AGCAGCCGAT CCGTAGTATG TCGTATGAAG AGCGCCGACG TATTGCCTAC
CATGAGGCCG GTCACGCCTA CGCAGCAGTT AAGCTGTTGA AGAAAGAACG TCTCACCAAA
GTGACGATCG TTCGCCACGG CAATGCGCTT GGTTTTGCAG CGTGGAAGCC AGAAGAGGAG
ATCCATACCC GTACCCGTGA AGAGCTGCTT GACCGGATCA AGATCGCACT GGCGAGCCGT
GCTGCCGAAG AGCTATTCCT CGGCACACAA ATGAGTGGAG TAACCGGTGA TCTGCAGAGC
GCAACCGGCA TTGCTGCAAT GATGGTCGGT GCGTATGGCA TGGATAATAG TTTCTTCTCG
TACCTGCTCT TCGGTATGCA GGGCCTTTCG GCGCCAGATG TGAAGCCACG GGTTGAAGCG
ATCTTGCAAG AGCAGTACCG AATCGTCAAG CAAATGCTTG AACGGAACCG AATGGCGGTG
ATTGCAATTG CTGAAGCGCT GATCTTACGC AACGAGCTGA CCGACATTGA TGTGAAAGAG
ATCTTGGCGC GCGTTGAGGC CGAACATCCC TACTTACCAC CGAACGAGAA GCCTGAGCGA
CCGGCCTTTG GCTTTGCGGC TGCATTAACG CCATCGAACA CGACCCTGGT TCGCCGACGG
CGCGAGCAGG TGGTGTTGCC GCCACCCAAA CAACCTGCAC CAGAAATTAC GATCATCGAT
GCACAACGGC GTGACGCCAA CGATCCCCAA CCTGATCCTA ATACTCAGGC ATAG
 
Protein sequence
MAIEPYRPTP EPTSNQREQE LEKKISKLFE RWNWKLFRFL RFMLFLGIGI WLAVNLLPGI 
FEIIITGEIG NLILQIIPVA VYLFFFIGFQ FWLMYFFMAR TRIYWVKPGE TGVSFKDYRG
NPEVLEAARR VVTLLKGAKE FKQMGGEVTR GILLIGPPGT GKSYLAQAIS TEAGVPFGYL
SAPSLLSAWM GMGNIKVMNL YRKARRLARE YGACILFIDE IDAIGAARSP NLMGTGAAGA
GMNNRENVVM GVGGMMGGGS SLLNELLLQM DPPPQEQTWW GKLLRLVGLR RGKADMPPVL
TMAATNLAET LDAALLRPGR FDRKIAVEPP DADGRREVIE YYLSKVKHEP MPIDRMVADT
IGYTPVAIKY VINEATIHAH FDGRSAINYW DFTQAREIHE WGLKQPIRSM SYEERRRIAY
HEAGHAYAAV KLLKKERLTK VTIVRHGNAL GFAAWKPEEE IHTRTREELL DRIKIALASR
AAEELFLGTQ MSGVTGDLQS ATGIAAMMVG AYGMDNSFFS YLLFGMQGLS APDVKPRVEA
ILQEQYRIVK QMLERNRMAV IAIAEALILR NELTDIDVKE ILARVEAEHP YLPPNEKPER
PAFGFAAALT PSNTTLVRRR REQVVLPPPK QPAPEITIID AQRRDANDPQ PDPNTQA