Gene Cagg_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2664 
Symbol 
ID7269571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3260896 
End bp3262866 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content57% 
IMG OID643567490 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002463968 
Protein GI219849535 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00646659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTGATA ACCGTTGGTT GAAAAATAGT TTTGTCTATC TGATCATTCT GGTCGCGGCT 
CTTGCGCTCT TTTTTCAATA TCTAGGTCCA GGGGCCAGCC AGACGGAGGA GAAAGGCATC
GCCGATGTGA TCGCCGATGC CCAAGCCGGC TTAGTGCGCG AGATTCAGGC GCAAGCCGGT
GATGAGCAGA TTATCGTTAC CTACAATGAC GGCAAGAAAT ATCGCTCCCG GCTTGAGTCG
GCCGATAGCG TGATGCGGTT GTTGGCCGAT TACGGCGTGC CGTTGCGCAA CGAACAAGGG
CAGCGCACCA TTAACGTTAT TGTGCAGCCG GCGCCGGCGT GGGGCGGTTT GCTGAGCATC
TTTACGATCC TCTTGCCAAC CCTGTTACTG ATCGGCTTTT TTGTCTTCTT TATGCGCCAA
GCGCAAGGAA GCAACAATCA GGCGATGTCG TTCGGTAAAA GCCGCGCACG CATGTTCGCC
GGCGATAAGC CGACAATTAC GTTTGCCGAT GTTGCCGGTC AGGAAGAGGC GAAGCAAGAC
TTGGCCGAGA TCGTTGAGTT TCTCAAGTTC CCGGATAAGT TCGCCGCTCT TGGTGCCCGT
ATTCCTCGCG GTGTGCTGAT GGTCGGCCCG CCCGGTACCG GTAAAACACT GCTCTCGCGC
GCCGTTGCCG GTGAGGCCGG TGTGCCTTTC TTCAGCATCT CCGGCTCCGA GTTTGTCGAG
ATGTTTGTCG GTGTCGGTGC CTCCCGCGTC CGCGATTTGT TCGATCAGGC CAAGCGTAAT
GCGCCGTGTA TCGTCTTCAT CGATGAGATC GATGCTGTTG GTCGTCAGCG TGGTGCCGGT
CTCGGTGGGT CGCACGACGA ACGTGAGCAG ACGCTCAACC AGATTCTGGT TGAGATGGAT
GGGTTCGATA CCAATACCAA CGTGATCGTG ATCGCCGCCA CGAACCGACC TGATGTGCTC
GATCCGGCGC TCGTTCGCCC CGGCCGCTTT GACCGCCAAG TAGTGCTCGA TGCGCCCGAT
GTGCGTGGGC GGATCGAGAT TCTGAAGGTT CACGTTAAGG GTAAGCCACT GGCCGAGGAT
GTGAATCTGG AGATTCTCGC CCGCCAGACC CCCGGTTTCT CCGGTGCTGA TCTAATGAAT
GTGGTGAATG AAGCGGCAAT TCTGGCGGCA CGTCGCTCGA AGCGCAAGAT TAGCATGGCC
GAGTTTCAAG ATGCGGTCGA GCGGGTGGCT ATCGGTGGTC CTGAGCGTCG CTCGCGGGTG
ATGACCGATC GCCAGAAGCT GGTGGTGGCG TACCACGAGG CTGGCCACGC AATTGTCGGT
GCCGCTTTGC CCAAGGCCGA CAAGGTGCAA AAAGTGACGA TTATCCCGCG TGGGCAGGCA
GGTGGCTATA CGCTCTTCTT GCCTGACGAG GATAGCCTCA ATTTGCGCAC TGTTTCGCAA
TTTAAAGCGC GACTGGCCGT TTCATTGGGC GGACGAGTTG CTGAAGAGAT TGTTTTCGGT
AACGAAGAGG TGACGACCGG TGCCTCCGGT GATCTGGTTC AGGTAACCCG TATCGCTCGT
GCGATGGTGA CTCGCTACGG TATGAGCCAG CGTCTCGGCC CAATCGTCTT TGGTGAGAAG
GAAGAGCTGA TCTTCCTTGG TCGCGAGATT AGCGAGCAGC GCAACTATGG TGATGAGGTT
GCCCGCCAAA TCGATGAAGA AGTGCATGCA ATCGTGAGCG AAGCCTACGA AACGGCGCAG
CAGATCCTGC TCCAGAACCG GGCAGTACTC GATGATATGG CGAATGCCTT GATTGAGTAC
GAGACGCTTG ACGGTGAGCA GCTCGAAGAG TTGATCCGGC GGGTGAAGCC GCTGACCCTC
GATTTTAGCA AGAGTGGTAG CACGACGCCA AATGGTCGCA CCGAGGATCG ACCGGCACAG
CCGGACGCTC CGCAGATGGG TTTGGGCGGT CCAAGCCCGT TGCCGGCGTA A
 
Protein sequence
MGDNRWLKNS FVYLIILVAA LALFFQYLGP GASQTEEKGI ADVIADAQAG LVREIQAQAG 
DEQIIVTYND GKKYRSRLES ADSVMRLLAD YGVPLRNEQG QRTINVIVQP APAWGGLLSI
FTILLPTLLL IGFFVFFMRQ AQGSNNQAMS FGKSRARMFA GDKPTITFAD VAGQEEAKQD
LAEIVEFLKF PDKFAALGAR IPRGVLMVGP PGTGKTLLSR AVAGEAGVPF FSISGSEFVE
MFVGVGASRV RDLFDQAKRN APCIVFIDEI DAVGRQRGAG LGGSHDEREQ TLNQILVEMD
GFDTNTNVIV IAATNRPDVL DPALVRPGRF DRQVVLDAPD VRGRIEILKV HVKGKPLAED
VNLEILARQT PGFSGADLMN VVNEAAILAA RRSKRKISMA EFQDAVERVA IGGPERRSRV
MTDRQKLVVA YHEAGHAIVG AALPKADKVQ KVTIIPRGQA GGYTLFLPDE DSLNLRTVSQ
FKARLAVSLG GRVAEEIVFG NEEVTTGASG DLVQVTRIAR AMVTRYGMSQ RLGPIVFGEK
EELIFLGREI SEQRNYGDEV ARQIDEEVHA IVSEAYETAQ QILLQNRAVL DDMANALIEY
ETLDGEQLEE LIRRVKPLTL DFSKSGSTTP NGRTEDRPAQ PDAPQMGLGG PSPLPA