Gene Cagg_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1866 
Symbol 
ID7266357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2286692 
End bp2287756 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content59% 
IMG OID643566703 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_002463197 
Protein GI219848764 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC TTCCTGATGA TTTTACAATC CTGGCGCTTG AAACGTCGTG TGATGAGACG 
GCAGCCGCGG TCGTGCGTGG TGGGCGCACC GTGATCAGTA ATATCGTCGC TTCACAAATG
GCGACACATG AGCGGTATGG GGGTGTAGTT CCCGAAATTG CCTCTCGCCA ACATATTCTT
AGTCTGGCGC CGGTAGTACG GGCAGCGTTG GCCGCGCTTC CTAACGGCTG GAATGATGTC
CACGCCGTGG CGGCTACTCA CGGTCCCGGT CTGAGCGGTG CGTTGTTGAC CGGTCTCAAT
GCGGCCAAAG CTATGGCGTG GCAGCGTGGC TTGCCGTTTG TCGCGGTGAA CCATCTAGAG
GCCCATCTGT ACGCCGGTTG GTTAGGCAGT GAGCCGCTGC CGCCATTCCC GCTCGTTGCT
TTGTTGGTCA GTGGTGGGCA TACCTTACTG GCCTTGATCC ACGATCATGG TCAATACGAA
TTGCTCGGTC AAACTCGTGA TGATGCAGCC GGTGAAGCGT TCGATAAGGT AGCGCGAATT
CTAGGCTTGG GATATCCGGG CGGGCCGGCG ATCCAGGCTG CCGCGGCGCA GGCAACCCCA
GGCGGGGTCT TGCCACGGGC ATGGCTCCGC GATAGCTACG ATTTCTCATT CAGTGGACTC
AAAACTGCCG TCCTGCATCG CGTCCAGGAT CGGTTGGCGC AGGCAGCCCG TCTCAGTGGA
CGAAAGGGTT CAAACGAAAC ACCGCAACTC GATGCGCCAT TTGTTGCGCA GATGGCGTAT
GCGTTTCAAG AGTCGGTTGT TGATGTGTTG GTAACGAAGA CGGTTGATGC AGCACGTCGC
TATAAGGCGC AGGCGATCCT ATTGGCCGGC GGCGTGGCGG CGAACCGACG GTTGCGTGAA
GAATTGAAGC GGCGCGCCGG CGTACCGGTA CATCTACCGG CACTTGACCT CTGCACCGAT
AATGCAGCGA TGGTGGCAGC GGCTGCATTC TACCGCTTCC ACGCCGGAGT ACAACACGGT
TGGGATGTGG ATGTAACGGC GAATTTGCCG TTAGGGGCGT CGTAG
 
Protein sequence
MKRLPDDFTI LALETSCDET AAAVVRGGRT VISNIVASQM ATHERYGGVV PEIASRQHIL 
SLAPVVRAAL AALPNGWNDV HAVAATHGPG LSGALLTGLN AAKAMAWQRG LPFVAVNHLE
AHLYAGWLGS EPLPPFPLVA LLVSGGHTLL ALIHDHGQYE LLGQTRDDAA GEAFDKVARI
LGLGYPGGPA IQAAAAQATP GGVLPRAWLR DSYDFSFSGL KTAVLHRVQD RLAQAARLSG
RKGSNETPQL DAPFVAQMAY AFQESVVDVL VTKTVDAARR YKAQAILLAG GVAANRRLRE
ELKRRAGVPV HLPALDLCTD NAAMVAAAAF YRFHAGVQHG WDVDVTANLP LGAS