Gene Cagg_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0309 
Symbol 
ID7267490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp385603 
End bp387360 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content55% 
IMG OID643565177 
Productpeptidase M61 domain protein 
Protein accessionYP_002461691 
Protein GI219847258 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.829558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACCT ATACCATTGC AATGCCGGTG CCTCATTCCC ATCTCTTCCA TGTTCACATA 
ACTATTCAAC GGACTGAGAC CGGTGATGTT CAGTTGTCGT TACCGGCCTG GACGCCCGGT
TCGTATATGA TCCGTGAGTT TGCCCGCCAT GTGCAAGAGT TTCAGGCGAC TGACGGGGCA
GGCGTCCGTT TGCCGTGGCA CAAGGTACGC AAAGATAGTT GGATCGTGCA GGCCGGGGCA
GCAACCCACG TCATTGTTTC GTATAAAGTC TATGCGTTTG ATTTGACGGT ACGCACCAGT
CATCTCGATG GCACGCACGG CTATTTCAAC GGCGCGAATG TGTTTATGTA CGTTCACGAC
CATACGCGCG AGCCGTTGAT CCTTCACGTC GATCCACAGC CCGGCTGGCA AGTAACGACC
GGTCTCACTC CGTTACCGCC TGATCCGGCG TATCCGCAGC GGGCATCGTT TCAGGCAGCC
GATTACGATG AGTTGGTCGA TTGCCCGGTT GAATGTGGCA CGCATCGCCT CTTAACCTTT
ACCGTCGATG ATGTACCGCA CCGCATTGCC ATTTGGGGCC ATGGGAATGA AGATGAAGCG
CGTTTATTGG CCGACACCAA ACGTATTGTT GAAGTACAAC GGGCGCTGTT TGGTAGCTTG
CCGTACCACG ACTATACGTT CATTCTGCAT CTGACCGATG GCCGCGGTGG GGGCCTTGAA
CATCGCAATA GCGCTACCAA TATGGTCGAT CGCTGGACGT TTACCAACCA GTATGAGCGC
TATCTGAGCC TTACCTCGCA CGAGCTGTTT CATGCCTGGA ATGTGAAACG GCTGCGCCCT
GCAGTGCTTG GGCCGTTCGA TTACCAGCAA GAGAATTACA CGCGCCTGCT GTGGTTGATG
GAAGGGGCAA CGAGCTACTA CGATGAGCTA CTGTTGGTGC GGGCCGGTCT GATGAGCGAA
GAGCGCTATT TGCAAAAATT GGCCGATAAG ATCGTCCAAC TTCAACAGCA ACCCGGTCGT
CGGTTGCAGA GTCTCGAACA GAGCAGCTTC GACGCCTGGA TTAAGTTCTA CCGTCCTGAT
GAGAATAGCA TCAATTCCAG TATTTCGTAC TATCTCAAAG GCGCGTTGGT GTGCTGGATG
TTTGATATGG CGATCCGGGC ACAAACGGAC GGCGAACGTA GCTTTGATGA TGTGATGCGT
TATCTTTACC AACGCTATCC GGTCGAAGGG CCGGGTATTC CCGAAGAGGG TGCGGTGCTG
GCTGCAATCG AGGCGGTTGG TGGTCCGCAG AAGGAGTTTC GTGAGTTATA CGAACAGTAT
GTAGCAGGAG TTGACGAGCT CGATTATCAT GCTGCCTTGG CCGTCGTTGG GCTGGAACCA
CGCTGGCACT ACCGACGGCC ACGGCCCGAT GGGCAGCCAC CGGTATGGTT GGGGATCAAT
TGGCGTCAAC AGGGTGAACG AACTATCGTA GCTTCGGTGC GCAGTGATGG TCCTGCTTAC
GAAGCGGGTG TCTATGCCGG TGATGAGTTG GTCGCGCTTG ATGGCTGGCG GGTGAATGAG
GAACGTCTCA ACCAACGCCT GCTGGAACGC CGGCCCGGTG ATTCGGTGCG GTTGACCCTC
TTCCGTGGTG ATGCATTGAT CGATGTCGTT GTGCCATTGG CGGTTGCGCC GTATGATGCG
CTATCGCTGG TACCGGTGGC TATTCCAACT GCCGCCCAAC TGCGTATGCG GGCAGCATGG
CTAGAGCGAA TGGTGTGA
 
Protein sequence
MITYTIAMPV PHSHLFHVHI TIQRTETGDV QLSLPAWTPG SYMIREFARH VQEFQATDGA 
GVRLPWHKVR KDSWIVQAGA ATHVIVSYKV YAFDLTVRTS HLDGTHGYFN GANVFMYVHD
HTREPLILHV DPQPGWQVTT GLTPLPPDPA YPQRASFQAA DYDELVDCPV ECGTHRLLTF
TVDDVPHRIA IWGHGNEDEA RLLADTKRIV EVQRALFGSL PYHDYTFILH LTDGRGGGLE
HRNSATNMVD RWTFTNQYER YLSLTSHELF HAWNVKRLRP AVLGPFDYQQ ENYTRLLWLM
EGATSYYDEL LLVRAGLMSE ERYLQKLADK IVQLQQQPGR RLQSLEQSSF DAWIKFYRPD
ENSINSSISY YLKGALVCWM FDMAIRAQTD GERSFDDVMR YLYQRYPVEG PGIPEEGAVL
AAIEAVGGPQ KEFRELYEQY VAGVDELDYH AALAVVGLEP RWHYRRPRPD GQPPVWLGIN
WRQQGERTIV ASVRSDGPAY EAGVYAGDEL VALDGWRVNE ERLNQRLLER RPGDSVRLTL
FRGDALIDVV VPLAVAPYDA LSLVPVAIPT AAQLRMRAAW LERMV