Gene Cagg_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0872 
Symbol 
ID7268323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1081758 
End bp1083455 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content59% 
IMG OID643565718 
Productpeptidase M9A collagenase domain-containing protein 
Protein accessionYP_002462227 
Protein GI219847794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0507103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTACC TGATCGTCGG GTTCGTTGTT CTGATCCTGC TTATTCCGTC CGGTTCGCGG 
CTTCCACTGG CGGTTACCCC GGCTTATCCA CCACCGATCC GCCTGCCGGC GTGCCCACAG
CCCACTGTGG CCGATGTCGA CGCAATGCTC GCGTTGTTGC CACAGGCTGG CTATGATTGC
ACTGAGCAGA TCGCGGTAGC ACTTCGCCCC CGCATCGAGC CGGTATACGT GCAGGAGTTA
CTCGCGATTG CTGTTGAGCC GACCTTCGAT ACCCGTACCC GCCGCAATGC ACTACGCATT
TTGGGCCGCT TGGCCGAAAG TGGGCCGGTT ACACGTGCTC GTGAGTTGAT GGTGCAGCAG
CAGTCGATGG TGCAGATGAC GGCGCTCACC CTCCTTGAAC GCGAACGCGA CAATTTTCTG
TTGCAAGATG CGGTCTGGCT GCTCGACAGC CACTACTATC CAAGCTGGGT AGCAGCACCG
GCACTCGAAC AGATTGCTCT CGGTGGCGAG TATGCGCCGG CGTTGCGTTA CCGCGCCGCC
CGTGCTCGTG CTCGGCTCAT CGCTGCTGAA TACGGACCAT TGCGTGATGA TTCGCAGCGT
TTCATCGTGG CGGCACTGCG CAGCGCCGAT CCCGGTGTAC GCACCGCTGC CGCTGAAGCG
ATTAGTTTCT TGCGCGACGA CCAATTGACA GCACGGACAG ATTGGTTACA GCTTGTGGAG
GAAGCGCTGG TCCACGAGCC GCCGTTGCAC GTAGCTACCG ATAGCGGCGA TCCACGTGGA
GCCGCACTGT TAACCTTCCT TGAGAGTACG CCAACGACGC TGACGGCACG GGCTGCATTA
GCACGCGCTG CCGACCGATT GGCCGGCGAA ACTGCGACGG TGCCACGGCT CAATGCCTTG
CGCATAGCAT ACGAACACCT TGCCCTACCG TTGCAACACG AAAGTGCAAC AGTCGTGCTG
CGCACCGGTC CTGCCGAGAC GAGCGATGGC GATGAACTGT TAGCGATCGT AGCTACAACG
TATGCACAGG CTCGTCGCTT CCTGGGTTCG GTAGGAGAAA CGCCAATCCC CGGCGAAGAG
CATCTCCCAT TACAGGTGCT AATCTTCCCC GGCCAGGCTG CATATCGTGA CTATATGCGC
GCCTTTACCC CCTTCACCGT TGATGTCGAT GGCATTTACG ACGTGCAGCA GAACACGCTC
TACAGCTACC GGCGTCGTGA TGATCAGACT GCGAACACGC TCGCCGAGAC GCTCCGCCAC
GAAGTAGCCC ATGCAGTCAC AGCGGCCTAT CTCTTTCCCG GCCAGTGGCA CACGCCCGGC
TACCACGCCG AACCGAAGGG CTGGTTTGAT GAAGGATTTG CTGAAGTATT GGCCGCACAA
ACCAAACCCG ACGCGCCACT TCAGCCGCAT CCGCGTCATT TGGCGACTAT CTGTGCACAA
CCGCTCAAGC CGGCTCTCGC CGATCTGGTG GCGTTACGTA CCGGGTATGA CCAGTATGGG
ACGTTCGATT ACCCGGCAGC TTGGGCATTG ATGCATTTCT TGCTCGCCGA ACGACCCGCA
GCAGCGGCAG CCTTGATCTC GGCATGGCGC AACCAGACGT ATCATCTAGC TAACTGGCCA
ACGTTGGGTG GTTGGTCAGA TTGGACCAGC GCCGAATCCG ATTGGCACTT TGCGATTGAG
CGTTGGTGTA GGCTATAA
 
Protein sequence
MRYLIVGFVV LILLIPSGSR LPLAVTPAYP PPIRLPACPQ PTVADVDAML ALLPQAGYDC 
TEQIAVALRP RIEPVYVQEL LAIAVEPTFD TRTRRNALRI LGRLAESGPV TRARELMVQQ
QSMVQMTALT LLERERDNFL LQDAVWLLDS HYYPSWVAAP ALEQIALGGE YAPALRYRAA
RARARLIAAE YGPLRDDSQR FIVAALRSAD PGVRTAAAEA ISFLRDDQLT ARTDWLQLVE
EALVHEPPLH VATDSGDPRG AALLTFLEST PTTLTARAAL ARAADRLAGE TATVPRLNAL
RIAYEHLALP LQHESATVVL RTGPAETSDG DELLAIVATT YAQARRFLGS VGETPIPGEE
HLPLQVLIFP GQAAYRDYMR AFTPFTVDVD GIYDVQQNTL YSYRRRDDQT ANTLAETLRH
EVAHAVTAAY LFPGQWHTPG YHAEPKGWFD EGFAEVLAAQ TKPDAPLQPH PRHLATICAQ
PLKPALADLV ALRTGYDQYG TFDYPAAWAL MHFLLAERPA AAAALISAWR NQTYHLANWP
TLGGWSDWTS AESDWHFAIE RWCRL