Gene Cagg_1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1655 
Symbol 
ID7268957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2018392 
End bp2020893 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content59% 
IMG OID643566497 
Productpeptidase U32 
Protein accessionYP_002462992 
Protein GI219848559 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.296071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAC CCGAAATTAT GAGTCCGGCC GGCTACTGGC CGCAACTCCA TGCCGCGATT 
GAGGCCGGCG CCGATGCCGT CTATTTTGGT TTGACGCATT TCACGGCACG CGCGAAGGTC
GGTTTTACGC TCGATGAGTT GCCGGAGGTG ATGAGGACAC TCCATCGCCG CGGCGTTAAA
GGGTACGTTA CGTTCAATAC CTTAGTGTTC GACCACGAGC TACGCACGGC GGCGAGGACA
CTTGCAGCGA TCGCCGCAGC CGGCGCCGAT GCGATCATCG TGCAAGATCT CGGTATTGCA
GCACTGGCCC ATCAGATTGT GCCCGATCTC CCTATTCACG GGAGCACGCA GATGAGCATC
ACCAGTGCGG AAGGAGTAGC TTTTGCCGCC CGCTACGGCG TCTCGCGGGT GGTCTTGGCC
CGCGAGTTGT CGCTGGCCGA GGTTGCCGCC ATTCGTTCAC GTAGCCCTAT CGAACTCGAA
ATCTTTGTCC ATGGCGCCCT CTGCGTCTCA TATTCGGGAC AGTGTTTTTC GTCAGAGGCA
TGGGGCGGAC GCAGTGCGAA CCGTGGTCAA TGCGCTCAAG CATGTCGCTT ACCCTACGAG
TTGATCGTCG ACGGAAAACC ACGACCCCTC GGCGCGGCGC GCTATTTGCT CTCACCCGGC
GATCTGGCCG CTATTGATGA TATGACAACT ATCGCTCGCC TGGGCGTGAG TGCGCTCAAG
ATCGAAGGGC GGTATAAAGA TGCTGAATAC GTTGCGATCA CAACCAACGC ATATCGCCGC
GCACTCGATG CGGTATGGGC CGGGCTTCCT TCCGATTTGA CCGTTGCCGA CCGGCTGTAC
CTCGAACAAG TGTACTCTCG TGGGCTGGGA CCGCACTTTT TGCGCGGCAC GAATCATCAG
GCTGTCGTCG AAGGGCGCGC CCCTCGCCAC CGCGGTTTGT TGATGGGACG GGTGGTGCGG
GTGCGTGCAG ACGCGATCAT TATCATACCT GAACGGGGCC GCGAGGCGGC ACCGCTCAAG
CCTGGCGATG GTGTTGTCTT TGATGCCGCC GATTGGCGCA GCCCGGAAGA GCCAGAAGAG
GGTGGTCGCA TCTTTACGGT TGAACCGGTC GGTGATGGAT TGCTGGCAAT TCGTTTCGCA
AAGGGGGCAA TTAATCCACG CCGTATTCGC GCCGGAGATC TGCTGTGGCG CACAAGTGAT
CCGCAAACCG AGCGAATAGC CCGCCCCTTC GTGCAAGCGG CTGCACCGGC ACGCCGCCAA
CCGGTGCGCG TCACAGCGCT GGTACGCGCC GGTGCGCCGC TTGAACTACA CTGGTCGCTT
ATTGCCCAAC CGAGCTTGAC CGTAACGGTA CAGAGTCCGA CACCTCTGAC CACAGCCCAG
AATCGGCCAC TCGATGAAAC GACGCTGCGT GAGCAATTGG GGCGGTTAGG CGATACTCCT
TACCAACTGA CCGAACTCAA CGCGGTCATT GAAGGCAACC CGTTTGTGCC GGTATCGCTG
CTCAACCAAT TGCGCCGGCA AGCCACCGCT GCGCTCGCCG AGTTGCAAGG ACGGCCACCG
GCAATGAACA TCCTCGATCC GGAGCAGGTG TTAGATACCA TGCTCGCTGC CGTTGCCCCA
TCCGCACCAC CTGACACTGC TCAGATTCAC CTCCTGATCC GTTCACCCGA TCAACTCGAA
GCGGCGATTG CGCTACAGCC GGCCAGTATT ACGCTCGATT ATCTCGACCT TGAAGGATTG
AAGCCGGCCG TGACCCGCGT GCGCGCTGCC GGGATCGCAG TCCGTGTTGC CGCACCGCGG
GTGCTCAAAC CGGAAGATGA GCGAGTAGCC CGCTTTTTAC GTAAACTCAA TGTACCGCTG
CTCGTGCGCT CGACCGGCTT GCTCGACAGG CTGCGCGACG ATCCGACGGT TGAGTTGACC
GGTGATTTTA GCCTCAACGC AGCCAATATC CTCACCGCCG ATCTGCTGCT GCGGTCGGGA
TTACAACGGC TCACGCTGAC CCACGATCTC AACGCCGAGC AGATCGCGCA CTTAGCCGAA
CGGATCGGCG GTAGTCGGCT AGAAGCCATC GTCTATCACC ATCTCCCTGT CTTTCATACC
GAGCACTGTG TCTTTTGTCG TTTCTTATCA ACTGGAACCA GTTACAAAGA CTGCGGTCGC
CCGTGTGAAC GTCACCACGT TGCGCTGCGC GACACTCACG GTCGGGCACA CCCGGTCATT
GCTGATGTTG GTTGCCGCAA CACGGTCTTT GGGGCTGAGG CGCAAGAAGC GAGCAAATAT
CTCGACCGCT GGCGCGCTGC CGGGATTGCT CACTACCGGC TTGAATTTGT CCATGAAACA
GCAGCGCAGA TTACTGCGGT GACAGAAGCC TTCCGTGCGT ATTTGCAGGA AGAGATTGAT
GCTGCTGAGC TAGGGCGACG TTGGCGACAA AGCGCACCAC AAGGCGTCAC CGAAGGTAGT
TTCTTCGTAC CGGCAAATTA TCAGTACATT CCACTGATGT GA
 
Protein sequence
MHKPEIMSPA GYWPQLHAAI EAGADAVYFG LTHFTARAKV GFTLDELPEV MRTLHRRGVK 
GYVTFNTLVF DHELRTAART LAAIAAAGAD AIIVQDLGIA ALAHQIVPDL PIHGSTQMSI
TSAEGVAFAA RYGVSRVVLA RELSLAEVAA IRSRSPIELE IFVHGALCVS YSGQCFSSEA
WGGRSANRGQ CAQACRLPYE LIVDGKPRPL GAARYLLSPG DLAAIDDMTT IARLGVSALK
IEGRYKDAEY VAITTNAYRR ALDAVWAGLP SDLTVADRLY LEQVYSRGLG PHFLRGTNHQ
AVVEGRAPRH RGLLMGRVVR VRADAIIIIP ERGREAAPLK PGDGVVFDAA DWRSPEEPEE
GGRIFTVEPV GDGLLAIRFA KGAINPRRIR AGDLLWRTSD PQTERIARPF VQAAAPARRQ
PVRVTALVRA GAPLELHWSL IAQPSLTVTV QSPTPLTTAQ NRPLDETTLR EQLGRLGDTP
YQLTELNAVI EGNPFVPVSL LNQLRRQATA ALAELQGRPP AMNILDPEQV LDTMLAAVAP
SAPPDTAQIH LLIRSPDQLE AAIALQPASI TLDYLDLEGL KPAVTRVRAA GIAVRVAAPR
VLKPEDERVA RFLRKLNVPL LVRSTGLLDR LRDDPTVELT GDFSLNAANI LTADLLLRSG
LQRLTLTHDL NAEQIAHLAE RIGGSRLEAI VYHHLPVFHT EHCVFCRFLS TGTSYKDCGR
PCERHHVALR DTHGRAHPVI ADVGCRNTVF GAEAQEASKY LDRWRAAGIA HYRLEFVHET
AAQITAVTEA FRAYLQEEID AAELGRRWRQ SAPQGVTEGS FFVPANYQYI PLM