Gene Cagg_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1721 
Symbol 
ID7269427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2105163 
End bp2106503 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content56% 
IMG OID643566563 
Productcarboxyl-terminal protease 
Protein accessionYP_002463058 
Protein GI219848625 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGG TGTGGGCAAT TGTTGGTGTC ATCGTTCGGT GGGTCGCCGT GATCGCTGTG 
GCTTTTCTTG GTGGTTGGAT CACCGGGCGG ATTGTTGGCG TCTCTCCGAT CGATGTCTTG
ATCACCGGTG TCTCTAACGT CGATACGCGA TTGCTCACCC CCGGTGACCG CCGACAACAG
TTTGCCGTCT TCTGGGACGT GTGGGATTTG GTTGAAGGCA ACTTCTACCA GCCGCAAGCT
ATCGATCGGC AGAAGATGGT ATACGGTGCG ATCCGTGGTA TGCTGGCAAC GCTCAACGAT
CCGTATACCT TCTTTCAAGA GCCAGAAGAA GCGCAACAAA ATCGGGAGTC GATGGAGGGC
CGCTTTGAAG GCATTGGTGC TTATCTGCGG GTTGAAAATG GCCAAATCAT CATCGACCGT
CCAATCCGGA ATTCGCCTGC CGAACAAGCC GGTATTCAAG CGGGTGACAT CATTCTGGCA
GTAGATGATC AACCACTGGC CGAGTTAATA GCCGGCTTGA GCGACCAAGA AGCAAGCGCT
CGTGCAGTAA GCCTTATTCG TGGTCCGGCC GGAACGGTCG TTCGCTTAAC CATTCACCGA
CCTGCCGAAG ATCGTGTCTT TACCGTTGCC ATCACGCGCG CGGCCATTCC GCTCATCACC
GTCAATAGCA CGCTCTTACC CGACCGCATT GCCTATATTC AGATTACCGA ATTTAAGGCC
ACAACCACTG AGTTGCTCGA CCAGGCGATT GCCGAGTTGC TCCCACAACA ACCCCGGGCA
ATTGTGCTCG ATCTGCGTAA TAATTCGGGC GGTTTTCTGA CTACTGCGCA AGAGGTGCTC
GGTCGGTTTT ACGACGGCGT GGCGCTCTAT GAAGAAGAGC GTAGTGGCGT CAACAAAGAG
TTACGCACCA TCACGGCACC GGCTAACCGG CGACTATACG GTATTCCAAT GGTTGTCTTG
GTGAACGGCG GGTCGGCTAG TGCCGCTGAG GTGGTTGCCG GCGCGCTGCG TGATGTCCGT
CCGAATACGG TCTTGCTCGG TGAGAAGACC TTTGGCAAGG GGTCGGTTCA GAACATCTAT
CCCCTGCGTG ACGGGAGCAG TGTGCGCATC ACTATCGCAC GTTGGCTGAC GCCGTCCGGT
GAAGCGATTA ACGGCGTTGG GATTACACCG GAGCACGTCG TACCGGCCGC GAACGATCCG
ATCTATCAGG TACCGTGTGT GCCGGACCGA CCCAACGATA CTGGTTGTGC GGATGCGCAA
TTGTACTGGG CGCTCAAACT GTTACGTGAT GGGACGCCGC CACCACTGCC GGTACCTGTT
GAAACAGTTA CGGCCCCTTG A
 
Protein sequence
MERVWAIVGV IVRWVAVIAV AFLGGWITGR IVGVSPIDVL ITGVSNVDTR LLTPGDRRQQ 
FAVFWDVWDL VEGNFYQPQA IDRQKMVYGA IRGMLATLND PYTFFQEPEE AQQNRESMEG
RFEGIGAYLR VENGQIIIDR PIRNSPAEQA GIQAGDIILA VDDQPLAELI AGLSDQEASA
RAVSLIRGPA GTVVRLTIHR PAEDRVFTVA ITRAAIPLIT VNSTLLPDRI AYIQITEFKA
TTTELLDQAI AELLPQQPRA IVLDLRNNSG GFLTTAQEVL GRFYDGVALY EEERSGVNKE
LRTITAPANR RLYGIPMVVL VNGGSASAAE VVAGALRDVR PNTVLLGEKT FGKGSVQNIY
PLRDGSSVRI TIARWLTPSG EAINGVGITP EHVVPAANDP IYQVPCVPDR PNDTGCADAQ
LYWALKLLRD GTPPPLPVPV ETVTAP