Gene Acel_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1729 
Symbol 
ID4484850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1949446 
End bp1950636 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID639730519 
Productcarboxyl-terminal protease 
Protein accessionYP_873487 
Protein GI117928936 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATCT CGCGTCGTTC ACGATGGGTT CGTATCGGGG GCGCGGTTCT CGCGCTTGCC 
TCGATCTATA GCGCCGGTGT GGTGACCGGC GTTCTCGGGA GCAGCGGTTC CGCCCCGCAG
CGACCGGCCG CCGCGCCTAG TTCCCCCGGC TTTCTGGACC AGGTGGAACA GACAATTCTG
CGCAACGCCG CGAAACCGGT CACGGCGGAC GAGCTCGATC GCTCCGCTAT CCGCGGAATG
CTCGACGCGC TGGACGACAA ATGGTCCAGC TATTATTCCG CGGCGGATTT TGCCTCGTTC
GAAAATGTCA TGAATGGTCA ATACACCGGT GTCGGTTTGT GGGTGCACCG TGATGCGTCC
GGTGCGGTGA CCGTTCTCAA CGTGCAGGCG GGTTCACCGG CGGATCGAGC CGGTGTGCGC
AGCGGGGACG TCGTTCTTGC CGTCGGCGGC GTCCCGGTTG CCGGGCGGTC GATTGCCGAT
GTCGTGACCG CGCTCCGTGG CGATGCCGGG ACGACGGTCA CCCTCACCTA CCGGCGTGGT
GACGTCGTCC GCACGGTGAC GATGCGGCGG AGCGCGGTGG CGAGTGAGGA TGTCACGGCT
GCCACACAGA ACGGAGTCAT GATCATCAAG GTGAGTGCGT TCAGTCGTGG CGTCGCCAAC
CGGGTTCGCG CGTTGGATTC GCTGGCGCGG ACCCAACGGG ACCGCGGGAT TGTGCTGGAT
TTGCGGGGGA ATCCCGGCGG CCTGCTCGAA GAGGGGGTCC AGACGGCATC GGTGTTTCTT
GACGGCGGCC TGGTGGCCAC GTTCGTACGA CGCGGCGCTC AGCCGGTCGC GCTCAAGGCT
GCCCCAGGGG GCGACATCGC GACGCCGCTG GCTGTTCTCG TGGATGGGGG GACGGCGAGC
GCCGCGGAGA TCGTCGCCGG TGCGCTGCAG GACCGGCAAC GGGCGGTGGT GGTCGGCAGC
CCGACCTTCG GCAAGGGGTC GGTGCAGCAG CCGATTCCGT TGGCCGACGG CTCGGCGATC
GAGTTCACCG TCGGCACGTA TCTCACGCCG GCGGGACGTT CCCTCGACGG GGTTGGGGTG
CAGCCGGATG TCCCGGTCGC GGCGAATGCT CCGCCGTCGC TGGCGCTCGA GGAGGCTGTC
GACGTCATAT CCGGGTTGCT CGCCAACGCG GGTACGAGTG GACACGGTTG A
 
Protein sequence
MRISRRSRWV RIGGAVLALA SIYSAGVVTG VLGSSGSAPQ RPAAAPSSPG FLDQVEQTIL 
RNAAKPVTAD ELDRSAIRGM LDALDDKWSS YYSAADFASF ENVMNGQYTG VGLWVHRDAS
GAVTVLNVQA GSPADRAGVR SGDVVLAVGG VPVAGRSIAD VVTALRGDAG TTVTLTYRRG
DVVRTVTMRR SAVASEDVTA ATQNGVMIIK VSAFSRGVAN RVRALDSLAR TQRDRGIVLD
LRGNPGGLLE EGVQTASVFL DGGLVATFVR RGAQPVALKA APGGDIATPL AVLVDGGTAS
AAEIVAGALQ DRQRAVVVGS PTFGKGSVQQ PIPLADGSAI EFTVGTYLTP AGRSLDGVGV
QPDVPVAANA PPSLALEEAV DVISGLLANA GTSGHG