Gene Acel_1593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1593 
Symbol 
ID4484646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1791774 
End bp1793312 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content69% 
IMG OID639730377 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_873351 
Protein GI117928800 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.421913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.822638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC ATCAGCAGCA CTGGCCGGGT CGACCGCAGT CCGAATCACC GCAGCCGTCG 
GAGAACGCGG CCCGGGAGCC GGCAGGCACG CCGGTTGATA CTCCGGCGAC GGCGAACGCC
TGGACGGCCG GCGGTGCACC CGCGGGTGCG GGGAACCAGG CGGAGGGCGC TAGTGGAGGC
GCTGCGTGGT GGGCGCCGCC TCCGCCGTCC GGCGCGTCGT GGCCCTCCCC TGCCGGATCG
CGGCCGTCGT CCGGCGAAGT GCAGCCGCCC GCCGGTGAGG GTACCGATCC GCTCACGACC
ACGATTCCAT TGGACGTGCC GGCCCCGACT CCACCGGCCG CGGGATTCCC TCCTGCTGGG
GACGTTGCGC CGGCCGGAGG ATTTCCGCCT CCTGAGTCGA GTGGACGGCG CCGGATCTGG
CCCATCGTTG TCGTGGCCGT TCTGCTCGCC CTTGTCGCTG GCGGCGGGTC GGGAGCCGTC
GTCGGCCGCT ACGTGGCTCG GCACACGACG ACAGCGTCGC CGTCGTCGAC GGTGTCGAGT
ACAACGGGGA ATTCGGCGAC ATCGAGTACA GCGCCGACGG TGCCCCCTGC GCCGGCCGGG
TCGATTGCCG CCGTCGCGAA ACAACTCCTT CCCTCGGTTG TCTCCATCAT CGTGACGACC
GACCAGGGCG GCGACGAAGG CACGGGCATC GTGCTGAGCA GCGACGGCTA CATTCTCACG
AACAATCACG TCGTCGAGGC GGCGGCCGGC GGGGGCATCG TTGCGGTCAC AACAGCGGAC
GGCCGATCGA TGCGGGCGCG GATCGTCGGG CGGGATCCCA CATCGGACCT CGCGGTGATT
CAGGCTGTCG GCGTGACGAA TCTGCAACCG GCGGCCATCG GGGACGACCG CACCCTTCAG
GTCGGCCAAC AGGTCATTGC CATCGGATCG CCCCTCGGCC TGTCGAATAC TGTTACGGCA
GGAATTGTGA GCGCGTTAAA CCGCCCGGTC TGTACGCAAA ATTGCTCCGG CGGGGGGACG
ACGCCCACCG TGCTCGACGC CATTCAAACC GACGCGGCCA TCAATCCCGG CAATTCCGGC
GGCCCGTTGG TTGACATGGC CGGCCGGGTG GTCGGCGTCA ACACAGCGAT TGCGACGCTT
GACCAGCAGC CGTTCGGCGG CGGGCAATCG GGGAACATCG GGGTCGGCTT TGCGATTCCG
ATTTCGGAAG CGATGCGGGT CGTCAAGGAG TTGGAGGCGA CTGGGCATGC GACGCACGCT
GTGCTCGGCG TCGGCGTCCG GGATTCGGTC GACCCGACGC TGCAGACGCC GAACGGCTGC
CTGGTCGTCA GCGTGACCGC GGGCGGGCCG GCGGATCGGG CGGGAGTTCG CGTCGGAGAC
GTCATCGTGC AATTCGGTGA TCGCACCATC CGGGATTCGG ATTCGCTCAT TGCGGCGACC
CACGCCGCGG TGCCGAATTC GACGGTGACT ATTGGATTTG TCCGCAACGG CAGCCGGCAC
ACCACGCAAG TCACCCTCGG CTCAGCGAGC TCCGGCTGA
 
Protein sequence
MTEHQQHWPG RPQSESPQPS ENAAREPAGT PVDTPATANA WTAGGAPAGA GNQAEGASGG 
AAWWAPPPPS GASWPSPAGS RPSSGEVQPP AGEGTDPLTT TIPLDVPAPT PPAAGFPPAG
DVAPAGGFPP PESSGRRRIW PIVVVAVLLA LVAGGGSGAV VGRYVARHTT TASPSSTVSS
TTGNSATSST APTVPPAPAG SIAAVAKQLL PSVVSIIVTT DQGGDEGTGI VLSSDGYILT
NNHVVEAAAG GGIVAVTTAD GRSMRARIVG RDPTSDLAVI QAVGVTNLQP AAIGDDRTLQ
VGQQVIAIGS PLGLSNTVTA GIVSALNRPV CTQNCSGGGT TPTVLDAIQT DAAINPGNSG
GPLVDMAGRV VGVNTAIATL DQQPFGGGQS GNIGVGFAIP ISEAMRVVKE LEATGHATHA
VLGVGVRDSV DPTLQTPNGC LVVSVTAGGP ADRAGVRVGD VIVQFGDRTI RDSDSLIAAT
HAAVPNSTVT IGFVRNGSRH TTQVTLGSAS SG