Gene Acel_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1148 
Symbol 
ID4484636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1275163 
End bp1276581 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content69% 
IMG OID639729923 
Producthypothetical protein 
Protein accessionYP_872906 
Protein GI117928355 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0910313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAT TCTCGCCGCA GGACGCCGCC GAGACGATTC TCGAGCGTGC GCGGGGCACC 
TGCGACGTCG ACGGGTGCAT TGTGCTGGTC GACGAGGACA GTTCCGCGAA TCTGCGCTGG
GCGAATAACA CGCTGACGAC GAACGGCGTC ACCCGGCAAC GGCGCATTAC GGTCATTGCG
ACGGTGCCGC AGCACGACGG CGCCGGCACC GGTATCGCCG CGGGTGTCGT TTCCCGCAGC
GTCACCGCCG ACACCGATGT CACCGAATTG GTGGACGCCG CGGTCGGCGC CGCCCGGTCG
AGCACTCCAG CCGAGGACGC CGCGCCGCTC GTCGAACCCG GCGACCGGAA TTCCCCCGCC
TGGTCCGAAC CTCCGCATGA GACGTCCGGC GGTGTCTTCA CCGACCTGGT TGCCGGGCTC
GGCGCGGAGC TGAGCCGGGC GGACAAGGAG GGCCGGCTGC TGTTCGGCTT CGCCGAGCAT
CAGATGCGCA CGACGTACCT GGCGACGTCA ACGGGTGTGC GGCTCCGGCA CGACCAGCCG
ACCGGAAAGA TCGAGCTGAC CGGCCGGTCC ACCGACCACC GGCGGTCGGC CTGGGTCGCG
GCCGGGACCC GTACCTTCCA GGACGTCGAC ATTGCCGCTC TGCACGCCGA CATCGCGCAG
CGGCTCGGCT GGGCTGAGCG GTCGATCGAG CTTCCGCCCG GCCGGTACGA GACGCTGCTC
CCCCCGACGG CTGTCGCGGA CTTGATGATT TACCTGTACT GGTCGGCCGG TGCGCGGGAC
GCCCATGACG GTCGTACGGT CTTCAGCAAG CCGGGCGGCG GCACCCGTGT CGGCGACCAG
CTCACCGGCG TACCGCTTAC CCTGCGGAGC GATCCGTTCG AGCCTGGTCT GGAATGCGAG
CCGTTTGTCA TCGCGCACAC GTCCACCGGG GAGAGCTCGG TTTTCGACAA CGGCCTTCCG
CTCGGCCCGA CCGCATGGAT CGACCGTGGC AGGCTCGCGG CGCTGCTGCA GACCCGCTAT
TCGGCCCGGC TCACCGGGCT GCCGGTGACG CCGGCGATCG ACAACCTCAT CCTCACGCAC
GCCGACGGGC AAGCGACGTT GTCGGAAATG ATCGCATCGA GCCGGCGCAG TCTGCTCGTC
ACGTCGCTGT GGTACATCCG CGAGGTCGAC CCGCAGACCC TCCTCCTCAC CGGGTTGACC
CGCGACGGTC TGTATCTGGT CGAGGACGGC GAGGTGGTCG CCGCGGTGAA CAATTTCCGG
TTCAACGAGA GCCCGGTCGA CCTGCTGGCA CGCGTGACGG AAGGCGGAGG GACGGTGCCG
TGTCTCCCCC GCGAGTGGAA CGACTACTTC ACCCGTACCG CGATGCCGCC GCTGCGGGTC
GCTGACTTCA ACATGAGCAC GGTCTCGCAG GCGTCCTGA
 
Protein sequence
MAAFSPQDAA ETILERARGT CDVDGCIVLV DEDSSANLRW ANNTLTTNGV TRQRRITVIA 
TVPQHDGAGT GIAAGVVSRS VTADTDVTEL VDAAVGAARS STPAEDAAPL VEPGDRNSPA
WSEPPHETSG GVFTDLVAGL GAELSRADKE GRLLFGFAEH QMRTTYLATS TGVRLRHDQP
TGKIELTGRS TDHRRSAWVA AGTRTFQDVD IAALHADIAQ RLGWAERSIE LPPGRYETLL
PPTAVADLMI YLYWSAGARD AHDGRTVFSK PGGGTRVGDQ LTGVPLTLRS DPFEPGLECE
PFVIAHTSTG ESSVFDNGLP LGPTAWIDRG RLAALLQTRY SARLTGLPVT PAIDNLILTH
ADGQATLSEM IASSRRSLLV TSLWYIREVD PQTLLLTGLT RDGLYLVEDG EVVAAVNNFR
FNESPVDLLA RVTEGGGTVP CLPREWNDYF TRTAMPPLRV ADFNMSTVSQ AS