Gene Acel_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0470 
Symbol 
ID4484791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp504502 
End bp505737 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID639729237 
Producthypothetical protein 
Protein accessionYP_872230 
Protein GI117927679 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.634716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.62944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAAT TCCCGTTCGG CTTCGGTAAA CCGGACGACG ACCAGCCGGG GTTCGATTTC 
GGTTCCCCCG CCGACCTGAG TGCCGCCCTG CACCGATTCG CCGATCTGCT CTCTTGGTCC
GGCGGGACGG TCAATTGGGA TCTCGCGCGG GATGTCGCCC GGCAATCTAT CGCCGCCGAC
GACGTCTCGG TCACCGAAGC CGACCAGCGT GACGTCGCCG AGAGCCTGCG GCTCGCCGAC
CTGTGGCTGG AGCCGCACAC CGAACTGCCC GGGGCGATAC GCGCCGCGCG GGCCTGGAGC
CGCGCCGAAT GGATCGAAGC AACCCTGCCG ACGTGGCGGG AACTCGTCGA ACCCGTCGCA
GCCCGGGTCG TTGAGGCGTT CGGCAGCGAG CTGAGCGCGG GGCTCGCCGG AGAGGCGGCC
GGTCCCGACG CGGAGCTCCC GGCTCAGCTT CAACAGCTCA CCGGGCCGCT CGCGGCAATG
ATGCGTCAGG TCGGCGGGGT GATGTACGGC GGTCAGGTCG GCCAGGCGCT CGGCGCCCTG
GCCCGCGACG TCGTCAGTTC CACGGACGTC GGTCTCCCTC TCGCCGGAGC CGGACAGGCC
GCCCTCCTCC CCGCCGGGGT CGCCGCGTTC GCCGCCGGTC TGGACGTCCC ACTCACCGAA
GTGCGGATTT TCCTCGCCCT ACGGGAGGCC GCGTATCACC GGCTGTACGC CGGCACGCCA
TGGCTGCGGT CCCGGTTGAT CGGTGTCCTC GAGGAGTACG CCCGCGGGAT CACGATTGAC
ACCGAACGGA TCCGCCAGGC GATGGAGTCC ATCGATCCGA CGCATCCGGA GACCCTGCAG
GATGCCCTGA TCGGCGGCCT TTTCGAGCCG CAGCGCACCC CCGCGCAGCA GGCGACCCTC
GACCGATTGG AAACGCTGCT TGCGCTGATC GAAGGCTGGG TGGACGAAGT CACCGACCAG
GCGGCCCGCG AGCATCTGCC AGCGGCGGCT GGTCTCATCG AAATGGTGCG CCGGCGGCGG
GCGACCGGCG GCCCCGCGGA GCAGACTTTT GCCGCCCTGG TCGGCCTGGA ACTTCGGCCG
CGCCGGCTCA GGGACGCCGC AGCGCTCTGG GCTGCGGTGC GTCACGCCCG TTCAGTCGCC
GGCCGGGACG CGCTGTGGCG CCATCCCGAC CTGTTGCCCA CCGCCGAGGA TCTCGCCGAT
CCGCTCGGGT TCGTCGAAGG GCTGGACGAG CAGTGA
 
Protein sequence
MAQFPFGFGK PDDDQPGFDF GSPADLSAAL HRFADLLSWS GGTVNWDLAR DVARQSIAAD 
DVSVTEADQR DVAESLRLAD LWLEPHTELP GAIRAARAWS RAEWIEATLP TWRELVEPVA
ARVVEAFGSE LSAGLAGEAA GPDAELPAQL QQLTGPLAAM MRQVGGVMYG GQVGQALGAL
ARDVVSSTDV GLPLAGAGQA ALLPAGVAAF AAGLDVPLTE VRIFLALREA AYHRLYAGTP
WLRSRLIGVL EEYARGITID TERIRQAMES IDPTHPETLQ DALIGGLFEP QRTPAQQATL
DRLETLLALI EGWVDEVTDQ AAREHLPAAA GLIEMVRRRR ATGGPAEQTF AALVGLELRP
RRLRDAAALW AAVRHARSVA GRDALWRHPD LLPTAEDLAD PLGFVEGLDE Q