Gene Acel_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0890 
Symbol 
ID4485722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp981548 
End bp982621 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content64% 
IMG OID639729665 
Productprotein of unknown function DUF1100, hydrolase family protein 
Protein accessionYP_872649 
Protein GI117928098 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.712139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATG AGCGCGTAAG AATCGCGATT GAGAATTGGG GCCCCCGCTT CACGACGAAC 
GGGGTGACGT ACAGCGACTT CCACGAGGTG CTCAGTCGCA TCACCACGTG GGAGGAGTGG
TGCTCGGCGT GGTGCACCGC CGCGGAGCCG TACGTGGAAC TGGGCAGAGC GGCGCTTGAC
GAGGGGCGTA CGTTGTCCGC AGGAGATCAC TTCTCGCAAG CAGCCGTGTA TTATCACTTT
GCGAAGTTTC TCTTCGTCGA TGACATGGAA CAAATGCGCG CTGCGCACCA GAAGGCCGTG
GAGTGTCTCA CCACCGCCCT GCCCTATCTC GATCCGCCGG GTCGGCGCAT CACCGTTCCG
TTTGAAGGCG CGCGGATGGT CGGCATCCTG CGCCTGCCGC GCGGCGAGCC GCCTTTCCCC
GCGGTCATGC TCATTCCGGG GCTTGACTCA ACCAAGGAGG AGTTCCGTTC CACCGAGCAA
CTCTTCCTGC AGCGTGGATT GGCGACATTC TCTGTCGACG GACCCGGCCA AGGCGAGGCC
GAGTACGACT TACCGATCCG GCCGGACTGG GAGGTTCCAG GCGCGGCGTT GCTCGATGCC
CTGGCTTCTC AACCTGAGAT TGATCCAGCC CGGCTGGGGA TCTGGGGCGT TAGTCTCGGA
GGTTATTACG CTCCGCGGCT GGCTAGCGGT GATCAACGTG TCAAGGCGTG TATCGCCCTT
GCCGGACCCT GGAACTTTGG TGCGTGTTGG GACGGACTCA ACGAGTTGAC CCGGGCGGCG
TTCCGGGTCC GATCGCGGAG TCGTTCCGAC GAGGAGGCAC GCGCAAAAGC CGCCCAACTG
ACGCTCGACG GCCGCGCGGA AAGGATTCGC TGCCCCTTAC TCGTCGTTGC CGGAAAGCGG
GATCGACTGA TTCCTTGGCA GGATGCGGTC AGACTCGCCG AGGCGGCAGG GTCGCAGGCG
GAATTGCTCC TGCTGGAGAA CGGAAATCAC GGCGGCATGA ACGTTGCTGC GCAGCATCGG
CAGCGATCGG CGGATTGGAT GGCCCGCATT CTCGGCGGGC GAGTGGCCGG ATGA
 
Protein sequence
MVDERVRIAI ENWGPRFTTN GVTYSDFHEV LSRITTWEEW CSAWCTAAEP YVELGRAALD 
EGRTLSAGDH FSQAAVYYHF AKFLFVDDME QMRAAHQKAV ECLTTALPYL DPPGRRITVP
FEGARMVGIL RLPRGEPPFP AVMLIPGLDS TKEEFRSTEQ LFLQRGLATF SVDGPGQGEA
EYDLPIRPDW EVPGAALLDA LASQPEIDPA RLGIWGVSLG GYYAPRLASG DQRVKACIAL
AGPWNFGACW DGLNELTRAA FRVRSRSRSD EEARAKAAQL TLDGRAERIR CPLLVVAGKR
DRLIPWQDAV RLAEAAGSQA ELLLLENGNH GGMNVAAQHR QRSADWMARI LGGRVAG