Gene Acel_0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0259 
Symbol 
ID4486330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp277523 
End bp278677 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content64% 
IMG OID639729022 
Producthypothetical protein 
Protein accessionYP_872019 
Protein GI117927468 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.341096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCCCG GGCTCCAGCG TGAGATCGAA GCCAAGGTTC TGGCAGGGGA GCGCCTGAGC 
TTCGCGGACG GCGTCGCCCT CTACGAGTGC GACGACTTGG CCTGGCTCGG TGAGCTCGCC
CACGCCGTGC GGACCAGGCT CAATGGGGAT TACGTCTACT TCAATGTGAA CCGGCACCTG
AACCTGACCA ACGTCTGCGC GGCGTCCTGC GCATACTGCA GTTTTCAACG CAAGCCGGGC
GAGCCGGACG CGTACACGAT GCGGGTCGAG CAGGCGGTCG AGCTCGCCCG GCAGATGGAA
CCGGAAGGCA TCACCGAGCT GCACATCGTC AACGGTTTGC ATCCCACCCT GCCCTGGCGC
TATTACCCCC GCATGCTGCG GGAACTGAAG AAGGCGCTGC CGAACGTGGC GCTGAAAGCG
TTCACCGCGA CGGAAATTCA TTGGTTCGAG AAGATTTCCG GACTCCCGGC CGATGAGATT
CTCGACGAGC TCATCGACGC CGGACTGGAA TCACTGACCG GCGGCGGCGC GGAAATCTTT
GATTGGGAAA TCCGGCGGCG CATCGTCGAC CATGACACCC ACTGGGAGGA CTGGTCACGG
ATTCACCGCC TCGCGCACGC CAAGGGACTC CGGACGCCGT GCACGATGCT CTACGGTCAC
ATCGAAGAAC CGCGGCACCG GGTCGACCAC GTCCTGCGGC TGCGTGAGCT GCAGGACGAG
ACCGGCGGTT TCGTCGTCTT CATCCCGCTG CGTTTCCAGC ACGACCCCAA CGGCGATCCG
CGGAATCGGC TCGCCACTCA GCCGATGGCG ACCGGGGCCG AGGCGCTGAA GACCTTCGCG
GTCTCCCGTC TGCTCTTCGA CAATGTGCCG CACGTCAAGG CGTTCTGGGT GATGCACGGT
TTGACGACGG CTCAGCTGGC GTTGTCGTAC GGCGCGGATG ATCTCGACGG TTCGGTGGTG
GAGTACAAAA TCACCCATGA CGCGGACCAC TACGGAACGC CGAATGTGCT GCACCGGGAA
GACCTTCTCG AACTGATCCG GGACGCTGGT TTCGTCCCCG TGGAACGCGA CACCCGCTAC
AACGTCCTCC GCGTCTATCC CGGTCCGGAT CCGCACCGCC GCGACGTGCC GCAACCGATG
CCGACCGCCG TATGA
 
Protein sequence
MDPGLQREIE AKVLAGERLS FADGVALYEC DDLAWLGELA HAVRTRLNGD YVYFNVNRHL 
NLTNVCAASC AYCSFQRKPG EPDAYTMRVE QAVELARQME PEGITELHIV NGLHPTLPWR
YYPRMLRELK KALPNVALKA FTATEIHWFE KISGLPADEI LDELIDAGLE SLTGGGAEIF
DWEIRRRIVD HDTHWEDWSR IHRLAHAKGL RTPCTMLYGH IEEPRHRVDH VLRLRELQDE
TGGFVVFIPL RFQHDPNGDP RNRLATQPMA TGAEALKTFA VSRLLFDNVP HVKAFWVMHG
LTTAQLALSY GADDLDGSVV EYKITHDADH YGTPNVLHRE DLLELIRDAG FVPVERDTRY
NVLRVYPGPD PHRRDVPQPM PTAV