Gene Acel_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0052 
Symbol 
ID4484748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp57212 
End bp58969 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content66% 
IMG OID639728814 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_871814 
Protein GI117927263 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAACCA GCGGCGTCGC ATTCCCCGGA TCACGGAAGA CCTACCTCGT GGGGTCGCGG 
CCCGACATCC GCGTTCCGAT GCGGGAAGTC CCGCTGAGCA CCGGGGACCG GGTGGTCCTC
TACGACACCT CAGGGCCCTA CACGGATCCC GATGTGGTGA TCGACGTCCG GCGTGGCCTT
CCGCCGCTGC GCGCCCGGTG GATCGACGAG CGCGGCGACA CGGAGACCTA CCAGGGCCGG
CCGGTCCGTC CGATCGACAA CGGGTATCGG GACGGCGATC CGCGGGCGAC GCGAAATCTC
GACAGCGTTT TCACCGCCAC GCGGCCGCCC CGGCGGGCCA AGCCGGGCCG GCGGGTCACC
CAGATGCACT ACGCCCGCCA AGGAATCATC ACTCCGGAGA TGGAATTCGT CGCGCTCCGT
GAAGGCGTGC CGCCGGAATT CGTCCGCGAC GAAATCGCGC GCGGACGGGC GGTTCTGCCG
GCGAACGTCA ACCACCCGGA GAGCGAGCCG ATGATTATCG GCCGGAATTT CTTGGTCAAG
GTGAACGCCA ACATCGGCAA CAGCGCCGTC GCGTCCTCCA TTGAGGACGA GGTCGAGAAG
ATGACCTGGG CGACCCGCTG GGGTGCGGAC ACCGTCATGG ATCTCTCCAC CGGGCGGAAC
ATCCACACCA CCCGGGAGTG GATCATTCGC AACAGCCCGG TGCCGGTGGG CACCGTGCCG
ATTTACCAGG CGTTGGAGAA GGTCGGCGGC AAGGCCGAGG ATCTCTCCTG GGAGGTGTAC
CGGGACACGC TGATCGAGCA GTTCGAGCAG GGTGTCGACT ACATCACGGT GCACGCGGGC
GTGCTGCTCC GGTACGTGCC GCTCACCGCA CGCCGGAAGA CGGGTATCGT GTCCCGCGGC
GGCTCAATCA TGGCGGCCTG GTGCCTCGCC CACCACGAGG AGAATTTCCT CTACACGCAC
TTCCGCGAGA TGTGCGAACT CATGGCGCAG TACGACGTCA CCTTCTCCCT CGGCGACGGT
CTGCGTCCCG GTTCCATCGC CGACGCCAAC GACGAGGCGC AGTTCGCCGA GCTTCGCACC
CTCGGCGAAT TGACGAAAAT CGCGTGGGAG TACGACGTCC AGGTGATGAT CGAGGGTCCG
GGTCATGTTC CGATGCACAA AATCAAGGAG AACATGGACC TGCAGCTCGA GCTCTGCTCC
GAGGCGCCGT TCTACACCCT CGGCCCGCTC ACCACGGACA TCGCACCCGG CTACGACCAC
ATCACCTCCG CCATCGGCGC TGCGATGATC GGGTGGTACG GAACCGCGAT GCTCTGCTAC
GTCACGCCGA AGGAGCACCT CGGCCTGCCG AACAAGGACG ATGTCAAGGC CGGGGTCATC
GCGTACAAAA TCGCGGCGCA CGCGGCGGAC CTCGCCAAGG GGCATCCCGG CGCCCAGGCC
TGGGACGATG CGCTGTCGGA CGCGCGGTTC GAATTCCGGT GGGAGGACCA GTTCAACCTC
TCACTGGATC CGGAGACGGC GCGCGATTTC CACGACGAGA CACTGCCTGC TGCGCCGGCG
AAGACCGCGC ATTTCTGTTC GATGTGCGGA CCACACTTCT GCTCGATGAA AATCAGCCAA
GATGTCCGCA AGTACGCCGA GGAGAAGGGC GTCGACGAGC AGACCGCAAT CGAGCTGGGG
ATGCGGGAGA AGGCGGCGGA ATTCACCGCG CACGGCAGCC GGGTCTACGT ACCGGTCGAG
GAATTGTCCA CCCGGTGA
 
Protein sequence
MPTSGVAFPG SRKTYLVGSR PDIRVPMREV PLSTGDRVVL YDTSGPYTDP DVVIDVRRGL 
PPLRARWIDE RGDTETYQGR PVRPIDNGYR DGDPRATRNL DSVFTATRPP RRAKPGRRVT
QMHYARQGII TPEMEFVALR EGVPPEFVRD EIARGRAVLP ANVNHPESEP MIIGRNFLVK
VNANIGNSAV ASSIEDEVEK MTWATRWGAD TVMDLSTGRN IHTTREWIIR NSPVPVGTVP
IYQALEKVGG KAEDLSWEVY RDTLIEQFEQ GVDYITVHAG VLLRYVPLTA RRKTGIVSRG
GSIMAAWCLA HHEENFLYTH FREMCELMAQ YDVTFSLGDG LRPGSIADAN DEAQFAELRT
LGELTKIAWE YDVQVMIEGP GHVPMHKIKE NMDLQLELCS EAPFYTLGPL TTDIAPGYDH
ITSAIGAAMI GWYGTAMLCY VTPKEHLGLP NKDDVKAGVI AYKIAAHAAD LAKGHPGAQA
WDDALSDARF EFRWEDQFNL SLDPETARDF HDETLPAAPA KTAHFCSMCG PHFCSMKISQ
DVRKYAEEKG VDEQTAIELG MREKAAEFTA HGSRVYVPVE ELSTR