Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0052 |
Symbol | |
ID | 4484748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 57212 |
End bp | 58969 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639728814 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_871814 |
Protein GI | 117927263 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.187685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCAACCA GCGGCGTCGC ATTCCCCGGA TCACGGAAGA CCTACCTCGT GGGGTCGCGG CCCGACATCC GCGTTCCGAT GCGGGAAGTC CCGCTGAGCA CCGGGGACCG GGTGGTCCTC TACGACACCT CAGGGCCCTA CACGGATCCC GATGTGGTGA TCGACGTCCG GCGTGGCCTT CCGCCGCTGC GCGCCCGGTG GATCGACGAG CGCGGCGACA CGGAGACCTA CCAGGGCCGG CCGGTCCGTC CGATCGACAA CGGGTATCGG GACGGCGATC CGCGGGCGAC GCGAAATCTC GACAGCGTTT TCACCGCCAC GCGGCCGCCC CGGCGGGCCA AGCCGGGCCG GCGGGTCACC CAGATGCACT ACGCCCGCCA AGGAATCATC ACTCCGGAGA TGGAATTCGT CGCGCTCCGT GAAGGCGTGC CGCCGGAATT CGTCCGCGAC GAAATCGCGC GCGGACGGGC GGTTCTGCCG GCGAACGTCA ACCACCCGGA GAGCGAGCCG ATGATTATCG GCCGGAATTT CTTGGTCAAG GTGAACGCCA ACATCGGCAA CAGCGCCGTC GCGTCCTCCA TTGAGGACGA GGTCGAGAAG ATGACCTGGG CGACCCGCTG GGGTGCGGAC ACCGTCATGG ATCTCTCCAC CGGGCGGAAC ATCCACACCA CCCGGGAGTG GATCATTCGC AACAGCCCGG TGCCGGTGGG CACCGTGCCG ATTTACCAGG CGTTGGAGAA GGTCGGCGGC AAGGCCGAGG ATCTCTCCTG GGAGGTGTAC CGGGACACGC TGATCGAGCA GTTCGAGCAG GGTGTCGACT ACATCACGGT GCACGCGGGC GTGCTGCTCC GGTACGTGCC GCTCACCGCA CGCCGGAAGA CGGGTATCGT GTCCCGCGGC GGCTCAATCA TGGCGGCCTG GTGCCTCGCC CACCACGAGG AGAATTTCCT CTACACGCAC TTCCGCGAGA TGTGCGAACT CATGGCGCAG TACGACGTCA CCTTCTCCCT CGGCGACGGT CTGCGTCCCG GTTCCATCGC CGACGCCAAC GACGAGGCGC AGTTCGCCGA GCTTCGCACC CTCGGCGAAT TGACGAAAAT CGCGTGGGAG TACGACGTCC AGGTGATGAT CGAGGGTCCG GGTCATGTTC CGATGCACAA AATCAAGGAG AACATGGACC TGCAGCTCGA GCTCTGCTCC GAGGCGCCGT TCTACACCCT CGGCCCGCTC ACCACGGACA TCGCACCCGG CTACGACCAC ATCACCTCCG CCATCGGCGC TGCGATGATC GGGTGGTACG GAACCGCGAT GCTCTGCTAC GTCACGCCGA AGGAGCACCT CGGCCTGCCG AACAAGGACG ATGTCAAGGC CGGGGTCATC GCGTACAAAA TCGCGGCGCA CGCGGCGGAC CTCGCCAAGG GGCATCCCGG CGCCCAGGCC TGGGACGATG CGCTGTCGGA CGCGCGGTTC GAATTCCGGT GGGAGGACCA GTTCAACCTC TCACTGGATC CGGAGACGGC GCGCGATTTC CACGACGAGA CACTGCCTGC TGCGCCGGCG AAGACCGCGC ATTTCTGTTC GATGTGCGGA CCACACTTCT GCTCGATGAA AATCAGCCAA GATGTCCGCA AGTACGCCGA GGAGAAGGGC GTCGACGAGC AGACCGCAAT CGAGCTGGGG ATGCGGGAGA AGGCGGCGGA ATTCACCGCG CACGGCAGCC GGGTCTACGT ACCGGTCGAG GAATTGTCCA CCCGGTGA
|
Protein sequence | MPTSGVAFPG SRKTYLVGSR PDIRVPMREV PLSTGDRVVL YDTSGPYTDP DVVIDVRRGL PPLRARWIDE RGDTETYQGR PVRPIDNGYR DGDPRATRNL DSVFTATRPP RRAKPGRRVT QMHYARQGII TPEMEFVALR EGVPPEFVRD EIARGRAVLP ANVNHPESEP MIIGRNFLVK VNANIGNSAV ASSIEDEVEK MTWATRWGAD TVMDLSTGRN IHTTREWIIR NSPVPVGTVP IYQALEKVGG KAEDLSWEVY RDTLIEQFEQ GVDYITVHAG VLLRYVPLTA RRKTGIVSRG GSIMAAWCLA HHEENFLYTH FREMCELMAQ YDVTFSLGDG LRPGSIADAN DEAQFAELRT LGELTKIAWE YDVQVMIEGP GHVPMHKIKE NMDLQLELCS EAPFYTLGPL TTDIAPGYDH ITSAIGAAMI GWYGTAMLCY VTPKEHLGLP NKDDVKAGVI AYKIAAHAAD LAKGHPGAQA WDDALSDARF EFRWEDQFNL SLDPETARDF HDETLPAAPA KTAHFCSMCG PHFCSMKISQ DVRKYAEEKG VDEQTAIELG MREKAAEFTA HGSRVYVPVE ELSTR
|
| |