Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0024 |
Symbol | |
ID | 4808789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 30415 |
End bp | 31266 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105434 |
Product | biotin biosynthesis protein BioC |
Protein accession | YP_001036459 |
Protein GI | 125972549 |
COG category | [R] General function prediction only |
COG ID | [COG4106] Trans-aconitate methyltransferase |
TIGRFAM ID | [TIGR02072] biotin biosynthesis protein BioC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00323673 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAGATA AAAAGATTCT TCAAATGCAC TTTAGCCGTA ATGCTAAAAA CTATGACGCA TACGCGAAAG TTCAAAAGAA AATGGCAAAT ACTCTGCTGG ATATGCTGGA CTTGGATAGT AAGAGCAGAT TGGATATTCT GGATGTGGGC TGTGGCACAG GGTATCTTAC GAAGCTTTTG CTTGACCGTT GGCCGGATGC CCGAATAACG GCAATAGATA TTGCTCCGGG AATGATTGAA TATGCACGGG ACAGATTTAA CGAGAGCAAT GTGGAGTTTG CCTGCCTTGA TATTGAAGAA GCGGAACTTA ACCAAAAATA TGACCTTGTT ATTTCAAATG CCACTTTTCA ATGGTTTAAC GACTTAGGAG GCACTGTCAA TAAGCTTGTT CAAAGCCTTA AAAGTGACGG AGTACTTGCC TTTTCCACTT TTGGGCATAT GACTTTTAGC GAACTTCATT TTTCTTATGA AACTGCCCGC AGAAAACTTA AAATAGACGA AGAATTTCCG CCAGGCCAAA AATTCTGTAA CGCAAAAGAG ATACTTAAGA TTTGCTGTGA GACATTTGAA GGGCTTGAGG GCTTTGAATT TGATACCGTC AAAAAAGAAA GTCTCGAATA TGAGTATTTT TACACGGTAA GAGAATTTTT AGATTCCGTG AAAAAAATCG GAGCGAACAA CAGTAACAAA CAAAGAAAGG TAAACACCGC TCTTACGAAA GAAATGATAC GGATTTACGA GGAGATGTTT AAAGTAAATG GATTGGTGAG GGCAACCTAT CATTGCATCT TCATAACATC AAGAAAGAAA CTTGCGGCAA ATACAAGAAG GCTGGTGAAC GCAGTTGTAT GA
|
Protein sequence | MIDKKILQMH FSRNAKNYDA YAKVQKKMAN TLLDMLDLDS KSRLDILDVG CGTGYLTKLL LDRWPDARIT AIDIAPGMIE YARDRFNESN VEFACLDIEE AELNQKYDLV ISNATFQWFN DLGGTVNKLV QSLKSDGVLA FSTFGHMTFS ELHFSYETAR RKLKIDEEFP PGQKFCNAKE ILKICCETFE GLEGFEFDTV KKESLEYEYF YTVREFLDSV KKIGANNSNK QRKVNTALTK EMIRIYEEMF KVNGLVRATY HCIFITSRKK LAANTRRLVN AVV
|
| |