Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0020 |
Symbol | |
ID | 4808785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 26797 |
End bp | 27759 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105430 |
Product | biotin synthase |
Protein accession | YP_001036455 |
Protein GI | 125972545 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000521068 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTT TAACGCTGAT GGAAAATAAA TTGGACGAAG GCCGGATGAT TACCTTTGAA GAGGCAGTAG AGCTTGCAAA AGGGAAAATT GAGGATGAAA AACTGTTTTT ACTGGCAGAT AAGCTTTGCA AAAAGAATAT GGGCCGAAGA GTGGACCTTT GCTCGATAAT TAACGCGAAA TCGGGAGGAT GTTCGGAGAA CTGCAAGTTC TGTGCCCAGT CGGGGCATTA CGATACAGGA GTTAAAATTT ACCCTTTGCT TGACGTTGAC GATGTGTTAA AAGCGGCAAA GGAAAATGAA AAGGAAGGTG TGCACCGATT TTCGCTTGTA ACCAGCGGGA AAAGCGTTTC AGATGAGGAA TTTGAAAAGA TTCTTGGAAT ATATTCGGTA TTAAGAAAAG AGACAAATTT AAAGCTTTGT GCTTCATTGG GAAGTCTAAG TTATGAGCGG GCGGTAAGGC TTAAAGAAGC GGGGGTATCC ATGTACCACC ATAATATTGA AACTTGCAGG GAGTATTACC CTAAAATTTG TGATACCCAT ACATATGACG ACAGAATAAA CACCGTAAAG AATGCGGCAA AGGCAGGTCT GGAAATTTGC TGCGGCGGAA TAATCGGCAT GGGGGAGAGC ATGGAACAAA GAATCAAGAT GGCGTTTGAA ATAAGGGAGC TTGGCGTAAA ATCGGTTCCA ATCAATGTAT TAAACCCCAT AAAGGGTACA CCTTTCGAGA ATGTCAGAAG TCTTTCTCCC GATGAGATAC TCAGAACTAT AGCCCTTTTC AGATTGATAA TGCCCTATGC CCACATTAGA TATGCCGGAG GGAGGATGTG TTTGGGAGAA CATCAGTCCA AAGGCTTTAA AGCCGGCGTC AGCGCAATGC TGGTGGGAAA CTATCTTACA ACGGTGGGCA ATAAAATTTC GGACGACCTT GAAATGATTC AAAGAATGGG GCTTGAAATA TGA
|
Protein sequence | MDFLTLMENK LDEGRMITFE EAVELAKGKI EDEKLFLLAD KLCKKNMGRR VDLCSIINAK SGGCSENCKF CAQSGHYDTG VKIYPLLDVD DVLKAAKENE KEGVHRFSLV TSGKSVSDEE FEKILGIYSV LRKETNLKLC ASLGSLSYER AVRLKEAGVS MYHHNIETCR EYYPKICDTH TYDDRINTVK NAAKAGLEIC CGGIIGMGES MEQRIKMAFE IRELGVKSVP INVLNPIKGT PFENVRSLSP DEILRTIALF RLIMPYAHIR YAGGRMCLGE HQSKGFKAGV SAMLVGNYLT TVGNKISDDL EMIQRMGLEI
|
| |