Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1839 |
Symbol | |
ID | 4809385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2183456 |
End bp | 2184517 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107253 |
Product | biotin synthase |
Protein accession | YP_001038253 |
Protein GI | 125974343 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0814889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACA TGACAAATAT GATAAATTTA ATTGACAAAC TTTCAACAAC ACATACTTTG TCGTATGATG AAATGTATCA GCTTATTGAG CATAGAAACG AGGAACTTGC CAATTATCTG TTTGAAAAGG CAAGGCAGGT GCGTATCCTC TACTACGGCC ACGATGTCTA TATGCGCGGT CTCATTGAAT TCACCAATTA CTGTCGAAAT GACTGCTACT ATTGCGGAAT AAGGAAAAGC AACTGCAATG CCGAAAGATA CCGTCTTACA AAAGAGCAAA TACTTGAATG CTGTGACGTG GGATATGAGC TGGGTTTTCG CACCTTCGTG CTTCAAGGGG GCGAAGACGG TTATTATACC GACAAAATTT TGGCGGACAT AGTAAGCAGC ATCAAGGCAA AATATCCCGA TTGTGCGATT ACTCTCTCTT TGGGTGAAAA AAGTTATGAA AGCTATAAAT TGCTTTATGA GGCTGGAGCG GACAGATACC TTCTTCGCCA TGAAACAGCA AATGCCCAGC ACTACTCAAA GCTTCATCCG CCTGTTATGT CCCTTAAAAA CAGAAAACAA TGTCTTTACA ATCTCAAAGA AATAGGATAC CAGGTAGGTT GCGGTTTTAT GGTCGGTTCA CCGTTTCAGA CCACGGAATG TCTCGTTGAT GACTTAATGT TTATAAAAGA ATTGCAGCCC CACATGGTGG GAATAGGTCC GTTTATCCCG CACAAGGATA CGCCTTTTGC CGGCAAACCC GCCGGTACCC TGGAGCTGAC ATTGTTCCTT CTCGGCATCA TACGGCTAAT GCTTCCCTAC GTTCTGCTTC CGGCCACCAC AGCCCTTGGC ACAATCCATC CCAAAGGCAG GGAACTGGGT ATTCTTGCAG GCGCAAACGT GGTAATGCCA AACCTTTCGC CGAAAGAAGT AAGAAGCAAG TATCTTTTAT ATGACAATAA AATCTGTACC GGGGATGAAG CCGCAGAATG CAGAATGTGC CTAACCCACC GTATTGAAAG CATCGGATAC AAACTGGTTG TGTCAAGAGG CGACTGCAAA AAGCCAAATT AA
|
Protein sequence | MTNMTNMINL IDKLSTTHTL SYDEMYQLIE HRNEELANYL FEKARQVRIL YYGHDVYMRG LIEFTNYCRN DCYYCGIRKS NCNAERYRLT KEQILECCDV GYELGFRTFV LQGGEDGYYT DKILADIVSS IKAKYPDCAI TLSLGEKSYE SYKLLYEAGA DRYLLRHETA NAQHYSKLHP PVMSLKNRKQ CLYNLKEIGY QVGCGFMVGS PFQTTECLVD DLMFIKELQP HMVGIGPFIP HKDTPFAGKP AGTLELTLFL LGIIRLMLPY VLLPATTALG TIHPKGRELG ILAGANVVMP NLSPKEVRSK YLLYDNKICT GDEAAECRMC LTHRIESIGY KLVVSRGDCK KPN
|
| |