Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0906 |
Symbol | |
ID | 4810527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1083026 |
End bp | 1084378 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106325 |
Product | radical SAM family protein |
Protein accession | YP_001037333 |
Protein GI | 125973423 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000066146 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGA TTCACAAGTT TTCAATGATG GGAACCAACA TAGTTGTGGA TGTCAACAGC GGCGCCGTCC ATGTGGTGGA TGACATATCC TTCGACATAC TGGATTATTA TAAAAACTTT ACTGCTGGGG AGATTAAAAA CAAACTTGCT CACAAGTACA ATGCAGATGA AATCGATGAA GCACTGAGGG AAATTGAAAG TCTCGAAGCG GAAGGGCTGC TTTTTTCCGA AGACCCTTAC AAAGAGTATG TTTCTTCCAT GGACAGAAAG TCGGTGGTAA AGGCGCTGTG TCTTCATATA TCCCATGACT GCAATTTAAG ATGTAAATAC TGTTTTGCAT CCACCGGAAA TTTCGGCGGA CAGAGAAACA TGATGAGCCT GGAAGTTGGC AAAAAAGCCA TTGACTTTTT GATTTCAGAA TCGGGAAATC GGAAGAATCT GGAAATAGAT TTCTTTGGCG GAGAGCCAAT GATGAACTTT GACGTGGTAA AGGGGATTAT TGAATACGCC CGTCAAAAGG AAAAAGAACA CAATAAAAAT TTCAGATTTA CATTGACGAC AAACGGACTG CTTTTGAATG ATGAAAATAT AAAGTATATA AATGAAAATA TGCAAAATAT TGTGCTGAGC ATCGACGGGC GCAAGGAAGT AAACGACAGG ATGCGAATAA GAATTGACGG CAGCGGCTGT TATGATGATA TACTGCCGAA GTTTAAATAT GTCGCAGAGT CCAGAAATCA GGATAATTAC TATGTTAGAG GAACCTTTAC CAGGGAAAAT ATGGATTTTT CCAATGATGT GCTGCATCTG GCCGATGAAG GCTTCAGGCA GATTTCGGTG GAGCCTGTGG TTGCGGCAAA GGACAGCGGA TATGATTTGA GGGAGGAAGA TCTTCCAAGG CTTTTTGAAG AATATGAGAA GCTGGCATAT GAGTATGTGA AAAGAAGAAA AGAGGGAAAT TGGTTTAATT TCTTCCATTT TATGATTGAC CTGACTCAAG GCCCCTGCAT TGTCAAAAGA TTGACCGGTT GCGGCTCGGG ACATGAGTAT CTTGCGGTTA CTCCCGAAGG AGACATTTAC CCCTGCCATC AATTTGTAGG AAATGAAAAG TTTAAAATGG GTAATGTCAA AGAAGGAGTT TTGAACAGGG ACATTCAAAA CTATTTCAAA AACTCCAATG TATATACAAA GAAAGAATGC GACAGCTGCT GGGCAAAGTT TTATTGCAGC GGAGGATGTG CCGCCAATTC GTATAATTTT CATAAAGATA TCAATACTGT GTACAAAGTC GGATGCGAAT TGGAGAAAAA AAGAGTTGAA TGCGCATTGT GGATAAAGGC ACAGGAGATG TAA
|
Protein sequence | MAMIHKFSMM GTNIVVDVNS GAVHVVDDIS FDILDYYKNF TAGEIKNKLA HKYNADEIDE ALREIESLEA EGLLFSEDPY KEYVSSMDRK SVVKALCLHI SHDCNLRCKY CFASTGNFGG QRNMMSLEVG KKAIDFLISE SGNRKNLEID FFGGEPMMNF DVVKGIIEYA RQKEKEHNKN FRFTLTTNGL LLNDENIKYI NENMQNIVLS IDGRKEVNDR MRIRIDGSGC YDDILPKFKY VAESRNQDNY YVRGTFTREN MDFSNDVLHL ADEGFRQISV EPVVAAKDSG YDLREEDLPR LFEEYEKLAY EYVKRRKEGN WFNFFHFMID LTQGPCIVKR LTGCGSGHEY LAVTPEGDIY PCHQFVGNEK FKMGNVKEGV LNRDIQNYFK NSNVYTKKEC DSCWAKFYCS GGCAANSYNF HKDINTVYKV GCELEKKRVE CALWIKAQEM
|
| |