Gene Cthe_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0906 
Symbol 
ID4810527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1083026 
End bp1084378 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content41% 
IMG OID640106325 
Productradical SAM family protein 
Protein accessionYP_001037333 
Protein GI125973423 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000066146 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGA TTCACAAGTT TTCAATGATG GGAACCAACA TAGTTGTGGA TGTCAACAGC 
GGCGCCGTCC ATGTGGTGGA TGACATATCC TTCGACATAC TGGATTATTA TAAAAACTTT
ACTGCTGGGG AGATTAAAAA CAAACTTGCT CACAAGTACA ATGCAGATGA AATCGATGAA
GCACTGAGGG AAATTGAAAG TCTCGAAGCG GAAGGGCTGC TTTTTTCCGA AGACCCTTAC
AAAGAGTATG TTTCTTCCAT GGACAGAAAG TCGGTGGTAA AGGCGCTGTG TCTTCATATA
TCCCATGACT GCAATTTAAG ATGTAAATAC TGTTTTGCAT CCACCGGAAA TTTCGGCGGA
CAGAGAAACA TGATGAGCCT GGAAGTTGGC AAAAAAGCCA TTGACTTTTT GATTTCAGAA
TCGGGAAATC GGAAGAATCT GGAAATAGAT TTCTTTGGCG GAGAGCCAAT GATGAACTTT
GACGTGGTAA AGGGGATTAT TGAATACGCC CGTCAAAAGG AAAAAGAACA CAATAAAAAT
TTCAGATTTA CATTGACGAC AAACGGACTG CTTTTGAATG ATGAAAATAT AAAGTATATA
AATGAAAATA TGCAAAATAT TGTGCTGAGC ATCGACGGGC GCAAGGAAGT AAACGACAGG
ATGCGAATAA GAATTGACGG CAGCGGCTGT TATGATGATA TACTGCCGAA GTTTAAATAT
GTCGCAGAGT CCAGAAATCA GGATAATTAC TATGTTAGAG GAACCTTTAC CAGGGAAAAT
ATGGATTTTT CCAATGATGT GCTGCATCTG GCCGATGAAG GCTTCAGGCA GATTTCGGTG
GAGCCTGTGG TTGCGGCAAA GGACAGCGGA TATGATTTGA GGGAGGAAGA TCTTCCAAGG
CTTTTTGAAG AATATGAGAA GCTGGCATAT GAGTATGTGA AAAGAAGAAA AGAGGGAAAT
TGGTTTAATT TCTTCCATTT TATGATTGAC CTGACTCAAG GCCCCTGCAT TGTCAAAAGA
TTGACCGGTT GCGGCTCGGG ACATGAGTAT CTTGCGGTTA CTCCCGAAGG AGACATTTAC
CCCTGCCATC AATTTGTAGG AAATGAAAAG TTTAAAATGG GTAATGTCAA AGAAGGAGTT
TTGAACAGGG ACATTCAAAA CTATTTCAAA AACTCCAATG TATATACAAA GAAAGAATGC
GACAGCTGCT GGGCAAAGTT TTATTGCAGC GGAGGATGTG CCGCCAATTC GTATAATTTT
CATAAAGATA TCAATACTGT GTACAAAGTC GGATGCGAAT TGGAGAAAAA AAGAGTTGAA
TGCGCATTGT GGATAAAGGC ACAGGAGATG TAA
 
Protein sequence
MAMIHKFSMM GTNIVVDVNS GAVHVVDDIS FDILDYYKNF TAGEIKNKLA HKYNADEIDE 
ALREIESLEA EGLLFSEDPY KEYVSSMDRK SVVKALCLHI SHDCNLRCKY CFASTGNFGG
QRNMMSLEVG KKAIDFLISE SGNRKNLEID FFGGEPMMNF DVVKGIIEYA RQKEKEHNKN
FRFTLTTNGL LLNDENIKYI NENMQNIVLS IDGRKEVNDR MRIRIDGSGC YDDILPKFKY
VAESRNQDNY YVRGTFTREN MDFSNDVLHL ADEGFRQISV EPVVAAKDSG YDLREEDLPR
LFEEYEKLAY EYVKRRKEGN WFNFFHFMID LTQGPCIVKR LTGCGSGHEY LAVTPEGDIY
PCHQFVGNEK FKMGNVKEGV LNRDIQNYFK NSNVYTKKEC DSCWAKFYCS GGCAANSYNF
HKDINTVYKV GCELEKKRVE CALWIKAQEM