Gene Cthe_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0533 
Symbol 
ID4808282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp649946 
End bp651478 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content42% 
IMG OID640105947 
Productradical SAM family protein 
Protein accessionYP_001036962 
Protein GI125973052 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0535] Predicted Fe-S oxidoreductases
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATC CAATTACTGC CGTCTGGGAA ATCACCATGG GGTGCAACAT GCGGTGCAAG 
CATTGCGGCT CCAGCTGTGA GAATGCCCTG GAGGGAGAGT TGACCACCGA GGAAGCTTTG
AAACTTTGCG ATGAATTGGG TGAGTTGGGT TTTAAGTGGA TTACTTTGTC CGGTGGTGAG
CCTACAACAA GAAAGGATTG GCATCTTATA GCCAAAAGGC TTAATGAAAA TGGGATAATC
CCCAATATGA TAACCAATGG ATGGCTGATG AATGAGGAGA TTGCCGATAA AGCGGTGGAA
GCGGGAATCA ACACAGTTGC CATTAGTGTG GACGGGCTTG AAGAAACTCA TGATTTTATT
CGGAAAAAAG GCTCTTTCCA AAGGATAATG CGGGCTTTCG ATATCCTTAG GACCAGGAAT
ATATTCTATT CAGCCATTAC CACAATAAAC AACATAAATA TAAAGGAGCT TCCAAGGCTT
AGAGATATTC TGATTGAAAA GGGAGTAAGA GGCTGGCAGC TTCAGCTTGG GCTTCCCATG
GGAAATATGG CTAAAAACAA TGAGCTTGTG GCTCAGCCTT ACCATATAGA CGAAGTGATA
GAATTTTCCT ATAAGACCGT AAAAGAGGGT TATAATATAG ATGTTCAGCT GGCTGACTGT
ATTGGCTATT TTAATTTAAA AGAAATAGAA GTGAGGAAAA ACAGCTACGG TGGAGGAAGG
GATGGTTACA ACTGGACCGG ATGCGGTGCC GGCAAATACA GTCTCGGCAT TCTGCATAAC
GGTGATATTT TAGGCTGTAC ATCCGTAAGA GACAGAAGCT TTATTGAAGG AAATATCAGA
CAGACCCCTA TTAAAGAAAT ATGGATGAAT CCTGACAGCT TTAGCTGGAA CAGAAAAATG
ACCAAGGACA AGTTATCCGG CTTGTGTAAA AAATGCTATT TCGGAGAACG TTGTCTCGGA
GGCTGCTCCA ATACAAGGCT GACAATGGAA GGAAGCGTCT ATTCTGAGAA CAGATACTGT
TCGTATAATG TGGCAATAAG CACTGCAAGA GAGCAGCTTG ACAGAATTGA AGACGTAGAT
GTAATGCTTT CAAAAGCAAA GAAATTTGCA GACAATGGCA ATTTTCAGCT GGCCGAAATA
CTGCTTTCCA GAGCCATTGA AAAAGGCTGT CGCGATATTG GGGTGTTTGA GTTTTACGGA
TATATAAGTT TTATGCTGGG AAACTACCTT GATGCAAAGA AAGCAAATGA AGAGGCTTTG
AAAATAAATC CGGACAGCGC TTATGCAAAC AAGGGAATGG GTCTGTCTTT AGGAAGACTG
GGAGAATTGG AAAAAGGAAT AGAGTATTTG AGAAAGGCAA TTGCTCTGTC CGATGAAAAC
TTCACAGACC CTTACTATGA CCTGGCTGTT TTGCTTTATG AGAACGGAAG GAAAGAAGAA
GCTCTGGAAG TTCTTGAAGA GGGAAGAAAA AAATACAAGA GTTTTATAGA CACAAGCAAG
GACCTTTATG ATATTCTTAC AAATGCTTCT TAG
 
Protein sequence
MNNPITAVWE ITMGCNMRCK HCGSSCENAL EGELTTEEAL KLCDELGELG FKWITLSGGE 
PTTRKDWHLI AKRLNENGII PNMITNGWLM NEEIADKAVE AGINTVAISV DGLEETHDFI
RKKGSFQRIM RAFDILRTRN IFYSAITTIN NINIKELPRL RDILIEKGVR GWQLQLGLPM
GNMAKNNELV AQPYHIDEVI EFSYKTVKEG YNIDVQLADC IGYFNLKEIE VRKNSYGGGR
DGYNWTGCGA GKYSLGILHN GDILGCTSVR DRSFIEGNIR QTPIKEIWMN PDSFSWNRKM
TKDKLSGLCK KCYFGERCLG GCSNTRLTME GSVYSENRYC SYNVAISTAR EQLDRIEDVD
VMLSKAKKFA DNGNFQLAEI LLSRAIEKGC RDIGVFEFYG YISFMLGNYL DAKKANEEAL
KINPDSAYAN KGMGLSLGRL GELEKGIEYL RKAIALSDEN FTDPYYDLAV LLYENGRKEE
ALEVLEEGRK KYKSFIDTSK DLYDILTNAS