Gene Cthe_0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0845 
Symbol 
ID4810463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1021420 
End bp1022430 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content40% 
IMG OID640106262 
Productstage III sporulation protein spoIIIAA 
Protein accessionYP_001037273 
Protein GI125973363 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02858] stage III sporulation protein AA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000346466 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACAA AAGAAGATTT TGTTTTCAGC GTTGAAAAAG TACAGAACTT TGAAAAGGAC 
ATTTTGAGAA TGATATCTCC ACAAGTGAGA GATGTTTTAA AGAAGATTCC GCTAAATGAA
CTTATAACCA CCGAAGAGAT AAGACTAAGA GCGAACAAGC CTTTAATGAT TCAAAATGAA
AAAGGAAGCT TTTTTGTCAA TTTGGAAGGA AGACTTACAG CAAACAGGAT GAATCTTTTT
TATGTAAGCC AGGAGCAGAT AGCAAAAACG CTGGAACTTA TAAGTGAAAA CTCCATTTAT
GCTTTTCAGG ATGAAATAAG AAACGGTTTT TTGACCATAA GGGGAGGCCA TAGGGTCGGT
ATTGTCGGAC GAGTTGTTTT AAACGGAGAT ACCGTAAAGA ACATCAAGGA TGTTTCCGGG
CTTAATATAA GAATATCCAG GGAAATAACC GGATGTTCCT CGAAAGTTTT AAAATATATT
ATCAGCAGTG AAAAGCAAGT TTACAACACT TTGATAGTAT CTCCTCCCCA ATGCGGGAAA
ACAACCTTGT TAAGAGACAT AACGAGGGCT ATCAGCGACG GTGTTGAAGA AATGGGCTTT
AAAGGAGTTA AAGTGGGAGT TGTAGATGAA CGTTCAGAAA TTGCAGCATG TTACAAGGGG
GTGCCCCAGA ACAGGGTAGG AACAAGGACC GATGTGCTTG ATGCGTGCCC CAAACAAATT
GGCATGATAA TGATGCTCAG ATCCATGTCG CCGGATGTGA TTGTTACGGA TGAAATAGGA
AACAAGGGAG ACAAAGATGC TTTGATTCAG GTGCTTAATG CAGGGGTGAA AGTGATATCC
ACGGCGCACG GGTACAATAT TTCGGAATTA AAAAGCCGCA AAGAAGTCTT GAGCCTGATA
GAAGAAAAGA TGTTTGAAAG GTATATTGTT TTGAGCGCGA GAAAAGGCCC CGGTACAGTG
GAAGAGATAA TTGACGGGAC CGATATGAGT ATTTTGTACA AAGGAGAATG A
 
Protein sequence
MVTKEDFVFS VEKVQNFEKD ILRMISPQVR DVLKKIPLNE LITTEEIRLR ANKPLMIQNE 
KGSFFVNLEG RLTANRMNLF YVSQEQIAKT LELISENSIY AFQDEIRNGF LTIRGGHRVG
IVGRVVLNGD TVKNIKDVSG LNIRISREIT GCSSKVLKYI ISSEKQVYNT LIVSPPQCGK
TTLLRDITRA ISDGVEEMGF KGVKVGVVDE RSEIAACYKG VPQNRVGTRT DVLDACPKQI
GMIMMLRSMS PDVIVTDEIG NKGDKDALIQ VLNAGVKVIS TAHGYNISEL KSRKEVLSLI
EEKMFERYIV LSARKGPGTV EEIIDGTDMS ILYKGE