Gene Cthe_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0944 
Symbol 
ID4811237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1130692 
End bp1132143 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content40% 
IMG OID640106363 
ProductSMC protein-like protein 
Protein accessionYP_001037371 
Protein GI125973461 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.665094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACGA TAAAAAGGAT AAGAATTGAA AATTTCCAGT CACACAAGGA TACGGAACTT 
TCTTTTTCCG ACGGGCTCAA TGTCATCGTG GGACCGTCGG ACCAGGGTAA GTCTGCCATT
ATCCGGGCTA TTAAATGGGT ATTGTATAAT GAACCCAGAG GGACTGATTT TATAAGGCAG
GGAACAAATT CTGCAAGGGT TACTTTGGAG CTTAGCAACG GATATGTCAT AACAAGGGAA
AGGGCACCTA ACAAAAACAG ATATACGCTC AAAGACCCTG ACGGAAATGT CAGTGTGTTT
GAAGGGTTCG GAAATGAAGT GCCTCTGGAA ATTGTAAAGG CTCATGGGAT TCCAAAAGTG
GCTTTGGACA TGGATGTCCG TGCAAGTCTC AACATAGGAG AGCAGCTTGA AGGACCGTTT
CTTTTGTCGG AATCCGGTGC CACCCGGGCA AAAGCCATTG GGAGACTTAC CGGACTTCAT
ATCATTGACC AGGCGATAAA GGATTGTGCC ACGGATATAA GAAGGGAAAA CCAGACTTGT
GACAGGATTG AAAGAGAAAT TGAAGATATT GACAAAAAAC TTGAAGAGTA TAAAAATATA
GAAGAATTGG GAAGAAGACT CGAAGAGTCC GAGAAAGTAA TCGCCCGGAT GGAAGCTTTG
ACGGCAAAAG TTGATATGCT TGAGGAAAAG AAGAATTCCC TCAAAGACAT TGAGACTGAA
TATTTGGCTC AAACCAAAAT TCTTTCAAGG CTTGACAGGC TGGAAGAGTG CGGTGTGTAT
TTAAAAAGTG CGGAAGCCTG CTTTTTCAAG CTAAATCAGA TACTGGGAAT AAAAAAGAGG
TATTCAGAGG TTTTAACCGG AATGGAAGAA ATGGAAAAGG TATTGCAAAA AACAAGCTTT
GTGGATGAGG CAGTGGAGAT TTTGAAAAAG GCATCTGATA TTTTCTCAAA ATATGAAAAA
CTTGACAGAT TGCGTTCGGA GTTTAGTAAT GTTGGCAGGG AACTTAACCA GACAAAAAAT
ATCCTGGACA GGACGTCAAA TGTGAAAGAA CTTGATTTTA TGATAAAAAA TATTTCTGAC
AAGGTTTTGC TTGGCTCTGA AATTACTCAG TTGCGGGAAA AGCTTGTTTG CCTTGAACGA
GAGATATCCA GGGTCAAAAA GAACATATCC TCTTACGAAA ATATAAATTT AGTGCAGGAG
ATTGTGACTT CTGTTGATAA AAAGCTGGAA GTGTTAAACA AACTGGAAGC AGCCAAAAAG
GAGTACAGTG CTGTTTGTAC CAGTCTGAAT GACGGTTTGG AGTTTATGGA CAAAAATAAA
AAAGAAATAC AGGAGAATCT TAACATATAT ATCGATATTC TTAGAAAAAG CGGAGTGTGT
CCGCTTTGCA AGAGCAGTAT CGGAGATGAA AAGCTTGAAA ACATAATAAG GCATTATGAG
GAGGTACACT GA
 
Protein sequence
MITIKRIRIE NFQSHKDTEL SFSDGLNVIV GPSDQGKSAI IRAIKWVLYN EPRGTDFIRQ 
GTNSARVTLE LSNGYVITRE RAPNKNRYTL KDPDGNVSVF EGFGNEVPLE IVKAHGIPKV
ALDMDVRASL NIGEQLEGPF LLSESGATRA KAIGRLTGLH IIDQAIKDCA TDIRRENQTC
DRIEREIEDI DKKLEEYKNI EELGRRLEES EKVIARMEAL TAKVDMLEEK KNSLKDIETE
YLAQTKILSR LDRLEECGVY LKSAEACFFK LNQILGIKKR YSEVLTGMEE MEKVLQKTSF
VDEAVEILKK ASDIFSKYEK LDRLRSEFSN VGRELNQTKN ILDRTSNVKE LDFMIKNISD
KVLLGSEITQ LREKLVCLER EISRVKKNIS SYENINLVQE IVTSVDKKLE VLNKLEAAKK
EYSAVCTSLN DGLEFMDKNK KEIQENLNIY IDILRKSGVC PLCKSSIGDE KLENIIRHYE
EVH