Gene Cthe_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1785 
Symbol 
ID4810030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2105320 
End bp2106402 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content42% 
IMG OID640107199 
ProductDNA integrity scanning protein DisA 
Protein accessionYP_001038199 
Protein GI125974289 
COG category[R] General function prediction only 
COG ID[COG1623] Predicted nucleic-acid-binding protein (contains the HHH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000327916 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGCC TGGATAGAAA AAAAGATGAT GAAATAATTG AGGTACTGAG AATGATGGCT 
CCCGGAACTT CACTGCGGGA AGGATTGGAT AATATTTTGC TGGCCCGGAC GGGTGCCCTT
ATTGTAATAG GTGACTCTGA GAAGGTTTTA AGTCTGGTGG ACGGCGGATT TTACATCAAC
AAGGACTATA CTCCGGCCCA TATTTATGAG CTGGCTAAAA TGGACGGAGC TATAATTCTC
AGCAAGGACC TTAAAAAAAT ATTGTATGCC AACGCACTGC TTGTTCCCGA TACATCGATA
CCTACGGCGG AGACCGGTAC AAGACACAAA ACGGCTGACA GGGTTGCAAA GCAGACGGGA
GAAGTGGTGG TGAGCATATC TCAGAGGAGA AATATTATCA CCATTTACAT GGGGTCAAGA
AAATATATAT TAAGAGAAAC CCCTGTTATC CTTGCAGAGG CCAATCAGGC TCTCCAAACC
CTTGAAAAAT ACAAGGTTGC CTTGGTTGAG GCTATAAACA ACTTAAACAT ATTGGAAATA
GAAGATATTG TGACGTTGGA TGATGTGGCT TTTGTCTTGC AACGTACCGA AATGCTAATG
AGAGTTGCTG CGGAAATTGA AAGATACATA AGTGAACTGG GCAGTGAAGG AAGGCTTATT
AGCTTGCAGT TGGATGAGCT TCTGACAAAT GTTGATGCCG ATGAACTTTT TATAATTGAA
GATTATGCCA TACGTACAGA TCTTCGTTCC GATGAAATTC TGGAAAAGCT GAGGCAACTT
TCTTATGATG AGCTTATGAA TCTGGTCAAC ATATGCAGTA TTTTAGGCTA CAGTCCAAAT
GCGGATGCTT TTGAGATGGT TATAAGCCCG AGGGGATATC GGCTTTTAAG CAAGATTCCC
AGAGTTCCCG TAAACATAAT AAGAAATTTG GTGAAAAAGT TCTCAAACCT GCAGGGTATT
TTAAACGCTT CCATTGAAGA ACTTGATGAT GTTGCCGGAA TAGGAGAAGT AAGAGCAAGA
ATTATCTGGG ACGGTTTAAG AAGAGTTCAG GAGCAAATCT TTTTGGATTC CAGAAAGCTG
TAA
 
Protein sequence
MTGLDRKKDD EIIEVLRMMA PGTSLREGLD NILLARTGAL IVIGDSEKVL SLVDGGFYIN 
KDYTPAHIYE LAKMDGAIIL SKDLKKILYA NALLVPDTSI PTAETGTRHK TADRVAKQTG
EVVVSISQRR NIITIYMGSR KYILRETPVI LAEANQALQT LEKYKVALVE AINNLNILEI
EDIVTLDDVA FVLQRTEMLM RVAAEIERYI SELGSEGRLI SLQLDELLTN VDADELFIIE
DYAIRTDLRS DEILEKLRQL SYDELMNLVN ICSILGYSPN ADAFEMVISP RGYRLLSKIP
RVPVNIIRNL VKKFSNLQGI LNASIEELDD VAGIGEVRAR IIWDGLRRVQ EQIFLDSRKL