Gene Cthe_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1321 
Symbol 
ID4809461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1602930 
End bp1604090 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content45% 
IMG OID640106745 
Productchaperone protein DnaJ 
Protein accessionYP_001037746 
Protein GI125973836 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGCA AAAGGGATTA TTACGAGATC CTTGGAGTTG ACAGAGGTGC ATCCGATGCA 
GAAATAAAAA AAGCTTACAG AAAGCTTGCT AAACAGTATC ACCCTGATAT GAATCCCGGT
GATAAGGCTG CCGAAGCAAA ATTTAAGGAA ATTAATGAAG CCTATGAGGT ATTAAGTGAC
CCGCAGAAAA GAGCGCGTTA TGACCAATTC GGCCATTCCG CATTTGATCC CAACGGTTTT
GGCGGAGGAG GTTTTGGCGG GGGATTTACC GGTGGATTTG GCGATTTTGA TTTTGGCGGA
TTTGGAGATA TTTTTGAAGC GTTTTTCGGA AGTGGATTTG GAACCAGAAC TTCCAGTGCA
AGAAGAGGGC CTCAAAAGGG TGCGGATCTT AAGTATTCCA TGGAAGTCTC ATTTGAAGAG
GCAGCTTTCG GAACAGAGAA GGAAGTTACG GTCAGCAGGT TGGAAATATG TCCGACTTGC
AGCGGTTCCG GAACAAAGCC CGGTCATCAG CCTGTTACAT GCAGGCAGTG TAACGGAACT
GGCCAGGTGC AGTACAAGCA GAGAACACCT TTTGGACAGA TTGTCAATGT AAGAACATGT
GACGTATGCC ACGGTGAAGG CAAAATTATT ACAAATCCTT GTGAAACTTG TGGCGGCAAA
GGAAGGGTAA GAAAGCATAC CAAACTGAAG GTTAGGATAC CTGCCGGTAT TGACAACGGT
GAGACGATAT CATTAAGAGG TGAGGGCGAG CATGGAATTA AAGGCGGGCC GTCCGGTGAC
CTTTTCATAA CCATCAAGGT GAAACCACAT CCAATTTTCA AAAGACATGG CAACGACGTT
AACTGTGAGA TTCCCATAAC TTTTACCCAG GCGGCGCTGG GAGCTGAGAT TGAAGTCCCA
ACACTGGATG GAAAGGAAAA AATTGTTATT CCTGAAGGTA CTCAGACAGG CACTGTATTT
AAGCTTAAAG GGAAAGGAAT ACCTTTCTTA AGAAGCAGCG GCAGAGGAGA CCAGTATGTA
AAGGTAAATA TTGAAGTGCC GAGAAAACTT AATGAAAAAC AGAAAGAGGT TTTAAGACAG
TTTGCAGAAC TCGTGGGTGA TGAGGTACAC GAGCAGAGAA AAGGATTTTT TAATAAAATG
AAAGATGCTT TGGGCATGTA G
 
Protein sequence
MAGKRDYYEI LGVDRGASDA EIKKAYRKLA KQYHPDMNPG DKAAEAKFKE INEAYEVLSD 
PQKRARYDQF GHSAFDPNGF GGGGFGGGFT GGFGDFDFGG FGDIFEAFFG SGFGTRTSSA
RRGPQKGADL KYSMEVSFEE AAFGTEKEVT VSRLEICPTC SGSGTKPGHQ PVTCRQCNGT
GQVQYKQRTP FGQIVNVRTC DVCHGEGKII TNPCETCGGK GRVRKHTKLK VRIPAGIDNG
ETISLRGEGE HGIKGGPSGD LFITIKVKPH PIFKRHGNDV NCEIPITFTQ AALGAEIEVP
TLDGKEKIVI PEGTQTGTVF KLKGKGIPFL RSSGRGDQYV KVNIEVPRKL NEKQKEVLRQ
FAELVGDEVH EQRKGFFNKM KDALGM