Gene Cthe_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1739 
Symbol 
ID4810169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2060655 
End bp2061782 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content31% 
IMG OID640107152 
ProductSNF2-related protein 
Protein accessionYP_001038153 
Protein GI125974243 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0287077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAT TAAATGCACC TTATAATCTT CTAATCTGCC AAAAGTCCAA AATTAAAGAT 
TGGGAAGAAC ATTTTAAAGA TTACTATAAC TTACAAATTA TAATTTTCAA GAATCAATCT
ATAGAGAGCA TACCGCAAAA TAGTGTAATT ATTATTAATT ATGATTTGGT ATGGCGAAGG
AAGCAATTAC AAAAGCTAAA GGATTTTACA TTAATACTTG ATGAATCGCA GTATATTAAA
AATGAAACAT CCAACAGGGC AAAATTTATT TTAAGCCTTA ATCCTGCAAA TATAATTCTT
CTCTCTGGTA CTCCTACAGG TGGAAAATAT GAGGAGCTGT GGTCACAGCT TAGACTTCTT
GGATGGAATA TTAGTAAAAG CTCCTTTTAT AATCACTACA CAATAACAGA AAAGATTGAT
GTTGGAGGAT TTAAAATTCC AGTTGTAAAA GGCTATAAAA ATGTTGATAG GTTAAAGGAA
AAACTAAAAT CCTATGGTGC AGTATTTATG AAAACAGAAG AAGTATTTGA CCTGCCGGAG
CAGATTGAAC AGGTTGTAAC AATCGAGAAC TCAAAGGAAT ATAAAAAGTT TAAAAAAGAC
AGAGTTATTA CAATTGATGG TGAAACTTTA GCAGGAGATA CAGCACTTAC AAAGTTATTG
TATCTAAGGC AGTTAACATC TATTTACAAC TCAAATAAAC ATCAGGTGCT AAAAGATATC
TTTGAATCCA GTAATGATAG ATTTGTTATC TTCTACAATT TTAAAAGAGA GTTTGAGATT
ATAAAAAACA TCTGTTTTAA GATAGATAGA CCCATTTCCT ATATTAATGG AGACGGAACA
GACCTTGAAA ATTATGAAAA TAAGTCAAAC AGTATTACAT TGGTTCAGTA CCAAGCAGGG
GCATCTGGAG TTAATTTGCA AAAGGCAAAT AGAATTATTT ATTTCAGCCT TCCACTATCT
AGTGAATTTT GGATGCAGTC AAAAAAGAGA ATACACAGGA TAGGGCAAAA TAGGACTTGC
TTTTACTACT ATCTTATTAC AGAAAACAGC ATAGAAGAAA AGATACTTGA AGTATTAAAA
CAAAGGCGGG ATTTTACTGT GGAGCTGTTT GAAAAGGAGA TGTTATGA
 
Protein sequence
MKQLNAPYNL LICQKSKIKD WEEHFKDYYN LQIIIFKNQS IESIPQNSVI IINYDLVWRR 
KQLQKLKDFT LILDESQYIK NETSNRAKFI LSLNPANIIL LSGTPTGGKY EELWSQLRLL
GWNISKSSFY NHYTITEKID VGGFKIPVVK GYKNVDRLKE KLKSYGAVFM KTEEVFDLPE
QIEQVVTIEN SKEYKKFKKD RVITIDGETL AGDTALTKLL YLRQLTSIYN SNKHQVLKDI
FESSNDRFVI FYNFKREFEI IKNICFKIDR PISYINGDGT DLENYENKSN SITLVQYQAG
ASGVNLQKAN RIIYFSLPLS SEFWMQSKKR IHRIGQNRTC FYYYLITENS IEEKILEVLK
QRRDFTVELF EKEML