Gene Cthe_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1161 
Symbol 
ID4810829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1378836 
End bp1381238 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content27% 
IMG OID640106583 
Producthypothetical protein 
Protein accessionYP_001037586 
Protein GI125973676 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA GTATTAATAT TGAAAATCTA AGAGGAATTA GATTTAATAC GAACATTGAT 
TTAAATAAAA AATCTTTAAT TATTTTTGGG GAGAATGGAA AAGGAAAAAG TTCAATTGTT
GATGGAATTG AATATGCAAT TACTGGGGAT ATTAAGCATA TTTCTTCAAC ATGTAGAGAA
GTATCATTAA AAAAACACGC CCCTCATATA TACGCAGATT TTCAAGAGAT AAAAGTTGAA
GTGGAATTTA GTGATGGAAG CGTATTATCT AATTATAAAG AGCCTGAGAA AGGCACATTG
GCATATAGAA TAAGAAATAG TAAATTAGGA AACATAAATA TTTTAAGACG CTCTCAACTG
TTAGATGCTA TTTCTGCCCA ACCAAAAGAA AGATATGATT TGTTAAAGCA ATTTCTGCCA
CTTGCTGAAA TAACAAAATT TGAGAATGCT TTAAAAGGAG TAGTTGATAA GTTTCAAGGG
GAAGTAAATA ATCTTAAAAC AGAAATTGAA AATCATATGA GGAACATTCA GGCTGCTCTT
GACATAAAGG ATTTAGGTTC TGTTACTTCT GATAATATCT TCTCGATACT TGTAAATAGA
GGTAAACAGT TTAACATAGA AGATATAAAA GACTTTAAGG AAATACCGGA ATACATAAAA
AAAGTTGATA ATTATATTGA GTGTATGGGA AATATAAATC GAGATGTAAT TATTAGGAAT
TTTTTGAATT TGCTTAATGA ACTAATTGAT AAAAAATCAC CTGAACTGTT TGCAAAAGCG
ATTTATGATA ATATTAACAC AAAACTTGAT TTGGTTCAGA AAAAGAAAGT TGTTTTTTAC
GAGGAATTTC TCACTACAGG TATTCAATGG TTACAGGAAG AAAACAAATC ACTATGTCCA
TTTTGTGAAA GTCCGATAGA TGTACCAAGT GTTATAGAAA GAGTAAAGAA GAGAATAAAT
GAGAATTCGG AGTATTCAAC ACTTAAAAAG GAGTTTCAAA GAGATTATAG TATATTGCAG
TCAGAACTTA AATGGTGGGA AGACAATCTA GACAAGATTA GAAATATAAA CAAAAAAATT
AATGATGAAA ATATTGAAAA TTTATGTAAA ACAATTGAAG ATGATATCGA GAATTTCAAG
AAATTAGTTC CTAATAGCAT ACAAGAAATC ATTATTGTAA AGAATTTGCC AAATTGGAAT
GAATCTATTT ATAAAAATGC TATTCATCTA AAAAATGAGT ATAGTAACAA AGTACTTCCA
AAGGATGCCG TGTTATTACT TAACGAAGCA ATTAAATTTA AAAATGATTT AAAAATTGTA
TATGACAACA TTGTACATAT AAATAAGAAA ACTAAAGAAA ATACTCTTTT AAATAAAAAA
TATAAAATTG CTAAGCAATT TTATGAGGAA TTGGTTCGCC AACGTAAAAA TTCCGTACAA
GAAATTTATG ATGAAATTAA AAATGATATT AATAGTTACT ATAGTAGTAT GCATTTTGAA
GAGGATATAG GTGATATTGA TTTAAAAATA AAAGATTCTT CAAGCAAGGG AAGTGTCATA
ATAGAAGCAA GTTTCTATGA AAAGAGTGGT GAGGACCCGA GAGCATATTA TAGTGAATCA
CATCTTGATA CTTTAGGATT GGCAATTTTT CTAGCATTAT ATAAGCGAGA ATGTTCAAAA
AATAAAGATT TAAGATTATT AATATTAGAT GATGTGCTTA CATCTGTTGA TGCGGCACAT
AGAATCAATA TAATAAACCT TATTTTTTCT GAATTTAAAG AACACCAATT AATTATTACA
ACTCATGATA TTGTATTATA TAAGGAAATC CTTGAGTTAG AAAAATTATA TGGAGGAAAT
AACAAGTATA GAAATATTGA GATATGTGAA TGGACGAAAG ATAGGGGACC TATATTAGAT
GATACTAAGT CAGAAATTGA GAAGTTAAGG GAACATTTAA CTAATCCTCA TACAGATAAA
AATATTCTTG CTTCTGCTAC TGGTACTTTT CTGGAACTAG TTTTATGCAA ATTAAGGTAC
TCTTTGGAAT TATCAATCCC TGCAAAATAT CAAGATAAAT ATACCATTGT GGATATTTGG
GATAACTTAT ACTCTAAATT AAAGAAGAAT AAGGAATTTT ATAGGTTAAA TTCAAAGGTA
TTAGATTCCA TCAATGTTTC TAAGTTTATT AGAAATATAA GTGGGTGTCA TTATAATGAA
TGGGCTCAAG GAGTATCCAA GGATGAAATA AAACAATTCA CAAATAATGT TATACGGTTT
TATGAGATAG TATATTGTAG CATTTGTAAC TCCTTTATAA AAAAGAGTAA TGATAATGAA
GATTATCAAT GCAATTGTTC GAGATTGCAA TATAATAAAA AAGACCAATT ATTAATTTCA
TAA
 
Protein sequence
MLKSINIENL RGIRFNTNID LNKKSLIIFG ENGKGKSSIV DGIEYAITGD IKHISSTCRE 
VSLKKHAPHI YADFQEIKVE VEFSDGSVLS NYKEPEKGTL AYRIRNSKLG NINILRRSQL
LDAISAQPKE RYDLLKQFLP LAEITKFENA LKGVVDKFQG EVNNLKTEIE NHMRNIQAAL
DIKDLGSVTS DNIFSILVNR GKQFNIEDIK DFKEIPEYIK KVDNYIECMG NINRDVIIRN
FLNLLNELID KKSPELFAKA IYDNINTKLD LVQKKKVVFY EEFLTTGIQW LQEENKSLCP
FCESPIDVPS VIERVKKRIN ENSEYSTLKK EFQRDYSILQ SELKWWEDNL DKIRNINKKI
NDENIENLCK TIEDDIENFK KLVPNSIQEI IIVKNLPNWN ESIYKNAIHL KNEYSNKVLP
KDAVLLLNEA IKFKNDLKIV YDNIVHINKK TKENTLLNKK YKIAKQFYEE LVRQRKNSVQ
EIYDEIKNDI NSYYSSMHFE EDIGDIDLKI KDSSSKGSVI IEASFYEKSG EDPRAYYSES
HLDTLGLAIF LALYKRECSK NKDLRLLILD DVLTSVDAAH RINIINLIFS EFKEHQLIIT
THDIVLYKEI LELEKLYGGN NKYRNIEICE WTKDRGPILD DTKSEIEKLR EHLTNPHTDK
NILASATGTF LELVLCKLRY SLELSIPAKY QDKYTIVDIW DNLYSKLKKN KEFYRLNSKV
LDSINVSKFI RNISGCHYNE WAQGVSKDEI KQFTNNVIRF YEIVYCSICN SFIKKSNDNE
DYQCNCSRLQ YNKKDQLLIS