Gene Cthe_1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1138 
Symbol 
ID4810806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1351046 
End bp1352731 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content38% 
IMG OID640106560 
Producthypothetical protein 
Protein accessionYP_001037563 
Protein GI125973653 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAAAT ACATAGGCAA ACTTATCGGC AATACGGGCA ATCCTAATGA TTTAAAGATT 
GCTCTCGAAA ACAGTTTTTC TGCTAAAAGA GGAGAGTTTG TAAAAATCAA GCATAGAGAA
TCGGAAGAAG ATGGAGATAC ATATGTTCTA GGGAGAATTG TATCCATATC CAGAAGCAAT
ATTCTATATA ACTCCAATAT GGGTGAGGGG CTGTCATCTC TTGAGATACT ACCAGGTGCC
CAAGTTACTG GGGAGACTTT ATTTGGGACC ATAGAACTGG TGGGGTACAG GGATAACTAT
GGACAGATAA AAATACCCAG GCGACCCCTT AACCCAGGGG AAAAGGTATA TGGTGTTGAC
TATGAGTTTT TGTCAAAGTT TTATAAGTTT GATGAGAATA CAAGCATTAA CATAGGTAAT
TTGATTGGAT ACGACAAGGG AAGTAATATA GTCCCTGTTT ATCTCGATGT AAACAAACTG
GTTACAGAAC ATTTGGCTGT GCTGGCAATG ACAGGTTCAG GGAAATCATA TACTGTGGGC
AGAATTATTG AGAGACTTGT AGCAGAAATG AATGGAACAG TAGTAGTTTT TGACGTTCAT
GGAGAATATG GAAAAGCATT TGAAAAGGGA GAGATACATT TTAATAATAA TCTTGATTTT
ATTGAGGATG AGAGGGAAAA GAAGAGTATT CAAAGGATTC AGGAAAATTT AATAAAAATG
CAGAATGCAG GTGGTGGAAT AAAAGTTTAT ACTCCCCAGA TTGATTCCTT TGATTATAAG
TATAGTGGAA AAAACCATCA CTTAGCCCTG CAATTCGATA GATTCGACAT GGATGATTTA
TCTTCCATTC TTCCCGGTTT GACAGAAGCC CAGGAAAGAG TATTGGATGT TGCAATCAGG
TATTGGAAAG CGAAATATAA TCATCCACCA AGAGATATTC AGGATTTAAC ATATCTACTT
TCTGATGAAC AGGGGCTTGA GGAACTAAAG AATTGGGACA ATTTAACTGA AGGTGAAGCC
AAAGCACTCA ATAATAGAAG TGCAGCAGTG GCTTCTATGA AATTAACCCG AGTAATAAAT
GAAGCAAAAA GTTTTTACAC AAGGGCTATA GGTGAGCCTA CAGATATTTA TGATATGATT
GGCGAAAAGG GAAATAGCGT GGGAAGGCTT GTAATAATAG ACTTACAAGG CCTATCCGAT
GATGCTAAAC AAATTATAAC AGCATTGATA TCCAGTGAAA TTATGAGGGC AGCATCAGAT
AAAAAAAGGC AAATAAGACC ATGTTTCCTT GTTTATGAAG AAGGACACAA TTTTGCACCG
GCAGGCATTC CGAGCATTTC TAAGAAAATT ATTAAGAAGA TTGCGGCGGA AGGAAGAAAG
TTTGGTGTTG GTTTTGCGAT TATTTCACAA AGACCGTCAA AACTTGACCC AGATGTAACC
TCACAGTGCA ATACAATTAT TACAATGCGG TTAAAGAATC CAGATGACCA GCGGTTTATA
GCAAAAACGT CAGATATGTT TTCATCATCT GATATTGAAG AATTGCCATC TTTATCAACG
GGAGAAGCAT TGATAAATGG CAGGTCAATT CCTGCACCAC TGTTAGTAAA AGTTGGAACA
AAGGCCTTAA TACATGGTGG AGAGTCTCCT GAAGTAATCA AGGAATGGGG CGTATTCAAT
GGATAA
 
Protein sequence
MDKYIGKLIG NTGNPNDLKI ALENSFSAKR GEFVKIKHRE SEEDGDTYVL GRIVSISRSN 
ILYNSNMGEG LSSLEILPGA QVTGETLFGT IELVGYRDNY GQIKIPRRPL NPGEKVYGVD
YEFLSKFYKF DENTSINIGN LIGYDKGSNI VPVYLDVNKL VTEHLAVLAM TGSGKSYTVG
RIIERLVAEM NGTVVVFDVH GEYGKAFEKG EIHFNNNLDF IEDEREKKSI QRIQENLIKM
QNAGGGIKVY TPQIDSFDYK YSGKNHHLAL QFDRFDMDDL SSILPGLTEA QERVLDVAIR
YWKAKYNHPP RDIQDLTYLL SDEQGLEELK NWDNLTEGEA KALNNRSAAV ASMKLTRVIN
EAKSFYTRAI GEPTDIYDMI GEKGNSVGRL VIIDLQGLSD DAKQIITALI SSEIMRAASD
KKRQIRPCFL VYEEGHNFAP AGIPSISKKI IKKIAAEGRK FGVGFAIISQ RPSKLDPDVT
SQCNTIITMR LKNPDDQRFI AKTSDMFSSS DIEELPSLST GEALINGRSI PAPLLVKVGT
KALIHGGESP EVIKEWGVFN G