Gene Cthe_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3068 
Symbol 
ID4809942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3609155 
End bp3610594 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content38% 
IMG OID640108492 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001039457 
Protein GI125975547 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000279575 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAA AAAAACGCCT TATTTTTTCG AATGCTGCAA TTATTGTTAT TCCTCTTGGG 
ATAACATTTG TGGCATCTTT TATTTTTATG TTTGTTTTGG CGAGGATACA TGATGTCGAC
CTAAGTTACA ATAATGTAAA AAAGCTTACT CAGATACAAT ACGAGTTTTT TAAAGCAGAG
GGGGGATTGC TTAAAAATTC CCCGGAAATA ATTTTGGAAA AAGATTTTCA GCAGTATATT
ACCACAAGAC TTGAGAGCAT AGAAGCCGAC ATAGTGGTGC TAAAAGGCCA GGAACGGGTT
TTTGAAACTC GCAAGCTCAG TATTATTGAG TTGGAAAGAT GTCTTGAAAA AACCGGCGAC
AACTTGTTTA GAAACATTGT TGAGATTCAG GGCAAATCCC ATATGGTAAA AGTAATACCT
GTGATATTTA AAAGCGGTGA GGATGGGAAA ATTCTTTTGC TTGTGCCCGC CGTAAATGAC
TGGATGACAA CGGAAAAGCT TTTCATATTT TCCGGCGTGG TGTTTGTTCT CAGCTTTATA
ATAACAAATA TAGTCATCAT TACTGCTTTT TCAAAGAAAG TTATAACTCC TCTGGGGAAG
CTTCAGGCTG CTGCGGGCAA AATAAGCGAA GGCAATCTGG ATTTTGAGAT TATTGAGGAC
GGAGATACCC AAATTAGAGA ATTGTGCCGC TCCTTTGAGA AAATGAGGCT TAAGCTTGTG
GAGGCAAATT ATACGCAGAA AAAATATGAT GAGAGCAGAA AAATGCTTTT TTCAAGCATA
TCTCACGATC TTAAAACTCC TATAACTTCA ATAAAGGGAT ATGTTGAGGG GATATTGGAC
GGTGTGGCAA ATACCCCTCA GAAAGTGGAA AAATATTTAA GAACGGTTCA TTCCAAGGCT
GTTCACATGG ATAGAATGAT TGATGACCTT CTTTTGTATT CGAGACTGGA TATGCACAAG
GTTTCGTTTA ATTTTGAAAA GACGGATGTG CTAAAGTACT TTGAAGATTG CATGTATGAA
ATAGATATTG AACTTGAAAA GTCCAATATC AAGGTTGAGC TTCATAACAA CTTGAGAGGA
AAGCGTTATA TAATGATAGA CAGGGATCAG GTGCGAAGAG TTGTGATAAA CATAATTGAC
AACTCAAGAA AATATATGGA CAAGGAACAG GGGAAAATAG ATATTTTTTT GAGGGAAGCA
ACATCAAATG TGGTAATAGA GATAAAAGAC AACGGAGCCG GAATTAGTGA AAGTGATTTG
CCCTACATTT TTGACAGGTT CTATCGCGCC GATTCGGCGA GGGATACCAG GAAAGGAAGC
GGGCTTGGAC TTGCCATTGC CAAACAAATA ATAGAAGGAC ACGGAGGGAA AATTTGGGCG
GTCAGCCGTA TGGGCGAAGG CACGAGTGTG ATGATTTCTC TGAAAAAATA TGAAGGCTGA
 
Protein sequence
MDLKKRLIFS NAAIIVIPLG ITFVASFIFM FVLARIHDVD LSYNNVKKLT QIQYEFFKAE 
GGLLKNSPEI ILEKDFQQYI TTRLESIEAD IVVLKGQERV FETRKLSIIE LERCLEKTGD
NLFRNIVEIQ GKSHMVKVIP VIFKSGEDGK ILLLVPAVND WMTTEKLFIF SGVVFVLSFI
ITNIVIITAF SKKVITPLGK LQAAAGKISE GNLDFEIIED GDTQIRELCR SFEKMRLKLV
EANYTQKKYD ESRKMLFSSI SHDLKTPITS IKGYVEGILD GVANTPQKVE KYLRTVHSKA
VHMDRMIDDL LLYSRLDMHK VSFNFEKTDV LKYFEDCMYE IDIELEKSNI KVELHNNLRG
KRYIMIDRDQ VRRVVINIID NSRKYMDKEQ GKIDIFLREA TSNVVIEIKD NGAGISESDL
PYIFDRFYRA DSARDTRKGS GLGLAIAKQI IEGHGGKIWA VSRMGEGTSV MISLKKYEG