Gene Cthe_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1915 
Symbol 
ID4810773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2281578 
End bp2282969 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content40% 
IMG OID640107332 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001038327 
Protein GI125974417 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000195525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGGGAA TCAGGGGCAG AGTTGTCGGA ACTTATCTGA TTATTACTTT TCTGTCGGTG 
GTGATATTTG AAATTATTCT CATTTACGGG CTTGTAGAGT TTTATTATTC AAGTATAGAG
GCTGAACTGA GTCGTTCCGC ACAGTCAATA ATCCATAATT ACAGCCAATA TTTCAATAAT
TATGATTTAC ACAGGGATGC AAAAGTATTG CTTGAATCCA TGCCCGGCAA CACCGTGGCC
CAAGTCCAGA TAATTGATGA CAAAGGGGTT TTGATTGCTG ATTCGATAGA GCCTTCCATG
GAAGGAAAAA AGCTTGATAA TTACGATGTC AATATGGCCT TGAATGGGAA AGAGGCTGCG
TGGAAAGGAA AAATACCCAT GACGGGTGAA CCGGTTTTTT CGGTTTCTTT TCCCATAATC
CGTCCTGACA GCGAAGAAGT GATTGGAATA GTAAGAGTAA TTACGACTCT TTCCGATGTA
AATGAAATAT TGAAAAATCA CATAATCATT CTCATTTCTT TGGGACTTTG CATAGTTTTT
CTGATTTTTC TTACCGGACT GGCTTTGACC AATACCATTA TAAAACCGGT CAAGGAAATA
ACATCTGCTG CGAAAGCCAT GGCCCAGGGC AGATTCGACG TACGGGTATC CAAAAGGTAT
GATGACGAAA TTGGAGAACT TGGAGATACC TTGAATTACA TGGCACAGGA AGTGGCCAAC
CAGCAAAAAA TGAAAAATGA CTTTATTGCG TCAATTTCCC ATGAACTTCG CACTCCTTTG
ACATCAATAA TGGGGTGGAT AATTACAATC AATTCCGGAG ACATTGACAG CAAAGAAGAA
CTGAAAGAAG GATTGGATAT AATTGAAAGA GAAAGCAAAA GACTGGCGGA GCTTGTGGAT
GAACTTTTGG ATTTTTCAAA ATTTGATGCC GGAATCATTA CGTTAAGGAA GAGTGTTGTA
AATTTGGGTG AGCTTTTGAA ATACATAAAA AGGCAAATGG AGCCTCGGGC TGAGCGAAAA
GGCATAACCA TGACGATAGA CGTGGATGAG CATCTGCCGT TGATTGAAGC TGACGAAAAC
AGGCTTAAAC AAGTGTTTAT AAATATTATT GATAATTCAT TTAAATTTAC GCAAAAAGGC
GGGTATATTG ATATTATAGG CAGAAAAAAT GAAAACGGTG TTTTGATAAG AATAGAGGAC
AGCGGATGTG GTATACCGGA GGAAGATTTG CCGAGAGTTA AGCAAAGGTT TTTCAAGGGA
AGCAATGTTG TTTCGGGAAG TGGTTTGGGA CTTGCCATCT GTGATGAGAT TGTAAGACTT
CACAATGGGA AAATAGACAT AGAAAGCACT GTCGGAAAAG GCACCAGAGT AGATGTGATA
CTTCCGGTTT GA
 
Protein sequence
MRGIRGRVVG TYLIITFLSV VIFEIILIYG LVEFYYSSIE AELSRSAQSI IHNYSQYFNN 
YDLHRDAKVL LESMPGNTVA QVQIIDDKGV LIADSIEPSM EGKKLDNYDV NMALNGKEAA
WKGKIPMTGE PVFSVSFPII RPDSEEVIGI VRVITTLSDV NEILKNHIII LISLGLCIVF
LIFLTGLALT NTIIKPVKEI TSAAKAMAQG RFDVRVSKRY DDEIGELGDT LNYMAQEVAN
QQKMKNDFIA SISHELRTPL TSIMGWIITI NSGDIDSKEE LKEGLDIIER ESKRLAELVD
ELLDFSKFDA GIITLRKSVV NLGELLKYIK RQMEPRAERK GITMTIDVDE HLPLIEADEN
RLKQVFINII DNSFKFTQKG GYIDIIGRKN ENGVLIRIED SGCGIPEEDL PRVKQRFFKG
SNVVSGSGLG LAICDEIVRL HNGKIDIEST VGKGTRVDVI LPV