Gene Ccel_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2107 
Symbol 
ID7310805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2464294 
End bp2466090 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content33% 
IMG OID643609041 
Producthistidine kinase 
Protein accessionYP_002506432 
Protein GI220929523 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA AAATAATTCA ATACTCTGTT ATGCTTGTAA TAATAGGTAT TACTATTGCA 
GGAATTTTTA CTTCTGCAAT GGCCAGATAT TTCTACAAGC AGGAAGTTCA AAATAATATT
GAAAGTATCG CTGTTTTATT AAAGAATGAT GTACTAGAAG AGATATCTGT CAAAAGTGAA
CTAGACTTTA ATAAGTTTGT AAAAGATTAT GCAAGTCTTC TTAATTCCAG ATCGAATAAA
GATAAGAATT TCCCTTTAGC TACCAGAATA ACAATTATAG ATTTTGAAGG AAAAGTACTC
GGTGACTCGG ACTCAAATAT AGATACAATG GAAAACCATT TAAACAGAAA GGAAATCCAT
GAAGCTATCC AGGGTCAGTC AGGTACAGAT CAACGATTTA GTGCCACGAT GCAAATGCCT
TATTTGTACG TCGCTCTGCC CATAAATGAG CATCGCATAA TTGTAAGAAT TTCTGTGCCT
TTATATCAGC TAAATGCTAT AAATAAAGCA TTTATATATT ATACTTTACT TGGAATTCTG
GCAGGCTTGC TTCTAACATT ACTCATTTCA TTTAAATTAT CCAGTTTTAT TACCAAGCCA
ATTCAACATT TAATTAATAC CTCTAAGGAA ATAGCAGGAG GTAACTACAA AAAGAGAATT
GATGTTAATT ATAAGGACGA AATTGGTCAG CTTGCTATAA CCTTCAATGA AATGGCTGAT
AAGCTTGATG AGACTCTTAG TGGTATAATG GATAAAAATG TAAAAATTGA TACCGTTATT
AATAGTATGA TGAACGGTAT AATTGCTATT GATAACAACT CGAAAATAAT TATTATAAAC
ACTACAGCCT GTGATATCTT TGGTGTACAG TATGGACCTG GAATAATAGG TAAAAATTTG
ATAGACATTA CAAGAAACTC CAAAGTCAAT TCTCTTTTGC GTCAAACAAT AAGAAATGAT
ATCTCTTTAA TTGATGAAAT TGTAATATTC TCACCTTCAT TGGGAATTGA CAAAATTTAC
AGAATTTATA CCAATACCAT TAAATCATCG GATACTAGAC AACAAACAGC GGGAGCCGTA
ATTACTCTAA ATGACATAAC ATCAGTAAGA AAATTGGAGC AAATACGTAC CGAGTTTGTA
TCGAATGTTA CTCATGAGCT GAAGACTCCT CTTACCTCAA TAAGAGGATT TGTGGAAACT
TTGAAAAACG GTGCTATTGA GGATACCACA GTAGCTGTTA AATTCCTTGA TATTATTGAT
ATCGAAGCCG AAAGACTATA TATGCTCATT AATGATACCC TGCAGTTGTC CGAAATAGAA
GCTATGCGAA AAGATGACAG TGTAATGAAA ATAGACCTTA TACAGATAAT TGATGAGGTT
GTTCTTATTT TAAAGTCTTC AGCTGATAAA AAAAATATTG CATTGGAGGT TCTTGCCAAT
TCTGCTGAAG TTCCTGTAAT AGCCAATAGA AACAGGATAA AGCAGATGCT CATCAATCTC
ATTGATAATG CAATTAAGTA TAATGTTGAA AATGGAAAAA TATATATAAA ATCGGAGAAA
GCTAACGGCA ATACAGTAAT TTCTGTAAAG GATACAGGCA TAGGTATACC TGAAAAGCAC
CACTCCAGAA TTTTTGAAAG ATTTTACAGA GTTGATAAGG GACGCTCAAG AAATATGGGT
GGTACCGGAC TCGGATTGTC TATAGTAAAG CATATAGTTA ATCTTTATAG TGGTGATATT
AATATTATCA GTGAACCAGG AAAGGGAACT GAATTTATTG TTAAGCTGCC ATTATAG
 
Protein sequence
MKKKIIQYSV MLVIIGITIA GIFTSAMARY FYKQEVQNNI ESIAVLLKND VLEEISVKSE 
LDFNKFVKDY ASLLNSRSNK DKNFPLATRI TIIDFEGKVL GDSDSNIDTM ENHLNRKEIH
EAIQGQSGTD QRFSATMQMP YLYVALPINE HRIIVRISVP LYQLNAINKA FIYYTLLGIL
AGLLLTLLIS FKLSSFITKP IQHLINTSKE IAGGNYKKRI DVNYKDEIGQ LAITFNEMAD
KLDETLSGIM DKNVKIDTVI NSMMNGIIAI DNNSKIIIIN TTACDIFGVQ YGPGIIGKNL
IDITRNSKVN SLLRQTIRND ISLIDEIVIF SPSLGIDKIY RIYTNTIKSS DTRQQTAGAV
ITLNDITSVR KLEQIRTEFV SNVTHELKTP LTSIRGFVET LKNGAIEDTT VAVKFLDIID
IEAERLYMLI NDTLQLSEIE AMRKDDSVMK IDLIQIIDEV VLILKSSADK KNIALEVLAN
SAEVPVIANR NRIKQMLINL IDNAIKYNVE NGKIYIKSEK ANGNTVISVK DTGIGIPEKH
HSRIFERFYR VDKGRSRNMG GTGLGLSIVK HIVNLYSGDI NIISEPGKGT EFIVKLPL