Gene Cthe_0582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0582 
Symbol 
ID4808257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp712662 
End bp714425 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content40% 
IMG OID640105996 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001037011 
Protein GI125973101 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA ATCTGCACAT TAATGAGAAT CTTCCGGAAG ACATATTGGA AAGCATCGTT 
ATTAGCTCAA CGGAATATGC CATTATCGCT ACTGACATCC ACAACAGGAT AATTATCTGG
AACAAGGGTG CTGAACTCAT TTACGGCTAC TCAAAGGAAG AAATGCTGGG AAAGCAGTTT
CCTCCCGACC TTCACAGAAA AGGTTTGCCA AACAATGAAT TTCTTTTCGT ACCCAACAAT
GAAACAAAAT CGCGCCTGAT AGACCAAACA ATGTATGCCA AAAGAAAAGA CGGAACTTTT
ATCCCCATTT CCATTACCTC AACCCCCAGA GTCAACAAAA ACAGCAAAAC ACTGGGCTTA
TTGATTCTTA CCAGAGATAT TACAAGATAC AAGCTTTTGG ACCAATTCAA CAGTGTACTG
ATTGAAATCA CACATCTTAT AAATTCATCA TCCTCCATAG ACCAGATGTG TTCCGAGGTA
TGCAATGCCA TAAGCAGCTT CCTCGGCTTG CCGGCGGTAT TTATTTGTCT TTTTGACCGT
CCTGGCAACT TTTTTTACAT AAATGCAATG ACTGGTTTGT GCAAAAACTG CACCGGACAC
ACCTGCGGCT ACCATACCTG CGCCAAAGAT GTGGCGGAAA ACATAAAAGG CTGCTTCAAA
ACTTACACAC AGCTTACAAT AAACCACTCT CCCCTTTCGG AACATGCCAT CTATGAATAC
ATGGAAGCAC ATTTACCCGG CAGAGGTGAA AACTTCATAA TTCACATTCC TCTGATATCG
GACGTTTCAA TCATGGGAAT ACTTCACATT GTTGTTTCTG AACCTCAGAA AACCTTTCTT
CTCAAAGAAA GCCAAATCTT AAGCCTTGTT GCCAATGAAA TAACAACAGG GATTCAGCGA
AAACGCTTGA TAGAAGAAAT AAAGGAATAT GCCGACAATC TTGAAAAAAT GGTAAAGGAG
AGGACCCGTG AGCTGCGCGA AAAAGACGCA CAGCTGGTAC AGTCGGAAAA ACTTGCCACT
CTGGGGGAAA TGGCCACCGG TATAGCTCAT GAAATAAATC AGCCTTTGGG AGGTATAAGT
CTGATTACCC AGGGGCTTAT TCTGGCAAAA AAGCGGAATA AACTGACCGA TGAAATGTTG
CTCGAAAAGC TCAACGCCAT TATAGAGCAG GTGGATCGTA TTGATAAAAT AATCGGACAC
CTTCGCACCT TTGCAAGGCA ATCCGACCAA ACTAAAACAC CGGTAAACAT TAAAATGCCT
CTCACGGACG TGTTCAAACT TATTGGCGAA CAGCTGAAAA GACAACAAAT CAGTGTTGAT
ATGGACATCG AAGAAAACCT TCCATATGTT CTTGCCGATC ATAACAGACT GGAACAGGTT
TTTTTAAATC TTATTATAAA TGCAAAAGAT GCCTTGAATG AACGTGAAAA AGTCCAACAA
CGCCTGGTAT CCGAAAATGA CGCTGACGGA CAAATTGTCC CCCCGGATAA AAAAATAACT
ATAAAAGCCT TTTCGGATAA TAAACATGTC ATTATTGAAA TTACTGACAA CGGAATTGGT
ATTTCCAAAT CCATTATCAA TAAAATCTTT GAGCCTTTCT TTACTACAAA GGAAGTTGGA
AAAGGTACCG GCATCGGGTT GTCAATAAGT TACGGAATTG TCAAAGAATT TAACGGTACA
ATAGAAGTAG AATCCGAAGA AATGAAAGGC AGCAAATTTA TAATAAAATT CCCCATATAT
AAAGAAAACA GCATGCAAAA ATAA
 
Protein sequence
MRKNLHINEN LPEDILESIV ISSTEYAIIA TDIHNRIIIW NKGAELIYGY SKEEMLGKQF 
PPDLHRKGLP NNEFLFVPNN ETKSRLIDQT MYAKRKDGTF IPISITSTPR VNKNSKTLGL
LILTRDITRY KLLDQFNSVL IEITHLINSS SSIDQMCSEV CNAISSFLGL PAVFICLFDR
PGNFFYINAM TGLCKNCTGH TCGYHTCAKD VAENIKGCFK TYTQLTINHS PLSEHAIYEY
MEAHLPGRGE NFIIHIPLIS DVSIMGILHI VVSEPQKTFL LKESQILSLV ANEITTGIQR
KRLIEEIKEY ADNLEKMVKE RTRELREKDA QLVQSEKLAT LGEMATGIAH EINQPLGGIS
LITQGLILAK KRNKLTDEML LEKLNAIIEQ VDRIDKIIGH LRTFARQSDQ TKTPVNIKMP
LTDVFKLIGE QLKRQQISVD MDIEENLPYV LADHNRLEQV FLNLIINAKD ALNEREKVQQ
RLVSENDADG QIVPPDKKIT IKAFSDNKHV IIEITDNGIG ISKSIINKIF EPFFTTKEVG
KGTGIGLSIS YGIVKEFNGT IEVESEEMKG SKFIIKFPIY KENSMQK