Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0582 |
Symbol | |
ID | 4808257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 712662 |
End bp | 714425 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105996 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_001037011 |
Protein GI | 125973101 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAA ATCTGCACAT TAATGAGAAT CTTCCGGAAG ACATATTGGA AAGCATCGTT ATTAGCTCAA CGGAATATGC CATTATCGCT ACTGACATCC ACAACAGGAT AATTATCTGG AACAAGGGTG CTGAACTCAT TTACGGCTAC TCAAAGGAAG AAATGCTGGG AAAGCAGTTT CCTCCCGACC TTCACAGAAA AGGTTTGCCA AACAATGAAT TTCTTTTCGT ACCCAACAAT GAAACAAAAT CGCGCCTGAT AGACCAAACA ATGTATGCCA AAAGAAAAGA CGGAACTTTT ATCCCCATTT CCATTACCTC AACCCCCAGA GTCAACAAAA ACAGCAAAAC ACTGGGCTTA TTGATTCTTA CCAGAGATAT TACAAGATAC AAGCTTTTGG ACCAATTCAA CAGTGTACTG ATTGAAATCA CACATCTTAT AAATTCATCA TCCTCCATAG ACCAGATGTG TTCCGAGGTA TGCAATGCCA TAAGCAGCTT CCTCGGCTTG CCGGCGGTAT TTATTTGTCT TTTTGACCGT CCTGGCAACT TTTTTTACAT AAATGCAATG ACTGGTTTGT GCAAAAACTG CACCGGACAC ACCTGCGGCT ACCATACCTG CGCCAAAGAT GTGGCGGAAA ACATAAAAGG CTGCTTCAAA ACTTACACAC AGCTTACAAT AAACCACTCT CCCCTTTCGG AACATGCCAT CTATGAATAC ATGGAAGCAC ATTTACCCGG CAGAGGTGAA AACTTCATAA TTCACATTCC TCTGATATCG GACGTTTCAA TCATGGGAAT ACTTCACATT GTTGTTTCTG AACCTCAGAA AACCTTTCTT CTCAAAGAAA GCCAAATCTT AAGCCTTGTT GCCAATGAAA TAACAACAGG GATTCAGCGA AAACGCTTGA TAGAAGAAAT AAAGGAATAT GCCGACAATC TTGAAAAAAT GGTAAAGGAG AGGACCCGTG AGCTGCGCGA AAAAGACGCA CAGCTGGTAC AGTCGGAAAA ACTTGCCACT CTGGGGGAAA TGGCCACCGG TATAGCTCAT GAAATAAATC AGCCTTTGGG AGGTATAAGT CTGATTACCC AGGGGCTTAT TCTGGCAAAA AAGCGGAATA AACTGACCGA TGAAATGTTG CTCGAAAAGC TCAACGCCAT TATAGAGCAG GTGGATCGTA TTGATAAAAT AATCGGACAC CTTCGCACCT TTGCAAGGCA ATCCGACCAA ACTAAAACAC CGGTAAACAT TAAAATGCCT CTCACGGACG TGTTCAAACT TATTGGCGAA CAGCTGAAAA GACAACAAAT CAGTGTTGAT ATGGACATCG AAGAAAACCT TCCATATGTT CTTGCCGATC ATAACAGACT GGAACAGGTT TTTTTAAATC TTATTATAAA TGCAAAAGAT GCCTTGAATG AACGTGAAAA AGTCCAACAA CGCCTGGTAT CCGAAAATGA CGCTGACGGA CAAATTGTCC CCCCGGATAA AAAAATAACT ATAAAAGCCT TTTCGGATAA TAAACATGTC ATTATTGAAA TTACTGACAA CGGAATTGGT ATTTCCAAAT CCATTATCAA TAAAATCTTT GAGCCTTTCT TTACTACAAA GGAAGTTGGA AAAGGTACCG GCATCGGGTT GTCAATAAGT TACGGAATTG TCAAAGAATT TAACGGTACA ATAGAAGTAG AATCCGAAGA AATGAAAGGC AGCAAATTTA TAATAAAATT CCCCATATAT AAAGAAAACA GCATGCAAAA ATAA
|
Protein sequence | MRKNLHINEN LPEDILESIV ISSTEYAIIA TDIHNRIIIW NKGAELIYGY SKEEMLGKQF PPDLHRKGLP NNEFLFVPNN ETKSRLIDQT MYAKRKDGTF IPISITSTPR VNKNSKTLGL LILTRDITRY KLLDQFNSVL IEITHLINSS SSIDQMCSEV CNAISSFLGL PAVFICLFDR PGNFFYINAM TGLCKNCTGH TCGYHTCAKD VAENIKGCFK TYTQLTINHS PLSEHAIYEY MEAHLPGRGE NFIIHIPLIS DVSIMGILHI VVSEPQKTFL LKESQILSLV ANEITTGIQR KRLIEEIKEY ADNLEKMVKE RTRELREKDA QLVQSEKLAT LGEMATGIAH EINQPLGGIS LITQGLILAK KRNKLTDEML LEKLNAIIEQ VDRIDKIIGH LRTFARQSDQ TKTPVNIKMP LTDVFKLIGE QLKRQQISVD MDIEENLPYV LADHNRLEQV FLNLIINAKD ALNEREKVQQ RLVSENDADG QIVPPDKKIT IKAFSDNKHV IIEITDNGIG ISKSIINKIF EPFFTTKEVG KGTGIGLSIS YGIVKEFNGT IEVESEEMKG SKFIIKFPIY KENSMQK
|
| |