Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2945 |
Symbol | |
ID | 4810228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3461018 |
End bp | 3462868 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108368 |
Product | histidine kinase |
Protein accession | YP_001039336 |
Protein GI | 125975426 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.546794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAG AGATGAAAGC GCCGGAAAAA ATAAAAAGTG TTTTTAAAAG ATTTAATATA AAAACAATAA ACAAGCAAAT AAGGCTCAAT ATCGTTTTTG GTTTGATTGT ATTAATTTCC ACCATGTTGT TGGGATATCT GTCCTTTGAC ATGATATCGG ATGCCATGAT AAGGCATGCC GGTGCGGACA ACCTGGAGCT TGTAAAACAA ATAACCAAAA ATATTGAGAC AGTAATGACC GGATTTGACG ATATTTCAAA CGAAATTCTT ACCAATGAAA ACTTTGACAG GCTTGTAAAA ATGCATGTTA CATTGGATGA TGAACATAAA AAGGCAGCAA ACAGAAGAAG CATAGAGAGT ATACTAAACG GTTATACCAA CACGAGAACG GACATAGCGG ACATAGCAGT GGTTACAAAT ACCGGAGAAT ACATTACCTC GGGAGAAACA AGACCTTTGG TTACGGACAA TGCGCTTTCA TACTATGTAG TAAAAAGATT CAAGCAAAGC GGAAGAGACT CATTGTGGCT TGACACGTAT CAGACTGAGG TTGCATCCAC GGGAACACAT ACAGGGAACC AGCTGGTCAT ATCCAACATA AAAAGCATTA AAGGAGAAAA CAATGAAGAA ATTGGCATGC TTATCCTTAA TGTAAAAGAA TCCTACATAT ACAGTCTTAT ATCGGAAATA AAGCTTCCCG ATGAAGGGCA GCTGTATATT GTCGGAAAAG ACGGCAATTA TGTAATGAAT CCATTTAACA GGCTTCAAAA TGGGAAAGTG GATTATGTAA AATATGAGTT GTATATTGAA GAAATATTAA AGAAAAAAAA CGGAACATTT ATAAAAAAAA TAGATGGAAG GGATTACTTG CTGGCCTTCC AGACGATTGA CAGCATAAAC GGTATTGAAC TGGGATGGAC GGTATTCGGG ATGACACCGG TTGATATCAT AACGTCGGGT ATTGAAAGTA CCCAAGATAT TTTGTATGAG ATTGGGTTGA TATGCGTTAT CGCAGGATTT GTGATTTCTC TGCTGATTAC AAGGCTTTAC AATGCTCATC TGGAAAAGAG ATATGAAAGA AAGCACTCCA TTATTATGGA AAGGGAGAGG CTTGCATCTT TGGGACAGCT TATGGGCGGA ATAGCACAGA GTTTTAAAGC TCCAATTATG TCAATATCGG ATGGACTTGA TGAATTAAAC AGTCTTGTGG ATGAATATGA AAAATCCATA GAAAACGAAA ATGTGTCGGA TGAGACAAGG CATGAGATTG CTTCCAGGAT GAGAGAGTGC CTGGACAAGA TAAAACCGCA TTGTTCGTAT ATTTCCGATG AAATATCTGC CGTAAAGGGG CAGGCTGTCA ATTTCAACGA TTCGACAGAC GGAATTTTTA CTGTTGATGA ATTGATTAAA AATGTAAAAC TGCTCATGAG CCATGAGATT AAATTCTGGA ATTGTGAAAT GAACGTGGAA CTTAAAGTAA GCGGAGACAC CTCGATAAGA GGAGAAATAA ACAATATGAC TCAGGTAATG AATAATATAA TTACCAATGC CATTGAAGCC TATAACGGCA AAGGAGGAAA AATTGATTTA ATATTCAGCA AAAAAGGACA TAATTTGGAG ATAACCGTAA GAGATTATGG ATGCGGAATC CCCGAAAGCG TAAAAAGCAA ACTGTTTAAA GAAATGGTGA CGACCAAGGG TTCAAAAGGT ACGGGTATAG GCGTGTATAT GGCCTATTCC ACCATAAAAG GAAAATTCGG AGGAACCATG ACCATTGACA GCAAGGAAGG GAAGGGAACC TCCGTAAATA TCACCATACC CTTAAAGGAT AAAGATTTTA CTCCACCGTA A
|
Protein sequence | MSKEMKAPEK IKSVFKRFNI KTINKQIRLN IVFGLIVLIS TMLLGYLSFD MISDAMIRHA GADNLELVKQ ITKNIETVMT GFDDISNEIL TNENFDRLVK MHVTLDDEHK KAANRRSIES ILNGYTNTRT DIADIAVVTN TGEYITSGET RPLVTDNALS YYVVKRFKQS GRDSLWLDTY QTEVASTGTH TGNQLVISNI KSIKGENNEE IGMLILNVKE SYIYSLISEI KLPDEGQLYI VGKDGNYVMN PFNRLQNGKV DYVKYELYIE EILKKKNGTF IKKIDGRDYL LAFQTIDSIN GIELGWTVFG MTPVDIITSG IESTQDILYE IGLICVIAGF VISLLITRLY NAHLEKRYER KHSIIMERER LASLGQLMGG IAQSFKAPIM SISDGLDELN SLVDEYEKSI ENENVSDETR HEIASRMREC LDKIKPHCSY ISDEISAVKG QAVNFNDSTD GIFTVDELIK NVKLLMSHEI KFWNCEMNVE LKVSGDTSIR GEINNMTQVM NNIITNAIEA YNGKGGKIDL IFSKKGHNLE ITVRDYGCGI PESVKSKLFK EMVTTKGSKG TGIGVYMAYS TIKGKFGGTM TIDSKEGKGT SVNITIPLKD KDFTPP
|
| |