Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2149 |
Symbol | |
ID | 7408342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2279681 |
End bp | 2281681 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716514 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_002573997 |
Protein GI | 222530115 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATGT CTCAGTATCT TGGAATGTTT ATAGAAGAGG CAAGAGACCA CATTCAAAGC CTTAACGACA ATATGCTAAA ACTTGAAGAA AACCCGGAGG ATTTGCAAAT TGTAAATGAG ATTTTCAGGT CAGCCCATAC TTTAAAAGGC ATGGCCGGCA CAATGGGATT TGTCAATATG CAAAAGCTTA CACATGCGAT GGAAAATGTT CTTGCTGCTG CACGCGATGG CAAGTTAAAA GTAAATCCTA ATATCATGGA TATTCTTTTC AAGACAGTTG ATGCGCTTGA ATCATACTTA GATGTTATAA TTGCAACAGG TACAGAAGGA CAAGAGACAA ATTTGCATCT TGTCAATGCT TTAAATGCTA TTTTAGGGAA ACCTGCCGAA GATGTGGCGG TGTCTTCAGC AACAAAAGCA GGTAAGAAAT ATGAATATGA TGAGTTTGTT GTAAGAGCAA TAGAACGTGC TTGGGACCAA GGGTTTAATG TTTACAGATT TGATGTTGAG CTTGACCAGA ACTGTCTTTT AAAATCTGCA CGTGCATACC TTGTTTTCAG AGCGGTTGAG GAACTTGGTG AAATTATTCA TTCTAAACCC TCGGTTCAGG ACATTGAGGA TGAAAAGTTT GATTTTGAGT TTTCTATTAC CGTTATAAGC AAGCAGCCGA TTGAAAAAAT AAGAGATAGA ATTCTTTCAA TTTCAGAGAT AAGAGAAGTA AAAGCGCTTG AGATAAAGTC TGGCGAAGTA AGTATGGCAG AAGAAAAAGA GGAGATTGAA GAGGTACAGC AAGAGACACA AGTACAGGAA ACTGTAAAGG TTGTAAGGCA GCAGAAACAA GAATCTTTGC AGAAGACAAG CAAAACAGTT AGAGTTGACA TTGAAAGATT AGATGTTCTT ATGAACTTGG TGAGCGAGCT TATTATAATC AAAAGCCGAA TAGAAGGACT TGCAAAAAAG TATAACGATA GACAATACGA AGAGTCTATT GAGTATTTGG AAAGAATTAC AACAAGTTTA CACGATGCTG TAATGAAGGT ACGAATGGTC CCGGTTGAAA GGGTATTTTC ACGTTTTCCA AGGATGATGA GAGATTTAGC AAGAGAACTT GGAAAGGAAT TTGAGCTTGT AATGTCTGGT GAGGATACTG AGGTTGACAG GACTATTGTG GACGAGCTTG GAGATCCTCT TATTCATCTT CTGAGAAATG CTGCCGACCA TGGAATAGAA GACCCTGATG AGAGGGTCAA AAACGGCAAA CCAAGAAGTG GGCTTATTAA ACTTTCGGCT TATCATGACG GGAACAATGT TGTCATTGAG GTTGAAGATG ATGGCAAAGG AATTGATTTA GAAAAGGTAA AGCAAAAAGC TATAGAAAAG GGGCTTTTGA AAGAGGACCA AATAGAATTG ACAGAGCAGG AAATAATAGA TTTTCTGTTT ATGCCAAGCT TTTCAACAAA AGACAAGGTT ACAAACCTTT CTGGACGTGG TGTTGGACTT GATGTTGTAA AGACCAAGAT TGAACAGCTT GGTGGAATGG TTGAAGTGAA GACACAGAAA GGAAAAGGAA CAAAGTTTGT TATACGACTT CCGTTAACTC TTGCTATCAT TCAGGCATTG CTTGTCACTG TACATGATGA GATATATGCA ATTCCTGTTG CATCAATCAG AGAGATTGTG GATGTTGCAA AAGAGGATAT AAAGGTTGTT CAAAAAGGAA AGGAAATAAT CATGCTGAGA AATCAGGTGA TTCCAATAAA ACATTTACAT TCTATTGTGG GGTTAGAGCC AGTTCTTGAC AAAAAGAAAT TTACAGTTGT GATAGTAAGA CGTGGCGAAA AACTGACAGG AATCATTGTT GACAAACTTT TAGGACAACA GGATATTGTT ATAAAATCGC TTGGAAAGTA TTTAGAGGGA GTTAGACTCA TATCAGGTGC AACAATCCTT GGCGATGGGT CTGTTGCAAT GATACTTGAT CCTAACATGC TTACAGTGTA A
|
Protein sequence | MDMSQYLGMF IEEARDHIQS LNDNMLKLEE NPEDLQIVNE IFRSAHTLKG MAGTMGFVNM QKLTHAMENV LAAARDGKLK VNPNIMDILF KTVDALESYL DVIIATGTEG QETNLHLVNA LNAILGKPAE DVAVSSATKA GKKYEYDEFV VRAIERAWDQ GFNVYRFDVE LDQNCLLKSA RAYLVFRAVE ELGEIIHSKP SVQDIEDEKF DFEFSITVIS KQPIEKIRDR ILSISEIREV KALEIKSGEV SMAEEKEEIE EVQQETQVQE TVKVVRQQKQ ESLQKTSKTV RVDIERLDVL MNLVSELIII KSRIEGLAKK YNDRQYEESI EYLERITTSL HDAVMKVRMV PVERVFSRFP RMMRDLAREL GKEFELVMSG EDTEVDRTIV DELGDPLIHL LRNAADHGIE DPDERVKNGK PRSGLIKLSA YHDGNNVVIE VEDDGKGIDL EKVKQKAIEK GLLKEDQIEL TEQEIIDFLF MPSFSTKDKV TNLSGRGVGL DVVKTKIEQL GGMVEVKTQK GKGTKFVIRL PLTLAIIQAL LVTVHDEIYA IPVASIREIV DVAKEDIKVV QKGKEIIMLR NQVIPIKHLH SIVGLEPVLD KKKFTVVIVR RGEKLTGIIV DKLLGQQDIV IKSLGKYLEG VRLISGATIL GDGSVAMILD PNMLTV
|
| |