Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3221 |
Symbol | |
ID | 7874442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3523214 |
End bp | 3525121 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700155 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_002890193 |
Protein GI | 237653879 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTGATT TTGCCGGGAT GGAGGATCTG CTGCAGGACT TCCTCGTCGA AGCCGGCGAC CTGCTGTCCG ACGTCGACAA CAAGCTCGTC GAACTCGAGC ACAGCCCGGA TGACCCCGGC CTGCTCAACG ACATCTTCCG TGGCTTCCAC ACCATCAAGG GCGGCGCGGG CTTCCTCGGC GCGGCCGAGC TGGTGACCCT GTGCCACCTC ACCGAGAGCC TATTCGATCG CCTGCGCAAG CACGAGCTCG AGGTCACCCC CGAGCGCATG GACGTGATCC TGTCGGCGAC CGGCGCGGTG CGCGACATGT TTGTCGATCT GGGCCGCGGC AACCGTCCGG GGCCGGCTGC GCCCGACCTG CTCGAGGCGC TGCGCGCCGC GTGCGCCGGG CGACCGGCGG CAGCCGCCGC GGGGGCACCC GTCGCGCCCG CCGCCGAGGT GCGGCCCGCC GGGGTGGTCC TGCCGCGGCC GGGCGGGCCC GACTGGGATG CCTTGTATGC TGCGGTCACC GGCGCGACCG CGCCGGCCGG AGCGCCCGTC GCGGCGCCCG CGATCCCGCG GCCCGCGCCG CTTCCCGCGC TCGTGCCCGC CGCCGCCGCG GCGGCTGCCG ACCCGGTGGT CGCCGCCGCC TTCGGCCGCC GCGCCAGCGA CGTGCCCGGT GCCGGCGCGC CGGTCGGTCG GCGCGAGGGC GAGCGTCAGC GCGACAACTC GATCCGGGTG GACACCGCCC GCCTCGATCA GGTGCTCAAC CTCTCGGGCG AGATCGGCCT CACCAAGAAC CGGCTCAACG CGCTGCGCAG CGACATCCTT GCCGGCCGCA CCGACACCGA GACCCTGCAG GCGCTCGACC TGGCGGTGAG CCAGCTCGAC CTGCTCGTCT CCGATCTGCA GAACGCGGTG ATGAAGACGC GCATGCAGCC GATCGGCCGG CTCTTCCACA AGTACCCGCG CATCGCCCGC GATCTCGCGC GCAACCTCGG CAAGGAGGTC GAGCTGGCGT TGGTCGGTGA GGACACCGAG ATCGACAAGA CCATGGTCGA GGATCTCTCC GACCCGATCA TCCACCTCAT CCGCAACGCG GTGGATCACG GCATCGAGGA TCCCGCCCGG CGCGCCGCCG CCGGCAAGCC GGCCAAGTCG GTGCTGCGCC TGGAGGCGCG CCAGGAGGGC GACCACATCG TGATCACCGT GGCCGACGAC GGCCGCGGCA TGGATCCCGA GAGGCTGCGC GCGAAGGCGC TCGCGCAAGG CGTGATCACC GCCGAGGAGG CGAGCACGCT CGATGAGCGC CAGAGCTACG ACCTCATCTT CCTGCCCGGG TTTTCCACCG CCGAGAAGGT GTCGGACGTG TCCGGCCGCG GCGTCGGCAT GGACGTGGTG CGCACCAACA TCCAGAAACT CAACGGCAGC ATCGAGATCC AGTCCCAGCT CGGCAAGGGC ACCACGCTGC TGATCCACCT GCCGCTGACC CTGGCGATCC TGCCGGTGCT GCTGGTGCGC CTCGGCGATC AGCCCTTCGC CGTGCCGCTG TCGATGGTGC GCGAGATCCT GCCGATCGAG CCGGGCGGCA TCCAGGACGT CGGTGGCAAG GCCACCATGG TGGTGCGCGG CGAGGTGCTG CCGATCGTGC CGCTGTCCGC GCTGCTCGGC TGGCCGCGCG AGGCGGTGCC GCAGTATGGC GTGGTGATGC AGTCGGGCGC CGCGGTGTTC ATTCTCGCGA TCGACAGCTT CGCCGGGCGC GAGGACGCGG TGATCAAGTC GCTCGAGGTC TTCCGACCGA AGGGCGTGGC CGGGGTGACC ACGCTCGCCA ACGGCCAGAT CGTGCTGATC CTCGACATGA AGGAGCTGCT CGAGTCGGCC GGCGATCGGC GCAGCGTGTC GCGCTCGACG CTGTTCGAGG CGGCGTAG
|
Protein sequence | MSDFAGMEDL LQDFLVEAGD LLSDVDNKLV ELEHSPDDPG LLNDIFRGFH TIKGGAGFLG AAELVTLCHL TESLFDRLRK HELEVTPERM DVILSATGAV RDMFVDLGRG NRPGPAAPDL LEALRAACAG RPAAAAAGAP VAPAAEVRPA GVVLPRPGGP DWDALYAAVT GATAPAGAPV AAPAIPRPAP LPALVPAAAA AAADPVVAAA FGRRASDVPG AGAPVGRREG ERQRDNSIRV DTARLDQVLN LSGEIGLTKN RLNALRSDIL AGRTDTETLQ ALDLAVSQLD LLVSDLQNAV MKTRMQPIGR LFHKYPRIAR DLARNLGKEV ELALVGEDTE IDKTMVEDLS DPIIHLIRNA VDHGIEDPAR RAAAGKPAKS VLRLEARQEG DHIVITVADD GRGMDPERLR AKALAQGVIT AEEASTLDER QSYDLIFLPG FSTAEKVSDV SGRGVGMDVV RTNIQKLNGS IEIQSQLGKG TTLLIHLPLT LAILPVLLVR LGDQPFAVPL SMVREILPIE PGGIQDVGGK ATMVVRGEVL PIVPLSALLG WPREAVPQYG VVMQSGAAVF ILAIDSFAGR EDAVIKSLEV FRPKGVAGVT TLANGQIVLI LDMKELLESA GDRRSVSRST LFEAA
|
| |