Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3726 |
Symbol | |
ID | 7873725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4093642 |
End bp | 4095606 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700672 |
Product | histidine kinase |
Protein accession | YP_002890696 |
Protein GI | 237654382 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCCG CCAATTTCAG CGCCTTCGAG CTCCAGCCCC TGCCCGACGG CGCGTCGACG ATCCTGCGCA TGCTCGCCGC GCAGACGCCG AACTGGGCGG AGATCGCGCT CGTGGCCGCA CGCGACCCCG CGCTGAGCCT GGCGCTGCTG CTGGCCGATC CGCTGCGCGC GGGCGAACTC CAGGACGGCC TCAATGCCGC CCTGCGGCGC CGCCTCGAGC GCATCGGCAC CGACCTGCTG CGCGCCTGGC TGCTCGGTCT GGGCCACGTC GGCAACCAAC CGGACGAAAC CGCCGACGCC GCACTGCTGC GTGCCGAATG CGCGCTCCAC CTCGCGATCG AGACCGAATA TCCGCGCCCC GACGAGGCCT ACCTCGCCGG CCTGTGGTTC GGCCTGCCGC GCTCGGCGGG GTGGGTTTCC GGGCGCCGGC CGACGCTGCC GGCCCTTGTC GCCGACTGCG GCCTGCCCGC GAGCCTGGCC GACGCGCTCG AGTTCGACGG CATGGATCCG GCCTGCGCGG GCGCGATGCA TCCGCTGCTC GGCCTGCTCG CCGCCGCACG CCAGCTCGCC GGTGGCAACT GGCAGGAGCG CATCACGGAA GTCGCCGAGC TGACCGGACT CGAGAGCGCA AGCGTGGTGG CGCTGCGCAC CGACGTCGGC TACATCCTGT CCGGGCATGC GTCCTACCCC GCCCTGCCCG GCCTGCCCAG CGCGGGCACG GGCGCGGCCT GCCCGACCCT GCGCCTCACG GACGACCCCT ACCGCTACGC CGGCATGCTC GGCCTGCTCA CCGCCGCCTT CGTCGACCTG GACGCGACGA CCATCCGCGA CCGCCTGGCG ATCGCCTGCC CGCTGTTCGG CCTGCACACC CCGCCGGTGC TGCTCGGCAG CGGCGAGGAC GGTCGCCTGC GCCCGCTGCT CGGCGCCGCG GCGGGCAGCC CCGACCTGCT GATCGGCGAG CTGCGCCTGC GCCTGGACGA CGAGACCAGC TGCATCGCCC TCGGTGCGCG CAGCGAGCAA CCCTGCTCGC ATTTCGGTGA CGGCGGCCCG CCCGGGCGCA GCGTCGCCGA CTGGCAGGTC GCACGCTGGC TCGGCCAGCC GGGCTTCCAC GTCATCCCTC TGGGCGGCGC GGGCGACACC AGCGTGGCCC TGGTCGCCAC CGCCTCCGCG CAGGCGCTCG ACAGCGAGCT GCGCTGGCGC TACGCCGCCC TGCTCGGTGC CGCGGCGCGC GCGCTGCGCA GCCACGCCCG CCAGCGCGAT GCGATCGCCG AGCGCGAGGC CGCGCTGCAC CAGCGCTTCC GCGACCATGT GCGCCGCATC GCCCACGAGG CCAGCAACCC GCTCACGGTG CTCAAGACCC GCATCGACAT GCTCGCCCAG GAGCGCTCCG GCGACAGCAC GCTGCAAGAC GAGATGAGCC TGCTCAACGC CGAACTCGAC CGCATCGACA AACTCCTGCG CAGCGCCGCC GAACTGCCTG CCGAGACGGC CGAGATGCGC ACCTGCCGGG TGCCCGAGCT GCTGCTGGAC ATGCGCACGC TGTACGGCGA GCCGCTCTTC GGCAGCCGCC AGATCCAGCT CGAGTTGCGC GCCGCGCGCG ACGTGCCGCC GGCCGCGATC CCCGCATCGG CGCTCAAGCA GGTGCTGCTC AACCTGCTGC GCAACGCCTC CGAAGCGCTC CAGCCCGGCC AGCGCCTGGT CGTCTCGGTG CTGCCGCTGG TCAATGTCGA CGGCCGCAAC TGCCTGGAGA TCCGCTTCGT CGACAACGGC CCCGGCCTGC CGGTGGAACG CGCCCAGGAT CCGCTCAGCC CGCGCCCGAG CGCAAAGGGG GCGGAGCACC AGGGCCTCGG CCTCTCGGTG GTGCGCGAGA TCCTCGCACA ATGGGGCGGC ACCCTGCTGT GCCGCACCCA GGCGGGCGCC GGCACGAGCT TCCAGATCTT CGTTCCGCTG GAACAAAGCG CCTGA
|
Protein sequence | MFAANFSAFE LQPLPDGAST ILRMLAAQTP NWAEIALVAA RDPALSLALL LADPLRAGEL QDGLNAALRR RLERIGTDLL RAWLLGLGHV GNQPDETADA ALLRAECALH LAIETEYPRP DEAYLAGLWF GLPRSAGWVS GRRPTLPALV ADCGLPASLA DALEFDGMDP ACAGAMHPLL GLLAAARQLA GGNWQERITE VAELTGLESA SVVALRTDVG YILSGHASYP ALPGLPSAGT GAACPTLRLT DDPYRYAGML GLLTAAFVDL DATTIRDRLA IACPLFGLHT PPVLLGSGED GRLRPLLGAA AGSPDLLIGE LRLRLDDETS CIALGARSEQ PCSHFGDGGP PGRSVADWQV ARWLGQPGFH VIPLGGAGDT SVALVATASA QALDSELRWR YAALLGAAAR ALRSHARQRD AIAEREAALH QRFRDHVRRI AHEASNPLTV LKTRIDMLAQ ERSGDSTLQD EMSLLNAELD RIDKLLRSAA ELPAETAEMR TCRVPELLLD MRTLYGEPLF GSRQIQLELR AARDVPPAAI PASALKQVLL NLLRNASEAL QPGQRLVVSV LPLVNVDGRN CLEIRFVDNG PGLPVERAQD PLSPRPSAKG AEHQGLGLSV VREILAQWGG TLLCRTQAGA GTSFQIFVPL EQSA
|
| |