Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4013 |
Symbol | |
ID | 7873659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4410186 |
End bp | 4412099 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700950 |
Product | histidine kinase |
Protein accession | YP_002890973 |
Protein GI | 237654659 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCTGG CGGTCCCCCG CCTGCTCGTG AGGCGCTCGA TCATGCTGCT GGTGTCGGTC GCCCTCGTGG CGATGACGCT GATCGGGCTG GCGGGCATGA GCGCATCCTT CGTCGTGGTG GATCGCAGCC GCGACAGCGT GCAGGCGATC GCCGTGGCCA GCACCCTGCG CGGCCATACC CAGCGGATCG CCAACCTGAT CGCGATCGAT GCGCTCAAGG GCCGCATCGG TGTGTCCGAG CGCACGCGCG CGGCGATGGC CGACATCGAG CGCGAGCTGC AGCAGGCCGC GCTGCGCCGC TTCGTGGACG AGGCCCCGGG AGAGCTCTTC GCGGCGACCT ACCGCGGCGT GCAGGCGGGT TGGGAGCGCT CGGTGCGGCC GCGGATCGAG GCGCTCGCCG GGCCTGTGCC GCCGGATGCG CGCGAGGTCG AGACGCTGCT CGCCGGGGTC GACGACTTCG TCGGTCAGGT CGATAGCCTG GTCGCGGTGC TCGGCCAGGA GAACGAGCGC CGTATCGACG AGCTGCGCCG CATCCTCGCG CTCGCCGCGG CGGTGACGCT GGGGGTGGTG CTGGTGGTGA TCGTCTTGCT GCAGCGGGCG CTGCTGCGCC CGCTCGGTGG CCTGCTCGAC GCCGCACGCC GCATCGCCGG CGGTGATTTC GGCGTGCGCG TGCGCTACAC CGGGGAGGAC GAACTCGGCC AGGTCGGCAG CGCCTTCAAC CTGATGGCCG ACGAACTCGC CCGCCACTAT CACCTGCTCG AGCTGCGCGT CACCGAGAAG ACCGCCGAGC TGCAGCGCAG CAACCGTTCG CTCGAGCTGC TCTACCACGC CATCGCCCGG CTCTACCAGG CGCCGAGCGC GCCCGACGCC TACGAGGCGA CCCTGCGCGA CATCGACCGC GTCGCCGGGC TGGAGGGCTC CTTCGTCTGC ATCGAGCCGC GCCCCGGCGC CCCCGCGGCG GTGATCGCCT CGTCCATGGG GCCCTGTCCG GATCGCGCCG AGCGCGGCGA GGACGCCTGC GCGGCCTGCC GGGCCGGCTC GGAGGGGCAG GCGCTGGCGC TGCTGTCGCC GACCCTGCTG CGCTTTCCGC TGCGCGACCG CGAGCACCAT CACGGCATGC TGCGCCTGTC GCTGTCCGAC GGCGCGCGCC TCGAGGACTG GCAGCGCCAG CTCGTCGAGG CGCTGTCGCG CCACATCGGC ATGGCGCTCG GCGCGGCACG GCGCACCGAG CAGGAGCGCC TGCTCGCACT GCAGGAGGAG CGCTCGGTGA TCGCGCGCGA GCTGCACGAT TCGCTCGCGC AGGCGCTGTC CTACATGAAG ATCCAGGTCA GCCTGTTGCA GCGCGCACTC GCCGATCCGG CGCGCACGGC GGAGGCGGAG CCGATCCTGG CCGACCTGCG CGAGGGCATC AGCGCCGCCT ACCGTCAGCT GCGCGAGCTG CTGGTGTCCT TCCGCCTCGG CCTGTCGGCC GACCTCGCCA CCCTGATGGA AGATGCGGCG CGCGAATACG GCACGCGCGG CGGTCTGGAG GTCGAGCTCG CCGTGGAGCT CGGCGCCTGC CAGCTCAGCC CGAACCAGGA GGTGCATGTG CTGCAGATCG TGCGCGAGGC CTTGTCGAAC ATGGTGCGCC ACGCTTCGGC ACGCCATGCC TGGGTGGCGC TGCGTGGCGG CGCGGATGGC GAGGTGCTGC TGGAGGTGCG CGACGACGGC TGCGGCATCG GCGCGCCGCC GGCGGATGCC CGCAACCACC ACGGGCTGGC GATCATGCGC GAGCGCGCGC GCAGCATGGG CGGGGAAATC GACATCGGGC CGGCGCTGCC GAGCGGTACC CGGGTGTGCG TGCGCTTCCG ATCCGCCAAC GCGAGCATGG CGGCGATGGC GCAGGGTGAT GCGAAGAAGG AAACGGAGAC ATGA
|
Protein sequence | MYLAVPRLLV RRSIMLLVSV ALVAMTLIGL AGMSASFVVV DRSRDSVQAI AVASTLRGHT QRIANLIAID ALKGRIGVSE RTRAAMADIE RELQQAALRR FVDEAPGELF AATYRGVQAG WERSVRPRIE ALAGPVPPDA REVETLLAGV DDFVGQVDSL VAVLGQENER RIDELRRILA LAAAVTLGVV LVVIVLLQRA LLRPLGGLLD AARRIAGGDF GVRVRYTGED ELGQVGSAFN LMADELARHY HLLELRVTEK TAELQRSNRS LELLYHAIAR LYQAPSAPDA YEATLRDIDR VAGLEGSFVC IEPRPGAPAA VIASSMGPCP DRAERGEDAC AACRAGSEGQ ALALLSPTLL RFPLRDREHH HGMLRLSLSD GARLEDWQRQ LVEALSRHIG MALGAARRTE QERLLALQEE RSVIARELHD SLAQALSYMK IQVSLLQRAL ADPARTAEAE PILADLREGI SAAYRQLREL LVSFRLGLSA DLATLMEDAA REYGTRGGLE VELAVELGAC QLSPNQEVHV LQIVREALSN MVRHASARHA WVALRGGADG EVLLEVRDDG CGIGAPPADA RNHHGLAIMR ERARSMGGEI DIGPALPSGT RVCVRFRSAN ASMAAMAQGD AKKETET
|
| |