Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2042 |
Symbol | |
ID | 7083802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2306180 |
End bp | 2307823 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699069 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002355686 |
Protein GI | 217970452 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0504248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTCGC GCCGGCGCCC GTCGCCGCAC CCGCTTCGCG CACGGCCCGC CGCGGGCAGC TTCTGTGCTA TACAGGACGA GTCCGACCGC CACGGAAACC ATGCGGCATG CGGGACCGCC GCCACTGACC AGGTGAGCCC CATCGACGCG TCCAGACCGC ACCCACCGCC TCGTGCCGGG CATGTGCTCG TGGTGGCGCT CGCCCTCGGC GCGGGCTCGG CGCAGGCCGC GGGGATCGAT GCCGGCGGGC TGGCCTGGAA TGGCGGTCTG GTTCTGCTCG GCGGCGGGCT GGGTTATCTG CTCGGCCGCC GCAGCCTGCG CCGACGCACC CTCGCCGGTG CGGCAGAGGG CTGCGCTGCG GACGGCCCGC TTCACGACGA CGCCCCGGCG AGCGGCGGCG CGGTTTCGAG GCGGGAGTGG GCAGGCACGC CTGCAGACGC CGACGATTGC GCGCCGGCGA GCGCGCTCCC CCATGCCCTC CTCCCCGCCG CGCACGCCGC TCCGCCCGCG GGCGCAGGCG CGGCGATCGG GGTGATCGCG ACCGAATCCT ATCGCGAGGT GGTCGATTCG CTCAGCGAGG TGCTCTTCCG CACCGACGAG CTTGGCCGCC TGGTTTTTCT CAACGACGCC TGGAAGGATC TCTCCGGTTT CGATCTGGAC GCCACGCGCC AGCATGCGCT GACCGATTTC CTCCATCCAG ACGACCGCTT GCGCGCACGC GACGCAATCC GCGCCCTGCT GTGCGGAGAC GGCCAGGAAT GGACCGACGA GTTGCGGCTG CGCACGCTCT CCGGCGAGAT CCGCTGGGTG GCGATCGACT GCCGTGCCTT GCGCGACGGC AGCGGGCAGG AAAGCGGGAT CGCCGGCACC ATCGACGACA TCTCGGCGCG CAAGATCGCC GAGTTCAGCC TGCGCAACCT GAACCAGGAG CTCGAGTCGC GGGTGCGCTC GCGCACCGCC GAACTCGAGA CCGCGGTGCG CGAGCTCGAG GCCTTCTCGT ATTCCGTCTC GCACGACCTG CGTGCGCCGC TGCGCGCCAT CGACGGCTTC GCGCGCATCC TCGTCGAGGA GGCCGGGCCG CGGCTCGACG AACGCCAACG CGAGCAGCTC GTGCGCATCC GCGCCGGCGC CGAGCGCATG GCGATCCTGA TCGATGCGCT GATCGACCTC GCCAGCGTCT CCCGGCAACC GTTGCGCAGG AGGCCGGTCG ACCTGTCGCG CATCGCCGAT GCGGTGATCC GCGACCTGCA GGCGGAGTCG CCCGGGCGCG TGGTCGCGGT CGAGATCACC AGCGACATGA CGGTGGTCGC CGACCCGGTG CTGATGCACG TGCTGCTCGA CAACCTGCTG CGCAATGCAT GGAAGTTCAC AAGCCAGTGC GAACATCCGC GTATCGTCTT CGGCGCGGAA CGCGACGGCG AGCGCACGGT GTTCCACGTC GAGGACAACG GCGCCGGATT CGAGATGAAC TACGCGGGCA AGCTCTTCCA GCCCTTCCAG CGCCTGCACG CGCAGCACGA GTTTCCCGGC ACCGGGATCG GGCTCGCCAC CGTACAACGC ATCGTCGCCC GCCATGAGGG CCGGGTGTGG GCCAGCGCGG AACCGGGCAA GGGCGCGCGC TTCTGCTTCA TGCTGGGGCA TTGA
|
Protein sequence | MRSRRRPSPH PLRARPAAGS FCAIQDESDR HGNHAACGTA ATDQVSPIDA SRPHPPPRAG HVLVVALALG AGSAQAAGID AGGLAWNGGL VLLGGGLGYL LGRRSLRRRT LAGAAEGCAA DGPLHDDAPA SGGAVSRREW AGTPADADDC APASALPHAL LPAAHAAPPA GAGAAIGVIA TESYREVVDS LSEVLFRTDE LGRLVFLNDA WKDLSGFDLD ATRQHALTDF LHPDDRLRAR DAIRALLCGD GQEWTDELRL RTLSGEIRWV AIDCRALRDG SGQESGIAGT IDDISARKIA EFSLRNLNQE LESRVRSRTA ELETAVRELE AFSYSVSHDL RAPLRAIDGF ARILVEEAGP RLDERQREQL VRIRAGAERM AILIDALIDL ASVSRQPLRR RPVDLSRIAD AVIRDLQAES PGRVVAVEIT SDMTVVADPV LMHVLLDNLL RNAWKFTSQC EHPRIVFGAE RDGERTVFHV EDNGAGFEMN YAGKLFQPFQ RLHAQHEFPG TGIGLATVQR IVARHEGRVW ASAEPGKGAR FCFMLGH
|
| |