Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3650 |
Symbol | |
ID | 7873155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4007015 |
End bp | 4008091 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700591 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_002890620 |
Protein GI | 237654306 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.051036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCGC ACCCCACCCA CGCCGCCAGC AGCCCCAGCA GCCCGTTCGC CGGACTCGAC CTGCTGTCTT CGGCGGTCGT GCTCGTCGAC GCCGGGCTGG TGATCCGCTA CCTCAACGCG GGCGCGGAGA ACCTGTTCGC GATCAGCCGG CGCAAGCTGC TCGGCCAACC GCTCGAACGC CTGCTCGGCA GCCCGCCCGG CCTGGCCGCG GCGCTCGACA ACGCGCTGCG CACCAACTGG AGCTACACCG GCCAGGACAT CAGCGTGCAG CGCGGCGACG CCGAGCCGCT GCGGCTCGAC TGCACGGTGA CGCCGGTCGA CACCGCAAGC GTGCGCCTGC TGCTCGAATT CCGCCCGATC GACGCGCAGC TGCGCGTCGC GCGCGAGGAG CAGCTGCTGC ACCAGCAGCA GGCCAACCGC GAGCTCATCC GCAACCTCGC GCACGAGATC AAGAACCCGC TCGGCGGCAT CCGCGGCTCG GCCCAGCTGC TGCAGCACGA GCTCGACGAC CCGCAGCTGC GCGAATTCAC CGACGTCATC ATCGCCGAGG CCGACCGCCT GCAGGACCTG ATGAACCGCC TGCTGAGTTC GCATTGCATG ATGCGCCCGG CCTCGATCAA CCTCCACGAC GTGCTCGAAC GCGTGCGCCG CCTGATCCTC GCCGAGTTTC CTTCGATCGG CATCGTCCCC GACTACGACC TCAGCCTGCC CGAGCTCACC GCCGATCGCG AGCAGCTCAT CCAGGCCGTG CTCAACATCG TGCGCAACGC CGCACAGGCG CTGGGCGGCC ACGGCGAGAT CCTGCTGCGC ACCCGCATCG CGCGCCAGGT CACGCTCGCC AAGCGCCGTC ACAAACTGGC ACTCAAATTG CAAGTAATCG ACGACGGCCC CGGCATCCCC GAGGAGATCC GCGATCGCAT CTTCTATCCG CTGGTTTCGG GGCGGGAGGG CGGCAGTGGT CTGGGCCTGT CGCTCGCACA GAGCTTCATC GAGCAACACC AGGGCATGAT CGAGGTGGAT AGCCGTCCCG GGCGCACCTG CTTCACGATC CTGCTGCCGA TTACCGAGCG TGCCTGA
|
Protein sequence | MPSHPTHAAS SPSSPFAGLD LLSSAVVLVD AGLVIRYLNA GAENLFAISR RKLLGQPLER LLGSPPGLAA ALDNALRTNW SYTGQDISVQ RGDAEPLRLD CTVTPVDTAS VRLLLEFRPI DAQLRVAREE QLLHQQQANR ELIRNLAHEI KNPLGGIRGS AQLLQHELDD PQLREFTDVI IAEADRLQDL MNRLLSSHCM MRPASINLHD VLERVRRLIL AEFPSIGIVP DYDLSLPELT ADREQLIQAV LNIVRNAAQA LGGHGEILLR TRIARQVTLA KRRHKLALKL QVIDDGPGIP EEIRDRIFYP LVSGREGGSG LGLSLAQSFI EQHQGMIEVD SRPGRTCFTI LLPITERA
|
| |