Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3651 |
Symbol | |
ID | 7873156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4008118 |
End bp | 4009542 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700592 |
Product | nitrogen metabolism transcriptional regulator, NtrC, Fis Family |
Protein accession | YP_002890621 |
Protein GI | 237654307 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR01818] nitrogen regulation protein NR(I) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.071917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG TCTGGATCAT CGACGACGAC CGCTCGATCC GCTGGGTGCT GGAAAAGGCG CTCGGCCGCG AGGGCATCGA CCACGCCAGC TTCAGCTCCG CCGGCGACGC GCTCGCCGAG CTCGAGCGCG CACCGCAGCC GCCGGCCGCG CTGCTATCCG ACATCCGCAT GCCCGGCGAG TCCGGGCTGG ACCTGCTGCA GAAGGTGAAG GAGCGCCACC CGCAGCTGCC GGTCATCATC ATGACCGCCT ACTCCGACCT CGACAGCGCA GTCGCGGCCT TCCAGGGCGG CGCCTTCGAG TATCTGCCCA AGCCCTTCGA CGTCGACCAG GCGGTCGCGC TCGTCCGTCG TGCGCTGGAG CAGACCGCGC ACCAGAACGG TGCGAGCGAA GAGGCCACCC TGGCCCCCGA GATCCTCGGC CAGGCCCCGG CGATGCAGGA GGTTTTCCGC GCCATCGGGC GGCTCGCCCA CTCGCACGCC ACCGTGCTGA TCAACGGCGA GTCGGGCAGC GGCAAGGAAC TGGTCGCGCG CGCCCTGCAC CGCCACAGCC CGCGCCGCGA CGCGCCCTTC ATCGCCATCA ACACCGCGGC CATCCCGCGC GACCTGCTCG AATCCGAACT CTTCGGCCAC GAGCGCGGCG CCTTCACCGG CGCCGCCACC CAGCGCCGCG GCCGCTTCGA GCAGGCCGAC GGCGGCACGC TGTTCCTCGA CGAGATCGGC GACATGCCGG CCGAGCTGCA GACGCGGCTG CTGCGCGTGC TCTCCGACGG CCACTTCTAC CGCGTCGGCG GCCAGCAACC GATCCGCGCC AACGTGCGCG TGATCGCCGC CACCCACCAG GACCTGGAGG AGCGCGTGCG CCAGGGCCTG TTCCGCGAGG ACCTCTTCCA CCGCCTAAAC GTCATCCGCC TGCGCCTGCC GCCGCTGCGC GAGCGCCATG AGGACATCCC CTTGCTGGTG CGGCACTTCC TGCAGAAGAG CGCGCAGGAG CTCGGCGTCG AGCGCAAGCG CATCTCGGAG GCCACGCTCG AATACCTGCA GGCTCAGCCT TTCCCGGGCA ACGTGCGCCA GCTCGAGAAC CTGTGCCACT GGCTCACCGT GATGGCGCCG GCGCAGGTGG TGGAGGTCGC CGACCTGCCA CCCGAGATGC GCGAGCAACC CGGCCGCGAG TCGCCATCGA ACTGGATGGA GGGCCTGGGC AGCGAGGCCG ACCGCCTGAT CGCCTCGCGC CCCGGAGAGG TGTTCGACCG CCTCACGCGC GATTTCGAGC GCACCCTGAT CCGTCGTGCG CTGGCCGCCA CCGGCGGCCG CCGCATCGAG GCCGCGCAGC TGCTCGGCAT CGGCCGCAAC ACCATCAGCC GCAAGATCCA GGAGCTGGGC ATGGACGAGG AGCGCGCGCC GGAGACGGAG GAAAGCGGGC GCTGA
|
Protein sequence | MNTVWIIDDD RSIRWVLEKA LGREGIDHAS FSSAGDALAE LERAPQPPAA LLSDIRMPGE SGLDLLQKVK ERHPQLPVII MTAYSDLDSA VAAFQGGAFE YLPKPFDVDQ AVALVRRALE QTAHQNGASE EATLAPEILG QAPAMQEVFR AIGRLAHSHA TVLINGESGS GKELVARALH RHSPRRDAPF IAINTAAIPR DLLESELFGH ERGAFTGAAT QRRGRFEQAD GGTLFLDEIG DMPAELQTRL LRVLSDGHFY RVGGQQPIRA NVRVIAATHQ DLEERVRQGL FREDLFHRLN VIRLRLPPLR ERHEDIPLLV RHFLQKSAQE LGVERKRISE ATLEYLQAQP FPGNVRQLEN LCHWLTVMAP AQVVEVADLP PEMREQPGRE SPSNWMEGLG SEADRLIASR PGEVFDRLTR DFERTLIRRA LAATGGRRIE AAQLLGIGRN TISRKIQELG MDEERAPETE ESGR
|
| |