Gene Tmz1t_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3651 
Symbol 
ID7873156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4008118 
End bp4009542 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content71% 
IMG OID643700592 
Productnitrogen metabolism transcriptional regulator, NtrC, Fis Family 
Protein accessionYP_002890621 
Protein GI237654307 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.071917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG TCTGGATCAT CGACGACGAC CGCTCGATCC GCTGGGTGCT GGAAAAGGCG 
CTCGGCCGCG AGGGCATCGA CCACGCCAGC TTCAGCTCCG CCGGCGACGC GCTCGCCGAG
CTCGAGCGCG CACCGCAGCC GCCGGCCGCG CTGCTATCCG ACATCCGCAT GCCCGGCGAG
TCCGGGCTGG ACCTGCTGCA GAAGGTGAAG GAGCGCCACC CGCAGCTGCC GGTCATCATC
ATGACCGCCT ACTCCGACCT CGACAGCGCA GTCGCGGCCT TCCAGGGCGG CGCCTTCGAG
TATCTGCCCA AGCCCTTCGA CGTCGACCAG GCGGTCGCGC TCGTCCGTCG TGCGCTGGAG
CAGACCGCGC ACCAGAACGG TGCGAGCGAA GAGGCCACCC TGGCCCCCGA GATCCTCGGC
CAGGCCCCGG CGATGCAGGA GGTTTTCCGC GCCATCGGGC GGCTCGCCCA CTCGCACGCC
ACCGTGCTGA TCAACGGCGA GTCGGGCAGC GGCAAGGAAC TGGTCGCGCG CGCCCTGCAC
CGCCACAGCC CGCGCCGCGA CGCGCCCTTC ATCGCCATCA ACACCGCGGC CATCCCGCGC
GACCTGCTCG AATCCGAACT CTTCGGCCAC GAGCGCGGCG CCTTCACCGG CGCCGCCACC
CAGCGCCGCG GCCGCTTCGA GCAGGCCGAC GGCGGCACGC TGTTCCTCGA CGAGATCGGC
GACATGCCGG CCGAGCTGCA GACGCGGCTG CTGCGCGTGC TCTCCGACGG CCACTTCTAC
CGCGTCGGCG GCCAGCAACC GATCCGCGCC AACGTGCGCG TGATCGCCGC CACCCACCAG
GACCTGGAGG AGCGCGTGCG CCAGGGCCTG TTCCGCGAGG ACCTCTTCCA CCGCCTAAAC
GTCATCCGCC TGCGCCTGCC GCCGCTGCGC GAGCGCCATG AGGACATCCC CTTGCTGGTG
CGGCACTTCC TGCAGAAGAG CGCGCAGGAG CTCGGCGTCG AGCGCAAGCG CATCTCGGAG
GCCACGCTCG AATACCTGCA GGCTCAGCCT TTCCCGGGCA ACGTGCGCCA GCTCGAGAAC
CTGTGCCACT GGCTCACCGT GATGGCGCCG GCGCAGGTGG TGGAGGTCGC CGACCTGCCA
CCCGAGATGC GCGAGCAACC CGGCCGCGAG TCGCCATCGA ACTGGATGGA GGGCCTGGGC
AGCGAGGCCG ACCGCCTGAT CGCCTCGCGC CCCGGAGAGG TGTTCGACCG CCTCACGCGC
GATTTCGAGC GCACCCTGAT CCGTCGTGCG CTGGCCGCCA CCGGCGGCCG CCGCATCGAG
GCCGCGCAGC TGCTCGGCAT CGGCCGCAAC ACCATCAGCC GCAAGATCCA GGAGCTGGGC
ATGGACGAGG AGCGCGCGCC GGAGACGGAG GAAAGCGGGC GCTGA
 
Protein sequence
MNTVWIIDDD RSIRWVLEKA LGREGIDHAS FSSAGDALAE LERAPQPPAA LLSDIRMPGE 
SGLDLLQKVK ERHPQLPVII MTAYSDLDSA VAAFQGGAFE YLPKPFDVDQ AVALVRRALE
QTAHQNGASE EATLAPEILG QAPAMQEVFR AIGRLAHSHA TVLINGESGS GKELVARALH
RHSPRRDAPF IAINTAAIPR DLLESELFGH ERGAFTGAAT QRRGRFEQAD GGTLFLDEIG
DMPAELQTRL LRVLSDGHFY RVGGQQPIRA NVRVIAATHQ DLEERVRQGL FREDLFHRLN
VIRLRLPPLR ERHEDIPLLV RHFLQKSAQE LGVERKRISE ATLEYLQAQP FPGNVRQLEN
LCHWLTVMAP AQVVEVADLP PEMREQPGRE SPSNWMEGLG SEADRLIASR PGEVFDRLTR
DFERTLIRRA LAATGGRRIE AAQLLGIGRN TISRKIQELG MDEERAPETE ESGR