Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1783 |
Symbol | |
ID | 7085753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2005772 |
End bp | 2007169 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698805 |
Product | N-acetylmuramoyl-L-alanine amidase |
Protein accession | YP_002355431 |
Protein GI | 217970197 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0860] N-acetylmuramoyl-L-alanine amidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.193017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGATC GGACTGGTGA AAGCTTTGCG GGGCGCGGAC CCGGCGCCCG GGACGACCTG AGTCTCGGCC AGACTCCGGG GCTTGCCGCC GCGGCCGAGG CCTTCGCCGC GGGGCGCATC GATCGCCGCG GCCTGCTCAA GTTCGCCGGC GCCTCGCTCG CCATGCTGGT GAGCCCGGTC GGGCTGGCGA GTTCGGCAAG CCTGCTCGCG GTGCGCGTAT GGCCCTCGGC CGAATACACC CGCATCACCC TCGAGGGCTC CTCGCGCCTG CGCCACAGCC ACATGCTGGT CGAGGACCCG CAGCGCCTGG TGGTCGACCT CGAGGGCGTG CAGCTCGACA GCGTGCTGCA GTCCCTGCCC TCGAAGGTGC TCGACTCCGA CCCCTACATC CGCCTGATCC GTGCCGGGCA GAACCGGCCG GGCGTGGTGC GTGTGGTGAT CGAGCTCAAG GCGGCGATCA ACCCCCAGGT GTTCACCCTC GACCCGGTCG GCAGCTACGG CCACCGCCTC GTGCTCGACC TGCACCCGAC CGAGGCCCAC GATCCGCTGA TGGCGCTGAT CATGAAGGAC TCGCCGATGG ACGCCGCGAT GGGCGACGCC GGCGGCAACA CCGCCGCGGT GGCGCGCGAG GAGCCGCGCG AGCCGGTGCG CCGCGGCAAG CGTAACGAGC CGGCGGTCGA TCGCCTGTAC ACCGTGGTGC TGGACGCCGG CCATGGTGGC GAGGATCCGG GCGCGATCGG TCGTGGCGGC AGCTACGAGA AGGACGTCAC GCTGTCGATC GCCCAACGCC TCAAGCGCAA GATCGACGCC ATGCCCGGCA TGCGCGCGGT GCTCACGCGC GACGGCGACT ACTTCGTGCC GCTGCACCAG CGCGTGGCGC GCGCCCGCCG GGTGCGCGCC GATCTCTTCG TGTCGATCCA CGCCGACGCC TTCGTTCGCC CCGAGGCCAA CGGCAGCTCG GTCTATGTGC TCTCCGAGCG CGGCGCCTCG AGCTCGGCGG CAAGCTGGCT GGCGCAGAAG GAGAACGATG CCGACCTCGT GGGCGGCGTC AACCTCGCCC GCCAGGACGG CCACATCGCC CGCACCCTGC TCGACCTGTC GCAGACCGCC ACGATCAACG ACAGCTTCAA GCTCGGGCGC GCCATGCTCG GCGAGCTCGG CACCATCAAC CGGCTGCACA AGCCCGAGGT GGAACAGGCC GGCTTCGCGG TGCTGCGCGC ACCCGACATC CCCTCGGTGC TGGTCGAAAC CGCCTTCATC AGCAACCCGC AGGAAGAGCG CCGTCTCAAC GACGAGGCCT ACCAGGACAA GATGGCGATG GCGCTGATGC GCGGCGTCAA GCGCTATTTC GAGGAGCACG CCCCGAGCGG GCCGACCCGC GTGGCGCGGC TCGACTGA
|
Protein sequence | MRDRTGESFA GRGPGARDDL SLGQTPGLAA AAEAFAAGRI DRRGLLKFAG ASLAMLVSPV GLASSASLLA VRVWPSAEYT RITLEGSSRL RHSHMLVEDP QRLVVDLEGV QLDSVLQSLP SKVLDSDPYI RLIRAGQNRP GVVRVVIELK AAINPQVFTL DPVGSYGHRL VLDLHPTEAH DPLMALIMKD SPMDAAMGDA GGNTAAVARE EPREPVRRGK RNEPAVDRLY TVVLDAGHGG EDPGAIGRGG SYEKDVTLSI AQRLKRKIDA MPGMRAVLTR DGDYFVPLHQ RVARARRVRA DLFVSIHADA FVRPEANGSS VYVLSERGAS SSAASWLAQK ENDADLVGGV NLARQDGHIA RTLLDLSQTA TINDSFKLGR AMLGELGTIN RLHKPEVEQA GFAVLRAPDI PSVLVETAFI SNPQEERRLN DEAYQDKMAM ALMRGVKRYF EEHAPSGPTR VARLD
|
| |