Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0702 |
Symbol | |
ID | 7083931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 785275 |
End bp | 787218 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697728 |
Product | Lytic transglycosylase catalytic |
Protein accession | YP_002354370 |
Protein GI | 217969136 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGG CACTGGTTGC GGCGCTCACG GCCGCGACGT TCCTCGCCCC CGCCGGCGCG TTCGCACAGA TCGGCGGCGA GTCCGGGGAC GGGCGTATCG TCGCCGCTCG CGAGGCCCTG CGCAAGGGCG ACCGCAGCAC GCTCGAACGT CTCGCAGCCG TGCGCGAGGC CCACCCCCTC GACGCCTATC CGCGCTACTG GCTGCTGATC AACCGCCTCG CGCGTGCCGA AGAACCCGTG CCCGTCGCGG CCCTGCAAGC CTTCCTCGCC GAAGAAGCCG GCTCCAACCT CGCCGAACGC CTGCGCGCCG ACTGGCTGCG CCGCCTCGCC AGGGACGGCG ACTGGAACGG CTTCCTCGCG GTGTACGCGG ACCTGCGCAG CCCCGACGCC GAATTGCGCT GCAACGCCTG GAGCGCGCGC GTGCTGACCG GCGAGCGCAG CGTCTTCGCC GAGCTCGCGC GCGAATGGGA CACCCTCGCC GACGCCGATC CGACCTGCGA CACGCCGCTG CGCGCGGCGG TCGACTCCGG CGCGGTGGAC GAAGAACGCG TGTGGTGGCG CATCCGCCGA CAGATCGACA CCCGCAAGCC CGAGGCTGCG CTCGCCAGCC TGTCGCTGCT GCCCGCCGGC AACGCCCCGG CCCACGCCGA CCTGGCGCAG GCGATCCGCT CGCCCGCACC CTGGCTGGAT CGGCTGCAGC CCAACTTCGC CGTGAGCCGC GCCGGCCGCG AGCTCGCGCT CGCCGCACTG GTGCGCCTGG CGCGCGAGGA CGTCTCCGCC GCGCGCCTGC GCCTGCTGCG CATCCAGGAC CGCCTCGGCG CGGCCGAGCG CAACTACGCC CATCTGGCGC TCGGCATGCA CGCCGCGCTC GACCGCCTGC CCGAGGCGAG CGCGCTCTAC GCCGCCGCCG GCGACATCGA GACCACCGCC CAGCAACGTG CGTGGCGGGT GCGTGCGGCG CTGCGCATCG GCGACTGGCA CGCGGTGCGC GAGGCGATCG AAGCCATGCC CGCCGACGAA CGCGAAGCTG CCGACTGGAT CTACTGGCTG GGCCGCGCAC ACGCCGCGGC CGGGCGCCGC GACGCCGCCG AGTCGCTCTA CGTGCGTATC GCCGGCGAGC CGCACTTCTA CGGCATGCTC GCGCGCGAGG AGCTCGGCGA GGCCTTCCCG CCGCCCCCCG CCACCGCGCC GCTACCGCGC GCCAAGCTCG AGGAGGCCGA GCGCGACCCC GGCCTGCAGC GCGCACTCGC GCTCTACCGC CTGGAGCTGC GCACCGAGGC GCTGCGCGAA TGGGTCTGGG GCGTGCGCGA ACGCGACGAG CAGTTCCGCC TCGCCGCCGC CCACCTTGCG CTGCGCAACG AGCTCTACGA CCGCGCGATC AATACCGCCG AGCTGGCCAA CCCGCGCAGC AACTTCGAGC TGCGCTTCCT GACCCCCTAT CGCGACCTCA TCGAGCCGCA GGTGCGCGCG CAGGGGCTCG ACCTCGGCTG GGTGTACGGC CTCATGCGCC AGGAGAGCCG CTTCGTGGTT CCGGCACGCT CCAGCGTCGG CGCCCAGGGC CTCATGCAGG TGATGCCCGC CACCGGCAAG TGGGTGGCGG ATCGGATCGG CCTCGCCGGC TACAACCAGC GCCTGCTCAC CGACCCCGAG ACCAACGTGC TGCTCGGCAC CAGCTACATG CGCCTGATCA TGGAGGGGCT CGACGCCCAC CCCGTGCTCG CCAGCGCCGG CTACAACGCC GGCCCCGGGC GGGCGCGGCG CTGGCGCGAC GCGGCGCCGC TCGAGGGCGC GATCTATACC GAGACGATCC CGTTCGACGA GACCCGCGAC TACGTCAAGA AGGTGCTCGC CAACGCGGTG ATCTACGCGG CGATGCTCGA AAAACGGCCG CAATCGCTCA AGGCACGACT CGGCACGATC GCACCCGGGG CTGCCAGCGA GTAA
|
Protein sequence | MKKALVAALT AATFLAPAGA FAQIGGESGD GRIVAAREAL RKGDRSTLER LAAVREAHPL DAYPRYWLLI NRLARAEEPV PVAALQAFLA EEAGSNLAER LRADWLRRLA RDGDWNGFLA VYADLRSPDA ELRCNAWSAR VLTGERSVFA ELAREWDTLA DADPTCDTPL RAAVDSGAVD EERVWWRIRR QIDTRKPEAA LASLSLLPAG NAPAHADLAQ AIRSPAPWLD RLQPNFAVSR AGRELALAAL VRLAREDVSA ARLRLLRIQD RLGAAERNYA HLALGMHAAL DRLPEASALY AAAGDIETTA QQRAWRVRAA LRIGDWHAVR EAIEAMPADE REAADWIYWL GRAHAAAGRR DAAESLYVRI AGEPHFYGML AREELGEAFP PPPATAPLPR AKLEEAERDP GLQRALALYR LELRTEALRE WVWGVRERDE QFRLAAAHLA LRNELYDRAI NTAELANPRS NFELRFLTPY RDLIEPQVRA QGLDLGWVYG LMRQESRFVV PARSSVGAQG LMQVMPATGK WVADRIGLAG YNQRLLTDPE TNVLLGTSYM RLIMEGLDAH PVLASAGYNA GPGRARRWRD AAPLEGAIYT ETIPFDETRD YVKKVLANAV IYAAMLEKRP QSLKARLGTI APGAASE
|
| |