Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2792 |
Symbol | |
ID | 7873201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3022802 |
End bp | 3024241 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699714 |
Product | peptidase M48 Ste24p |
Protein accession | YP_002889769 |
Protein GI | 237653455 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGCC GACCCCTCGC CCTGCTGCTC AGCCTCGCGC TGGCCCTGCC TGCGCCGGCC CCCGTGCTCG CCGCCGATCT GCCCGATCTC GGCGACGTGG CCGCGTCCGA ACTCTCTCCC GCGGCCGAAC GCAAGATCGG CGAACAGATC ATCCGCGAGA TCCGCTGGCG TGACGCGGCC TATCTCGACG ATGCGGAGGT CGAGGAGTAC GTGAACCGCC TCGGCCAGCG CCTCGCCGCG GTGAGCAACA ACCCGGGACT CGACTTCGAC TTCTTCGTCG TGCGCGACGC CACGCTCAAC GCCTTCGCGC TGCCGGGCGG CTTCATCGGC GTGCACACCG GGCTGATCCT GGCGGCGGAG GGCGAGTCCG AGTTCGCCTC GGTGCTCGGC CACGAGATCG CCCACGTCAC CCAGCGCCAC ATCGCGCAGA TCGTCGGCAA GCAGAGCCAG TCGGCGATGC TGATGATCGC CTCGATGCTG GTCGCGGTGC TGGCCGCGCG CAGCAACTCG GATGTCAGCA CCGCGGCGAT CGCTGCCGGC CAGGCCGGGG CGATCCAGTC CCAACTCGGC TACACCCGCG CCTTCGAGCG CGAGGCCGAC CGCGCCGGCC TCGAGACCCT GGACAAGGCC GGGCTCGACG TGCGCGGCAT GCCCGGCTTC TTCGAGCGCC TGCAACGCAA CACGCGTGTG TACGAGAACA ACGCGCCGGC TTACCTGCGC ACCCACCCGC TGACCACCGA GCGCATCGCC GACATGGAGA ACCGCGTCGC GGCGATGCGC TATCGCCAGG TGCCGGACTC GGCGGATTTC CGCTTCGCGC GTGCCAAGCT GCGCGCCAAC GCCGGCCAGC CCGCCGAGGC GGTGCAGGAG CTGGAGGACC GCCTCGCGCG CGAGCCCAAG GACGAGGCGC TCGCCTACGG TCTCGCCCGT GCGCTGATGC GCGCCGGTCG CCTCGACGAG GCCGAGGCCC GCCTGCAGCC GCTGCGCGCG AAGGCGGCGG GCCTGGCCTG GGTCGAGGGG CTGGCGGCGG AGATCCGTCT CGCGCGCAAC GACGCCGGGG GCGCGATCCG CATCCTCGAG GCCGCGCAGA AGCAGTTTCC GTCCAGCCGC AGCGTGAGCT ACGCGCTCGC CGATGCGCGC ATCCTCGGCG GACGGGCGGA CGTCGCCGCG ACCGAACTGC GCCGCCGCAT CGACAACCGC AGCGGTGATC CGCGCCTGTG GCAGCTGCTG TCGCGCGCCT ACGCGGCCCT CGGACAGCGC ACCGAGCAGC ACCGCACCCA GGCCGAGGTG TACTACCTGC GCGGCAGCCT GCCGGCCGCG ATCGAGCAGC TTGAGATCGC GCGCAGGGCC AGCGATGGCG ACTTCTATAC CCTGTCGGCG GTGGATGCGC GCCTGCGCGA GCTCAAGCAG CGCCTGCTCG AGGAAAAGCG CGAGCGCTGA
|
Protein sequence | MMRRPLALLL SLALALPAPA PVLAADLPDL GDVAASELSP AAERKIGEQI IREIRWRDAA YLDDAEVEEY VNRLGQRLAA VSNNPGLDFD FFVVRDATLN AFALPGGFIG VHTGLILAAE GESEFASVLG HEIAHVTQRH IAQIVGKQSQ SAMLMIASML VAVLAARSNS DVSTAAIAAG QAGAIQSQLG YTRAFEREAD RAGLETLDKA GLDVRGMPGF FERLQRNTRV YENNAPAYLR THPLTTERIA DMENRVAAMR YRQVPDSADF RFARAKLRAN AGQPAEAVQE LEDRLAREPK DEALAYGLAR ALMRAGRLDE AEARLQPLRA KAAGLAWVEG LAAEIRLARN DAGGAIRILE AAQKQFPSSR SVSYALADAR ILGGRADVAA TELRRRIDNR SGDPRLWQLL SRAYAALGQR TEQHRTQAEV YYLRGSLPAA IEQLEIARRA SDGDFYTLSA VDARLRELKQ RLLEEKRER
|
| |