Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1993 |
Symbol | |
ID | 7083748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2255495 |
End bp | 2256781 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699018 |
Product | Ste24 endopeptidase |
Protein accession | YP_002355640 |
Protein GI | 217970406 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0368513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC CCGCCCCATT CATGCTGTCG CCCTTCGGCC TGCCGCCTCT TTCCGCCCTG TTCCTGGCCT TCCTGGTCGC CGGTACCGTG CTCGGCCTCG GTCTGCTGCA TCGCCACGCC CACCACGTGC GCCGCCATCG CGACGCGGTG CCGCAGCCCT TCGCCGGCTC GATTCCCCTG CATTCGCACC AGCGTGCCGC CGACTACACC GTCGCGCGCG CGCGCCTGTC CGCCTTCCAC GCCGCGGCCA ACGCCGGCTT CGTGCTCGCG CTCACGCTCG GCGGCGGGCT GCAGGCGATG CACGACGCCT GGGCGGACGT GCTGCCCGCC GGCGGGCTCG CCCACGGCGT CGCGCTGCTC GCCAGCCTGG GCGTGCTCGG CTGGCTGTTC GAACTACCCT TCGCGCTGCT GCGCACCTTC GGCATCGAGA GGACCTTCGG CTTCAACCGC ATGACGCCGC GCCTCTACCT CGCCGACACC GTGCGCGAGG CCGCGCTCGC CGCGCTGATC GGGCTGCCGC TGCTCGCCGC GGTGCTGTGG CTGACGCTGG CGACGGGCGC GCTGTGGTGG GCCTGGGTGT GGGCGTTCTG GCTCGGCTTC AACCTGCTCG CGATGGTGAT CTGGCCGACC TTCATCGCGC CGCTGTTCAA CAAGTTCACC CCGCTCGCCG ACGCCACGCT GAAGGCGCGC GTCGAGGCCC TGCTCGCGCG CTGCGGCTTT CGCGCCAAGG GCCTGTTCGT GATGGACGGC TCGCGCCGCT CGGCACACGG CAACGCCTAC TTCACCGGGC TGGGCGCGGC CAAGCGCATC GTGTTCTTCG ACACCCTGCT CGACAAGCTC GATGCCGACG AGGTCGAGGC GGTGCTCGCG CACGAGCTCG GCCACTTCCA CCACCGCCAC CTGCTGCGCC GGCTGGCGGT GCTCGCCCCG GCCAGCCTGG GCGTGCTCGC CTTGCTCGGC TGGCTCGCCC AGCAGCCCTG GTTCTTCTCC GGGCTGGGCA TGCAAAGCGC CGACCTGGCG AGCGCGCTCG CCCTGTTCAC GCTGGTACTG CCGGTGTTCA GCTTCCCGCT CGCGCCGCTC GCGAGCCACT GGTCGCGCAA GCACGAGTTC GAGGCCGACG CCTACGCCGC CCGCCAGGCC GACGCCGGCA AGCTGGTGAG CGCGCTGGTC AAGCTCTACC GCGACAACGC CTCCACGCTG ACGCCCGACC CGCTGTACTC GCGCTTCCAC GACTCGCATC CGCCCGCCGC GCTGCGCATC GCGCGCCTGC AGGCGCTGCA ACGGTGA
|
Protein sequence | MTDPAPFMLS PFGLPPLSAL FLAFLVAGTV LGLGLLHRHA HHVRRHRDAV PQPFAGSIPL HSHQRAADYT VARARLSAFH AAANAGFVLA LTLGGGLQAM HDAWADVLPA GGLAHGVALL ASLGVLGWLF ELPFALLRTF GIERTFGFNR MTPRLYLADT VREAALAALI GLPLLAAVLW LTLATGALWW AWVWAFWLGF NLLAMVIWPT FIAPLFNKFT PLADATLKAR VEALLARCGF RAKGLFVMDG SRRSAHGNAY FTGLGAAKRI VFFDTLLDKL DADEVEAVLA HELGHFHHRH LLRRLAVLAP ASLGVLALLG WLAQQPWFFS GLGMQSADLA SALALFTLVL PVFSFPLAPL ASHWSRKHEF EADAYAARQA DAGKLVSALV KLYRDNASTL TPDPLYSRFH DSHPPAALRI ARLQALQR
|
| |