Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0768 |
Symbol | |
ID | 7084159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 853483 |
End bp | 854454 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697793 |
Product | peptidase U32 |
Protein accession | YP_002354435 |
Protein GI | 217969201 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGCCG AGGCCGGCGC CACCGAGCTG TATTGCGGCC TGGTGCCGCC CGACTGGCGC GGACGCTTCC AGTCGATCAG CGCCAATCGC CGGCCGTCGG GCAATCTTGC GAGCTATGCA GACCTGGCGG AGGCGGTGCG GAGCGCGCAT GCCCACGCTG CCGCGCTGTC GCTGGTGCTC AATGCCCAGC ACTACGCTGC AGCGCAGATC GAGGCCGCGG AAGAGATCGC CGGGCGCTTT GCGGACCTCG GTGGCGATGC CCTGATCGTC AGCGACCTGG GCCTGGTGGA GCGCCTGGCG ACGCGTTTCC CGGCGCTGCG CGTGCACGTC AGTTCGGTGG CGACCTGCCG CAACGCGGCG GCGGCGCGGC TGTGCCAAGC CCTCGGTGCG CGCCGGCTGA TCCTGCCGCG CGACGTGACC CTCGCCGAAG CCACCGCCAT TGCCGAAGAG GTGCCGGACA TCGAGATCGA AGCCTTCGTG CTCAATGATG GCTGCATCTT CGAGGAGGGC GCCTGCCACA CGATCCACCT CCCGGGGCAA CTGGGCGGGC CGATCTGCCT CGACCGCTAC GGTTTCCGCC ATCGTCGGCG CGACGGCGGG GAGCTGTCGG CACGCCTCGC CGCACGCTTG CAGGAGAACG ACGAGGCCTA CGAGCGCTGG CTGTGGTACC GCTTTTCCTG CGGCTTCAGC ACCACGCCCG AAGGCTTGCC CTTCGGACCA TGCGGGCTGT GCGCGCTGCC CGCGCTCAGG CGCGGACGGG TGGCCGCGGT CAAGATCGCC GGGCGCGAGG CGCCGACGGC GCGCAAACTG GCGAGCGTGC GCATGGTGAG GTCCGTGCTC GACCGGGTCT GCGCAGGGGC GGATGCCGCC GCGGTGCGCG CGTTCGCCAC CCGCCTGCGG CCGTCCGAGG CGCATTGCGC CACCGGCCAC ATGTGCTACT ACCCCGAAGT GCTGCGGGCG GCGGAATGCT GA
|
Protein sequence | MLAEAGATEL YCGLVPPDWR GRFQSISANR RPSGNLASYA DLAEAVRSAH AHAAALSLVL NAQHYAAAQI EAAEEIAGRF ADLGGDALIV SDLGLVERLA TRFPALRVHV SSVATCRNAA AARLCQALGA RRLILPRDVT LAEATAIAEE VPDIEIEAFV LNDGCIFEEG ACHTIHLPGQ LGGPICLDRY GFRHRRRDGG ELSARLAARL QENDEAYERW LWYRFSCGFS TTPEGLPFGP CGLCALPALR RGRVAAVKIA GREAPTARKL ASVRMVRSVL DRVCAGADAA AVRAFATRLR PSEAHCATGH MCYYPEVLRA AEC
|
| |