Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1793 |
Symbol | |
ID | 7085763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2013382 |
End bp | 2015349 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698815 |
Product | peptidase U32 |
Protein accession | YP_002355441 |
Protein GI | 217970207 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ACCGCCACAC CCTCGAACTC CTCGCCCCCG CCAAGACCGC CGACTTCGGC ATCGAGGCCA TCGACCACGG CGCGGACGCG GTCTACATCG GCGGCCCGGC CTTCGGCGCG CGCTCCTCCG CGGACAACTC GGTGGAGGAC ATCGCGCGCC TCGTGCAGCA CGCCCACCGC TATCACGCCG AGGTCTTCGT CGCCACCAAC ACCATCCTCT TCGACCACGA GATCGAGCCG GCGCGCAAGC TGATCTGGCA ACTGTACGAT GCCGGCGTCG ACGCGCTCAT CGTGCAGGAC ATGGGCCTGC TCGAGCTCGA CCTGCCGCCG ATCCAGCTCC ACGCCAGCAC CCAGACCGAC ATCCGCGACG CGAGCAAGGC GCGCTTCCTG CAGGACGTGG GCTTCTCGCA GATCGTGCTG GCGCGCGAGC TGTCGTTGAA CGAGGTGAAG AAGATCGCCG CGGCCACCAC CTGCCAGCTC GAATACTTCG TGCACGGCGC ACTGTGCGTG GCCTTCTCCG GCCAGTGCTA CATCAGCCAC GCGCACACCG GGCGCAGCGC CAACCGCGGC GAGTGCTCGC AGGCCTGCCG GCTGCCCTAC GACCTCAAGG ACAAGGACGG CCACACCGTC GCCAGCAACC AGCACATGCT GTCGATGAAG GACAACAACC AGAGCGCCAA CTTGCGCGCG CTCGCCGCCG CGGGGGTAAG CTCGTTCAAG ATCGAGGGCC GCTACAAGGA TCTCTCCTAC GTCAAGAACA TCACCGCGCA CTACCGCACG CTGATCGACG AGATCCTCGA GCACCCGGAC ACCGACGGCG CCCGATATCG CCGCGCCTCC AGCGGACGCA CCACCTTCTT CTTCACCCCG CAGGCCGACA AGACCTTCAA CCGCGGCTAC ACGGACTACT TCACCAACGA TCGCCGCCAC GGCATCGAGG CGTTCGAATC GCCCAAGTTC GTCGGCGAGC CCATCGGCCG CGTGAAGAAG ATCGACACCA AGGGGCGCAC GTTCTTCGAC GTCGAGCGCG CGGCGCCGAT CCACAACGCC GACGGCCTGA CCTGGTACGA CCCCAAGGGC GAGCTCACCG GCCTGCGGGT GAACCGCGCC GAGGCGGACG GCGGCGGCGA AGGCATCGAC CGCATCTTCC CTGCCGACCC CCTGCCGACC GACCTGGTCC CCGGCACCTC GCTGTTCCGC AACCATGACC ACGAATTCGA GCGCGCGCTG GGGAAGAAGT CGGCCGAGCG CCGCATCCGT GTCGATGCAC GTTTCGCCGC CACCCACGAC GGCTTCGCGC TGACCCTGAC CGACGAGGAC GGCGTCGCCG TCACCGCGAC GCTTGCCGCC GCCTTCGAGC CGGCACAGAA CGCCGAGCGT GCACTCGCCA CCCTGCGCGA GCACCTCGGC AAGCTGGGCA ACACGATCTT CAGCGCAGGC GAGCTCGTGC TCGACCTGCC CGCCGCGCCC TTCCTGCCCG CCGGACAGCT CAATGCGCTG CGCCGCGATG CCGTGGAGCG GCTCGAGGCC GGCCGCCTCG CGGCCCACGC CCGCCCGCTG CGGGCCGCGC CGGTCGAGCC GCCGGTGCCC TACCCGCAGG ACGCGCTGAG CTACCTCGCC AACGTGTCGA ACGACAAGGC GCGCGCCTTT TACGCCCGCC ACGGCGTCAA GCTGATCGAC GCCGCATACG AGGCCAACGA GGAGCGCGAC GACGTCTCGC TGATGATCAC CAAGCACTGC CTGCGCTACA GCTTCAATCT GTGTCCGAAG GAGGTCAAGG GCATCCGCCC CGACCCGATG CAGTTGGTCA ATGGCGACGA GACGCTGACG CTGAAGTTCG ACTGCAAGCG CTGCGAGATG CACGTCATCG GCGCGCTGCG TCCCCACGTG GCGAAGATGC GCGACACCGT GGTGGCGCAC AAGGTGAGCT TCGTACCCCA GCGCAAGACC AACGCCTCCG TGCGCTGA
|
Protein sequence | MSIDRHTLEL LAPAKTADFG IEAIDHGADA VYIGGPAFGA RSSADNSVED IARLVQHAHR YHAEVFVATN TILFDHEIEP ARKLIWQLYD AGVDALIVQD MGLLELDLPP IQLHASTQTD IRDASKARFL QDVGFSQIVL ARELSLNEVK KIAAATTCQL EYFVHGALCV AFSGQCYISH AHTGRSANRG ECSQACRLPY DLKDKDGHTV ASNQHMLSMK DNNQSANLRA LAAAGVSSFK IEGRYKDLSY VKNITAHYRT LIDEILEHPD TDGARYRRAS SGRTTFFFTP QADKTFNRGY TDYFTNDRRH GIEAFESPKF VGEPIGRVKK IDTKGRTFFD VERAAPIHNA DGLTWYDPKG ELTGLRVNRA EADGGGEGID RIFPADPLPT DLVPGTSLFR NHDHEFERAL GKKSAERRIR VDARFAATHD GFALTLTDED GVAVTATLAA AFEPAQNAER ALATLREHLG KLGNTIFSAG ELVLDLPAAP FLPAGQLNAL RRDAVERLEA GRLAAHARPL RAAPVEPPVP YPQDALSYLA NVSNDKARAF YARHGVKLID AAYEANEERD DVSLMITKHC LRYSFNLCPK EVKGIRPDPM QLVNGDETLT LKFDCKRCEM HVIGALRPHV AKMRDTVVAH KVSFVPQRKT NASVR
|
| |