Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3962 |
Symbol | |
ID | 7873608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4360778 |
End bp | 4362196 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700899 |
Product | peptidase U32 |
Protein accession | YP_002890922 |
Protein GI | 237654608 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCT CCCCCCCGGC CCCGCGCCGC CCCGAACTGC TCGCCCCCGC CGGCACCCTC GACATGATGC GCACCGCCTT CGCCTATGGC GCGGACGCGG TCTACGCCGG CCAGCCGCGC TATTCGCTGC GCGTGCGCAA CAACCACTTC GGCCAGCTCG ACACGCTCGC CGCCGGCATG GCCGAGGCCC GCGCCGCCGG CAAGCTGTTC TATCTCGTCG CCAACATCTA TCCGCACAAC GCCAAGCTGC GCACCTTCGA GGACGACATG GCGCCGGTGA TCGCGCTGCA GCCCGACGCG CTGATCATGG CCGACCCCGG GCTGATCCTG ATGGTGCGCG AGCGCTGGCC CGAGCTGCCG ATCCACCTCT CGGTGCAGGC CAACACTACC AACTACGCCT CGGTGCGCTT CTGGCAGTCG GTGGGGGTCA AGCGCATCAT CCTGTCGCGC GAGCTGTCGC TGGACGAGGT CGCCGAGATC CGCGACGCCT GCCCGGACAT GGAGCTGGAA GTCTTCGTGC ACGGTGCGCT GTGCATCGCC TACTCCGGGC GCTGCCTGCT GTCGGGCTAC TTCAACCACC GCGACCCCAA CCAGGGCAGC TGCACCAACT CCTGCCGCTG GGACTACAAG CTGCACGAGG CCGCCGAGGA CGCCGCCGGC GACGTGCAGG CCTGCGGCGG CGCGCCGATC GGCAACCCCA AGGACGCCGG CGCGGTCGGC ACCGCCACCC GCAGCGCGCT CGACACCGCG CAGGGCCTCG CACTCGGCGG CGGCCCGCGC CACCTCGGCG GCAGCAAGCT GTGGCTGCTG GAAGAAGGCA CCCGCCCGGG CGAGCGGATG CCGATCGAGG AAGACGAGCA CGGCACCTAC ATCCTCAACT CGAAGGATCT GCGCGCGATC GAGCACGTGC AGCGCCTGGT CGAGATCGGC GTCGATTCGC TCAAGATCGA AGGCCGCACC AAGAGCCCCT ACTACGTCGC CCGCGCCGCC CAGGGCTACC GGCGCGCGAT CGACGATGCG GTGGCCGGGC GGCCCTTCGA TGTACGCCTG CTCGGCGAAC TCGAGGGCCT GGCCAGCCGC GGCTACACCG ACGGCTTCTA TCAGCGCCAC TCCACCCCCG AGCAGCAGAA CTACCTGCGC GGCCATTCCG AATCCGGGCG CAGCCTGCTG GTGGGCGAGG TGGTCGGCTG GGATGCCGCG CGCGGCCTGG CCGAGGTCGA GGTCAAGAAC GGCTTCGGCG TCGGCGACCG GCTGGAGTTC GTACAGCCGG GCGGCAACAC CGAGGCGGTG CTGGAGCGGC TGTTCGGCGC CGATGGCGAG GCGATCCAGC GTGTGCCGGG CAGCGGCCGG CGCGTCTGGC TGGCGCTGCC GGCGGATGCG GACCCGGCGC GACCCTGCTT CATCGCGCGC TTTCTGTGA
|
Protein sequence | MPASPPAPRR PELLAPAGTL DMMRTAFAYG ADAVYAGQPR YSLRVRNNHF GQLDTLAAGM AEARAAGKLF YLVANIYPHN AKLRTFEDDM APVIALQPDA LIMADPGLIL MVRERWPELP IHLSVQANTT NYASVRFWQS VGVKRIILSR ELSLDEVAEI RDACPDMELE VFVHGALCIA YSGRCLLSGY FNHRDPNQGS CTNSCRWDYK LHEAAEDAAG DVQACGGAPI GNPKDAGAVG TATRSALDTA QGLALGGGPR HLGGSKLWLL EEGTRPGERM PIEEDEHGTY ILNSKDLRAI EHVQRLVEIG VDSLKIEGRT KSPYYVARAA QGYRRAIDDA VAGRPFDVRL LGELEGLASR GYTDGFYQRH STPEQQNYLR GHSESGRSLL VGEVVGWDAA RGLAEVEVKN GFGVGDRLEF VQPGGNTEAV LERLFGADGE AIQRVPGSGR RVWLALPADA DPARPCFIAR FL
|
| |