Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1481 |
Symbol | |
ID | 7083564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1652606 |
End bp | 1654750 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698499 |
Product | Oligopeptidase A |
Protein accession | YP_002355136 |
Protein GI | 217969902 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGACA GCACGTCCAT TTCCACGTCC ATTTCCACGT CCATTTCCAC GTCCATTTCC ACGTCCACTC CCAACGCCAA TCCCCTGCTC GACTTCGCCG GCATGCCGCG CTTCGCCGAC ATCCGGCCCG AGCACGTCGA GCCCGCGATC CGCGGCCTCA TCGACGAGAA TCGTGCGCTC ATCGAACGCC TCACCGCCGA CCCCGCCACC CCGAGCTGGG ACGGCTTCGT CGCTCCGATG GAAGAGGCTG GCGAGCGCCT CGGCCGCGCC TGGGGCGTGG TCGGCCACCT GCACAGCGTG TTCGACGTGC CCGAGTGGCG CGAGGCCTAC AACGCGATGC TGCCCGAGAT CTCGCGCTTC TATGCCGAGG TCGGGCAAGA CCTCGCGCTG TTCGAGAAGT ACAAGGCCCT GCACGACAGC CCCGAGTTCG TCACCCTGTC TCCGGTGCGC CAGCGCATCC TCGAGCACGC GCTGCGCGAC TTCCGCCTCT CCGGCGCCGA GCTGCCCGAC GCGCAGAAGC CGCGCTTCCA GGAGATCCAG GAAGAGCAAT CCCAGCTCGC CGCCAAGTTC TCCGAGAACC TGCTCGACGC CACCAACGCC CACGCCGAGT GGATCACCGA TGAATCCGGC CTATCCGGCA TCCCCGACGA CGCACGCCAG GCCGCCCGCG CCGCCGCCGA GGCCGAGGGC AAGGAAGGCT GGAAGTTCAC CCTGCAGATG CCCTCCTACC TGCCGGTGAT GCAGTACGCC GACGATCGCG AGCTGCGCGC GCGCATGTAC CGCGCCTACG CCACCCGCGC CTCCGAGCTC GGCAACCCGG AACTCGACAA CGGCCCGCTG ATCGGCCGCA TCCTCGCGCT GCGCGACGAA GAAGCGCGCA TGCTCGGCTA TCGCAACTTC GCCGAGGTGT CGCTGGTGCC CAAGATGGCC GACACGCCGG AGCAGGTGCT CGGCTTCCTG CGCGACCTCG CCGCCAAGGC CAAGCCCTTC GCCGAGAAGG ACCTGGAAGA ACTCAAGGCC TTCGCCAAGG CCGAGCTGGG CCTCGACACC CTCGAACCCT GGGACGTGGC CTACGCCTCC GAGAAATTGC GCGAGAAGCG CTACGCCTAC TCCGACCAGG AGGTGAAGCA GTACTTCCCC GAGCCCAAGG TGCTCGACGG CCTGTTCGGC GTGATCCGCG CGCTGTACCG CGTCGACATC CTGCCCGACG AGGCGCCGAG CTGGGACCCG GACGTGCGCT TCTTCCGCAT CGAGAAGGAA GGCGCGAACG GTCCGGAACT CGTCGGCCAC TTCTACCTCG ACCTGCATGC GCGCAGCACC AAGCGCGGCG GCGCGTGGAT GGATTCGGCA CGCAGCCGCC ACCGCAACAC CTGGGGCAGC GACACGCCGG TGGCCTACCT GGTGTGCAAC TTCCCCGGCC CGGTGGGTGG CAAACCGGCC ACCTTCACCC ACGACGACGT GCTCACCCTC TTCCACGAGT GTGGCCACGG CCTGCACCAC CTGCTCACCC AGGTGGACGA GCTCGCGGTG TCCGGCATTC ACGGCGTGGA GTGGGATGCG GTCGAGCTGC CCAGCCAGTT CATGGAGAAC TTCTGCTGGG AGTGGGACGT GCTGCAGGGC ATGACCGCGC ACGTCGATAC CGGCGAGCCG CTGCCGCGCG CGCTGTACGA CAAGATGATC GCCGCCAAGA ACTTCCAGAG CGGCATGCAG ACCGTGCGCC AGCTCGAGTT CTCGATGTTC GACCTGCGCC TGCACGGCGA GGTGGAGGCC TCTGCCGGCC CGGTCGCAAT CGAACGCGTG ATGGCCCTGC TCGACGAGGT GCGCCGCGAA GTGGCGGTGA TGATCCCGCC CGCCTGGCAC CGCTTCCCGC ACAGCTTCTC GCACATCTTC GCCGGCGGCT ACGCGGCGGG CTATTACAGC TACAAGTGGG CCGAGGTACT GTCGGCCGAC GCCTTCGCCG CCTTCGAGGA AGCCGGCGCC GGCAAGGGCA GCCTGCTCGA CCCCGAGACC GGCGAGCGCT TCTGGCGCGA GATCCTGGCG GTGGGCGGCA GCCGCCCGGC GCTGGAATCC TTCAAGGCCT TCCGCGGCCG CGAGCCCAGG GTCGACGCGC TGCTGCGCCA CAGCGGCATG GTGGCGCAGG CCTGA
|
Protein sequence | MHDSTSISTS ISTSISTSIS TSTPNANPLL DFAGMPRFAD IRPEHVEPAI RGLIDENRAL IERLTADPAT PSWDGFVAPM EEAGERLGRA WGVVGHLHSV FDVPEWREAY NAMLPEISRF YAEVGQDLAL FEKYKALHDS PEFVTLSPVR QRILEHALRD FRLSGAELPD AQKPRFQEIQ EEQSQLAAKF SENLLDATNA HAEWITDESG LSGIPDDARQ AARAAAEAEG KEGWKFTLQM PSYLPVMQYA DDRELRARMY RAYATRASEL GNPELDNGPL IGRILALRDE EARMLGYRNF AEVSLVPKMA DTPEQVLGFL RDLAAKAKPF AEKDLEELKA FAKAELGLDT LEPWDVAYAS EKLREKRYAY SDQEVKQYFP EPKVLDGLFG VIRALYRVDI LPDEAPSWDP DVRFFRIEKE GANGPELVGH FYLDLHARST KRGGAWMDSA RSRHRNTWGS DTPVAYLVCN FPGPVGGKPA TFTHDDVLTL FHECGHGLHH LLTQVDELAV SGIHGVEWDA VELPSQFMEN FCWEWDVLQG MTAHVDTGEP LPRALYDKMI AAKNFQSGMQ TVRQLEFSMF DLRLHGEVEA SAGPVAIERV MALLDEVRRE VAVMIPPAWH RFPHSFSHIF AGGYAAGYYS YKWAEVLSAD AFAAFEEAGA GKGSLLDPET GERFWREILA VGGSRPALES FKAFRGREPR VDALLRHSGM VAQA
|
| |