Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3880 |
Symbol | |
ID | 7873531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4282405 |
End bp | 4283532 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700822 |
Product | VanZ family protein |
Protein accession | YP_002890845 |
Protein GI | 237654531 |
COG category | [S] Function unknown |
COG ID | [COG5652] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGCA TGGCCGCACG GTCGCCCTCC TCCCTCGCTC CCGCGTTCGC GCTCGCCTAT GCGCTGCTGG TCGCCTATGC CTGCCTGCAT CCGCTGACCG GCTGGCGCAA CAGCGGCTTG CCGGCGTTCG ACTGGCTGTG GGCGCCATGG CCGAAGTACT TCATCCTCGA AGATCTCCTC TTCAACATCC TCGGCTACCT GCCGCTCGGG CTGCTGCTGG CGGCGGCGCT GCCGGCGCAG TGGCGCTTCG CGCGCAAGGC GCTCGTCGCC GCGCTGCTCG CCGGGCTGTT CAGCTTCGGC CTGGAGGCGC TGCAGAACTA CCTGCCCAGC CGGGTGGCGA GCAATCTCGA CCTCGGCGGC AACGCGGCCG GCGGCCTGCT CGGCGCACTC GCCGGCGCGC GCTGGGGCAG TGCGCTGCTC GGGCCGCAGG GGAGGCTGCA GCGCTGGCGC GCGCGCAGCG TGATCGGCGG GCGCAGCGGC GAGGCCGGAC TGGTGCTGAT CGGGCTGTGG CTGCTCGGCC AGCTCGGCGC CACCGACCTC GTCTTCGCCA GCGGCGACCT GCGCAGCCTG CTCGGCATCC CTGCGCCGCT GCCCTTCCGC CCCGAGCGCT TCATCGCCTT CGACACCGCG CTCACCGCAA GCGGGCTGCT CGCGATCGGA CTGTTCGTGC GCTGCATGAC GCGCGGGGCC AGCCCGCTGC CGGTGTTGGC GGTGATCGCC CTCGGCATCG GCGCGAAGTC GCTGGCGACG TGGATCTTCT TCGAGCCGGG GGCGCCGCTC GCCTGGCTCA CCCCGGGCGC AGAGCGCGGC CTGGTGATCG GCGGCGCGCT GCTGCTGCCC GCGCTGCTGC TGCCGCGCCT GGCCCAGCAC GCGATCGCCG GCACCAGTCT GCTGCTCGCC ACCGCGCTGG TGAACCTGAT CCCGGAGAAC CCCTACCTGC CCTTCGACCG GCGCCTGGCC GGGTTCAGCA ACGTGTTCAG CTTCCACGGA CTCACCGGGC TGGTCGACAG CCTGTGGCCC TACGCCGCGC TGGCCTACCT CTCGGCGCTC GGGCTCTGGC GGGGCGAGCA CCTCGCGGCC GCCCCTGCGA GCGCGCAGCC GGCTCCGCGC CGCCGAGCCC GCCGCTAG
|
Protein sequence | MRRMAARSPS SLAPAFALAY ALLVAYACLH PLTGWRNSGL PAFDWLWAPW PKYFILEDLL FNILGYLPLG LLLAAALPAQ WRFARKALVA ALLAGLFSFG LEALQNYLPS RVASNLDLGG NAAGGLLGAL AGARWGSALL GPQGRLQRWR ARSVIGGRSG EAGLVLIGLW LLGQLGATDL VFASGDLRSL LGIPAPLPFR PERFIAFDTA LTASGLLAIG LFVRCMTRGA SPLPVLAVIA LGIGAKSLAT WIFFEPGAPL AWLTPGAERG LVIGGALLLP ALLLPRLAQH AIAGTSLLLA TALVNLIPEN PYLPFDRRLA GFSNVFSFHG LTGLVDSLWP YAALAYLSAL GLWRGEHLAA APASAQPAPR RRARR
|
| |