Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2144 |
Symbol | |
ID | 7085497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2420083 |
End bp | 2421309 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699164 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002355780 |
Protein GI | 217970546 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGTTCG TGCTCGCGGC GGCCATCCTG GCGCTGCCGG TGCTCGCGCG CCTCGTGCTG CCCTTGCTCG CGCGCGGGCG CAGCGCGATC TGGCGGCTGG CGCACGCGCG CCTGGCGGCG GCGCCGGGGC AGGCGGTGGT GGCGGGGGCA GGGGTGATCG CCAGCGTCGC GCTCGCGGTG TCGATGGCGA TCATGGTGAA TTCCTTCCGC GTCTCGGTGG ATGAGTGGCT CGCCCAGGTC TTGCCGGCAG ACCTGTACGT GCGTGCCTCG GCGGCGAGTG GCAGCGGCTA CCTCGATCCG CCTGCGCTCG TGGCGATCGC GGCCACGCCC GGCGTGGCGC GAGTAGAGAC GACGCGGCTC GTGAACCTGC GCGTTTCGGA CACCGAGCCG CTGTTCAGCG TGCTGGCGCG CGCGACGCAG GGCAACCGGG GGCTGCCGCT GGTGAGCGGA TCGATGGAGC CGGTGGCCGG GGACGCGGAA GGCCTGCCTC CGGCCTGGGT GTCGGAGGCG GCTGCCGACC GCCACGATCT GGCGCGCGGC AGCCGCTTCG AGCTGCCGCT GGGAGGCCGG ATGCAGGCCT TCCGCGTCGC CGGGGTGTGG CGCGACTACG CCCGCCAGCA CGGCGCCGCG CTGATCGAGC GCCGCGACTA CGTCGCCCTC ACCGGCGACG ACCGGGTAAA CGACGCCGCG CTACTGCTCG AGCCGGGTGC CACTCCGTCA GCGGTGGCGC AGGCGCTGCG CGAGCGCTTC GGCGCCGAGC ACATCACCAT CGCGCTGCCG GGCGAGATCC GCGCGCTCAG CCTGCAGATA TTCGATCGTA CCTTCCTGAT CACCTACCTG ATGGAGGCGG TCGCGGTGGT GATCGGCCTG TTCGGCATCG CCACCACCTT CGCCGCGCTC GCCGCCACCC GCCAGGGCGA GTTCGGCATG CTGCGCCATC TCGGCTTCAC GCGCGCCGAG ATCGGTCGCC TGATCGCCAC CGAGGGCGCG CTGACCGCAG GGCTGGGCGT GATCGCCGGA CTCGCCGGCG GCGCGGCGAT CGCCTGGGTG CTGATCGAGA TCATCAACCG GCAGAGCTTC CACTGGAGCA TGGAGCTGGC CGTGCCGTGG AGCGGGCTGG CGATCTTCGC CGTCTGCCTG GTGCTGCTGG CGGCGCTGGT GGCCCGGCTC GCCGGAAGGC ACGCGATGCG GCGGTCGGCG GTGCTGGCGG TGAAGGCGGA TTGGTAG
|
Protein sequence | MVFVLAAAIL ALPVLARLVL PLLARGRSAI WRLAHARLAA APGQAVVAGA GVIASVALAV SMAIMVNSFR VSVDEWLAQV LPADLYVRAS AASGSGYLDP PALVAIAATP GVARVETTRL VNLRVSDTEP LFSVLARATQ GNRGLPLVSG SMEPVAGDAE GLPPAWVSEA AADRHDLARG SRFELPLGGR MQAFRVAGVW RDYARQHGAA LIERRDYVAL TGDDRVNDAA LLLEPGATPS AVAQALRERF GAEHITIALP GEIRALSLQI FDRTFLITYL MEAVAVVIGL FGIATTFAAL AATRQGEFGM LRHLGFTRAE IGRLIATEGA LTAGLGVIAG LAGGAAIAWV LIEIINRQSF HWSMELAVPW SGLAIFAVCL VLLAALVARL AGRHAMRRSA VLAVKADW
|
| |