Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0266 |
Symbol | |
ID | 7084388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 300959 |
End bp | 302293 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697307 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002353955 |
Protein GI | 217968721 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCG CCTTACTCAT CGCCCTCATC GTGCTCAACG GTGTCTTTGC AATGTCCGAG ATCGCGCTCG TCACCGCGCG CCGCGCGCGC CTCGCACGGC TTGCGGACGA CGGTGACGGC TCGGCCGCAG TCGCCATGAA GCTCGGCGAG GATCCCACGC GCTTCCTGTC CACGATCCAG ATCGGCATCA CCTCGATCGG CATCCTCAAC GGCATCGTCG GCGAGGCGGC CCTCGCGGGC CCGCTGGCGG AGTGGTTGCA GACGCTGGGC ATGGAGCAGC GCACCAGCGA GATCGGATCG ACCGTGCTGG TCGTCGTCGT CATCACTTAC GTCTCGATCG TGGTCGGCGA GCTCGTCCCC AAGCGCATCG GCCAGATCAA TCCGGAGGGC ATCGCCCGCC TCGTAGCCCG GCCCATGAAT GTGCTGGCCA TGGCCTCGCG CCCCTTCGTC TATCTGCTGG CCGGTTCGAC CGCGCTGCTG CTACGTCTGA TGGGACAACG CGAGACGACT GGCCCCAGCG TGACTGAGGA AGAAATCCAC GCGATGCTCA ATGAGGGCTC GGAAGCCGGC GTCATCGAGA AGAGCGAACA TGAGATGGTG CGCAACGTGT TCCGCCTCGA CGATCGTCAG ATCGGCTCGC TGATGGTGCC GCGTGCCGAC ATCGTCACCC TGGACGTGGA TCGCCCCCTC GATGAGAACC TCGCGCTGGT GGCCGAATCC GCGCACTCGA GTTTCCCGGT GTGCCGGGAT GGGCTGGATG AGATCCTCGG CATCGTCAGC GCCAAGCAGA TCTTCTCCCA GATGGTGCGT GGCGAGTCGG TCGACTTCAC ACAAAACCTG CAGGCGCCGG TCTACGTGCC CGAATCGCTC ACCGGCATGG AACTGCTCGA TCAGTTCCGG GCCTCCGGCA CGTACATCGT CTTCGTGATC GACGAGTACG GCGAGGTGCA AGGCATGGTC ACGCTGCACG ACGTCATCGA ATCCGTGACC GGCGAGTTCC TCCCGCACGA CACGAAGGAA TCGTGGGCCG TGCAGCGCGA GGACGGCTCC TGGCTGCTCG ATGGACTCAT CCCGATCGTC GAGTTCAAGG ATCGCCTGGG CATCAAGGCC GTGCCCGAAG AAGAAAAGGG GCGATACCAC ACGCTGTCGG GCATGGTGAT GTGGCTGCTC GGCCGCCTGC CCAACACCGG CGACATCGCC ACCTGGGAGA ACTGGCGTTT CGAGGTCATC GACCTCGACG GCAAGCGCAT CGACAAGGTA CTGGCGATGC AACGGCCGGA ACCGGCCCCT GAGACGATCG TCGAAAGCGA GTCTCAGGCG CCTTCGCAAG CCTGA
|
Protein sequence | MEIALLIALI VLNGVFAMSE IALVTARRAR LARLADDGDG SAAVAMKLGE DPTRFLSTIQ IGITSIGILN GIVGEAALAG PLAEWLQTLG MEQRTSEIGS TVLVVVVITY VSIVVGELVP KRIGQINPEG IARLVARPMN VLAMASRPFV YLLAGSTALL LRLMGQRETT GPSVTEEEIH AMLNEGSEAG VIEKSEHEMV RNVFRLDDRQ IGSLMVPRAD IVTLDVDRPL DENLALVAES AHSSFPVCRD GLDEILGIVS AKQIFSQMVR GESVDFTQNL QAPVYVPESL TGMELLDQFR ASGTYIVFVI DEYGEVQGMV TLHDVIESVT GEFLPHDTKE SWAVQREDGS WLLDGLIPIV EFKDRLGIKA VPEEEKGRYH TLSGMVMWLL GRLPNTGDIA TWENWRFEVI DLDGKRIDKV LAMQRPEPAP ETIVESESQA PSQA
|
| |