Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1577 |
Symbol | |
ID | 7084781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1753884 |
End bp | 1755227 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698594 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002355231 |
Protein GI | 217969997 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.569313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCC TCTTTCTGAT CGGCCTCATC GTCCTCAACG GCGTCTTCGC CATGTCCGAG ATCGCGCTCG TCACCGCGCG TCGGGCACGG CTCGCGCGGC TCGCCGAGGA GGGCGACGGC GCGGCCGAGG TGGCGATCAA GCTCGGCGAG GACCCGACCC GCTTCCTGTC GACGATCCAG ATCGGCATCA CCTCGATCGG CGTGCTCAAC GGCATCGTCG GCGAGGCCGC GCTCGCCGGC CCGCTCTCCG ACTGGTTGCA GGTGCTCGGC ATGGAGCAGC GGGTGAGCGA GATCGGCTCC ACCGTGCTGG TCGTGGTGGT GATCACCTAT GTGTCGATCG TGGTCGGCGA GCTGGTGCCC AAGCGCATCG GGCAGATCGA CCCCGAGGGG ATCGCCCGCC TGGTGGCGCG GCCGATGAAT GTGCTGTCGA TCGCCTCGCG GCCCTTCGTC TACCTGCTCG CCGGCTCCAC CGCGCTGCTG CTGCGCATCA TGGGCCAGCG CGAGAGCACC GGGCCGAGCG TGACCGAGGA GGAGATCCAC GCCATGCTGG TCGAGGGCTC CGAGGCCGGC GTGATCGAGA AGAGCGAGCA CGACATGGTG CGCAACGTGT TCCGCCTCGA CGACCGCCAG ATCGGCTCGC TGATGGTGCC GCGCGCGGAC ATCGTGTCGC TCGACCTCGA GCGGCCGCTG GAGGAGAACC TCGAGCTGGT GGCGTCCTCG TCCTACTCCA GCTTCCCGGT GTGCCGCGGC GGGCTCGACG ACATCCTCGG CATTGCCAGC GCCAAGAAGC TCTTCAACCA GTCCCTGCGC GGCGAGCCGA TCGACCTCGC CAGGGAGCTG CAGCCTGCGG TGTACGTGCC CGAGTCGCTG ACCGGCATGG AGCTGCTCGA TCAGTTCCGC TCTTCCGGCA CCTACACCGT GTTCGTGATC GACGAGTACG GCGAGATCCA GGGCATGGTC ACGCTGCACG ACGTGATCGA GTCGGTGACC GGCGAGTTCC TGCCGCACGA CACCGAGGAG GCCTGGGCGG TGCAGCGCGA GGACGGCTCC TGGCTGCTCG ACGGCCTGAT CCCGATCGTC GAGCTCAAGG ACCGCCTGGG CATCAAGACC GTGCCGGAAG AGGAGAAGGG GCGCTACCAC ACGCTGTCGG GCATGGTGAT GTGGCTGCTC GGCCGCCTGC CGGGCACCGG CGACATCGCC ACCTGGGAGA ACTGGCGCTT CGAGGTGATC GACCTCGACG GCAAGCGTAT CGACAAGGTG CTGGCGAGCC GCCTGCCCGA GCCGGCCGAC GAGATCGGCG CCGAGCAGGA GGCCAGCCCG GAAGATCGGG GCGAGACCGC ATGA
|
Protein sequence | MEILFLIGLI VLNGVFAMSE IALVTARRAR LARLAEEGDG AAEVAIKLGE DPTRFLSTIQ IGITSIGVLN GIVGEAALAG PLSDWLQVLG MEQRVSEIGS TVLVVVVITY VSIVVGELVP KRIGQIDPEG IARLVARPMN VLSIASRPFV YLLAGSTALL LRIMGQREST GPSVTEEEIH AMLVEGSEAG VIEKSEHDMV RNVFRLDDRQ IGSLMVPRAD IVSLDLERPL EENLELVASS SYSSFPVCRG GLDDILGIAS AKKLFNQSLR GEPIDLAREL QPAVYVPESL TGMELLDQFR SSGTYTVFVI DEYGEIQGMV TLHDVIESVT GEFLPHDTEE AWAVQREDGS WLLDGLIPIV ELKDRLGIKT VPEEEKGRYH TLSGMVMWLL GRLPGTGDIA TWENWRFEVI DLDGKRIDKV LASRLPEPAD EIGAEQEASP EDRGETA
|
| |