Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0759 |
Symbol | |
ID | 7084150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 842012 |
End bp | 843355 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643697784 |
Product | hypothetical protein |
Protein accession | YP_002354426 |
Protein GI | 217969192 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCTG ACGGCATCGT CATCCTCGGC GCCGGCCCCG CGGGCGCGGC GGTGGCGATC GGACTGGCGC GGATGGGCGA GCCGGTGGTG CTGGTGGGCG AGCCGCGCCG TTTCGCCGCG GTCGAGGGCG TCTCGGCGCG CGTCATCGAC GCGCTGCGCG GCCTCGGCCT GCAGCGGGCG CTGACGGCGT TCGCGCCGCC TTCGCCGCGG CGGGCGGTGT GGAATGGCAC CTACACCGAG GCGAACGTGG AGAGCCTGGT CGACCGCCAG CGCTTCGACC AGGGGCTGCT CGAGGACTTG GAGCGCCTCG GCGTGCCCGT GGTGCGCGGC CGGGTGACGG CGCTGCACCC CGCGCCAGGG GGCCATGAGC TGGAGATCGA CACCGATGCC GGGCCGCGCC GCATGGCGGC GGGCTTCCTC GTCGAGGCGC GCGGCCGCGC CGCGCCCGGC GGGGGGGCGC CGCGGGTGCG CGGAGTGGAG ACGGTGTCGC TGCTGCAGTA CTGGCAGGGA CCGGCGGGCG AGGCGGGCTC GACCGTTCTC AGCGTCGAGG ATGGCTGGAT GTGGATGGCC GCGTTGGCCG ATGGACGGCG CTATCTGCAG CTGACGGTGG ACGTTGCCAG CGCCGACCTG CCGCCGAAAA GGGCCCTGGG CGACTACTGC AGCGCACGGT TTCACGCGGT CGAGGCGGCG GCGCCCTTCG TCCGCGATGC GCAGCCGGTC GGCGAACCGC ACGCGCGCAC CAGCACGGCG GTGCTCAACG AATCGGTGGC CGGCGACGAC TGGATCCGCG TCGGCGATGC GGCGATGGCG GTCGATCCGC TCTCGGGCAA CGGCATCTTC CAGGCGCTGT CGTCCGCGCT GCAGGCACCG GCGGTGGTGG CTACCCTCCG TCACGATCGG GGCCGCACCG CGCTCGCGCA GCACTTCCAT ACGCGGCGTA TCGAGCACCT GTTCCACCGC TTCGCCCGAA TCGGGCGCGA TTTCTACGCT CAGGAAGCGC GCTGGCCGCA GGCGCCGTTC TGGGCCGCGC GCCGCGGCTG GCCGGATGCC CTGCCGCTGC ACGCCAAGGT GAGGCCGGAG ACCGTGCGTG TTGCGCGCGG GCCGGTGGTG TGCGCGGGGC GGATCGTCGA GCAAGACGTC GTGGTGACGC CCGACCAGCC GCTCGGCGTT TGGCACCTCG ATGGTCTGGC GGTGGCGCCC ATGCTTGCCG CCTTGCGTCG AGAGGGAGGC GATGCGCTGG AAAAAGCTGC CGGCGAGGCG CTGCTGTGCG CGCGTTTCGG GCTCGAGCGT GCGCGTGCCG CCGCCTTGCT GGCCTGGATG CGGGCACAGA ACTGGCTGGA TTGA
|
Protein sequence | MAADGIVILG AGPAGAAVAI GLARMGEPVV LVGEPRRFAA VEGVSARVID ALRGLGLQRA LTAFAPPSPR RAVWNGTYTE ANVESLVDRQ RFDQGLLEDL ERLGVPVVRG RVTALHPAPG GHELEIDTDA GPRRMAAGFL VEARGRAAPG GGAPRVRGVE TVSLLQYWQG PAGEAGSTVL SVEDGWMWMA ALADGRRYLQ LTVDVASADL PPKRALGDYC SARFHAVEAA APFVRDAQPV GEPHARTSTA VLNESVAGDD WIRVGDAAMA VDPLSGNGIF QALSSALQAP AVVATLRHDR GRTALAQHFH TRRIEHLFHR FARIGRDFYA QEARWPQAPF WAARRGWPDA LPLHAKVRPE TVRVARGPVV CAGRIVEQDV VVTPDQPLGV WHLDGLAVAP MLAALRREGG DALEKAAGEA LLCARFGLER ARAAALLAWM RAQNWLD
|
| |