Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1020 |
Symbol | |
ID | 7084004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1116665 |
End bp | 1117702 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698039 |
Product | hypothetical protein |
Protein accession | YP_002354679 |
Protein GI | 217969445 |
COG category | [V] Defense mechanisms |
COG ID | [COG1566] Multidrug resistance efflux pump |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTC CAAACCTCCT CTCCAGAAAA ACCCGTTGGG TCGTGATCGC CGTCGTCTTG CTCGCTCTTG GCGGCGCACT GGCGTTCTGG CTGTCGCGCG ACCATGGACA ACAAGATGCA TTGCGCCTCT ACGGCAACGT CGATATCCGC GAGGTGCAAC TGGCCTTCCG TCAGCCCGGG CGCGTGATGC AAATGGCTTT CGACGAGGGC GATGCCGTCA GCGTGGGCGC GCGCCTGGCG GCGCTTGACG CGCAACCCTA TCGGGAAGCC CTTGCGGCAG CGCAGGCCCA GGTGCAGGTG GCGCAGGCGG AACTGGCCAA GCTGCGCCGC GGCCTGCGAC CGCAGGAAAT CACCCAGGCG CGCGAGGCCC TCAGGCAGGC ACAGGCGCTC GCCACCGAGA CCGAGCGGAA TTTCAAGCGC CAGAGTGGCC TGCTGACATC GGGTGCCAGC AGCCAGCGTA CGGTCGATGC CGCCCGCACG GCGCGAGACC AGGCAGCCGC TGGCGTCGAA GCAGCGAAGG CGGCCCTGTC GCAAGCATCC GAAGGCTTCC GCAAGGAAGA CATCGCCGCG GCAGAAGCTC GCCTTGCGGC CGCTCAGGCT GCCGCAGCGC AAGCCACGAC AGCCTTGGCG GACACCGAAT TGGTGGCGCC CAGTAGTGGC ACCGTCATCG CACGGGTGCG CGAGCCCGGC AGCATGGTTG CAAGCCAGAG CGCGGTCTAC AGCCTGAGCC TGGACAAGCC GGTTTACGTG CGCGCCTACG TGGGCGAGTC GGACTTGGGG CGCATCGCGC CCGGTACTGT GGTACGCGTC AAAAGCGATT CATCAGAGAA GGTCTATCGC GGCCAGATCG GCTTCATCTC GCCGCGCGCC GAGTTCACCC CCAAGACGGT GGAGACGACG GATTTGCGCA CGGATCTGGT GTACCGCCTG CGCATCGTCA TCGACGAAGC CGACAGCGAC AGTGCCTTGC GCCAGGGCAT GCCGGTGACG ATCGAGGTCG ATGCGAAAGC CGGCACCGGT ATCCCGGGGG AGCGGTGA
|
Protein sequence | MKTPNLLSRK TRWVVIAVVL LALGGALAFW LSRDHGQQDA LRLYGNVDIR EVQLAFRQPG RVMQMAFDEG DAVSVGARLA ALDAQPYREA LAAAQAQVQV AQAELAKLRR GLRPQEITQA REALRQAQAL ATETERNFKR QSGLLTSGAS SQRTVDAART ARDQAAAGVE AAKAALSQAS EGFRKEDIAA AEARLAAAQA AAAQATTALA DTELVAPSSG TVIARVREPG SMVASQSAVY SLSLDKPVYV RAYVGESDLG RIAPGTVVRV KSDSSEKVYR GQIGFISPRA EFTPKTVETT DLRTDLVYRL RIVIDEADSD SALRQGMPVT IEVDAKAGTG IPGER
|
| |