Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3369 |
Symbol | |
ID | 7873860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3680469 |
End bp | 3681593 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700306 |
Product | OmpA/MotB domain protein |
Protein accession | YP_002890340 |
Protein GI | 237654026 |
COG category | [N] Cell motility |
COG ID | [COG1360] Flagellar motor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAC TCGCCCGCCG CCGGCCGCTG GATTTCTGGC CCGGCTTCGT CGACGCCCTC GCCTCGCTGC TGATGGTGAT GGTGTTCGTG ATCCTGATCT TCGTCATCGG CCAGTTCGTG CTCGCCGACG CGGTGAGCGG CCGCGACCGC GCGCTCGCCC AGCTCGAGGC CGAGCTCGCC ACCCTCGCCA GGACACTGAG CCTGGAACAG AGCGCGCGCC AACGAGCGGA GGCCGAGGTC GGCGAGCTCT CGGCCTCGCT GAGCGCCGCC CGGCGCGAGA GCGAAGCGCG CGGCAACGAG CTGCTCGCCG CGCGCGACGA GCTCCACGCC GCCCAGGACG AGGCGCGCAG CCGCCGCGAG GAGGCCACCC GCATCGTTGC CGACATCGAG GCCCTGCAGC GCCTGAAGAC CGAGCTCGAG GCCGAGGCCG CCCGCCTGGC GAGCGCACTC GACACCTCGG AGCGCGGCCT GAAGGAACAC AAGGAGATGT CCGCCGCCGC GATCGCCCAG GTCGAGTTGC TCAATCGCCA GCTCGCCGCG GTGCGCGAGC AACTCGAGCA GCTCAATGCG GCACTCGACG CCGCCAAGCT CGCGGCCAAG GACAAGGATC TGAAGCTCGA GGAGCTCGGC CGCGAGCTCA ACCTGGCGCT TGCCGGCCGC GTCAAGGAAC TCGCACGTTA CCGCTCCGAG TTCTTCGGCC GCCTGCAGGA GGTGCTCGGC AAGCGCAAGG ACGTGCAGAT CGTCGGCGAC CGCTTCGTCT TCTCCAGCGA GGTGCTGTTC GCCTCCGCCT CGGACGAGGT CAGCGCCGAC GGCATGGTCC AGCTCACCCG CCTGGCCGAG ACGCTGAAGA CGCTGTCCGC CGACATGCCC AAGGACCTGC CCTGGGTCCT GCAGGTGGAC GGCCACACCG ACCGCCGCCC GATCGCCACC GCCCGCTTCC CGTCGAACTG GGAGCTATCG ACCGCCCGCG CCCTCGCCAT CGTCAAGTTC CTGCGCGGCC AGGGCATCCC GCCCGAGCGC CTGGCCGCGA CCGGCTACGG GGAGTTCCAC CCGCTCGATG CGCGCAGCAC CGAGGAAGCC TACACGCGCA ACCGCCGCAT CGAGCTCAAG CTGACAAGCC GGTGA
|
Protein sequence | MARLARRRPL DFWPGFVDAL ASLLMVMVFV ILIFVIGQFV LADAVSGRDR ALAQLEAELA TLARTLSLEQ SARQRAEAEV GELSASLSAA RRESEARGNE LLAARDELHA AQDEARSRRE EATRIVADIE ALQRLKTELE AEAARLASAL DTSERGLKEH KEMSAAAIAQ VELLNRQLAA VREQLEQLNA ALDAAKLAAK DKDLKLEELG RELNLALAGR VKELARYRSE FFGRLQEVLG KRKDVQIVGD RFVFSSEVLF ASASDEVSAD GMVQLTRLAE TLKTLSADMP KDLPWVLQVD GHTDRRPIAT ARFPSNWELS TARALAIVKF LRGQGIPPER LAATGYGEFH PLDARSTEEA YTRNRRIELK LTSR
|
| |