Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3774 |
Symbol | |
ID | 7874018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4158393 |
End bp | 4159712 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643700718 |
Product | O-antigen polymerase |
Protein accession | YP_002890742 |
Protein GI | 237654428 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCGT CCCCCTTCGC AGACCACGAC GCCGGCAGCC CGCTCGAGCG CACTCTCCAC ACGGTGAACT CGGCACTGGT GTTCTGGTTC TTCGCGTTCC TGATCACGGG ACCGAAGAAC TCCCACGCGA CGACCGCCCT GCTGGCCTTG AGCCTGGTGA CCCTTCCGGC GACGGTTCGC GTCGCCCCCA GGGTCTGGGC CCACGCAGCG CCCTGGCTGA TCGGCCTGGG AGCATACTGT TCGTATCAGA TCGCCTACCG ATTGATCGAC GGTGGGCTCG AGGCTCGCAT CGACCCTCCT GCACGCTACC TCGGTGCAAT CCCGATCCTT TTCTACCTCG CGCGCTACGG TTTCAACATC AAGGCACTGT GGGCGGGGAT GGCCGTCGGC AGCCTCATCG GTGGCGTGGC GGGCGCGCAG GAAGTGTGGA TCGAGGGGGC GCAACGCGCA GGAGCGGGAA TCCACCCGAT TGCATACGGC AGCATCCTGG CGCTGCTGTC GATGATCCTG CTGTACGGCG CCACGATTTT CCGCGAAACC ACTTGGCGGA TCTTTCTCTC GGCTGCATTC GCCGTCGGGC TGACGGGGGT GCTGCTATCC GGCACACGTG GGCTCTACGC GGCGTTGGCG GTCTGCATGG CGTTCATCGG CTATCGCGCA TTGAGGCAGG CGGGCGTTTC CAGCCGGGCC GTCTGGCTTA CAGCCGCGTT CAGCTTGATC CTCACCATCG CAGTTGCGTC CCAGATCCCG GCCGTCAACG AACGGTTGCA GGAAACGCAG CGCGAATACG CAGAAATTTA CGAGGGCAAC CTGGATACCT CGATCGGTCA TCGTCTACAA ATGTGGCATG CCGGGCTGTT CATCATTTCG GAACGACCGC TCTTCGGTCT CGGCCCCGAC GTAACCAAGA GGCAAACGGC TACACAGGCA TTCATGGAGG AACATCAATA CGGCCCCTGG GTCCTCCGCA TCTACGACCA CCTGCATAAT CTCTACATCA ATGAGGCTGC GACCTTTGGC CTCATCGGTC TAACCGCTCT AGCTGGACTG CTTTTCGGCG CGCTCAAGGG AACCTTCGGT CCCACGCGCA CGATGATCAA CCTCACCATC ATGATCATCC TCCTCGAAGG ACTGACCGAG ACAATCCTCA ACCATCACCG TCTGATGATG ACCTTCATGA TCCTCGTGAC CGTGCTGCGA GCTCGGCTCG CCACCGAAAC CATCGGCGCC CGGAATCTCA CTCTCTCCGG GCAGGCCAGC CCTTCGGCAT TCGATGGTGC AGAAGGCCGA ATACCGGGCG ATAGTAGACC GCATCCATGA
|
Protein sequence | MSSSPFADHD AGSPLERTLH TVNSALVFWF FAFLITGPKN SHATTALLAL SLVTLPATVR VAPRVWAHAA PWLIGLGAYC SYQIAYRLID GGLEARIDPP ARYLGAIPIL FYLARYGFNI KALWAGMAVG SLIGGVAGAQ EVWIEGAQRA GAGIHPIAYG SILALLSMIL LYGATIFRET TWRIFLSAAF AVGLTGVLLS GTRGLYAALA VCMAFIGYRA LRQAGVSSRA VWLTAAFSLI LTIAVASQIP AVNERLQETQ REYAEIYEGN LDTSIGHRLQ MWHAGLFIIS ERPLFGLGPD VTKRQTATQA FMEEHQYGPW VLRIYDHLHN LYINEAATFG LIGLTALAGL LFGALKGTFG PTRTMINLTI MIILLEGLTE TILNHHRLMM TFMILVTVLR ARLATETIGA RNLTLSGQAS PSAFDGAEGR IPGDSRPHP
|
| |