Gene Tmz1t_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3774 
Symbol 
ID7874018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4158393 
End bp4159712 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID643700718 
ProductO-antigen polymerase 
Protein accessionYP_002890742 
Protein GI237654428 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCGT CCCCCTTCGC AGACCACGAC GCCGGCAGCC CGCTCGAGCG CACTCTCCAC 
ACGGTGAACT CGGCACTGGT GTTCTGGTTC TTCGCGTTCC TGATCACGGG ACCGAAGAAC
TCCCACGCGA CGACCGCCCT GCTGGCCTTG AGCCTGGTGA CCCTTCCGGC GACGGTTCGC
GTCGCCCCCA GGGTCTGGGC CCACGCAGCG CCCTGGCTGA TCGGCCTGGG AGCATACTGT
TCGTATCAGA TCGCCTACCG ATTGATCGAC GGTGGGCTCG AGGCTCGCAT CGACCCTCCT
GCACGCTACC TCGGTGCAAT CCCGATCCTT TTCTACCTCG CGCGCTACGG TTTCAACATC
AAGGCACTGT GGGCGGGGAT GGCCGTCGGC AGCCTCATCG GTGGCGTGGC GGGCGCGCAG
GAAGTGTGGA TCGAGGGGGC GCAACGCGCA GGAGCGGGAA TCCACCCGAT TGCATACGGC
AGCATCCTGG CGCTGCTGTC GATGATCCTG CTGTACGGCG CCACGATTTT CCGCGAAACC
ACTTGGCGGA TCTTTCTCTC GGCTGCATTC GCCGTCGGGC TGACGGGGGT GCTGCTATCC
GGCACACGTG GGCTCTACGC GGCGTTGGCG GTCTGCATGG CGTTCATCGG CTATCGCGCA
TTGAGGCAGG CGGGCGTTTC CAGCCGGGCC GTCTGGCTTA CAGCCGCGTT CAGCTTGATC
CTCACCATCG CAGTTGCGTC CCAGATCCCG GCCGTCAACG AACGGTTGCA GGAAACGCAG
CGCGAATACG CAGAAATTTA CGAGGGCAAC CTGGATACCT CGATCGGTCA TCGTCTACAA
ATGTGGCATG CCGGGCTGTT CATCATTTCG GAACGACCGC TCTTCGGTCT CGGCCCCGAC
GTAACCAAGA GGCAAACGGC TACACAGGCA TTCATGGAGG AACATCAATA CGGCCCCTGG
GTCCTCCGCA TCTACGACCA CCTGCATAAT CTCTACATCA ATGAGGCTGC GACCTTTGGC
CTCATCGGTC TAACCGCTCT AGCTGGACTG CTTTTCGGCG CGCTCAAGGG AACCTTCGGT
CCCACGCGCA CGATGATCAA CCTCACCATC ATGATCATCC TCCTCGAAGG ACTGACCGAG
ACAATCCTCA ACCATCACCG TCTGATGATG ACCTTCATGA TCCTCGTGAC CGTGCTGCGA
GCTCGGCTCG CCACCGAAAC CATCGGCGCC CGGAATCTCA CTCTCTCCGG GCAGGCCAGC
CCTTCGGCAT TCGATGGTGC AGAAGGCCGA ATACCGGGCG ATAGTAGACC GCATCCATGA
 
Protein sequence
MSSSPFADHD AGSPLERTLH TVNSALVFWF FAFLITGPKN SHATTALLAL SLVTLPATVR 
VAPRVWAHAA PWLIGLGAYC SYQIAYRLID GGLEARIDPP ARYLGAIPIL FYLARYGFNI
KALWAGMAVG SLIGGVAGAQ EVWIEGAQRA GAGIHPIAYG SILALLSMIL LYGATIFRET
TWRIFLSAAF AVGLTGVLLS GTRGLYAALA VCMAFIGYRA LRQAGVSSRA VWLTAAFSLI
LTIAVASQIP AVNERLQETQ REYAEIYEGN LDTSIGHRLQ MWHAGLFIIS ERPLFGLGPD
VTKRQTATQA FMEEHQYGPW VLRIYDHLHN LYINEAATFG LIGLTALAGL LFGALKGTFG
PTRTMINLTI MIILLEGLTE TILNHHRLMM TFMILVTVLR ARLATETIGA RNLTLSGQAS
PSAFDGAEGR IPGDSRPHP