Gene Tmz1t_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1801 
Symbol 
ID7085771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2023415 
End bp2024641 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content74% 
IMG OID643698823 
Productglycosyl transferase group 1 
Protein accessionYP_002355449 
Protein GI217970215 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.115818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACGC TCGACTACAC CACCGAAGAC CTCTGCCGCG CGCCGCGCCT GCGCATCGCG 
CTTGTGACCG AGACCTGGGC GCCCGAGGTC AATGGCGTGG CCATGACCCT GGGGCGGATG
GTCGACGGCC TCATCCGCCG CGGCCACGGC GTACAGCTCA TCCGCCCGCG CCAGCGCCCC
GGGGAGACCG CGGCGCACGG CGAGGGCCTG GAAGAAGTGC TCGCCCGCGG CCTGCGCCTG
CCGCGCTACG ACGGCCTCAA GCTCGGGCTG CCGGCGCGGG TGCGTCTCGT GCGCGAGTGG
TCGCGCCAGC GCCCGGACCT GGTGCACGTG GCGACCGAGG GGCCGCTCGG GTGGACCGCG
GTCACCGCGG CCAACAAGCT GCGCATCCCG GTCAGCTCCG ACTTCCACAC CAATTTCGAC
CACTACAGCG GCCACTATGG CATGGGCTGG CTGCGCCAGC CGGTGGCGGC CTACCTGCGC
CGCTTCCACA ACCGCAGCGC GGCGACCTTC GTGCCAACCG CGGCGCTCGC GGCGCAACTC
TCGGCGCAGG GTTACCGCAG CGTGGAGGTG ATCTCGCGCG GGGTCGACAC CGCGCTGTAT
TCGCCGGCGC GCCGCGACGA GGCCTTGCGC CGTGCCTGGG GCCTACCCCC GGGCGGGCTG
GCGGTGATCA GCGTCGGCCG CCTGGCGCCG GAGAAGAACC TCGGCCTCGC GATGCGCGCC
TTCGCAGCGA TCCGCCGCCT GCGCCCGGAC GCGCGCATGG TCCTGGTCGG CGACGGCCCC
CAGCGTGCGG CGCTGGCGCG CGCCCACCCC GACGCCGTCT TCGTCGGCAT GCGTCACGGC
GAGGACCTCG CCGCGCACTA TGCGTCGGCC GATCTGTTCC TGTTCCCCAG TCTCACCGAG
ACCTTCGGCA ACGTCACCCT CGAGGCGATG GCGAGCGGCG TGTGCCCGGT GGCCTACGAC
TACGCCGCCG CCGCCGAGGT GATCCGCGAC CTCGGCAACG GTGCCAGCGT GGCCTGTGGC
GACGAAGAGG GCTTCATCGC GCGTGCCGTA CAGATGGCCG GGGCCGATGC GCTGCGCGCG
GAGCTCGCGC GCGCCGCGCG CCGCAGCGCC GAGGCGATCG ATTGGGAGCG GGTGAACGAT
CGCTTCGCCG CGGCCCTGCT GCGCGTGTGG CGGGCGGGCA GCGGCCGGCC CCTCGATTTG
TCCGAACCCC GCCCGGAGGA GACCTGA
 
Protein sequence
MRTLDYTTED LCRAPRLRIA LVTETWAPEV NGVAMTLGRM VDGLIRRGHG VQLIRPRQRP 
GETAAHGEGL EEVLARGLRL PRYDGLKLGL PARVRLVREW SRQRPDLVHV ATEGPLGWTA
VTAANKLRIP VSSDFHTNFD HYSGHYGMGW LRQPVAAYLR RFHNRSAATF VPTAALAAQL
SAQGYRSVEV ISRGVDTALY SPARRDEALR RAWGLPPGGL AVISVGRLAP EKNLGLAMRA
FAAIRRLRPD ARMVLVGDGP QRAALARAHP DAVFVGMRHG EDLAAHYASA DLFLFPSLTE
TFGNVTLEAM ASGVCPVAYD YAAAAEVIRD LGNGASVACG DEEGFIARAV QMAGADALRA
ELARAARRSA EAIDWERVND RFAAALLRVW RAGSGRPLDL SEPRPEET