Gene Tmz1t_3899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3899 
Symbol 
ID7873547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4295334 
End bp4296482 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content63% 
IMG OID643700838 
Productglycosyl transferase group 1 
Protein accessionYP_002890861 
Protein GI237654547 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGGCAAG GCCGTTGGTC TTGTTGCCGC GGCGGGGCAA GACGATTGAC CGATCGATTC 
CGCGTTCTTG TGCTGTGTCC GCATGCTGAC GGCGCGGCCC CGGTGGAGAC GATGGATGGC
GTCGAGGTCA TTCGCTACCG CTACGCACCG GCAGCATTGG AAACGCTGGT CAACAATGGC
GGCATTGTCA CCAATCTGCG CAACAAGCCG TGGAAGCTCT TCCTGGTGCC AGGATTCGTC
TTGATGCAGG CGTGGTATGC CCTGCGCTTG TGCCGGCAGC GCGGGATTGA TCTGGTGCAT
GCACACTGGC TGATTCCTCA GGGGCTGATT GCCACGCTGC TGGGCAAGCC GTTTCTTGTG
ACTTCCCATG GAGCCGATCT GTACGCACTG CGCAGCAGGC CGTTCCGGGC GCTCAAGCGC
TTCGTGTTGC GCAAGGCACG AGCAACGACC GTTGTCAGTA GCGCCATGCG TGATGCGGTA
GGCGAGTTGG ACGTGGATGT CGCGCAAGTC GCGGTCGTCC CGATGGGCGT GGAGATGACC
CGGCTGTTTG TGCCGGGTGA CGCGACGCAG CGTTCCCGCG GCGAGTTGCT TTTCGTGGGC
CGTCTGGTGG AAAAGAAGGG GCTGCGCTAT CTGCTGCTTG CTTTGCCCTC CGTGCTGCGC
GAGCGCCCCG ACGTCACCTT GACCATCGCT GGCTTCGGCC CGGACAAGGA CCCACTCGAG
GCTCAGGTTC GCGAATTGGG CTTGCAGGAC GCAGTGCGCT TCCTGGGGGC GGTGGCGCAG
AAGGACCTAC CCGACCTGTA TCGGCGTGCG GCACTCTTTG TGGCGCCCTT CGTCAGGGCG
AAGTCTGGCG ATCAGGAGGG GCTTCCCGTG GCTTTGATGG AAGCCGTGGC TTGCGGCTGT
CCCGCCATTG CCGGCGATGT GGCAGGGTTG CGGGATATTT TTGGCGCGCA GGCTGACACC
TGCCTGGTCA CCCCGCAGGA CATCGACCAG CTGGCCGAAG CCATTCTCCG CCAATTGCGG
CAGCCCGAAG AGGCTGCACA GCGGAGTCTG GCCATGCGCA CGGCCTTGCG GGCGCATCTG
AGCTGGGAAC ATGTGAGTGC GCGCTATATG GAACTGCTGC AAGGCGCTCA CGAGAAAAGC
AATAATTGA
 
Protein sequence
MWQGRWSCCR GGARRLTDRF RVLVLCPHAD GAAPVETMDG VEVIRYRYAP AALETLVNNG 
GIVTNLRNKP WKLFLVPGFV LMQAWYALRL CRQRGIDLVH AHWLIPQGLI ATLLGKPFLV
TSHGADLYAL RSRPFRALKR FVLRKARATT VVSSAMRDAV GELDVDVAQV AVVPMGVEMT
RLFVPGDATQ RSRGELLFVG RLVEKKGLRY LLLALPSVLR ERPDVTLTIA GFGPDKDPLE
AQVRELGLQD AVRFLGAVAQ KDLPDLYRRA ALFVAPFVRA KSGDQEGLPV ALMEAVACGC
PAIAGDVAGL RDIFGAQADT CLVTPQDIDQ LAEAILRQLR QPEEAAQRSL AMRTALRAHL
SWEHVSARYM ELLQGAHEKS NN