Gene Tmz1t_3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3772 
Symbol 
ID7874016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4156480 
End bp4157589 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID643700716 
Productglycosyltransferase 
Protein accessionYP_002890740 
Protein GI237654426 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.987552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG TGCTGATCTG GGGCCGCTAC GGCAACTACG GGCCCGACTA CCCGCGCAAC 
CGCGTCATCG AGTCGGTGCT GCGCAGCCTC GGCTGCGAGG TGAGCCGCTT CCTGCCCGCG
CTGTCGGCCA CCGCCGACCT CGAGTACGCC CTGCGGAACC TCCTGGAGCG CAGCCACCGC
CCCGACCTCG TCTGGGTGCC GTGCTTCCGC CAGCGCGACC TCGCCGCCGC CGCACGCTAC
GCCCGCCGCC AGCGCGTGCC GCTGGTCTTC GACCCGCTGA TCAGCGCCTA CGACAAGCAG
GTCAACGAAA AGCACAAGTT CGCCGCGGAC AGCGCAAAGG CGCGCAAGCT GCTGGAGTGG
GAATCGCGCC TCTTTCAGCT GCCCGACTGG CTGATCGCCG ACACCGAGGG CCACGCCGAC
TACTTCCACG CCACCCACGG CGTGGAGCGC GCGCGCATCC GCGTGATCCC GGTCGGCGCC
GAGGAGTCGC TGTTCACCCC GCAGCCCTGG CCGCACAAGC CCGCCGATGC GCCGCTGGAA
CTCGCCTTCT TCGGCACCTT CATCGGCCTG CAGGGGGTGG ATGTGCTGGC GCAGGCCATC
CTGCACTACG ACGGCCCGCC CACCCACTGG CGCCTGATCG GCGAAGGGCC GATGAAGGCG
GAGTGCGAAC GTCTCCTCGC GCCGCTTGCC GGCGCCACCG GCCCCAGCCG CGTCAGCGTC
GAAGGCTGGG GCCCGCTGCC CGAGCTCCCC GGCCGGCTCG CCAGCGCCGA CGCCATCCTC
GGCATCTTCG GCACCAGCGA CAAGGCGCTG CGGGTGATTC CGAACAAGGT GTATCAGGGG
CTGGCGATCG GGCGGGCGGT GCTCACCGCG GCAACGCCGG CCTTCACGCC CGAACTGCGG
GCGGACGAAA ATAACGGGCT GCTCTGGGCA ATACCGGGAA ACCCGGATAG CATTCGCACC
GCGGTGGAGC GCCTGCACCA ACGTCGCAGC GAAACGTGGG CGATCGGTGC CGCGGCGCGC
AGCACCTACG AACAGCACTT CTCCAACCGG GTCATCCGCG ACGTGCTGAG CATCCTGCTC
ACCGCGGACA CCCGGCCCAC CGCACGATGA
 
Protein sequence
MKKVLIWGRY GNYGPDYPRN RVIESVLRSL GCEVSRFLPA LSATADLEYA LRNLLERSHR 
PDLVWVPCFR QRDLAAAARY ARRQRVPLVF DPLISAYDKQ VNEKHKFAAD SAKARKLLEW
ESRLFQLPDW LIADTEGHAD YFHATHGVER ARIRVIPVGA EESLFTPQPW PHKPADAPLE
LAFFGTFIGL QGVDVLAQAI LHYDGPPTHW RLIGEGPMKA ECERLLAPLA GATGPSRVSV
EGWGPLPELP GRLASADAIL GIFGTSDKAL RVIPNKVYQG LAIGRAVLTA ATPAFTPELR
ADENNGLLWA IPGNPDSIRT AVERLHQRRS ETWAIGAAAR STYEQHFSNR VIRDVLSILL
TADTRPTAR