Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3772 |
Symbol | |
ID | 7874016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4156480 |
End bp | 4157589 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700716 |
Product | glycosyltransferase |
Protein accession | YP_002890740 |
Protein GI | 237654426 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.987552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGG TGCTGATCTG GGGCCGCTAC GGCAACTACG GGCCCGACTA CCCGCGCAAC CGCGTCATCG AGTCGGTGCT GCGCAGCCTC GGCTGCGAGG TGAGCCGCTT CCTGCCCGCG CTGTCGGCCA CCGCCGACCT CGAGTACGCC CTGCGGAACC TCCTGGAGCG CAGCCACCGC CCCGACCTCG TCTGGGTGCC GTGCTTCCGC CAGCGCGACC TCGCCGCCGC CGCACGCTAC GCCCGCCGCC AGCGCGTGCC GCTGGTCTTC GACCCGCTGA TCAGCGCCTA CGACAAGCAG GTCAACGAAA AGCACAAGTT CGCCGCGGAC AGCGCAAAGG CGCGCAAGCT GCTGGAGTGG GAATCGCGCC TCTTTCAGCT GCCCGACTGG CTGATCGCCG ACACCGAGGG CCACGCCGAC TACTTCCACG CCACCCACGG CGTGGAGCGC GCGCGCATCC GCGTGATCCC GGTCGGCGCC GAGGAGTCGC TGTTCACCCC GCAGCCCTGG CCGCACAAGC CCGCCGATGC GCCGCTGGAA CTCGCCTTCT TCGGCACCTT CATCGGCCTG CAGGGGGTGG ATGTGCTGGC GCAGGCCATC CTGCACTACG ACGGCCCGCC CACCCACTGG CGCCTGATCG GCGAAGGGCC GATGAAGGCG GAGTGCGAAC GTCTCCTCGC GCCGCTTGCC GGCGCCACCG GCCCCAGCCG CGTCAGCGTC GAAGGCTGGG GCCCGCTGCC CGAGCTCCCC GGCCGGCTCG CCAGCGCCGA CGCCATCCTC GGCATCTTCG GCACCAGCGA CAAGGCGCTG CGGGTGATTC CGAACAAGGT GTATCAGGGG CTGGCGATCG GGCGGGCGGT GCTCACCGCG GCAACGCCGG CCTTCACGCC CGAACTGCGG GCGGACGAAA ATAACGGGCT GCTCTGGGCA ATACCGGGAA ACCCGGATAG CATTCGCACC GCGGTGGAGC GCCTGCACCA ACGTCGCAGC GAAACGTGGG CGATCGGTGC CGCGGCGCGC AGCACCTACG AACAGCACTT CTCCAACCGG GTCATCCGCG ACGTGCTGAG CATCCTGCTC ACCGCGGACA CCCGGCCCAC CGCACGATGA
|
Protein sequence | MKKVLIWGRY GNYGPDYPRN RVIESVLRSL GCEVSRFLPA LSATADLEYA LRNLLERSHR PDLVWVPCFR QRDLAAAARY ARRQRVPLVF DPLISAYDKQ VNEKHKFAAD SAKARKLLEW ESRLFQLPDW LIADTEGHAD YFHATHGVER ARIRVIPVGA EESLFTPQPW PHKPADAPLE LAFFGTFIGL QGVDVLAQAI LHYDGPPTHW RLIGEGPMKA ECERLLAPLA GATGPSRVSV EGWGPLPELP GRLASADAIL GIFGTSDKAL RVIPNKVYQG LAIGRAVLTA ATPAFTPELR ADENNGLLWA IPGNPDSIRT AVERLHQRRS ETWAIGAAAR STYEQHFSNR VIRDVLSILL TADTRPTAR
|
| |