Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3899 |
Symbol | |
ID | 7873547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4295334 |
End bp | 4296482 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643700838 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002890861 |
Protein GI | 237654547 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTGGCAAG GCCGTTGGTC TTGTTGCCGC GGCGGGGCAA GACGATTGAC CGATCGATTC CGCGTTCTTG TGCTGTGTCC GCATGCTGAC GGCGCGGCCC CGGTGGAGAC GATGGATGGC GTCGAGGTCA TTCGCTACCG CTACGCACCG GCAGCATTGG AAACGCTGGT CAACAATGGC GGCATTGTCA CCAATCTGCG CAACAAGCCG TGGAAGCTCT TCCTGGTGCC AGGATTCGTC TTGATGCAGG CGTGGTATGC CCTGCGCTTG TGCCGGCAGC GCGGGATTGA TCTGGTGCAT GCACACTGGC TGATTCCTCA GGGGCTGATT GCCACGCTGC TGGGCAAGCC GTTTCTTGTG ACTTCCCATG GAGCCGATCT GTACGCACTG CGCAGCAGGC CGTTCCGGGC GCTCAAGCGC TTCGTGTTGC GCAAGGCACG AGCAACGACC GTTGTCAGTA GCGCCATGCG TGATGCGGTA GGCGAGTTGG ACGTGGATGT CGCGCAAGTC GCGGTCGTCC CGATGGGCGT GGAGATGACC CGGCTGTTTG TGCCGGGTGA CGCGACGCAG CGTTCCCGCG GCGAGTTGCT TTTCGTGGGC CGTCTGGTGG AAAAGAAGGG GCTGCGCTAT CTGCTGCTTG CTTTGCCCTC CGTGCTGCGC GAGCGCCCCG ACGTCACCTT GACCATCGCT GGCTTCGGCC CGGACAAGGA CCCACTCGAG GCTCAGGTTC GCGAATTGGG CTTGCAGGAC GCAGTGCGCT TCCTGGGGGC GGTGGCGCAG AAGGACCTAC CCGACCTGTA TCGGCGTGCG GCACTCTTTG TGGCGCCCTT CGTCAGGGCG AAGTCTGGCG ATCAGGAGGG GCTTCCCGTG GCTTTGATGG AAGCCGTGGC TTGCGGCTGT CCCGCCATTG CCGGCGATGT GGCAGGGTTG CGGGATATTT TTGGCGCGCA GGCTGACACC TGCCTGGTCA CCCCGCAGGA CATCGACCAG CTGGCCGAAG CCATTCTCCG CCAATTGCGG CAGCCCGAAG AGGCTGCACA GCGGAGTCTG GCCATGCGCA CGGCCTTGCG GGCGCATCTG AGCTGGGAAC ATGTGAGTGC GCGCTATATG GAACTGCTGC AAGGCGCTCA CGAGAAAAGC AATAATTGA
|
Protein sequence | MWQGRWSCCR GGARRLTDRF RVLVLCPHAD GAAPVETMDG VEVIRYRYAP AALETLVNNG GIVTNLRNKP WKLFLVPGFV LMQAWYALRL CRQRGIDLVH AHWLIPQGLI ATLLGKPFLV TSHGADLYAL RSRPFRALKR FVLRKARATT VVSSAMRDAV GELDVDVAQV AVVPMGVEMT RLFVPGDATQ RSRGELLFVG RLVEKKGLRY LLLALPSVLR ERPDVTLTIA GFGPDKDPLE AQVRELGLQD AVRFLGAVAQ KDLPDLYRRA ALFVAPFVRA KSGDQEGLPV ALMEAVACGC PAIAGDVAGL RDIFGAQADT CLVTPQDIDQ LAEAILRQLR QPEEAAQRSL AMRTALRAHL SWEHVSARYM ELLQGAHEKS NN
|
| |