Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3637 |
Symbol | |
ID | 7873142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3994479 |
End bp | 3995489 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700578 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002890607 |
Protein GI | 237654293 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCACT CCGACTCGCT CGATGCCCAC CCCCATGTGC CGCCCGCCTC CTCGGCACAC CCCCGACTCT CGGTCGTGGT CCCGCTCTAC AACGAGGGCG AAACGGTGGA GGCGCTGCAT CGGCGCCTGT CGCAGGTGCT TGCGGCGCTG GAGCTGTCCG ACTGGGAGAT CGTCTGCGTC GATGACGGCA GCCGCGACGA CACCTATGCC CGCGTGGCCG CCCTGTGCGC GGGTGACCCC CACCTCAAGG CGATCCGCTT CGCGCGCAAC TTCGGCAAGG AGGCGGCGAT GGCCGCCGGG CTGCAGGCGG CTGGGGGCGA CGTGATCGTG CTGATGGACG GCGACCTGCA GCATCCGCCC GAGCTGATCC CCGAGATGCT CGCGCGCTGG CGTGGCGGGG CGATGATGGT CACCGCAGTG CGGCGCTCGC GCGTCACCGA TCCCTGGCTG CGGCGCAAGC TGACTCGCGG CTTCTACGCC TTCTTCAGCA AGGTCTCCGA GGTCGAGCTC GCCGAGGGCG GGGGCGACTT CCGCGTCTTC GACCGCGCGG TGGTCGACGC GATCAACAGC CTGCCCGAGC GCACCCGCTT CATGAAGGGG ATCACGAGCT GGGTGGGCTT CCGCCAGGAG GTGGTCGAGT TCGAGCCCGC GCAGCGCGCC GGCGGCGTCT CCGGGTGGTC GATGCTGCGC CTGCTGCGCT ATGCGATCGA CGGCCTGTCC GCCTTCAGCA CCCTGCCGCT GCGGGTCTGG TCGGTGATCG GCCTGATGAT GGCGGGGCTC TCGGGCTTGT ACGGACTTTT CCTCGTGCTG CGCACGATGC TGTTCGGCAT CGACCTGCCC GGCTACGTGT CGCTGATGGT GTCGGGGCTG TTCATGTCCG GCATCCAGCT CATCAGCCTG GGGGTGCTCG GCGAGTACGT GGGGCGCATC TTCACCGAGG TCAAGGGCCG GCCGCTGTTC CTGGTGTCGG AACGGCTCGG CTTCGAACGC AGGCCGGAGC GCGATCGATG A
|
Protein sequence | MSHSDSLDAH PHVPPASSAH PRLSVVVPLY NEGETVEALH RRLSQVLAAL ELSDWEIVCV DDGSRDDTYA RVAALCAGDP HLKAIRFARN FGKEAAMAAG LQAAGGDVIV LMDGDLQHPP ELIPEMLARW RGGAMMVTAV RRSRVTDPWL RRKLTRGFYA FFSKVSEVEL AEGGGDFRVF DRAVVDAINS LPERTRFMKG ITSWVGFRQE VVEFEPAQRA GGVSGWSMLR LLRYAIDGLS AFSTLPLRVW SVIGLMMAGL SGLYGLFLVL RTMLFGIDLP GYVSLMVSGL FMSGIQLISL GVLGEYVGRI FTEVKGRPLF LVSERLGFER RPERDR
|
| |