Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2455 |
Symbol | |
ID | 7874139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2647828 |
End bp | 2648901 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 643699378 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002889435 |
Protein GI | 237653121 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGCTC ACAGGGGGAG CGCAGACCTC GATTTCATCG TCCCCGGCGA CCCCGCCCAG CGCACCGGCG GCTATCTCTA CGATGCCCGC ATCGTCGAGG CGCTGCGCCA CCTCGGCTGG ACCGTCACCG TCCATGGCCT GCCCGGCCGC TTCCCCGACG CCGACCCCAC GGCGCGCGAC GCCCTGCACG ACACCCTCGC CACCCTGCCC GCCGGTCGGC GCGTGGTCAT CGACGGGCTC GCGCTCGGCG GCCTGCCCGA GGTCGCGATC GAGCACGCCG GACGGCTGGC ACTGGTCGCG CTGGTGCACC ACCCCCTGGC CGACGAGCGC GGCCTCGATC CCGTGCTGCG GCGCTGCCTG CTGGCGAGCG AACGGGCCGC GCTCGCCGCC GTGCGCCTCG CGATCACCAC CAGCGCCTTC ACCGCGCGCC GGCTGCTCGA CTTCGGACTG CGCGCCGAGC GCATCCGCTG GGTCGAGCCG GGCGTGGCGC CGCTCGCGCT GGCCGCCGCG GAGGGCGAGC CGCCGCAGCT GCTGTGCGTG GCCAGCCTGA CCCCGCGCAA GGGCCAGGAC GTGCTCGTGC GCGCGCTCGA GCGCGTACGA GCGCTGCCCT GGCGCTGCAC CCTGATCGGC AGCACGCATC GCGACCCCGG CTACGCCGGC GAGGTGGCCG AACTCGTCCG CAGCCTCGGC CTGCAGGACC GCATCCGGCT CTCCGGCGAA TGCGCGGACG CGGCCCTGCG CGACGCCTAC GCCGCCGCCG ACCTCTTCGT GCTGCCCTCC CACTACGAGG GCTACGGCAT GGTGGTCGCC GAGGCGATCG CCGCCGGGCT GCCGGTGCTC GCCACCACCG GCGGCGCGCT CGCGGGCACG CTGCCTCCCG GCGCCGGGCT GGCCGTGCCG CCCGGCGACG TCGATGCGCT CGCCGGCGCG CTTGGCGAGT TGATCGGCGA CCGCGCCCGG CGCCTGCGCC TGCGCGACGG CGCCCGCCGC GCACGCGCTG GACTGCGCGG CTGGCCGCAA GCGGGCGAGG CCTTCGCCGC CGCCCTCGCC GAACTCGCGC CGGCGTCCGC GTGA
|
Protein sequence | MSAHRGSADL DFIVPGDPAQ RTGGYLYDAR IVEALRHLGW TVTVHGLPGR FPDADPTARD ALHDTLATLP AGRRVVIDGL ALGGLPEVAI EHAGRLALVA LVHHPLADER GLDPVLRRCL LASERAALAA VRLAITTSAF TARRLLDFGL RAERIRWVEP GVAPLALAAA EGEPPQLLCV ASLTPRKGQD VLVRALERVR ALPWRCTLIG STHRDPGYAG EVAELVRSLG LQDRIRLSGE CADAALRDAY AAADLFVLPS HYEGYGMVVA EAIAAGLPVL ATTGGALAGT LPPGAGLAVP PGDVDALAGA LGELIGDRAR RLRLRDGARR ARAGLRGWPQ AGEAFAAALA ELAPASA
|
| |