Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1796 |
Symbol | |
ID | 7085766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2016948 |
End bp | 2018126 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698818 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_002355444 |
Protein GI | 217970210 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.10437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGATG GCAAAGTTCG CGAACTTTCG AGCTCGCGCA CGCGCGCAGC GCTGCGCGTC GCGCTGCTCG CCGGCATGCT CGCCGCCGGC AGCATGCACG CCCCGCTCGC GGGCGCGACC GGCATGGTGT ATCCGAGCGA GACGGGGCCG CTGCGCGTGA CCGAGGTCGC GCGCGGTTTG GAGAGTCCGT GGGCGCTCGC CTTCCTCCCC GACGGTCGCA TGCTGGTGAC CGAGCGCCCG GGGCGGATGC GCATCGTCGA TACGGCAGGG CGGGTGTCGG AACCCATCGC CGGCGTGCCC GAGGTCCATG CGCGCGGGCA GGGCGGCCTG CTCGACGTTG CGCTCGCGCC CGACTTCGAC CGTGACCGCC TGATCGTGTT CTCCTACGCC GAGCCGACCG CGCGGGGAGC GCGCACCGCC GTGGCGCGCG CGCGGCTCGA CGTCGACGGC CTGCGCCTCG AGGACGTGCG CCGCATCTTC GCGCAGGACG AGGATCCCGC GGGCAATCAC CACTGGGGCT CGCGCCTGGT GTTCGGGCGC GACGGCCGCC TGTTCGTGAC CCTGGGCGAT CGGTTCCACC ACCGCGAGCG CGCGCAGGCG CTCGACAGCC ATCTCGGCAA GGTGGTGCGC ATCGATCCCG ACGGCGGCGT GCCCGCCGAC AACCCGCTCG TGGGGCGTGC AGGCGTGCGC GGCGAGATCT GGTCCTGGGG CCACCGCAAC GTGCAGGGTG CGGCCTTGCA CCCGCGCACC GGCGAGCTGT GGACCCACGA GCACGGCCCG CAGGGCGGCG ACGAGATCAA CCGCACCCTG GGCGGGCGCA ATTACGGCTG GCCGGAGATC ACCTACGGCC GCGAATACGT CACCGGTCGC AAGATCGGCG CCGGCAGCGA GCGCGAAGAC GTGGCGGCGC CGGTGCTGCA ATGGACACCC TCGATCGCCC CCTCGGGCAT GGCCTTTTAT ACCGGCGATG CCTTCCCGCA GTGGCAGGGC AACCTCTTCG TCGGCGCGCT CAAGTTCCAG CTGCTCGCCC GCCTGGTGCT CGATGGCGAG CGCGTGGTGC GCGAGGAGCG CCTGCTCGAA GGCCTGGGGC GGATCCGCGA CGTGCGCCAG GGGCCGGACG GCCGCCTGTG GCTGCTCGAC GAGAGCGCCG GGCGGGTGCT GCGGATCGAT CCGCAGTGA
|
Protein sequence | MQDGKVRELS SSRTRAALRV ALLAGMLAAG SMHAPLAGAT GMVYPSETGP LRVTEVARGL ESPWALAFLP DGRMLVTERP GRMRIVDTAG RVSEPIAGVP EVHARGQGGL LDVALAPDFD RDRLIVFSYA EPTARGARTA VARARLDVDG LRLEDVRRIF AQDEDPAGNH HWGSRLVFGR DGRLFVTLGD RFHHRERAQA LDSHLGKVVR IDPDGGVPAD NPLVGRAGVR GEIWSWGHRN VQGAALHPRT GELWTHEHGP QGGDEINRTL GGRNYGWPEI TYGREYVTGR KIGAGSERED VAAPVLQWTP SIAPSGMAFY TGDAFPQWQG NLFVGALKFQ LLARLVLDGE RVVREERLLE GLGRIRDVRQ GPDGRLWLLD ESAGRVLRID PQ
|
| |