Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2050 |
Symbol | |
ID | 7083810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2320098 |
End bp | 2321171 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699077 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002355694 |
Protein GI | 217970460 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR02955] TMAO reductase system periplasmic protein TorT |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.285756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTCGGGC TTGCGCTGGG CGCGCTGCCG AGCTGCGCCG CGGCGCTGGC CGGTGCGCAG CCGGCAGCCG AGTGGCCGCT GCAACGCTGG GCAGGCGATC ATCGCGCGGC AAACGGCGCG GCCGAGACCG GGCCGGACAT CGTTCCGCCC TCCCCCACGC GTGGCTGGCG CCTGTGCGCG ATCTACCCGC ACCTGAAGGA CAGCTACTGG CTGGCGGTGA ATTTCGGCAT GACCGAGGAG GCGCGCGCGC TCGGCCTCGG CCTGGGCGTA CGCGAGGCCG GCGGCTACGG GCAGCTCGCC CGCCAGCGTG AACGCGTGGC GGAGTGCCTG GAGGATGAGG CGGTCGATGC GCTGCTGATC GGCACCGTCA GCCGCGGCGG GCTCAACGAC CTGCTCGCGC CGCAACTCGA CCATCGGCTC GTGCTCGGGG TGGTGAACGA CATCGACCCG CAGGTGGTGC GCGCGCGCAT CGCCGTGCCC TGGTACCAGC TCGGCTGGAC GATCGGCCGC TGGCTGGCCG CACGCCATCC CGCGGGCGGT CCCCCCGCGC GGATCGCGTG GATTCCCGGA CCGGCGGACG CGGACTGGGT GGGCTTCATC GACCGCGGTT TCCGCACGGG CATCGCAGGC AGCGCGGTGA ACGTCGTCGC CGAGCGCCAC GGTGACACGG GGCGCGCGAT CCAGCGCCGG CTGGTGGAGG CTGTGCTCGA CGAGCAGCCG CAGCTCGATT ACCTGGTCGG CAACGCGCCC ATGGCCGAGG CCGCGATCGC CGCGCTGCGC CGGCGCGGAC GCGAGGGGGC GACGGCGATC GTGTCGTCCT ACCTGACTCC GGCCGTCCAT GGTGGCATCC TGCGCGGCCG CATCCTCGCG GCGGTGACGG ATTTCCCGGT GCTGCAGGGG CGGCTGGCGG TGCGTCAGGC CTTGCGGGCG CTGGAGGGGC GGCCGATCGA GCCCTACCTC GGCCCGGCGG TCGAACTCGT GGACAAGAGC TCGCTGGGAC GTTTCCCGGT GCAGTGGATG TTGCCGCCGG CGGGTTTCGT ACCGGTGTAT CAGATCGCAC CGGCACTGCC CTGA
|
Protein sequence | MLGLALGALP SCAAALAGAQ PAAEWPLQRW AGDHRAANGA AETGPDIVPP SPTRGWRLCA IYPHLKDSYW LAVNFGMTEE ARALGLGLGV REAGGYGQLA RQRERVAECL EDEAVDALLI GTVSRGGLND LLAPQLDHRL VLGVVNDIDP QVVRARIAVP WYQLGWTIGR WLAARHPAGG PPARIAWIPG PADADWVGFI DRGFRTGIAG SAVNVVAERH GDTGRAIQRR LVEAVLDEQP QLDYLVGNAP MAEAAIAALR RRGREGATAI VSSYLTPAVH GGILRGRILA AVTDFPVLQG RLAVRQALRA LEGRPIEPYL GPAVELVDKS SLGRFPVQWM LPPAGFVPVY QIAPALP
|
| |