Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4030 |
Symbol | |
ID | 7873676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4426619 |
End bp | 4427986 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643700967 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_002890990 |
Protein GI | 237654676 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCAGT TAGATGTTCA CAAGATCGCC GACGAGGCGA AGTTCAATGG TTTTCATGGC GTCATCCTGG CATGGTGCGC CCTGATCATC ATTTTTGACG GCTACGACCT TGCCGTCGCA GGCATTGCGA TGCCGGCAAT CATGCAGGAG ATGGGCGTCA GCGCCACCCA GGCCGGATTC ATGGCGAGCT CGGCGTTGTT CGGCATGATG TTCGGCGCCA TCTTCCTGGG CACGCTTGCC GACCGCATCG GCCGGCGCCG GGTGATCGCG ATCTGCATCG TGCTGTTCAG TGTCTTTACC GCCGCCGCGG GGCTGACCAC CGATCCCGTC ACATTCGGAG TGCTGCGCTT CATCGCCGGA CTCGGCATCG GTGGGGTCAT GCCGAACGTG GTTGCAGAGA TGAGCGAATA CGCCCCGCGC CGCATGCGGG CCACGCTCGT GGCGCTGATG TTCAGCGGCT ACGCGGTGGG CGGCATGCTG GCGGCCCTGC TGGGCAAGGC CATGATCGCC AGTTACGGCT GGCAGTCGGT CTTCTACGCG GCGCTGGCGC CGGTCGTCCT CATCCCCTTC ATCCTGCGCT CGCTGCCGGA GTCCATGCCC TTCCTGCTGC GCGTCGGTCG CTTCGAGGAA CTGAAGGCGA TCGTCGCTCG CATCGATCCG AGCTATCGAC CGCTCGCGAC CGACCGCTTC GCGCTCCCGG CCGAGGATCG CTCGGACAAG GCGCCCGTCC ATCATCTGTT CTCCGAGGGC CGTGGCTTCA GCACCGTGAT GCTTTGGCTG GCCTTCTTCA TGTGCCTGTT CATGATCTAC GCCCTGAGTA CCTGGCTCAC CAAGCTGATG GCCACGGCTG GCTACAGCCT GGGGTCGGCG ATGAGCTTCG TCTTGGTACT CAATTTCGGC GCGATCGTCG GCGCCCTGGG TGGAGGCTGG CTTGCCGACC GGTTCAGGAT CAAGCGGGTG CTGATCGGCA TGTATCTGCT CGCCGCAGCC TCGATCAGCC TGCTCGGCCA GCCGATGCCT GCGCTGTTGC TGTTCATCGT CGTCGGCCTG GCCGGGGCTA GCACCATCGG CACGCAAATC GTGACCAATG CCTTCACGGC ACAGTTCTAT CCGCTCGCGA TCCGCTCCAC CGGCTTGGGT TGGGCGCTCG GCATCGGGCG CACCGGTGCC ATCCTCGCGC CGATCCTGCT TGGCGTCCTC GTCGGCATGG CCCTGCCCCT GGAACTGAAC TTCGTGGCGA TCGCGCTACC TGCCCTGTTT GCGGCACTGG CCATCGCACT GGTCGATAAT AAGGTGTCCG CATACGAGCA CCACGAAGAC GTCTCAGCTA TTCTTCCGGA TAACGCGGAA GCGCTCAGGA AGGCTTGA
|
Protein sequence | MRQLDVHKIA DEAKFNGFHG VILAWCALII IFDGYDLAVA GIAMPAIMQE MGVSATQAGF MASSALFGMM FGAIFLGTLA DRIGRRRVIA ICIVLFSVFT AAAGLTTDPV TFGVLRFIAG LGIGGVMPNV VAEMSEYAPR RMRATLVALM FSGYAVGGML AALLGKAMIA SYGWQSVFYA ALAPVVLIPF ILRSLPESMP FLLRVGRFEE LKAIVARIDP SYRPLATDRF ALPAEDRSDK APVHHLFSEG RGFSTVMLWL AFFMCLFMIY ALSTWLTKLM ATAGYSLGSA MSFVLVLNFG AIVGALGGGW LADRFRIKRV LIGMYLLAAA SISLLGQPMP ALLLFIVVGL AGASTIGTQI VTNAFTAQFY PLAIRSTGLG WALGIGRTGA ILAPILLGVL VGMALPLELN FVAIALPALF AALAIALVDN KVSAYEHHED VSAILPDNAE ALRKA
|
| |