Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0070 |
Symbol | |
ID | 7083453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 78974 |
End bp | 79990 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697117 |
Product | hypothetical protein |
Protein accession | YP_002353766 |
Protein GI | 217968532 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCCA CCCTCACCGA CCTGCTCGCG GCCGCGCTTG CCGCCCGAGC CCCGCTGATC GCCCAGCTGG CTGCCGAAGA CACCGACGCC TGGCGCCTCT TCCACGGCAC CGTCGAGGGC GCGCCCGGCC TCACGGTCGA CCGCTACGGC AGCGTGCTGC TGGCGCAGAC CTTCCATCGC CCGCTCAGCG CCGAGCAGCT GGCCGAGCTC GAGGCCTTCT ACGCCGCCGC CCTGCCCGGC CTGGCGCTGG TGTGGAACGA CCGCAGCGGC AAGAACTCGC GCGTCGCCAA TCCCCTTCCG CCCGAACAAC AGGCCCTCGC CGAGCAGCCT TCGGAGTTTT CCGAACTCGG CGTGCGCTAC CGCTTCCAGG CCCGCCATGG CGGGCAGGAC CCCTGGCTGT TCCTCGACCT GCGCGCCGCG CGCCGCCGCG TCATGCAGGA GGCCGCGGGC AAGTCCCTGC TCAACGTGTT CGCGTACACC TGCGGCGTCG GCATCGCCGC GGCCAAGGCC GGCGCGCGCC ACGTGGTGAA TGTGGACTTC GCCGAGTCCG CCCTCGCGGT CGGCAAGGAC AACGCACGCC TCAACGAGCT GCCGATCCGG GTGCGCTTCA TCAAGTCCGA CGCCTTCGCC GCGCTGCGCC AGTACGCCGG CATCGGTCAG CCGAAGATGG TGCGCGGCAA GCACCTGCCG CCCTTCCCCG AGCTCGCGCC GCACCGCTTC GACCTCGTCT TCCTCGATCC CCCGCGCTAC GCCAAGAGCC CCTTCGGCGT GGTCGACCTG GTGCACGACT ACGCCGCGCT GTTCAAGCCC GCGCTGCTCG CCACGGAAGA GGGCGGCACC CTGATCTGCT GCAATAACGT CGCCCGCGTG GATCGCGAGG ACTGGCTCGA GCAGCTCGAG CGCAGCGCCC GCAAGGCGGG CCGTGCGGTG CGTGAGGCCG AATGGATCAG CCCCGAGGCG GACTTTCCCA GCCGCGACGA CAACCCGCCG CTGAAGGTGG TGCTGCTGCG CGTTTGA
|
Protein sequence | MSSTLTDLLA AALAARAPLI AQLAAEDTDA WRLFHGTVEG APGLTVDRYG SVLLAQTFHR PLSAEQLAEL EAFYAAALPG LALVWNDRSG KNSRVANPLP PEQQALAEQP SEFSELGVRY RFQARHGGQD PWLFLDLRAA RRRVMQEAAG KSLLNVFAYT CGVGIAAAKA GARHVVNVDF AESALAVGKD NARLNELPIR VRFIKSDAFA ALRQYAGIGQ PKMVRGKHLP PFPELAPHRF DLVFLDPPRY AKSPFGVVDL VHDYAALFKP ALLATEEGGT LICCNNVARV DREDWLEQLE RSARKAGRAV REAEWISPEA DFPSRDDNPP LKVVLLRV
|
| |