Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1479 |
Symbol | |
ID | 7083562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1648752 |
End bp | 1649996 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643698497 |
Product | Radical SAM domain protein |
Protein accession | YP_002355134 |
Protein GI | 217969900 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACGA ACCTGTCCCA ATATCACCTG CTGGCCAAGC CCGCGGGCGC CGCGTGCAAC CTGGGGTGCC AGTACTGCTT TTTCCTGTCG AAGGAAAACC TGTACCACGG TGATAGCCAC CTCATGGACG AGGCCACGCT CGACCGCTAT ATCCGGCAGT TGATGGAGTC CTCGCTCGGG CCGCAGGTCG ATGTCGCGTG GCAGGGCGGT GAGCCGATGC TGCGCGGGCT GGATTTCTAC AAGCGCTCGG TGGAGCTTGC CGCGCGTTAC CGGAAGCCTC ATCAGCAGAT CCTCCACACC ATCCAGACCA ACGGCACACT GATCGACGAT GCGTGGGCGC GCTTCTTCAA GCGGCACAAC TATCTGGTCG GAATCAGCAT CGACGGGCCG CGCGCCATGC ACGACGCCTA TCGTGTCACC AAGAAGGGCG AGGGCAGCTT CGACGAGGTG GTTCGCGGCT GGAATCTGCT GCGCAAGCAT GGGGTGGATG TGAACATCCT GTGCACCGTC CATGCGGCCA ACCAGGATCA TCCGCTCGAG GTCTACCGCT TCTTCCGTGA CGAGCTGCAG GCGGAGTACA TCCAGCTCAT TCCCATCGTC GAGCGCGCGA CACCCGACAC CCTCGCCGTC GCCAACCAGG GCTGGGGCGG GCTCAAGGGC ACGGATCGTC CGCTCTACCG GCAGGAAGGC AGCCTGGTCA CCGAGCGCAC CGTGAATGCG GAGAAGTTCG GCCAGTTCCT CAACGGCATC TTCGACGAAT GGGTCAAGCG CGATGTCGGC AAGGTCTATG TGACGACCTT CGACATCGCC CTCGGCAGCT GGCTCGGGCA CCACAACGCC TGCATCGTCT CGCCCACCTG CGGCGAAGCG CTGGCGCTGG AGCACAACGG CGACGTCTAT TCCTGCGACC ACTTCGTCGA GCCCGATCAC CTGCTCGGCA ACCTCAAGGA CACGCCGCTG GGCGGGCTGG TCCGGTCGGA GAAGCAGCGC CGCTTCGGCC AGGCCAAGTA CGACACCCTG CCCAGGTACT GCAGGGAATG CCCGGTGCTG TTCGCCTGCT ACGGCGAATG CCCGCGCAAC CGCTTCATCA GGACGCCCGA CGGCGAGCAG GGGCTGAACT ACCTGTGCGC GGGCTACAAG GGCTTCTTCA CCCACATCGA CCCGGCGATG AAGACCATGG CGAGCCTGCT CAGGCAGGGG CGCTACGCCG ACGAGATCAT GGACCTTGCG GGCGCCACGC CCTGA
|
Protein sequence | MQTNLSQYHL LAKPAGAACN LGCQYCFFLS KENLYHGDSH LMDEATLDRY IRQLMESSLG PQVDVAWQGG EPMLRGLDFY KRSVELAARY RKPHQQILHT IQTNGTLIDD AWARFFKRHN YLVGISIDGP RAMHDAYRVT KKGEGSFDEV VRGWNLLRKH GVDVNILCTV HAANQDHPLE VYRFFRDELQ AEYIQLIPIV ERATPDTLAV ANQGWGGLKG TDRPLYRQEG SLVTERTVNA EKFGQFLNGI FDEWVKRDVG KVYVTTFDIA LGSWLGHHNA CIVSPTCGEA LALEHNGDVY SCDHFVEPDH LLGNLKDTPL GGLVRSEKQR RFGQAKYDTL PRYCRECPVL FACYGECPRN RFIRTPDGEQ GLNYLCAGYK GFFTHIDPAM KTMASLLRQG RYADEIMDLA GATP
|
| |