Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1062 |
Symbol | |
ID | 7084046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1163250 |
End bp | 1164101 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698080 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_002354720 |
Protein GI | 217969486 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.369095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAG CCCTGACTTT CCCCGTTCCC GCCGCCGGCA GCCTGGACCA GTACATCCAG AGCGTGAATC GCATGCCCAT GCTGAGCGAG CAGCAGGAGG TGGAACTGGC GACGCGGCTG CGCGACGAGG GCGACGTGGA TGCGGCACGG CAACTGGTGA TGTCGCATCT GCGCGTCGTC GTGGCGATCG CGCGCGGATA CCTCGGCTAC GGCCTGCCCC ACGCCGACCT GATCCAGGAA GGCAATATCG GACTGATGAA GGCGGTCAAG CGCTTCGACC CCACGCGCGG TGTGCGTCTG GTGTCGTTCG CGATCCACTG GATCAAGGCC GAGATCCACG AATACATCCT GAAGAACTGG CGCCTGGTGA AGATCGCCAC CACCAAGGCG CAGCGCAAGC TCTTCTTCAA CCTGCGCGGC ATGAAGCAGG ACAGCAGCAC GCTGCAGCCC GCCGAGGTGC GCTCGATCGC GGCGCAGCTG GGCGTGAAGC CCGAGGAAGT GGTCGAGATG GAGACGCGGC TGAGCGGTCG CGACATCCCG CTCGACGCCG GCAGCGATGA CGAGGACGAG CGCTTCGCAC CCATCGCCTA CCTGCCCGAT CCGCACGCCG AGCCCTCCGA GCAGGTCGAG CAGGCTCAGC TCGCCCGCCT GCAGGACAGC GGCCTGCGCG ACGCGCTCGC CAGCCTGGAC GAGCGCAGCC GCGCGATCGT GCAGCGGCGC TGGCTCGCCG AGGGCGACAG CGCCACGCTG CACGAGCTGG CGGCGGAATA CGGCGTGTCG GCCGAGCGCA TCCGCCAGAT CGAGGCCAAG GCGATGCAGA AGATGCGCGG GATGCTGGCC GCGGTGGCGT GA
|
Protein sequence | MSQALTFPVP AAGSLDQYIQ SVNRMPMLSE QQEVELATRL RDEGDVDAAR QLVMSHLRVV VAIARGYLGY GLPHADLIQE GNIGLMKAVK RFDPTRGVRL VSFAIHWIKA EIHEYILKNW RLVKIATTKA QRKLFFNLRG MKQDSSTLQP AEVRSIAAQL GVKPEEVVEM ETRLSGRDIP LDAGSDDEDE RFAPIAYLPD PHAEPSEQVE QAQLARLQDS GLRDALASLD ERSRAIVQRR WLAEGDSATL HELAAEYGVS AERIRQIEAK AMQKMRGMLA AVA
|
| |