Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3014 |
Symbol | |
ID | 7874403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3266138 |
End bp | 3267076 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643699935 |
Product | RNA polymerase sigma factor RpoS |
Protein accession | YP_002889989 |
Protein GI | 237653675 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02394] RNA polymerase sigma factor RpoS [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.175723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAC CGGTCAATCT TGATGAACTG GAAAGTCAGC AGGAACCGGA TCTTCCCCCT GAGGTGGAGG TGTTCTCGTT CCAGGCGCCG CCGGTTGTCG AGAACGAGTT CTTCAGCGAC GTCACCCAGC TCTACCTCAA CGAGATCGGT GCCAATCCGC TGCTGACCGC CGAGGAGGAG CTGGTGATCG CCCGCCGCGT GCGGATGGGC GACTTCGATG CCCGGCAGAC GATGATCGAG CGCAACCTGC GCCTGGTCGT CAATATCGCC AAGCACTACC TCAATCGCGG GATTCCCTTG CTCGATCTGG TCGAGGAGGG CAACCTCGGC CTCATGCACG CGCTCGAGAA GTTCGACCCC GAGCGCGGCT TCCGCTTCTC GACTTACGCG ACGTGGTGGA TTCGGCAGAA CATCGAGCGT GCGATCATGA ACCAGTCGCG CACGATCCGC CTGCCCGTCC ACGTGGTGAA GGAACTCAAC CAGGTCCTGC GCGCGCAGCG CCACATCGAG GCCGATTGCA ACGGCGAGTC CTCGCTCGAG CAGATCGCCA ATCGGCTAGG CAAGACGATC GAGGAGGTGC GCAGCATCCT CGCGCTCGGC GAGCACACCG CCTCGCTCGA CGCACCCCTC GACATCGACC CGTCCTTGTC GATCGGCGAG TCGCTCGCGG ACGAGCAGCA CATCTCCGCC GACCTTCGCA TACAGTGCTC GGAAGTCGAG CAACTCGTGC GCGAATGGCT CGCGATCCTC AACGACAAGC AGCGCATGGT GATTCGTCAC CGCTATGGCA TCGACGAGTG CGAGTTGCTC ACGCTCGAAG AACTCGCCGA ACGCCTCGAA CTCACCCGCG AGCGCGTGCG CCAGATCCAG CTCGAGGCGC TGGGCCAACT GCGCCGGATC CTGCGCCGGC GCGGAATCTC GCGCGACGCG CTGCTCTAG
|
Protein sequence | MEEPVNLDEL ESQQEPDLPP EVEVFSFQAP PVVENEFFSD VTQLYLNEIG ANPLLTAEEE LVIARRVRMG DFDARQTMIE RNLRLVVNIA KHYLNRGIPL LDLVEEGNLG LMHALEKFDP ERGFRFSTYA TWWIRQNIER AIMNQSRTIR LPVHVVKELN QVLRAQRHIE ADCNGESSLE QIANRLGKTI EEVRSILALG EHTASLDAPL DIDPSLSIGE SLADEQHISA DLRIQCSEVE QLVREWLAIL NDKQRMVIRH RYGIDECELL TLEELAERLE LTRERVRQIQ LEALGQLRRI LRRRGISRDA LL
|
| |