Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0708 |
Symbol | |
ID | 7083937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 791368 |
End bp | 792822 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697734 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_002354376 |
Protein GI | 217969142 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.725065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCA GCCTCCAGCT CAAGCTCTCC CAGCACCTCA CGCTGACGCC CCAGCTGCAG CAGTCGATCA AGCTGCTGCA GCTGTCTACG ATCGAGCTGA ACCAGGAGAT CGAGCGCGTG CTGCTCGAGA ATCCCATGCT CGAGCGCGAG GATGGCGACG GCGGCGTCGA CTACGCCCCC GCGCCGCCGC CGGGCAACGC GCCCGAGCGC GAAGCCGCGC CGGAACCCGC CGCCGAACCG GCGGGCGAGT CGATCACGCA CGAGGGCGGC GAAGGCAGCA GCGAGGACGA GGGCATCGAC TGGTCCAACG TGGGTCAGGG CGCGTCCAGC CGCAGCGAGG ACGACGAGGA CGGCGACTAC CAGGACATCC AGGCCGCCGG CGTGTCGCTG CGCGAACACC TCGACCAGCA GGTCGCCCTC TCGCCCCTGT CCGACCGCGA CCGCGCGCTG GTGCGCTTCC TGATCGAGGC ACTCGACGAC GACGGCTACC TGCACCAGCC GCTGGAAGAT CTCCTCGAGC TGTTGCCGCA GGACGCCGAG GTCGATCTCG ACGAGCTCGC CATCGCCCTG CGCCACGTGC AGAGCCTCGA GCCGGCCGGG ATCGGCGCGC GCAGCCCGCA GGAATGTCTG GCCCTGCAGC TCCAGGCCTT GCCCGAGTGC CCGATACGCC CTCTCGCTCT CGAGATCGTG ACCAGCCACC TCGAACTGCT CGCGGAGCGC AACTTCGCCC GTATCCGCAA GCTCACCGGC TGTGACGACG AGGGTCTGCG CGAAGCGCAG GCCCTGATCT GCAGCCTCGA CCCCCACCCC GGCTCGCGCT ACTCCACCGC CGAGACGCGC TACGTGCTGC CCGACGTCGT CGTGCGCAAG CTGCGCGGCC AATGGACCGT GAGCCTGAAC CAGGAAGCGA TGCCCCGCCT GCGCATCAAC CGCCTGTACG CGAGCCTGTT GCAGCAGAAC CGCGCACAGG GCGGCGGCCT CGGCGGGCAG CTGCAGGAGG CGCGCTGGCT GATCAAGAAC GTCCAGCAGC GCTTCGACAC CATCCTGCGC GTGTCCCAGG CCATCGTTGA CCAGCAACGG CAGTTCTTCG ATCATGGGGA CGTGGCGATG CGGCCACTCA CCCTAAGGGA GATCGCAGAC CAGCTCGAGC TGCACGAATC CACCGTCTCG CGCGTCACCA CGCAGAAGTA CATGGCGACG CCGCGCGGAG TCTTCGAGCT CAAGTATTTT TTCGGCAGTC ATGTCGCGAC CGACACCGGC GGTGCGGCTT CCTCCACGGC GATTCGTGCG CTGATTCGCC AGCTGGTGGA CGCGGAGGAC AGCAAGAAGC CGCTGTCCGA CGCCAAGATC GCCGAAATTC TCGGCCAACA GGGAATCGTC GTGGCGAGGC GCACGATCGC CAAGTACCGC GAGTCCCTCA ACATCCCGCC GGTCAGCCTG CGCAAATCCC TCTGA
|
Protein sequence | MKPSLQLKLS QHLTLTPQLQ QSIKLLQLST IELNQEIERV LLENPMLERE DGDGGVDYAP APPPGNAPER EAAPEPAAEP AGESITHEGG EGSSEDEGID WSNVGQGASS RSEDDEDGDY QDIQAAGVSL REHLDQQVAL SPLSDRDRAL VRFLIEALDD DGYLHQPLED LLELLPQDAE VDLDELAIAL RHVQSLEPAG IGARSPQECL ALQLQALPEC PIRPLALEIV TSHLELLAER NFARIRKLTG CDDEGLREAQ ALICSLDPHP GSRYSTAETR YVLPDVVVRK LRGQWTVSLN QEAMPRLRIN RLYASLLQQN RAQGGGLGGQ LQEARWLIKN VQQRFDTILR VSQAIVDQQR QFFDHGDVAM RPLTLREIAD QLELHESTVS RVTTQKYMAT PRGVFELKYF FGSHVATDTG GAASSTAIRA LIRQLVDAED SKKPLSDAKI AEILGQQGIV VARRTIAKYR ESLNIPPVSL RKSL
|
| |