Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3312 |
Symbol | |
ID | 7874210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3630517 |
End bp | 3631497 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700246 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_002890284 |
Protein GI | 237653970 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.713226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGCA ATGCACTGCT GAAACCGCGC ATCATCGACG TCCAGAGCAC CTCGCCCGTT CAGGCGCGGG TGACGATGGA GCCGTTCGAG CGCGGTTTCG GTCACACGCT GGGGAATGCG CTTCGGCGCA TCCTCCTGTC GTCGCTTCCC GGCCATGCGC CGACCGAAGT GACGATCGAA GGGGTGCTTC ACGAGTACTC CACGCTCGAC GGCGTCCGCG AGGACATCGT CGATCTGCTG CTCAACTTGA AGGGCGTGGT GTTCAAGCTC AACGGGCGCA GCGAGGTCAC CGTCCGTCTC GCCAAGGCAG GCGAGGGTGT GGTGACTGCG GCCGACATCG AGGTCGGTCA CGACGTCGAG ATCATCAATC CCGACCACGT GATCGCCCAC CTCGCGCCGG GCGGCAAGCT CGACATGCAG ATCAAGGTCG AGGAAGGCCG CGGCTACGTG CCGGGCAATT CGCGTCCCGA GAACGCGGAG AGCAAGTCCA TCGGTCGCGT CGTGCTCGAT GCCTCCTTCG GTCCCGTGCG TCGCGTCAGC TACCTTGTCG AGAGCGCTCG CGTCGAGCAG CGTACCGACC TGGATCGCCT GATCATCGAC ATCGAGACCA ACGGTGCGGT CGACCCGGAA GAGGCGATTC GCTACGCCGC CCGGGTTCTC ATGGACCAGC TTTCCGTGTT CGCCGACCTC GAGGGGACGG CACCGGAGCG CGTCGAGGCG GCTTCGCCGA CCATCGACCC GGTGCTGCTG CGCCCGGTCG ACGATCTGGA GCTCACGGTG CGCTCGGCCA ACTGCCTGAA GGCCGAGAAC ATCTACTACA TCGGCGACCT GATCCAGCGT ACCGAGACCG AGCTGCTGAA GACCCCGAAC CTGGGCCGCA AGTCGCTCAA CGAGATCAAG GAAGTGTTGG CCTCCCGCGG GCTGACGCTT GGCATGAAAC TGGAAAACTG GCCGCCGGCC GGGCTTGAAA AGCTCGGTTG A
|
Protein sequence | MQSNALLKPR IIDVQSTSPV QARVTMEPFE RGFGHTLGNA LRRILLSSLP GHAPTEVTIE GVLHEYSTLD GVREDIVDLL LNLKGVVFKL NGRSEVTVRL AKAGEGVVTA ADIEVGHDVE IINPDHVIAH LAPGGKLDMQ IKVEEGRGYV PGNSRPENAE SKSIGRVVLD ASFGPVRRVS YLVESARVEQ RTDLDRLIID IETNGAVDPE EAIRYAARVL MDQLSVFADL EGTAPERVEA ASPTIDPVLL RPVDDLELTV RSANCLKAEN IYYIGDLIQR TETELLKTPN LGRKSLNEIK EVLASRGLTL GMKLENWPPA GLEKLG
|
| |