Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0338 |
Symbol | |
ID | 7085639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 382911 |
End bp | 384683 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643697373 |
Product | sigma-70 region 2 domain protein |
Protein accession | YP_002354021 |
Protein GI | 217968787 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACGATC CGGCCCCATT CCGGAAGAAG CCCTTGAATC CGCTTCTGAA AATGGCAGCT GTAGCTGGCG TGCAGACAGC CATCAGACTG CACATCCGCC GCGGTGACGA TCTGGATGCC GTCGAGGAGA ACGGTCGAAC GCCGCTCATG CTTGCCGCAG TCCGGGGGCA CGCCGACGTC TGCAAACTGC TTCTCGACGC GGGAGCAGAC CCGCTTCTCA CCGATCACGA AGGGCGAGAT GCCGTCGGTC TTGCCTTGGC AGCAGGGAAG ACCGGAGTTG TGGAAGTTCT AATGCAGTTC CGAGTCGAAA CCGCGGACGG TTCCGCGGTA TGCCCTCCAC CTACGGGCAG GCCGGTTTCC GCAGCTGGCT CATCACCCGA GTCGGGCGCG ACGACAGAGG ATGAATTCAG TCCAAATGGC TGGGAGGCGG AAGAAGAGTC GCTCACACCA CCCGATGACA CTCGACGCAC AAGTTCAGCA AGGACCATTC AGCAGGCGAT CTCCTCGCAT GCACCAACGA GCACCGACGA GGACTGGTCA GATATCGAAA TCGATCTTCC CGAGGTCCGC ACTGGCCCCG CGGGAGGAAG GCGTTTTGAT GACGAGGATC TCGCACGTCT CCGCCTTCTG CTCATGCAGG GCCTGTCAGT CGGGTTCGTG AACCGCTCGG ATATCGTCGA AGCGTGCGCC GATGCAACAG AATCACATGA GGCCGACCTT GAGGAGCGTA TCGAGCGGCT GATTGGCGAC CTGGGGCTCC ACATTGAGGA CGTTTCGCCT TCCTTTGGTC TTGATGAACA CGGCGCTGCC GAGTCGGAGG ACGCTTGTGA GGATGCCCTC GAGTATCTCC GCGACATCTG TTCTGGCGAC AACGATCCGT TGAAACTGTA TATCCGGGAG GTGGGGCCCC ATCAGGCACT CACACGTGAG GATGAAGCCT TCATCGCACG CGCGATGGAG GAGGGAGTAT CGCAAGCGAT ACGTACAATT GCCTCATGCA AGGCAGCACT CGACCAGATT ATCGACGCTG GAGATGCGGT CTGCCGCGGG GAGGCAGATG CCGGATCAAT GTTTGATCGG GCCGCAGGCA ATCCAGAGGA CGAACCGCAC GAATCCGCCT TTGGACTGTC ATCCACCATG CAGGATGACG AACAGGATCA AGCATCCGAC GATGCAGGTC GCCAGCAGAT GCAACTCCAG CCAGAACTCG GTGCAGCACT GGCCGATTTG CGCAGGCTTG TGTACTCGCT TCCCGCCAGT ACCGAAGTCC CTGTCGCCGC TCAGGGGCGG ATTGTCTCCC TACTGAATGC CCTTGGACTG AACTTCGATT TCCTGGAAGC AGTGTGCAAC ACGCTCCTCG CTTCGCCCGA TCATCGGGAC ATCGGAGCCG CGGTCGCTGT CGCCCTGAAT TCCGCTCTGG ATCACAGAAA TCGAATGATC AAGTCGAACC TTAGACTGGT CATTTCGATT GCAAAGAAGT ACACACATAC GGGCTTCCCG TTCCTCGATC TGATCCAGGA AGGCAACTTG GGCCTCATGA AGGCGGTCGA GAAATTCGAC TATCGTCGCG GCTTCAAGTT TTCGACTTAC GCAACCTGGT GGATCAGGCA GGCCATCACG CGCGGCATTG CCGATCAGCA ACGCCTTGTC CGGGTTCCCG TCCATATGGT CGAATCAATC AACAAGGTGT CGCGCGTCCT TCGCGAACTC GAGGGACGGG CGCCTCGAAA AACCTCCGCC TCGCCCGCCC TTGGTAAAAT TACGACGTCC TGA
|
Protein sequence | MNDPAPFRKK PLNPLLKMAA VAGVQTAIRL HIRRGDDLDA VEENGRTPLM LAAVRGHADV CKLLLDAGAD PLLTDHEGRD AVGLALAAGK TGVVEVLMQF RVETADGSAV CPPPTGRPVS AAGSSPESGA TTEDEFSPNG WEAEEESLTP PDDTRRTSSA RTIQQAISSH APTSTDEDWS DIEIDLPEVR TGPAGGRRFD DEDLARLRLL LMQGLSVGFV NRSDIVEACA DATESHEADL EERIERLIGD LGLHIEDVSP SFGLDEHGAA ESEDACEDAL EYLRDICSGD NDPLKLYIRE VGPHQALTRE DEAFIARAME EGVSQAIRTI ASCKAALDQI IDAGDAVCRG EADAGSMFDR AAGNPEDEPH ESAFGLSSTM QDDEQDQASD DAGRQQMQLQ PELGAALADL RRLVYSLPAS TEVPVAAQGR IVSLLNALGL NFDFLEAVCN TLLASPDHRD IGAAVAVALN SALDHRNRMI KSNLRLVISI AKKYTHTGFP FLDLIQEGNL GLMKAVEKFD YRRGFKFSTY ATWWIRQAIT RGIADQQRLV RVPVHMVESI NKVSRVLREL EGRAPRKTSA SPALGKITTS
|
| |