Gene Tmz1t_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0338 
Symbol 
ID7085639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp382911 
End bp384683 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content59% 
IMG OID643697373 
Productsigma-70 region 2 domain protein 
Protein accessionYP_002354021 
Protein GI217968787 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGATC CGGCCCCATT CCGGAAGAAG CCCTTGAATC CGCTTCTGAA AATGGCAGCT 
GTAGCTGGCG TGCAGACAGC CATCAGACTG CACATCCGCC GCGGTGACGA TCTGGATGCC
GTCGAGGAGA ACGGTCGAAC GCCGCTCATG CTTGCCGCAG TCCGGGGGCA CGCCGACGTC
TGCAAACTGC TTCTCGACGC GGGAGCAGAC CCGCTTCTCA CCGATCACGA AGGGCGAGAT
GCCGTCGGTC TTGCCTTGGC AGCAGGGAAG ACCGGAGTTG TGGAAGTTCT AATGCAGTTC
CGAGTCGAAA CCGCGGACGG TTCCGCGGTA TGCCCTCCAC CTACGGGCAG GCCGGTTTCC
GCAGCTGGCT CATCACCCGA GTCGGGCGCG ACGACAGAGG ATGAATTCAG TCCAAATGGC
TGGGAGGCGG AAGAAGAGTC GCTCACACCA CCCGATGACA CTCGACGCAC AAGTTCAGCA
AGGACCATTC AGCAGGCGAT CTCCTCGCAT GCACCAACGA GCACCGACGA GGACTGGTCA
GATATCGAAA TCGATCTTCC CGAGGTCCGC ACTGGCCCCG CGGGAGGAAG GCGTTTTGAT
GACGAGGATC TCGCACGTCT CCGCCTTCTG CTCATGCAGG GCCTGTCAGT CGGGTTCGTG
AACCGCTCGG ATATCGTCGA AGCGTGCGCC GATGCAACAG AATCACATGA GGCCGACCTT
GAGGAGCGTA TCGAGCGGCT GATTGGCGAC CTGGGGCTCC ACATTGAGGA CGTTTCGCCT
TCCTTTGGTC TTGATGAACA CGGCGCTGCC GAGTCGGAGG ACGCTTGTGA GGATGCCCTC
GAGTATCTCC GCGACATCTG TTCTGGCGAC AACGATCCGT TGAAACTGTA TATCCGGGAG
GTGGGGCCCC ATCAGGCACT CACACGTGAG GATGAAGCCT TCATCGCACG CGCGATGGAG
GAGGGAGTAT CGCAAGCGAT ACGTACAATT GCCTCATGCA AGGCAGCACT CGACCAGATT
ATCGACGCTG GAGATGCGGT CTGCCGCGGG GAGGCAGATG CCGGATCAAT GTTTGATCGG
GCCGCAGGCA ATCCAGAGGA CGAACCGCAC GAATCCGCCT TTGGACTGTC ATCCACCATG
CAGGATGACG AACAGGATCA AGCATCCGAC GATGCAGGTC GCCAGCAGAT GCAACTCCAG
CCAGAACTCG GTGCAGCACT GGCCGATTTG CGCAGGCTTG TGTACTCGCT TCCCGCCAGT
ACCGAAGTCC CTGTCGCCGC TCAGGGGCGG ATTGTCTCCC TACTGAATGC CCTTGGACTG
AACTTCGATT TCCTGGAAGC AGTGTGCAAC ACGCTCCTCG CTTCGCCCGA TCATCGGGAC
ATCGGAGCCG CGGTCGCTGT CGCCCTGAAT TCCGCTCTGG ATCACAGAAA TCGAATGATC
AAGTCGAACC TTAGACTGGT CATTTCGATT GCAAAGAAGT ACACACATAC GGGCTTCCCG
TTCCTCGATC TGATCCAGGA AGGCAACTTG GGCCTCATGA AGGCGGTCGA GAAATTCGAC
TATCGTCGCG GCTTCAAGTT TTCGACTTAC GCAACCTGGT GGATCAGGCA GGCCATCACG
CGCGGCATTG CCGATCAGCA ACGCCTTGTC CGGGTTCCCG TCCATATGGT CGAATCAATC
AACAAGGTGT CGCGCGTCCT TCGCGAACTC GAGGGACGGG CGCCTCGAAA AACCTCCGCC
TCGCCCGCCC TTGGTAAAAT TACGACGTCC TGA
 
Protein sequence
MNDPAPFRKK PLNPLLKMAA VAGVQTAIRL HIRRGDDLDA VEENGRTPLM LAAVRGHADV 
CKLLLDAGAD PLLTDHEGRD AVGLALAAGK TGVVEVLMQF RVETADGSAV CPPPTGRPVS
AAGSSPESGA TTEDEFSPNG WEAEEESLTP PDDTRRTSSA RTIQQAISSH APTSTDEDWS
DIEIDLPEVR TGPAGGRRFD DEDLARLRLL LMQGLSVGFV NRSDIVEACA DATESHEADL
EERIERLIGD LGLHIEDVSP SFGLDEHGAA ESEDACEDAL EYLRDICSGD NDPLKLYIRE
VGPHQALTRE DEAFIARAME EGVSQAIRTI ASCKAALDQI IDAGDAVCRG EADAGSMFDR
AAGNPEDEPH ESAFGLSSTM QDDEQDQASD DAGRQQMQLQ PELGAALADL RRLVYSLPAS
TEVPVAAQGR IVSLLNALGL NFDFLEAVCN TLLASPDHRD IGAAVAVALN SALDHRNRMI
KSNLRLVISI AKKYTHTGFP FLDLIQEGNL GLMKAVEKFD YRRGFKFSTY ATWWIRQAIT
RGIADQQRLV RVPVHMVESI NKVSRVLREL EGRAPRKTSA SPALGKITTS