Gene Tmz1t_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3014 
Symbol 
ID7874403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3266138 
End bp3267076 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content64% 
IMG OID643699935 
ProductRNA polymerase sigma factor RpoS 
Protein accessionYP_002889989 
Protein GI237653675 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.175723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC CGGTCAATCT TGATGAACTG GAAAGTCAGC AGGAACCGGA TCTTCCCCCT 
GAGGTGGAGG TGTTCTCGTT CCAGGCGCCG CCGGTTGTCG AGAACGAGTT CTTCAGCGAC
GTCACCCAGC TCTACCTCAA CGAGATCGGT GCCAATCCGC TGCTGACCGC CGAGGAGGAG
CTGGTGATCG CCCGCCGCGT GCGGATGGGC GACTTCGATG CCCGGCAGAC GATGATCGAG
CGCAACCTGC GCCTGGTCGT CAATATCGCC AAGCACTACC TCAATCGCGG GATTCCCTTG
CTCGATCTGG TCGAGGAGGG CAACCTCGGC CTCATGCACG CGCTCGAGAA GTTCGACCCC
GAGCGCGGCT TCCGCTTCTC GACTTACGCG ACGTGGTGGA TTCGGCAGAA CATCGAGCGT
GCGATCATGA ACCAGTCGCG CACGATCCGC CTGCCCGTCC ACGTGGTGAA GGAACTCAAC
CAGGTCCTGC GCGCGCAGCG CCACATCGAG GCCGATTGCA ACGGCGAGTC CTCGCTCGAG
CAGATCGCCA ATCGGCTAGG CAAGACGATC GAGGAGGTGC GCAGCATCCT CGCGCTCGGC
GAGCACACCG CCTCGCTCGA CGCACCCCTC GACATCGACC CGTCCTTGTC GATCGGCGAG
TCGCTCGCGG ACGAGCAGCA CATCTCCGCC GACCTTCGCA TACAGTGCTC GGAAGTCGAG
CAACTCGTGC GCGAATGGCT CGCGATCCTC AACGACAAGC AGCGCATGGT GATTCGTCAC
CGCTATGGCA TCGACGAGTG CGAGTTGCTC ACGCTCGAAG AACTCGCCGA ACGCCTCGAA
CTCACCCGCG AGCGCGTGCG CCAGATCCAG CTCGAGGCGC TGGGCCAACT GCGCCGGATC
CTGCGCCGGC GCGGAATCTC GCGCGACGCG CTGCTCTAG
 
Protein sequence
MEEPVNLDEL ESQQEPDLPP EVEVFSFQAP PVVENEFFSD VTQLYLNEIG ANPLLTAEEE 
LVIARRVRMG DFDARQTMIE RNLRLVVNIA KHYLNRGIPL LDLVEEGNLG LMHALEKFDP
ERGFRFSTYA TWWIRQNIER AIMNQSRTIR LPVHVVKELN QVLRAQRHIE ADCNGESSLE
QIANRLGKTI EEVRSILALG EHTASLDAPL DIDPSLSIGE SLADEQHISA DLRIQCSEVE
QLVREWLAIL NDKQRMVIRH RYGIDECELL TLEELAERLE LTRERVRQIQ LEALGQLRRI
LRRRGISRDA LL