Gene Tmz1t_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1189 
Symbol 
ID7083849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1317171 
End bp1319171 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID643698205 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002354844 
Protein GI217969610 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.920411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGCG AAAAAGCCAA GGACCCACGC AAGGACGCCC CCGCGAAGGG CCGCCCTGCC 
AAGGCCAAGG ACAAGGCAGC CCCGGTGGTC GAAAGCCCCG CCGCCACCCC GCTCGACGCG
GAGGTGCGCC GCACCCGGCT GAAGACGCTG ATCACGCTCG GCAAGGAACG CGGCTATCTC
ACCTACGCCG AGATCAGCGA CCACCTCCCC GACGATGTGG CCGATGCCGA GCAGATCGAA
GGCATCATCG CCACCTTCAA CAACATGGGC ATCCAGGTCT ATGACGAGGC GCCGGCCGCC
GAAGACCTGC TCATGTCGGA CAGCGTCGCC ACCAACGTGG ACGAGGACGT CGCCGAAGAG
GAAGCCGAAC AGGCGCTGTC CTCGGTCGAC TCCGAATTCG GCCGCACCAC CGACCCGGTG
CGCATGTACA TGCGCGAGAT GGGCACGGTC GAGCTGCTCA CCCGCGAGGG CGAGATCGAG
ATCGCCAAGC GCATCGAGGA CGGCCTCAAG CACATGGTGC AGGCCATCTC CGCCTGCCCG
ACCACGATCG CCGAGATGCT CGGCATCGTC GACCGCATCG CCGCCGACGA TGCCAAGATC
GACGAGCTGG TCGACGGCCT GATCGACCCC AACGCCGGCA GCGCCGAGGA AGCGGCCGCG
GAGGAGGAGT CCGCCGACGA GGACACCGAG GCCGAAGAGG ACGAGGGCGC GGAGGACGAG
GACGAGGGCG ACGACGAGGA AGACAGCGCC GAGAGCGCCG AGCGCGCCAA TGCCGCCTCG
CTGCTGCAAC TCAAGAACGA CGCGCTCGCC CGCTTCGACG TCATCCGCGG GCTGTACGAG
AAGAAGATGC AGGCGCTCGA GAAGCGTGGC TCGCAGGACC TCTCCTACCT CGCGCTGCAG
CAGCAGATCT CCGACGAGCT GCTCAACATC CGCTTCACCG CCAAGGCCAT CGAGAAGCTG
TGCGACTCGG TGCGCCACAT GGTCGAGCAG GTGCGCAGCC ACGAGCGCCA GATCCTGCAG
CTGTGCGTGG ATCGCGCCGG CATGCCGCGC CAGCACTTCA TCAAGGTGTT CCCCGGCCAG
GAAGTGAACC TCGACTGGCT CAAGGACGAG ATCGCCGCCG GCAAGAACTA CGCCGACGGC
CTGATGCGCA TCCACCCCGC GGTGCTCGAG GAGCAGCAGA AGCTCATCGA CCTGCAGGAC
CGCATCGGCA TCCCGCTCAA GGAACTCAAG GACATCAACC GCCAGATGTC CACCGGCGAA
GCCAAGATGC GTCGCGCCAA GCGCGAGATG ACCGAGGCCA ACCTGCGCCT GGTGATCTCG
ATCGCGAAGA AGTACACCAA CCGCGGCCTG CAGTTCCTCG ACCTCATCCA GGAAGGCAAT
ATCGGCCTGA TGAAGGCGGT GGACAAGTTC GAATACCGCC GCGGCTACAA GTTCTCGACC
TATGCCACGT GGTGGATCCG CCAGGCCATC ACGCGCTCGA TCGCCGACCA GGCGCGCACC
ATCCGCATCC CGGTGCACAT GATCGAGACG ATCAACAAGA TGAACCGCAT CAGCCGCCAG
ATCCTGCAGG AGACCGGTCA GGAGCCGGAT CCCGCGACGC TGGCCGAGAA GATGGAGATG
CCCGAGGAGA AGATCCGCAA GATCATGAAG ATCTCCAAGG AGCCGATCTC CATGGAGACG
CCGATCGGCG ACGACGACGA CTCCCACCTG GGCGACTTCA TCGAGGACAC CGCCACCCTG
GCCCCGGCCG AGGCGGCGAT GTACTCCGGC CTGCGCGACG CCACCTGCGA GGTGCTCGAC
TCGCTGACCC AGCGCGAGGC CAAGGTGCTG CGCATGCGCT TCGGCATCGA GATGAACACC
GACCACACGC TGGAAGAGGT CGGCAAGCAG TTCGACGTCA CCCGCGAGCG CATCCGCCAG
ATCGAAGCCA AGGCCCTGCG CAAGCTGCGC CACCCGAGCC GCTCCGAGAA GCTGCGCAGC
TTCCTCGACA GCGACGCGTA A
 
Protein sequence
MAREKAKDPR KDAPAKGRPA KAKDKAAPVV ESPAATPLDA EVRRTRLKTL ITLGKERGYL 
TYAEISDHLP DDVADAEQIE GIIATFNNMG IQVYDEAPAA EDLLMSDSVA TNVDEDVAEE
EAEQALSSVD SEFGRTTDPV RMYMREMGTV ELLTREGEIE IAKRIEDGLK HMVQAISACP
TTIAEMLGIV DRIAADDAKI DELVDGLIDP NAGSAEEAAA EEESADEDTE AEEDEGAEDE
DEGDDEEDSA ESAERANAAS LLQLKNDALA RFDVIRGLYE KKMQALEKRG SQDLSYLALQ
QQISDELLNI RFTAKAIEKL CDSVRHMVEQ VRSHERQILQ LCVDRAGMPR QHFIKVFPGQ
EVNLDWLKDE IAAGKNYADG LMRIHPAVLE EQQKLIDLQD RIGIPLKELK DINRQMSTGE
AKMRRAKREM TEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST
YATWWIRQAI TRSIADQART IRIPVHMIET INKMNRISRQ ILQETGQEPD PATLAEKMEM
PEEKIRKIMK ISKEPISMET PIGDDDDSHL GDFIEDTATL APAEAAMYSG LRDATCEVLD
SLTQREAKVL RMRFGIEMNT DHTLEEVGKQ FDVTRERIRQ IEAKALRKLR HPSRSEKLRS
FLDSDA