Gene AnaeK_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_2641 
Symbol 
ID6786172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp2947000 
End bp2947968 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content71% 
IMG OID642764107 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_002134995 
Protein GI197123044 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0535725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACG AGACGCGTCA CCGCTCCACG AAGGGCCGCC CGGCACTCCC GGCCGCCGGG 
GAGGCGCGCG ACCGCTCCGA GGGCGGGCTG GTCCGCTACG ACCCGCTCCG CGCGTACATG
GCCGAGGTGG CGCGCCACCC GGTGCTCTCG CGGGACGAGG AGCACGCGCT CGCGGTCCAG
TACCGCGAGA CCGGCGACGT GGACGCCGCC TACCGGCTGG TCGCGTCGAA CCTGCGACTG
GTGGTGAAGA TCGCCCACGA GTACCGCCGC ACCGCGTTCC AGCTCCTGGA TCTCGTCCAG
GAGGGCAACC TGGGGCTCAT GCAGGCGGTG AAGAAGTACG ACCCGTGGAA GGGCGTGAAG
CTCTCGTCCT ACGCGGCCTG GTGGATCCGC GCGTACATCA TCCGCTTCAT CATGGAGAAC
TGGCGGCTGG TGAAGCTCGG GACCACCCAG GCGCAGCGCA AGCTGTTCTT CAACCTCTCC
AAGGAGCGCG AGAAGCTGCT CGCGCGCGGC ATCGAGCCGA CGCCGCGGCT GCTCGCGAAG
AACCTCCAGG TCGAGGAGAA GGACGTGGAG GAGATGAGCG CGCGCATGGC CGCGGACGAC
CTCTCGCTGG ACGCGCCGGT CGGCACCGAG GGCGACGACG GCCGCCAGAA CCGCCTCGAC
CGGCTGGCCG ACGACGGCGG CCCGTCCCCC GACGCCGCGC TCGGCGACGA GCAGCTCCGG
CGGATCTTCC GCGAGAAGCT GGACGCGTTC TCCGGGACGC TCACCGACGA GAAGGAGCGG
TACATCTTCG AGCACCGCCT CCTGCCGCCC GACGGCACGC CGCCGCTCAC GCTGCAGGAG
GTGGGGGACC ACTTCCGGCT CACCCGCGAG CGCGCCCGCC AGATCGAGGC CAAGCTGACC
GGGCGGCTGC GCGAGTTCCT CCGCGCCGAG ATCCCGGACT TCGAGCTGCT CGGGCCGCCC
GAGACCTGA
 
Protein sequence
MSDETRHRST KGRPALPAAG EARDRSEGGL VRYDPLRAYM AEVARHPVLS RDEEHALAVQ 
YRETGDVDAA YRLVASNLRL VVKIAHEYRR TAFQLLDLVQ EGNLGLMQAV KKYDPWKGVK
LSSYAAWWIR AYIIRFIMEN WRLVKLGTTQ AQRKLFFNLS KEREKLLARG IEPTPRLLAK
NLQVEEKDVE EMSARMAADD LSLDAPVGTE GDDGRQNRLD RLADDGGPSP DAALGDEQLR
RIFREKLDAF SGTLTDEKER YIFEHRLLPP DGTPPLTLQE VGDHFRLTRE RARQIEAKLT
GRLREFLRAE IPDFELLGPP ET