Gene Mmar10_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2297 
Symbol 
ID4285880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2502088 
End bp2503005 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content61% 
IMG OID638141799 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_757527 
Protein GI114570847 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000241193 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.392536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CCAAGGCCGC CACTGCACGT TCTGCCCACC CGACTGCCGC CCGCGATGCC 
GGCGAACAGC GTTTTGTGAA ATCGGCCATG GCCCGGGATC TGCTGTCCCG CGAGGACGAG
GCCGATCTCG CCCGCCGCTG GAAAGATGAT CGCGACGAAA AGGCACTCCA CGAACTGACC
GAAGCCCATA TGCGTCTGGT CATCGCCGTG GCCGCCAAAT TCAAGCGTTA CGGCCTGCCG
TTTTCCGATC TGATCCAGGA AGGCAATATC GGCCTGATGA AAGCGGCCGA CCGTTTCGAC
CCCGAGCGCG ATGTACGCTT CTCGACCTAT GTGACCTGGT GGATCCGCTC CTGCATCCAG
GACTATGTGC TGCGAAACTG GTCGATCGTG CGAACCGGAA CAACATCCGC CCAGAAATCG
CTCTTTTTCA ACCTGCGCCG GATACGCGCG AATATCGGTG ATCTCGACGG CAGCTCTATC
ACGCCCGACA ATCGTCAGAA GATTGCCAAG GATTTGCGGG TACGCGAACG GGATGTCGAG
AACATGGCAC TTCGGTTGAG TGCGTCAGAC CGCTCTCTCA ATGCCCCGGT CGGCGACGCC
GAGGATTCCC AGTGGCAGGA CTTCCTGGTC GACGACACCG CGGCTCCGGA GACCGAAGTC
ATGAACCGGA CCGATAGCGA ACGCCGCAGT GCCTGGCTCG GACTGGCCCT TGATGGTCTC
AACTCGCGGG AACAATTCAT CATCCGGGAA CGGCGATTGC GTGAAGACGG GTCTACCCTG
GCAAGCCTCG GCGACAGCCT GGGCATTTCG AAGGAACGGG TCCGTCAAAT TGAAAATGCC
GCACTCGCCA AGTTGCGCGA CCATCTGACC GCAAACGTCG GCGACCCTCA TGAAGCCGGC
TTGCTTCCCG ATGCCTGA
 
Protein sequence
MTTAKAATAR SAHPTAARDA GEQRFVKSAM ARDLLSREDE ADLARRWKDD RDEKALHELT 
EAHMRLVIAV AAKFKRYGLP FSDLIQEGNI GLMKAADRFD PERDVRFSTY VTWWIRSCIQ
DYVLRNWSIV RTGTTSAQKS LFFNLRRIRA NIGDLDGSSI TPDNRQKIAK DLRVRERDVE
NMALRLSASD RSLNAPVGDA EDSQWQDFLV DDTAAPETEV MNRTDSERRS AWLGLALDGL
NSREQFIIRE RRLREDGSTL ASLGDSLGIS KERVRQIENA ALAKLRDHLT ANVGDPHEAG
LLPDA