Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2297 |
Symbol | |
ID | 4285880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2502088 |
End bp | 2503005 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638141799 |
Product | RNA polymerase, sigma 32 subunit, RpoH |
Protein accession | YP_757527 |
Protein GI | 114570847 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000241193 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.392536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG CCAAGGCCGC CACTGCACGT TCTGCCCACC CGACTGCCGC CCGCGATGCC GGCGAACAGC GTTTTGTGAA ATCGGCCATG GCCCGGGATC TGCTGTCCCG CGAGGACGAG GCCGATCTCG CCCGCCGCTG GAAAGATGAT CGCGACGAAA AGGCACTCCA CGAACTGACC GAAGCCCATA TGCGTCTGGT CATCGCCGTG GCCGCCAAAT TCAAGCGTTA CGGCCTGCCG TTTTCCGATC TGATCCAGGA AGGCAATATC GGCCTGATGA AAGCGGCCGA CCGTTTCGAC CCCGAGCGCG ATGTACGCTT CTCGACCTAT GTGACCTGGT GGATCCGCTC CTGCATCCAG GACTATGTGC TGCGAAACTG GTCGATCGTG CGAACCGGAA CAACATCCGC CCAGAAATCG CTCTTTTTCA ACCTGCGCCG GATACGCGCG AATATCGGTG ATCTCGACGG CAGCTCTATC ACGCCCGACA ATCGTCAGAA GATTGCCAAG GATTTGCGGG TACGCGAACG GGATGTCGAG AACATGGCAC TTCGGTTGAG TGCGTCAGAC CGCTCTCTCA ATGCCCCGGT CGGCGACGCC GAGGATTCCC AGTGGCAGGA CTTCCTGGTC GACGACACCG CGGCTCCGGA GACCGAAGTC ATGAACCGGA CCGATAGCGA ACGCCGCAGT GCCTGGCTCG GACTGGCCCT TGATGGTCTC AACTCGCGGG AACAATTCAT CATCCGGGAA CGGCGATTGC GTGAAGACGG GTCTACCCTG GCAAGCCTCG GCGACAGCCT GGGCATTTCG AAGGAACGGG TCCGTCAAAT TGAAAATGCC GCACTCGCCA AGTTGCGCGA CCATCTGACC GCAAACGTCG GCGACCCTCA TGAAGCCGGC TTGCTTCCCG ATGCCTGA
|
Protein sequence | MTTAKAATAR SAHPTAARDA GEQRFVKSAM ARDLLSREDE ADLARRWKDD RDEKALHELT EAHMRLVIAV AAKFKRYGLP FSDLIQEGNI GLMKAADRFD PERDVRFSTY VTWWIRSCIQ DYVLRNWSIV RTGTTSAQKS LFFNLRRIRA NIGDLDGSSI TPDNRQKIAK DLRVRERDVE NMALRLSASD RSLNAPVGDA EDSQWQDFLV DDTAAPETEV MNRTDSERRS AWLGLALDGL NSREQFIIRE RRLREDGSTL ASLGDSLGIS KERVRQIENA ALAKLRDHLT ANVGDPHEAG LLPDA
|
| |