Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0015 |
Symbol | |
ID | 5320842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 13139 |
End bp | 14680 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640788946 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_001325710 |
Protein GI | 150395243 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.973254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000243165 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCTTGT CCGCCAGCCT TCATTTAAGA CAGTCGCAGT CGCTGGTCAT GACGCCGCAA CTGATGCAGT CGATCCAGCT GCTGCAGATG AATCATCTGG AGCTCACCCA GTTCATCGCG CAAGAGATCG AAAAGAACCC GCTGCTGGAG GTCCAATCGC CGTCCGATGA GGCAAGCACG GCGGAGCGCG GGGATTCAGG ACCTCAGCCG GAGGAGGCCG GCAGCGAAAT CGACGAAGGG GCAGGCGAGG GCGATGTTTA CGACAGCGCC ACGTCGAGAT CCGGCGAGAG GCTCAGCGAT GGCCTCGACT CCGACTTCGC CAACGTCTTC CCGGACGACA CGACTCCGCA GCGGGCGGAT GCGCCCGAGC TGCTCGGCCA ATGGAAGTCT ATGCCGGGCG CGAGCGACGG CGAAAGTTAT GATCTCGACG ATTTCGTTGC CAGCCGCAAG ACATTGAGGG AGGCGCTCAT CGAGCAGCTT CCCTTTGCTC TTGGATCGGC CTCCGACCGC CTGATCGCCC AGTATCTCAT CGATCAGCTC GATGATGCCG GCTATCTGCA CGCCGATCTC GCCGAGACGG CGGAGCGGCT CGGCTCGGCA AGCGAGGACG TGACGCGCGT ACTCGACGTC CTGCAGCAGT TCGACCCGCC GGGCGTCTTT GCGCGCACCC TTGGGGAATG CCTTGGCCTT CAGTTGCGCG CCCGCAATCG CCTCGATCCG GCTATGGAGG CGCTCGTGGG CAACCTCGAT CTCCTGGCAA GGCGCGATTT CGCGAGCCTT AAAAAGATCT GCGGGGTCGA CGAGGAAGAC CTGATCGACA TGTTTGCCGA GATTCGCAAG CTCGATCCGA AACCTGGCAC CAGCTTCGAA ACCGGTTCGT TCGAGACGAT CATCCCCGAT GCCGTGGTTC GCACCGCACC GGATGGCGGC TGGCTCGTGG AGCTCAATCC GGACGCCCTG CCCCGCGTTC TCGTCAATCA CGAATATTTC GCAGAGATAT CCCGCTCGTG CCGAAAGAGC AGTGGCGAAC AGATCTTCCT CAATGAATGC CTGCAAAACG CCAACTGGCT GACGCGCAGC CTCGATCAGC GCGCCAGAAC GATCATGAAG GTGGCAAGCG AGATCGTCCG GCAGCAGGAC GCTTTTCTCA TGCACGGCGT CGACCATCTG CGCCCGCTAA ACCTCAGGAC CGTCGCGGAT GCGATCAAGA TGCATGAATC GACGGTGAGC CGGGTGACGT CCAACAAATA CATGCTGACC CCGCGCGGGC TCTACGAGCT GAAATATTTC TTTACTGTGT CGATCGGCTC GGCCGAAAAC GGCGATGCCC ACTCGGCCGA GTCCGTGCGC CATCGAATCC GGACGATGGT CAATCAGGAA AGCGCTGATG CCGTGCTATC GGACGACGAC ATCGTCGATA TCCTGAAGAA GGCGGGCGTA GACATCGCCA GACGCACGGT CGCAAAATAT CGCGAGGCGA TGCATATCCC CTCCTCTGTC CAACGCCGCC GGGAAAAGCG CGCACTGGCA AGAGTCGGAT GA
|
Protein sequence | MALSASLHLR QSQSLVMTPQ LMQSIQLLQM NHLELTQFIA QEIEKNPLLE VQSPSDEAST AERGDSGPQP EEAGSEIDEG AGEGDVYDSA TSRSGERLSD GLDSDFANVF PDDTTPQRAD APELLGQWKS MPGASDGESY DLDDFVASRK TLREALIEQL PFALGSASDR LIAQYLIDQL DDAGYLHADL AETAERLGSA SEDVTRVLDV LQQFDPPGVF ARTLGECLGL QLRARNRLDP AMEALVGNLD LLARRDFASL KKICGVDEED LIDMFAEIRK LDPKPGTSFE TGSFETIIPD AVVRTAPDGG WLVELNPDAL PRVLVNHEYF AEISRSCRKS SGEQIFLNEC LQNANWLTRS LDQRARTIMK VASEIVRQQD AFLMHGVDHL RPLNLRTVAD AIKMHESTVS RVTSNKYMLT PRGLYELKYF FTVSIGSAEN GDAHSAESVR HRIRTMVNQE SADAVLSDDD IVDILKKAGV DIARRTVAKY REAMHIPSSV QRRREKRALA RVG
|
| |