Gene Smed_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0015 
Symbol 
ID5320842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp13139 
End bp14680 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content61% 
IMG OID640788946 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001325710 
Protein GI150395243 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.973254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000243165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTTGT CCGCCAGCCT TCATTTAAGA CAGTCGCAGT CGCTGGTCAT GACGCCGCAA 
CTGATGCAGT CGATCCAGCT GCTGCAGATG AATCATCTGG AGCTCACCCA GTTCATCGCG
CAAGAGATCG AAAAGAACCC GCTGCTGGAG GTCCAATCGC CGTCCGATGA GGCAAGCACG
GCGGAGCGCG GGGATTCAGG ACCTCAGCCG GAGGAGGCCG GCAGCGAAAT CGACGAAGGG
GCAGGCGAGG GCGATGTTTA CGACAGCGCC ACGTCGAGAT CCGGCGAGAG GCTCAGCGAT
GGCCTCGACT CCGACTTCGC CAACGTCTTC CCGGACGACA CGACTCCGCA GCGGGCGGAT
GCGCCCGAGC TGCTCGGCCA ATGGAAGTCT ATGCCGGGCG CGAGCGACGG CGAAAGTTAT
GATCTCGACG ATTTCGTTGC CAGCCGCAAG ACATTGAGGG AGGCGCTCAT CGAGCAGCTT
CCCTTTGCTC TTGGATCGGC CTCCGACCGC CTGATCGCCC AGTATCTCAT CGATCAGCTC
GATGATGCCG GCTATCTGCA CGCCGATCTC GCCGAGACGG CGGAGCGGCT CGGCTCGGCA
AGCGAGGACG TGACGCGCGT ACTCGACGTC CTGCAGCAGT TCGACCCGCC GGGCGTCTTT
GCGCGCACCC TTGGGGAATG CCTTGGCCTT CAGTTGCGCG CCCGCAATCG CCTCGATCCG
GCTATGGAGG CGCTCGTGGG CAACCTCGAT CTCCTGGCAA GGCGCGATTT CGCGAGCCTT
AAAAAGATCT GCGGGGTCGA CGAGGAAGAC CTGATCGACA TGTTTGCCGA GATTCGCAAG
CTCGATCCGA AACCTGGCAC CAGCTTCGAA ACCGGTTCGT TCGAGACGAT CATCCCCGAT
GCCGTGGTTC GCACCGCACC GGATGGCGGC TGGCTCGTGG AGCTCAATCC GGACGCCCTG
CCCCGCGTTC TCGTCAATCA CGAATATTTC GCAGAGATAT CCCGCTCGTG CCGAAAGAGC
AGTGGCGAAC AGATCTTCCT CAATGAATGC CTGCAAAACG CCAACTGGCT GACGCGCAGC
CTCGATCAGC GCGCCAGAAC GATCATGAAG GTGGCAAGCG AGATCGTCCG GCAGCAGGAC
GCTTTTCTCA TGCACGGCGT CGACCATCTG CGCCCGCTAA ACCTCAGGAC CGTCGCGGAT
GCGATCAAGA TGCATGAATC GACGGTGAGC CGGGTGACGT CCAACAAATA CATGCTGACC
CCGCGCGGGC TCTACGAGCT GAAATATTTC TTTACTGTGT CGATCGGCTC GGCCGAAAAC
GGCGATGCCC ACTCGGCCGA GTCCGTGCGC CATCGAATCC GGACGATGGT CAATCAGGAA
AGCGCTGATG CCGTGCTATC GGACGACGAC ATCGTCGATA TCCTGAAGAA GGCGGGCGTA
GACATCGCCA GACGCACGGT CGCAAAATAT CGCGAGGCGA TGCATATCCC CTCCTCTGTC
CAACGCCGCC GGGAAAAGCG CGCACTGGCA AGAGTCGGAT GA
 
Protein sequence
MALSASLHLR QSQSLVMTPQ LMQSIQLLQM NHLELTQFIA QEIEKNPLLE VQSPSDEAST 
AERGDSGPQP EEAGSEIDEG AGEGDVYDSA TSRSGERLSD GLDSDFANVF PDDTTPQRAD
APELLGQWKS MPGASDGESY DLDDFVASRK TLREALIEQL PFALGSASDR LIAQYLIDQL
DDAGYLHADL AETAERLGSA SEDVTRVLDV LQQFDPPGVF ARTLGECLGL QLRARNRLDP
AMEALVGNLD LLARRDFASL KKICGVDEED LIDMFAEIRK LDPKPGTSFE TGSFETIIPD
AVVRTAPDGG WLVELNPDAL PRVLVNHEYF AEISRSCRKS SGEQIFLNEC LQNANWLTRS
LDQRARTIMK VASEIVRQQD AFLMHGVDHL RPLNLRTVAD AIKMHESTVS RVTSNKYMLT
PRGLYELKYF FTVSIGSAEN GDAHSAESVR HRIRTMVNQE SADAVLSDDD IVDILKKAGV
DIARRTVAKY REAMHIPSSV QRRREKRALA RVG