Gene Rleg2_5028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5028 
Symbol 
ID6978122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp675154 
End bp676740 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content60% 
IMG OID643394172 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_002278990 
Protein GI209547072 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTGT CCGCCAATCT ATCACTGCAT CAGACGCAAT CCCTTGTCAT GACACCGCGA 
CTAGTAGAGT CGATCAAGTT GCTGCGAATG AATCATCACG AACTCAGCCA GTTCATTTCT
CAGAAAGCGG AGGAGAATCC CCTCCTCGAA ATTTCGTCGG ATGAGGTCGA AGGCGACGAG
TGGGGGGCAT GCGGCGACGA CCCGCTCAAT CCCTCGCCGG ATGACGCCGC TGATAGCGGC
TCCCACAATC GTGAGGAGGC GCTGTCGAGC GACTGGTACG ATGATGTCGG CTGCGCCGGC
ACCACTCGGC TGAACGGTGA GCTCGACGAC AAGTATTCCA ACGTCTACGA GGATTGTGGC
TCAAGCGACC TTGATGCGCC TGGATCCGAC GACCAGCGGC AATCGGGTAG CAGTGGAAAG
GGCCCCGATT ACTATCTCAG TAGACATGTC GCTGGACCCA CGACGTTGGG CCATCATCTC
GCCCAGCAAA TCCCCTTCGT CCTGCCCGAC AAGGTCGATC GGCTGATTGC GCAGTACTTA
GTCGATCGGC TCGACGATGC CGGTTACCTG CAGGTCGAAC TCATCGAGGT CGCCGAGCGG
CTGGGCACGA GTCTGACTGC GGCCGAGCGT GTGCTTTCCG CCCTGCAGAC GCTCGATCCA
CCGGGAGTTT TCGCCCGCAG CCTTTCGGAG TGCCTAGAGC TCCAGCTCAG GCAGAAGGAT
CGGTGCGACC CCGCCATGCA GACGCTGATC GAAAACCTCG AACTGCTGGC ACGGCGCGAC
TTTGCTACAC TGAAGCGGCT CTGCGGCGTC GACGAGGAAG ACCTTCTCGA CATGCTTGGC
GATATCCAGG AGCTCAATCC CAAGCCCGGC CTCGGTTTCG GACGCAGGGC TGGTGACACA
TTGCCTGACG TGATTGTCCG ACGCTCTTTA GAAGGTGGCT GGCTGGTCAA TCTCAATCCG
AATACCCTGC CGCGCGTGTT GGTCAACCAG TCCTATTTTT CCCAGGTGAC CAAGGGCGGC
GAGGTTTACG CATTCCTTGC CGAGTGCCTT CGGGACGCCG AGTGGCTAAC GCGCAGCCTC
AACCAGCGGG CAAACACGAT CATGAAGGTG GCAAACGAGA TTGTCCGCCA GCAGGACGCC
TTCCTGCTGA ACGGTGTTGA CCACCTGCGC CCGTTGAATC TGAAAACGGT GGCACAGGCA
ATCGACAGGA GCGAATCCAC CGTCAGCCGC GTTACCTCGG ACAAATATAT GCTGACGCCG
CGCGGCCTCT TCGAGCTGAA ATATTTCTTC ACCGTCTCGA TCAGCGCGGT TGCTGGCGGC
GACAGCCATT CAGCCGAGGC GGTGCGTCAC AAGATCCGCG CGGTGATCAT GCAGGAGAGC
CCGGACGCGG TGCTCTCCGA CGACGACATC GTAGACATGC TGAAGAAGGG TGGCGTCGAT
CTCGCCCGCC GCACGGTGGC CAAATACCGC CAGGCTATGA ACATCGCCTC CTCGGTTCAG
CGCGGTCGCG AAAAACGGGC ACGCGCAAAC TCGCCGGTTC GTGAGGACTC TTCAATGGTA
TTCCCCTCGA AAACGCAAAG CGAGTGA
 
Protein sequence
MPLSANLSLH QTQSLVMTPR LVESIKLLRM NHHELSQFIS QKAEENPLLE ISSDEVEGDE 
WGACGDDPLN PSPDDAADSG SHNREEALSS DWYDDVGCAG TTRLNGELDD KYSNVYEDCG
SSDLDAPGSD DQRQSGSSGK GPDYYLSRHV AGPTTLGHHL AQQIPFVLPD KVDRLIAQYL
VDRLDDAGYL QVELIEVAER LGTSLTAAER VLSALQTLDP PGVFARSLSE CLELQLRQKD
RCDPAMQTLI ENLELLARRD FATLKRLCGV DEEDLLDMLG DIQELNPKPG LGFGRRAGDT
LPDVIVRRSL EGGWLVNLNP NTLPRVLVNQ SYFSQVTKGG EVYAFLAECL RDAEWLTRSL
NQRANTIMKV ANEIVRQQDA FLLNGVDHLR PLNLKTVAQA IDRSESTVSR VTSDKYMLTP
RGLFELKYFF TVSISAVAGG DSHSAEAVRH KIRAVIMQES PDAVLSDDDI VDMLKKGGVD
LARRTVAKYR QAMNIASSVQ RGREKRARAN SPVREDSSMV FPSKTQSE