Gene Rleg_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1454 
Symbol 
ID8012543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1442032 
End bp1443042 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID644824043 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_002975285 
Protein GI241204189 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.132743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.477359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAGA AGAACTGGCA GGAACTGATC AAGCCGAACA AGGTGGAGTT CTCCTCGAGC 
TCGCGCACCA GGGCGACGCT TGTTGCCGAA CCGCTGGAGC GCGGCTTCGG CCTCACCCTC
GGCAACGCGC TTCGCCGCGT TCTGCTTTCC TCGCTGCGCG GTGCTGCCGT CACGGCGGTG
CAGATCGATG GCGTGCTGCA TGAATTCTCC TCGATTCCGG GCGTCCGCGA AGACGTCACG
GACATCGTGC TCAACATCAA GGAAATCGCC ATCAAGATGG ATGGCGACGA TGCAAAGCGC
ATGGTCGTGC GTAAGCAGGG CCCTGGCGTT GTCACGGCTG GCGACATTCA GACGGTCGGC
GATATCGAAA TCCTCAACCC CGAGCATGTC ATCTGCACGC TCGACGAGGG TGCCGAGATC
CGCATGGAAT TCACCGTCAA CAACGGCAAG GGCTATGTTC CGGCCGAACG CAATCGTGCG
GAAGATGCTC CGATCGGTCT CATCCCGGTC GACAGCCTCT ACTCGCCGGT CAAGAAGGTG
TCCTACAAGG TTGAAAATAC CCGCGAAGGA CAGGTTCTCG ATTACGACAA GCTGAACATG
ACCATCGAAA CCGATGGCTC GATCACCGGC GAAGACGCCG TCGCTTTTGC GGCGCGCATC
CTCCAGGATC AGCTTGGCGT CTTCGTCAAC TTCGACGAGC CGCAGAAGGA AACCGAAGAG
GAAGCAGTCA CCGAACTCGC TTTCAACCCG GCTCTCCTGA AGAAGGTGGA CGAACTCGAG
CTGTCGGTCC GTTCGGCAAA CTGCCTGAAG AACGACAACA TCGTCTACAT CGGCGACCTC
ATTCAGAAGA CCGAAGCAGA AATGCTCCGC ACACCGAATT TTGGTCGCAA GTCGCTGAAC
GAAATCAAGG AAGTTCTCGC TTCCATGGGC CTGCACCTCG GCATGGAAGT GCCGGCATGG
CCGCCCGAGA ACATCGAAGA TCTCGCCAAG CGTTACGAAG ATCAATACTG A
 
Protein sequence
MIQKNWQELI KPNKVEFSSS SRTRATLVAE PLERGFGLTL GNALRRVLLS SLRGAAVTAV 
QIDGVLHEFS SIPGVREDVT DIVLNIKEIA IKMDGDDAKR MVVRKQGPGV VTAGDIQTVG
DIEILNPEHV ICTLDEGAEI RMEFTVNNGK GYVPAERNRA EDAPIGLIPV DSLYSPVKKV
SYKVENTREG QVLDYDKLNM TIETDGSITG EDAVAFAARI LQDQLGVFVN FDEPQKETEE
EAVTELAFNP ALLKKVDELE LSVRSANCLK NDNIVYIGDL IQKTEAEMLR TPNFGRKSLN
EIKEVLASMG LHLGMEVPAW PPENIEDLAK RYEDQY