Gene Rleg_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3550 
Symbol 
ID8015815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3584826 
End bp3586076 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID644826115 
Productputative RNA polymerase, sigma 70 family subunit 
Protein accessionYP_002977335 
Protein GI241206239 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCG ATGCCAGACG CGCTGCCGAA AAGGTGGCGC GTCAGTCCTA CGGCAAGCTG 
ATCGCCTTTC TCGCCGCCCG CTCGCGCGAC GTGCCGGCGG CAGAAGACGC ATTGTCGGAA
GCGCTGGCTT CGGCGCTTCG CGTCTGGCCG GAGCGCGGCG TTCCCGGCAA TCCGGAAGCC
TGGCTGTTGG TGGCTGCCCG CCGCAACCTC ATGCAGGCTG CCCGCCACCG CACCGTCGAG
GCCAATGCGC AAACGACGAT ATCGGTCGCC TTCGAGGAGG CGGAGGAACG CATGAACGCG
GCCGGCAACG CCGTCTTTCC CGATGAGCGG CTGAAGCTGC TCTTTGCCTG CACTCACCCA
GCCATCGACA GCTCCGTGCA TACGGCGCTG ATGCTGCAGA CCGTTCTCGG CATCGAGGCG
AGAACCATCG CACGGACCTT CGTCGTTTCG CCAGAAACGA TGAGCCAGCG GCTGGTGCGG
GCCAAGGTGA AGATCCGCGA TGCCGGCATT CCCTTCGCCG TCCCGCCTCG CCCCGCCTTA
CCCGGGCGGC TGGCCGCGGT GCTTTCGGCG ATCTACGCGG CTTACGGTCT CGGCTGGGAC
GGTCTTGACG GAGAGAACGA ACGCCATTCG CTCGCCGGCG AGGCGATCTG GCTCGGCCGT
GCGCTTCTGG CAGTGCTGCC GGACGAACCG GAAGCGATCG GCCTGCTCTC GCTGATGCTT
CACTGCGAAG CCCGCCGCAG CGCCCGGCGC GACGATAGCG GCCGATATGT GCCGCTGGAC
GAGCAGGATA CGACCGCGTG GAACGCGATC ATGATTGCCG AGGCGGATGC GCTGCTGCGC
AAGGCCGGCA GCTTCGACCG TTTCGGTCCC TTCCAGTGCC AGGCAGCGAT CCAATCCGTG
CATGCGGCGC GCCGGCTGTC GGGAACGACC GACTGGCAGG CGCTGACGAC ACTCTACGCG
GCGCTGGTGA TGATGAAGCC GACGCTGGGT GCGCATGTCA GCCAGGCTGC GGTGATCGGC
AGGGCGTTCA GCGCCACTGC CGGCCTGGAG CGGCTCGACA AGCTCGATCC GCGAGACATC
GCAAGCTACC AGCCCTACTG GGCGGTGCGT GCCTTTCTCC TGGCTCAGGC CGGTGATCAT
ACCGCGGCAG CCGATAGCTA TATGACGGCG ATCGGCCTCA GCGACAGCGC CGCCGTGCGC
GATTTTCTCG CCGTCCGGCT AAGGGACGCG CGGCAGGCGA TATCAAGTTA A
 
Protein sequence
MPPDARRAAE KVARQSYGKL IAFLAARSRD VPAAEDALSE ALASALRVWP ERGVPGNPEA 
WLLVAARRNL MQAARHRTVE ANAQTTISVA FEEAEERMNA AGNAVFPDER LKLLFACTHP
AIDSSVHTAL MLQTVLGIEA RTIARTFVVS PETMSQRLVR AKVKIRDAGI PFAVPPRPAL
PGRLAAVLSA IYAAYGLGWD GLDGENERHS LAGEAIWLGR ALLAVLPDEP EAIGLLSLML
HCEARRSARR DDSGRYVPLD EQDTTAWNAI MIAEADALLR KAGSFDRFGP FQCQAAIQSV
HAARRLSGTT DWQALTTLYA ALVMMKPTLG AHVSQAAVIG RAFSATAGLE RLDKLDPRDI
ASYQPYWAVR AFLLAQAGDH TAAADSYMTA IGLSDSAAVR DFLAVRLRDA RQAISS