Gene Rleg_5451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5451 
Symbol 
ID8016760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp29692 
End bp30708 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content57% 
IMG OID644827624 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002978824 
Protein GI241518196 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.648016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.239065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGTG TTCGCGCGAT CCTGTTGGCT GCGTTCGCAT CCATGTTGCT CGAGACCACC 
GCGTCATCAC AGGAGGCCAA GTATGCCGAC ATCCTGGACA GGGAAGCCTT AAGACAAGGC
GTTCCCACGG TATCCTGCCC GGCCGAGACC GTGGACTGCA CCGGGTCGCT GGTCCCCAAA
ATCATAAATG GCAGGCGTGC CGACGCGGGA ATGTTTCCTT GGTCCGTCTC CGTAGGCAAA
GCCAATGCCA CAAATTTCCG CGGGCACCTG TGCGGCGGCA CGCTGATAAG CGATCGGTTT
ATTCTGTCGG CGGCGCATTG CTTTCCAGAT ACCGCAAGGC CCGAAGATTA CCGGATCCAA
ATGGGTACGG TCGAACTGGA GGGGTACACG GACAAGATAT CGATCCAGAG AATCCTCATT
CACAAACACT TCAACCGTGC CACCAACGAA GCCGATGTAA GTCTGCTGGA GCTCAACACT
CCTATTACGC CAAGCGCTTC GCTTAATTGG ATACCAATCC AGGATCAAGC TGGGTTTGAA
AGCAGCGGGC ACGAGAGCAG TGAGGCTCGA CTGCAATATA CGATTACGGG CTTCGGATAC
ATTGGGCCAG GTAAAAGCCC GGTCCGGCTG CAATTTTCCA ACGACATACC ATCACTCACC
ACCGCCGAGT GTCACGAGTT GAAGGTGTGG GACGAGTCGA TCTTCGGAGA TACCCTCAAG
CCTGGCATGA TATGCGCAGG CAATACCAAC AACGTCTACA AATCCGACGC ATGCAAAGGC
GACAGCGGTG GTGGCTTGAT CCTCCCCCAA CCCGATAACA CACAGGTGGT TGTCGGGATA
GTTTCGCGCG GTGCCCTGCC TGATGGATCT CTCGACTGCA CTCAGCAGCC GTTGCGGGTC
GGCGTGTACA CGCGCGTTTC CACCTACGCA TCGGAAATTT CGTCGTGCCT TGATCCCGGT
GGCACCGGAT GCGACTTCGT CGCGCCCGGC CAGCAGACCG CGGCAGTGCA GGATTGA
 
Protein sequence
MPSVRAILLA AFASMLLETT ASSQEAKYAD ILDREALRQG VPTVSCPAET VDCTGSLVPK 
IINGRRADAG MFPWSVSVGK ANATNFRGHL CGGTLISDRF ILSAAHCFPD TARPEDYRIQ
MGTVELEGYT DKISIQRILI HKHFNRATNE ADVSLLELNT PITPSASLNW IPIQDQAGFE
SSGHESSEAR LQYTITGFGY IGPGKSPVRL QFSNDIPSLT TAECHELKVW DESIFGDTLK
PGMICAGNTN NVYKSDACKG DSGGGLILPQ PDNTQVVVGI VSRGALPDGS LDCTQQPLRV
GVYTRVSTYA SEISSCLDPG GTGCDFVAPG QQTAAVQD