Gene Rleg2_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2210 
Symbol 
ID6980949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2269780 
End bp2270739 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content64% 
IMG OID643396928 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002281716 
Protein GI209549799 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTACA TGGACAAGGC GGTAGCGCCT CAAAATGACG GGGCGCTGAT CGATGCCTAT 
TCACAATCGA TCGCAGCAGC AGTCGACACG GTCGGGCCGG CCGTCAGCCG GATCGAGAGG
GTTGGTGGCC GGCAGGGGCA TGGTTCGGGC TTCGCCGTCT CGCCAGACGG CCTGATCATC
ACCAACAGTC ATGTCGTCGA CGACGCCAAA GTCGTTCGCA TTACCACGCC CGACGGCTTC
GTTACGGAAG GCCGGGTGTT GGGTCGCGAT GTCGATACCG ATATCGCCCT CATTCGCGCC
AATACAAGCA CCGGCGCTTG GGCGAGGCTC GGAGATTCCC AGCGCTTGCG CCGAGGCCAT
ATCGCTATCG CGATCGGAAA CCCGCTCGGT TTCGAATGGA CGGTTACTGC CGGCATCGTC
TCGGCGCTCG GCCGGTCGAT GCGGGCAGCC AGCGGCCGCC TGATGGAGGA TGTCATCCAG
ACAGATGCGG CGCTCAATCC CGGCAACTCA GGTGGACCGC TGGTGTCTTC CAGCGGCGAG
GTCATCGGCG TCAATACCGC CGTCATCCAG GGCGCGCAGA GCATCGCTTT TGCCGTAGCG
TCGAATACCG CCAATTTCGT CGTTTCGGAA ATACTCCGCT ACGGCCAGGT CAGGCGCGCC
TTCATCGGTA TCGCGGGCGA TACGATCGTG CTGCCGCGCC GGGTGGCGCT TGCGGCGGGG
ACGGTGCAGA CGACCTCCGT TCGCATCCGA CGCGTCGAGC CCGACGGGCC GGCGGCAAAG
GGAGGGCTGC AGGAGGGCGA TTACATCCTC GCCATCGACG GCAGTCCGGT CGGCGGTGTC
GACGATATCG TCAGGCTGAT GGATGGCAGC AGGATCGACA GGGATACGGA GATACTGGTG
TTCTCGGTGG CGGGTCGGAT CGAAAAGAAG ACCTTGCTGC CGATGGCGCG GACGTCCTGA
 
Protein sequence
MGYMDKAVAP QNDGALIDAY SQSIAAAVDT VGPAVSRIER VGGRQGHGSG FAVSPDGLII 
TNSHVVDDAK VVRITTPDGF VTEGRVLGRD VDTDIALIRA NTSTGAWARL GDSQRLRRGH
IAIAIGNPLG FEWTVTAGIV SALGRSMRAA SGRLMEDVIQ TDAALNPGNS GGPLVSSSGE
VIGVNTAVIQ GAQSIAFAVA SNTANFVVSE ILRYGQVRRA FIGIAGDTIV LPRRVALAAG
TVQTTSVRIR RVEPDGPAAK GGLQEGDYIL AIDGSPVGGV DDIVRLMDGS RIDRDTEILV
FSVAGRIEKK TLLPMARTS