Gene Rleg_4619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4619 
Symbol 
ID8015365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4740968 
End bp4742503 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content65% 
IMG OID644827194 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_002978394 
Protein GI241207298 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.221687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCATAC GAACCCTGTT TCTGCTCACC GCATTTATGG CGTCGCTGGC GCCTGCCCTT 
TCCCATGCGC AGGAAAGCGA TCCACCGAGG GCGAATGTCC AGAGTGGCAG CACGCGCGAC
GGCGTGCTGA AGCTGCTGCC GACCGACTCC GTTACAGAGC ACGCGCTGAC GATCGGCGGC
CGGAAGCTCG CCTATACCGC CACCGCCGGC ACGCTGGATC TCTTCGGCCA GGACGGGGCG
CAGACCGGCG CGATCTTCTA CACCGCCTAT GTCGCAAGGG ATAGCGGGGC GAACCGGCCT
CTGACCTTTG CGTTCAACGG CGGGCCGGGT GCTGCTTCCG CCTTCCTGCA TCTCGGGCTG
GTCGGGCCGA AGGTGCTCGA TTTCGGGCCG GACGGACGCG ACGGCGCCAA TGCGAAACTC
GTCGACAACC CGCAGAGCTG GCTCGATTTC ACCGACCTCG TGCTGATCGA TCCGATCGGC
ACCGGCTGGA GCCGGACGGC AAAGGCCGAC GACGCCTCCA ACTATTACAA CGTCAGTGCC
GATGCTGAGA GCATCGCCAA GGCGATCGCG CTCTATGTCG CGCACAACAA CCGTTCCAAC
TCGCCGAAAT ATCTTCTCGG CGAAAGTTAT GGCGGCTTCC GCGCCGCCAA GGTCGCCTCG
GCGCTGCAGG AAAGCCAGGG CATCATCGTC GCCGGTGCGG TGATGCTCTC CCCCCTGCTC
GAGGGCCAGC TGATGTTCAA TGCCGATCAG TTCCCGCTCG GCGCCGCGCT GGAGCTGCCG
TCTCTGGCGG CAGCCGAACT CGACCGGCAC AAGGCCTTTG ACGAAGAGCA GCAGAAGGAG
GCCGAGAGCT TCGCGCTCGG GGACTATCTG ACGACGCTGG CCGGGCCGCC GCCGACGGGT
GCCGCCGCCG CCGCCTTTTA TGGCAGGATC GCCGCATTGA CCGGCATTCC CGAGGATATC
GTCACCCGCA ACCGCGGCTT CCTCGGCAGT TCCTTCGTCA AACATTCGGA CGCGGGCAGC
GGCGAGGTGA TGAGCTCATA CGACGCCTCC TTTGCAGCAC CCGATCCCTA TCCGGAATCG
GATTACGACC GCGGCGACGA CGCCATCCTC GACGGCTTCA CCCGCGCCTA TGGCGGCGCC
TTTGCCGACT ACGCCCGCAA CGAGCTCGGC TTCAAGACCG AGATGACCTA TTCGCTGCTC
GACGGCGATA TCAGCCGACG CTGGGAATGG GGCGGCGGGC GCGGCGGCGG ATCGCGATTC
CAGGCCAGCG CCACCGACGA CATCCGGCAG TTGCTCGCCG CAAACCCGGC CTTCCATCTG
CTGATTGCGC ATGGCTACAG CGATCTGGTG ACGCCTTATG GCGTCAGCCG TTACGTGGTC
GACCATCTGC CGCCCTCGCT CGCTGGCGGC CGCGTCGGGC TGAAGCTTTA TCGCGGCGGC
CATATGTTCT ATACGAAAGC AGATCAACGG GCCGCCTTCA CGGCGGATGC GAAGGCCCTC
TACGCCACAC ATCCGGTCGC TCAGCCGGCG GACTAA
 
Protein sequence
MRIRTLFLLT AFMASLAPAL SHAQESDPPR ANVQSGSTRD GVLKLLPTDS VTEHALTIGG 
RKLAYTATAG TLDLFGQDGA QTGAIFYTAY VARDSGANRP LTFAFNGGPG AASAFLHLGL
VGPKVLDFGP DGRDGANAKL VDNPQSWLDF TDLVLIDPIG TGWSRTAKAD DASNYYNVSA
DAESIAKAIA LYVAHNNRSN SPKYLLGESY GGFRAAKVAS ALQESQGIIV AGAVMLSPLL
EGQLMFNADQ FPLGAALELP SLAAAELDRH KAFDEEQQKE AESFALGDYL TTLAGPPPTG
AAAAAFYGRI AALTGIPEDI VTRNRGFLGS SFVKHSDAGS GEVMSSYDAS FAAPDPYPES
DYDRGDDAIL DGFTRAYGGA FADYARNELG FKTEMTYSLL DGDISRRWEW GGGRGGGSRF
QASATDDIRQ LLAANPAFHL LIAHGYSDLV TPYGVSRYVV DHLPPSLAGG RVGLKLYRGG
HMFYTKADQR AAFTADAKAL YATHPVAQPA D