Gene Rleg_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1556 
Symbol 
ID8012635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1534846 
End bp1536306 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content53% 
IMG OID644824142 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_002975384 
Protein GI241204288 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.388564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00312965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCGGA AAATGGCAGC GCTGGTGATT GGGGTGGCAG ACTACTCCGA GGGAAACAAG 
CTCGCCAATC CCGTTCACGA TGCTGTTGAC CTAGGCGACA AATTGAGGGG TTACGGGTTC
GAGGTCATCG TCGTAACTGA CTGCACGAAA AGGGATATGG ACAAGCAGTT AAAGGAGTTC
CGTACGCTGC TAGAAACCCA CGACGTGGGG CTGTTCTTCT TTGCGGGCCA TGGAATGCAA
ATCGATGGGA CCAACTTCCT TCTTGCCACC GATACCGAAA TGGACACGGA GTTGGACGCA
AAGCACACGT CCCTTTCGCT CGACAGAGTG GTGGACGTCA TGGCGAAGTC GGCAGCGTCC
ACTAAGATCA TCGTACTCGA TGCATGTCGG AATAACCCCT GGGAGCGCGC TTGGCATCGT
GGCCCAGCAC TTAGGGGGCT TGCTTCGGTT TACGCGCCGA AGGGAACTAT AATTGGCTTT
GCGACATCTC CCGGAGAGGT TGCGTTGGAT GGCGCGGGCC GCAACGGGAC ATACACCGAA
GCATTACTCG AGCATATCGA CACTCCTGAT AGCTCCATCG AGACCATGTT CAAGCGCGTA
CGCAATACAG TTGCGGCATC GAGTGGAGGT CGGCAGACCA CGTGGGAACA CACCTCGCTA
TCCGGAGAGT TCTATTTTAA CCTGAGTTTG GGTAACTTGG TTGATGAATA CGATGGCACC
GCGCTTGCGG ACAGCCTCTT CGTCCTCGAT CCTGGCCGGA AATCGCACAA CATCATTGCC
GGACTGAAGA CATATAACTG GTATAAACAG AACCCAGCAC TAGATCGGCT CGACGCCGCC
TCTGCAAACA AAATGTCGAA AGACAACTTG TTCGTCCTCG GGCGGAATAT CTACCAGGCG
GCCTGCGGAT CAGCGGGCTC CGCAATCACG TTCATCGACA ATTTCATGGA TAAGACACGC
GGATTCAATC GGGAAGCAAG GAAGTCACTT CTGGACGGCA TCTTGTTCGA AGTCTTCTTC
GACTCCCAAG CGCATATGCG GGACAAAATG AAAAGACGGT ATTTTAACGA GGTTTTCGAG
TTGCAGCGCT TCGCCGACCT GAAGCAAAGT TTCGATTTCA TTGCAGAAGT GCTGACCACG
GCCGCTGGCA AGTTCTATGC CTTGCCTGGC AAGGGACACG ATCTGGCTGT AAGCGTTTCC
ACGAAGAGCG AGGATGGCAT GATCATCGTG GACGCAGTGT ATGTCGGAGG TGTGGACGTG
CTTCGGCGAG AAGACGATAA TGATGCCGAT GAACACGACC CACCTCGTTA TTGGAGGCTG
AGCCCACAGG AAATAAAGGA GAGACTTAGC GAAGAGCTTG TCGTACCTAC ACGCTTGTTG
AAGTTGACTT TCACTCCTGG TGCCGCTGTA AAGGAAGAGG AATTGGGGTT CCCAATGGGT
TGGACTGTCA GGCAAACATG A
 
Protein sequence
MNRKMAALVI GVADYSEGNK LANPVHDAVD LGDKLRGYGF EVIVVTDCTK RDMDKQLKEF 
RTLLETHDVG LFFFAGHGMQ IDGTNFLLAT DTEMDTELDA KHTSLSLDRV VDVMAKSAAS
TKIIVLDACR NNPWERAWHR GPALRGLASV YAPKGTIIGF ATSPGEVALD GAGRNGTYTE
ALLEHIDTPD SSIETMFKRV RNTVAASSGG RQTTWEHTSL SGEFYFNLSL GNLVDEYDGT
ALADSLFVLD PGRKSHNIIA GLKTYNWYKQ NPALDRLDAA SANKMSKDNL FVLGRNIYQA
ACGSAGSAIT FIDNFMDKTR GFNREARKSL LDGILFEVFF DSQAHMRDKM KRRYFNEVFE
LQRFADLKQS FDFIAEVLTT AAGKFYALPG KGHDLAVSVS TKSEDGMIIV DAVYVGGVDV
LRREDDNDAD EHDPPRYWRL SPQEIKERLS EELVVPTRLL KLTFTPGAAV KEEELGFPMG
WTVRQT