Gene Rleg_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0190 
Symbol 
ID8011419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp189146 
End bp190330 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content63% 
IMG OID644822782 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_002974040 
Protein GI241202944 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0949182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.567401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA CCTGGCGCCC GGCAACCCAA CTCGTCCACG GTGGCACGCT GCGTTCGCAA 
TATGGCGAGA CGTCCGAGGC AATCTATCTC ACCCAAGGCT TCGTCTACGA AACGTCCGAG
GCGGCCGAAG CCCGCTTCAA GGGCGAGACG GAGGGCTTCA TCTACGCCCG CTACGGCAGC
CCCACCAACG ACATGTTCGA AAAGCGCATG TGCATGCTCG AAGGCGCCGA AGACGCCCGC
GCCACCGCTT CCGGCATGGC CGCCGTCACC GCGGCGATCC TCTGCCAGCT GAAATCAGGC
GATCATATCG TCGCCGCGCG CGCCCTGTTC GGTTCCTGCC GCTGGGTCGT CGAGACGCTG
GCGCCGAAAT ACGGCATCGA CTGCACGCTG ATCGACGGCC GGGATCTGGC GAACTGGGAA
AAGGCGATCA CGCCGAAGAC CAAGGTGTTC TTCCTGGAAA GCCCGACCAA CCCGACGCTC
GAAGTGATCG ATATCGCTGG CGTCGCCAAG CTCGCCAACC AGGTCGGCGC CAAGGTCGTC
GTCGACAATG TCTTTGCCAC GCCACTTTTC CAGAAGCCCC TGGAGCTCGG CGCCCATATC
GTCGTTTATT CCGCCACCAA GCATATTGAC GGCCAGGGCC GCTGCCTCGG CGGTGTCGTT
CTTTCCGACA AGGAATGGAT CGACGAGAAC CTGCACGACT ACTTCCGCCA TACTGGGCCG
GCCATGTCGC CGTTCAATGC CTGGACACTG TTGAAAGGCA TCGAGACGCT GCCGCTGCGC
GTGCGCCAGC AGACCGAGAA TGCGGCAAAG ATCGCCGATT TCCTGGCCGA GCAGGGCAAG
GTCGCCAAGG TGATCTATCC CGGCCGCAAG GACCATCCGC AGGCCGATAT CATCGCCAAG
CAGATGACCG GCGGCTCGAC GCTGGTCGCC TTCGAGCTGA AGGGCGGCAA GGATGCGGCC
TTTGCGCTGC AGAACGCGCT CGATATCGTC AAGATCTCCA ACAATCTCGG CGACAGCAAG
AGCCTGATCA CCCATCCGGC GACGACGACG CACAAGAACC TGACGGATGA GGCGCGCGCC
GAACTCGGCA TTTCCCCGGG CACGGTCCGC CTTTCGGCCG GCATCGAGGA TACCGACGAC
CTGATCGAAG ATTTCGCCAA GGCGCTTGAC AAGGTCTTGG CCTGA
 
Protein sequence
MSKTWRPATQ LVHGGTLRSQ YGETSEAIYL TQGFVYETSE AAEARFKGET EGFIYARYGS 
PTNDMFEKRM CMLEGAEDAR ATASGMAAVT AAILCQLKSG DHIVAARALF GSCRWVVETL
APKYGIDCTL IDGRDLANWE KAITPKTKVF FLESPTNPTL EVIDIAGVAK LANQVGAKVV
VDNVFATPLF QKPLELGAHI VVYSATKHID GQGRCLGGVV LSDKEWIDEN LHDYFRHTGP
AMSPFNAWTL LKGIETLPLR VRQQTENAAK IADFLAEQGK VAKVIYPGRK DHPQADIIAK
QMTGGSTLVA FELKGGKDAA FALQNALDIV KISNNLGDSK SLITHPATTT HKNLTDEARA
ELGISPGTVR LSAGIEDTDD LIEDFAKALD KVLA