Gene Rmet_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2459 
Symbol 
ID4039282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2669567 
End bp2670775 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID637977858 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_584605 
Protein GI94311395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00148709 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.252663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CGTTCAACCC CGACGCCTAC GGCATTGATA CCCTCGGCGT GCGCGCCGGC 
ACGCTGCGCA CTGGCGAATT CATGGAGCAC TCCGAGGCGA TGTACCTGAC CTCGAGCTTC
TGCTTCAACA GTGCAGCCGA AGCCGCCGCA CGCTTCGCCA ACTCGGAAGA GGGTTACACC
TACTCGCGCT TCACGAATCC GACCGTGTCG ATGTTCCAGT CGCGCCTGGC CGCGCTGGAA
GGCGCGGAGG CCTGCATGGC CACGGCTTCG GGCATGAGCG CGATTCTGTC GGTGGCGCTC
GCCACGCTGC AGGCCGGCGA TCACCTGGTC AGCTCGCGTT CGATCTTCGG CTCGACGATG
ACGCTGTTCA ACTCGATCCT GGCCAAGTTC GGTGTGGAGA CGACCTACGT CGACGGCACC
GATCTGGCTG CCTGGCGTGC CGCTGTGAAG CCGAACACGA AGCTGTTCTT CCTGGAGACG
CCGTCGAACC CGCTGACCGA GGTGTCCGAT ATCGCGGCAG TTGCCGATAT TGCGCACAAT
GCCGGCGCGC TGTTCGTGGT CGACAACTGC TTCTGCTCAC CGGCTTTGCA GCAGCCGATC
AAGTTCGGTG CCGATGTGGT GGTGCATTCG GCCACCAAGC ACATCGATGG CCAGGGCCGC
GTGCTTGGCG GCGCGGTGGT CGGCAAGCAC GATTTCATCA TGGGCAAGGT GTTCCCGTTC
GTGCGTACGG CGGGCCCGAC GCTGTCGGCG TTCAACGCGT GGGTGATGCT CAAGGGCATG
GAAACGCTGG CGATCCGCAT GGAGCGTCAC TCGCAGAGCG CGTTGGCGAT TGCCGAGTTC
CTCGAGTCAC ATCCGGCCGT GAATCGTGTG TTCCACCCGG CGCTGAAGTC GCATCCGCAG
TACGAGATCG CCCAACGCCA GCAGAGCGGG GGCGGCGCGA TCGTGTCGTT CGAGTTGAAG
GGCGATAGCC CCGAAGCCAT GCGTGCTGCT GCGTGGCGCG TGATCGACAG CACGAAGCTG
TGCTCGATCA CCGGCAATCT CGGCGACACG CGCACGACGA TCACCCATCC GTACACCACC
ACCCACGGTC GCGTGGCGCC TGAAGCCAAG GCCGCCGCCG GCATCAGCGA AGGGCTGATC
CGACTGGCCG TTGGCCTGGA GTCCGTGGAG GATCTCAAGG CCGATCTGCT GCGCGGCCTG
GGCCAGTAA
 
Protein sequence
MSEPFNPDAY GIDTLGVRAG TLRTGEFMEH SEAMYLTSSF CFNSAAEAAA RFANSEEGYT 
YSRFTNPTVS MFQSRLAALE GAEACMATAS GMSAILSVAL ATLQAGDHLV SSRSIFGSTM
TLFNSILAKF GVETTYVDGT DLAAWRAAVK PNTKLFFLET PSNPLTEVSD IAAVADIAHN
AGALFVVDNC FCSPALQQPI KFGADVVVHS ATKHIDGQGR VLGGAVVGKH DFIMGKVFPF
VRTAGPTLSA FNAWVMLKGM ETLAIRMERH SQSALAIAEF LESHPAVNRV FHPALKSHPQ
YEIAQRQQSG GGAIVSFELK GDSPEAMRAA AWRVIDSTKL CSITGNLGDT RTTITHPYTT
THGRVAPEAK AAAGISEGLI RLAVGLESVE DLKADLLRGL GQ