Gene Rleg_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2103 
Symbol 
ID8013127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2093354 
End bp2094595 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID644824689 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002975919 
Protein GI241204823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.105912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGA TAGTGCCGGC CAAGCCATAC GACGTCGAAG CCATCCGCCG GGATTTTCCG 
ATCCTAGCGG AGAAGGTGCA TGGCAAGCCG CTGGTCTATC TCGACAACGG CGCGTCGGCG
CAGAAGCCGC AGGTGGTGAT CGACGCCATC TCGCATGCCT ATAGCCATGA ATATGCCAAT
GTGCATCGTG GCCTGCACTA TCTCTCGAAT GCGGCCACGG ACGCCTATGA GGCAGCGCGC
GAGAAGGTCC GCCGCTTCCT CAATGCGCCT TCGGTGAACG ACATCGTCTT CACCAAGAAT
TCGACGGAAG CGATCAACAC CGTCGCCTAT GGCTGGGGCA TGCCGAAGAT TGGCGAAGGC
GACGAGATCG TGCTTACGAT CATGGAGCAC CATTCCAACA TCGTGCCCTG GCACTTCATC
CGCGAGCGGC AGGGCGCCAA ACTTGTCTGG GTGCCTGTCG ACGACGAGGG CGCCTTCCAT
ATTGAGGATT TCGAGAAGAG CCTGACGGAG CGCACCAAGC TCGTTGCCAT CACCCATATG
TCGAATGCGC TCGGCACAAT CGTTCCCGTC AAGGAAGTCT GCCGGATCGC GCATGAGCGC
GGCATTCCGG TGCTGATCGA CGGCAGCCAG GGCGCCGTGC ATCTGCCTGT TGACGTGCAG
GATATCGATT GCGACTGGTA CGTCATGACC GGCCACAAGC TCTACGGCCC GTCAGGCATC
GGCGTGCTTT ACGGCAAGAA GGAGCGGCTT TTCGAGATGC GCCCGTTCCA GGGCGGTGGA
GAGATGATCT TCGAGGTCGC CGAGGATATG GTCACTTATA ACGACCCGCC GCATCGCTTC
GAGGCCGGCA CGCCGCCGAT CGTGCAGGCG ATCGGGCTCG GTTATGCGCT CGACTACATG
GAGAAGATCG GCCGCGAGGC GATCGCCCGG CATGAGGCCG ATCTTGCCGC CTATGCGGTC
GAGCGGCTGA AATCCGTCAA TTCGCTGCGA GTCTTCGGGA CGGCGCCCGA CAAGGGCAGC
ATCTTTTCCT TCGAACTTGC CGGCATTCAT GCCCACGACG TCTCGATGGT GATCGACCGG
CAGGGTGTTG CAGTCAGGGC CGGCACGCAT TGCGCCATGC CGCTCTTGAA ACGCTTCGGC
GTCACCTCCA CATGCCGTGC ATCCTTCGGC ATGTACAATA CCCGCGCCGA GGTCGATGCC
CTGGCCGATG CGCTTGATTA TGCGCGCAAG TTCTTTGCTT GA
 
Protein sequence
MDKIVPAKPY DVEAIRRDFP ILAEKVHGKP LVYLDNGASA QKPQVVIDAI SHAYSHEYAN 
VHRGLHYLSN AATDAYEAAR EKVRRFLNAP SVNDIVFTKN STEAINTVAY GWGMPKIGEG
DEIVLTIMEH HSNIVPWHFI RERQGAKLVW VPVDDEGAFH IEDFEKSLTE RTKLVAITHM
SNALGTIVPV KEVCRIAHER GIPVLIDGSQ GAVHLPVDVQ DIDCDWYVMT GHKLYGPSGI
GVLYGKKERL FEMRPFQGGG EMIFEVAEDM VTYNDPPHRF EAGTPPIVQA IGLGYALDYM
EKIGREAIAR HEADLAAYAV ERLKSVNSLR VFGTAPDKGS IFSFELAGIH AHDVSMVIDR
QGVAVRAGTH CAMPLLKRFG VTSTCRASFG MYNTRAEVDA LADALDYARK FFA