Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2103 |
Symbol | |
ID | 8013127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2093354 |
End bp | 2094595 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824689 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_002975919 |
Protein GI | 241204823 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.105912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGA TAGTGCCGGC CAAGCCATAC GACGTCGAAG CCATCCGCCG GGATTTTCCG ATCCTAGCGG AGAAGGTGCA TGGCAAGCCG CTGGTCTATC TCGACAACGG CGCGTCGGCG CAGAAGCCGC AGGTGGTGAT CGACGCCATC TCGCATGCCT ATAGCCATGA ATATGCCAAT GTGCATCGTG GCCTGCACTA TCTCTCGAAT GCGGCCACGG ACGCCTATGA GGCAGCGCGC GAGAAGGTCC GCCGCTTCCT CAATGCGCCT TCGGTGAACG ACATCGTCTT CACCAAGAAT TCGACGGAAG CGATCAACAC CGTCGCCTAT GGCTGGGGCA TGCCGAAGAT TGGCGAAGGC GACGAGATCG TGCTTACGAT CATGGAGCAC CATTCCAACA TCGTGCCCTG GCACTTCATC CGCGAGCGGC AGGGCGCCAA ACTTGTCTGG GTGCCTGTCG ACGACGAGGG CGCCTTCCAT ATTGAGGATT TCGAGAAGAG CCTGACGGAG CGCACCAAGC TCGTTGCCAT CACCCATATG TCGAATGCGC TCGGCACAAT CGTTCCCGTC AAGGAAGTCT GCCGGATCGC GCATGAGCGC GGCATTCCGG TGCTGATCGA CGGCAGCCAG GGCGCCGTGC ATCTGCCTGT TGACGTGCAG GATATCGATT GCGACTGGTA CGTCATGACC GGCCACAAGC TCTACGGCCC GTCAGGCATC GGCGTGCTTT ACGGCAAGAA GGAGCGGCTT TTCGAGATGC GCCCGTTCCA GGGCGGTGGA GAGATGATCT TCGAGGTCGC CGAGGATATG GTCACTTATA ACGACCCGCC GCATCGCTTC GAGGCCGGCA CGCCGCCGAT CGTGCAGGCG ATCGGGCTCG GTTATGCGCT CGACTACATG GAGAAGATCG GCCGCGAGGC GATCGCCCGG CATGAGGCCG ATCTTGCCGC CTATGCGGTC GAGCGGCTGA AATCCGTCAA TTCGCTGCGA GTCTTCGGGA CGGCGCCCGA CAAGGGCAGC ATCTTTTCCT TCGAACTTGC CGGCATTCAT GCCCACGACG TCTCGATGGT GATCGACCGG CAGGGTGTTG CAGTCAGGGC CGGCACGCAT TGCGCCATGC CGCTCTTGAA ACGCTTCGGC GTCACCTCCA CATGCCGTGC ATCCTTCGGC ATGTACAATA CCCGCGCCGA GGTCGATGCC CTGGCCGATG CGCTTGATTA TGCGCGCAAG TTCTTTGCTT GA
|
Protein sequence | MDKIVPAKPY DVEAIRRDFP ILAEKVHGKP LVYLDNGASA QKPQVVIDAI SHAYSHEYAN VHRGLHYLSN AATDAYEAAR EKVRRFLNAP SVNDIVFTKN STEAINTVAY GWGMPKIGEG DEIVLTIMEH HSNIVPWHFI RERQGAKLVW VPVDDEGAFH IEDFEKSLTE RTKLVAITHM SNALGTIVPV KEVCRIAHER GIPVLIDGSQ GAVHLPVDVQ DIDCDWYVMT GHKLYGPSGI GVLYGKKERL FEMRPFQGGG EMIFEVAEDM VTYNDPPHRF EAGTPPIVQA IGLGYALDYM EKIGREAIAR HEADLAAYAV ERLKSVNSLR VFGTAPDKGS IFSFELAGIH AHDVSMVIDR QGVAVRAGTH CAMPLLKRFG VTSTCRASFG MYNTRAEVDA LADALDYARK FFA
|
| |