Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1896 |
Symbol | |
ID | 6980635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1939910 |
End bp | 1941151 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396619 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_002281407 |
Protein GI | 209549490 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00682669 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACAAGA TCGTGCCGGC CACGCCATAC GATGTCGAAG CCATCCGCCG GGATTTTCCG ATCCTGGCGG AGAAGGTGCA CGGCAAGCCG CTGGTCTATC TCGACAACGG CGCCTCGGCA CAGAAGCCGC AGGTGGTGAT CGACGCCATC TCGCACGCCT ACGCCCATGA ATATGCCAAT GTGCATCGCG GCCTGCACTA TCTCTCCAAT GCTGCGACGG ATGCCTATGA GGCGGCGCGC GAGAAGGTGC GCCGTTTCCT CAACGCTCCT TCGGTCAACG ACATCGTCTT CACCAAGAAT TCAACGGAAG CGATCAACAC CGTCGCCTAT GGCTGGGGCA TGCCCGAGAT CGGCGAGGGC GACGAGATCG TCCTGACGAT CATGGAGCAT CATTCCAACA TCGTGCCCTG GCACTTCATC CGTGAGCGGC AGGGTGCCAA ACTCGTCTGG GTGCCGGTCG ACGACGAGGG CGCCTTCCAT ATCGAGGATT TCGAGAAGAG CCTGACGGAG CGCACCAAGC TCGTCGCCAT CACCCATATG TCGAATGCGC TCGGCACGAT CGTTCCCGTC AAGGAAGTTT GCCGGATTGC GCATGAGCGC GGCATCCCGG TGCTGATCGA CGGCAGCCAG GGCGCCGTGC ATCTGCCTGT CGACGTGCAG GATATCGATT GCGACTGGTA TGTGATGACC GGCCACAAGC TCTACGGCCC GTCGGGCATC GGCGTGCTTT ACGGCAAGAA GGAGCGGCTT TCCCAAATGC GCCCCTTCCA GGGCGGCGGC GAGATGATTT TCGAAGTTGC CGAGGACGCG GTCACCTACA ATGATCCGCC GCACCGCTTC GAGGCCGGCA CGCCGCCGAT CGTCCAGGCG ATCGGGCTCG GCTACGCGCT CGACTATATG GAGAAGGTCG GCCGCGAGGC GATCGCCCGG CATGAGGCCG ATCTTGCCGC TTACGCTGTG GAACGGCTGA AAACGGTCAA TTCGCTGCGC GTCTTCGGAA CGGCGCCCGA CAAGGGCAGC ATCTTTTCCT TCGAGCTTGC CGGCATCCAT GCCCATGACG TCTCGATGGT GATCGACCGG CAGGGCGTTG CGGTGCGAGC CGGCACGCAT TGCGCCATGC CGCTCTTGAA ACGCTTCGGC GTCACCTCCA CATGCCGTGC ATCCTTCGGC ATGTACAATA CCCGCGCCGA GGTCGATGCC CTGGCCGATG CGCTTGAATA TGCGCGCAAG TTCTTTGCTT GA
|
Protein sequence | MDKIVPATPY DVEAIRRDFP ILAEKVHGKP LVYLDNGASA QKPQVVIDAI SHAYAHEYAN VHRGLHYLSN AATDAYEAAR EKVRRFLNAP SVNDIVFTKN STEAINTVAY GWGMPEIGEG DEIVLTIMEH HSNIVPWHFI RERQGAKLVW VPVDDEGAFH IEDFEKSLTE RTKLVAITHM SNALGTIVPV KEVCRIAHER GIPVLIDGSQ GAVHLPVDVQ DIDCDWYVMT GHKLYGPSGI GVLYGKKERL SQMRPFQGGG EMIFEVAEDA VTYNDPPHRF EAGTPPIVQA IGLGYALDYM EKVGREAIAR HEADLAAYAV ERLKTVNSLR VFGTAPDKGS IFSFELAGIH AHDVSMVIDR QGVAVRAGTH CAMPLLKRFG VTSTCRASFG MYNTRAEVDA LADALEYARK FFA
|
| |