Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2982 |
Symbol | |
ID | 5323859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3129485 |
End bp | 3130762 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640791933 |
Product | cysteine desulfurase family protein |
Protein accession | YP_001328646 |
Protein GI | 150398179 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.234076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCA ACGCACAGCT AGATATCGAT TTCGCGCGCA GTCAGTTCCC GGGCCTCGAA CTCGGCTGGA CCTTTTTCGA CAATGCCGGC GGATCGCAGA TCCTCAAGGG CGCGCTCGAG CGCATAAACA CCTTCCTTAT CGAGAAGAAC GTTCAGATCG GCGGTTCCTA TGCGGTGTCG CAGCATGCGG CCAAGGCGCT CTACGAGGCC CGCACAGCCG GGATGCATCT CGTTAATGCG AGCCGGCCGG AAGAAATCGT CTTCGGCAGT TCGACGACGG CGCTTCTCCA GAACCTGGCT CGCGCCATGG CGAGCCAGCT TGCGCCGGGC GACGAAATCA TCGTCACGGT GAGCGACCAC GAATCCAATA TCGGTCCTTG GGACCGGTTG CAGGCAGCGG GCATCGTTCT CAAGATCTGG CCGCTCGACA AAAAGACCCT GACCCTTGAC CTCGCCGACC TCGAACCTTT GATGGGCGCA CGCACGAAGC TCGTCTGCGT GCCCCATGCT TCCAACATTC TCGGCTCGAT CAACCCGGTC CGCGAAATTG CCGATTTCGT CCACGCCCGA AACGCGAAAC TATGCGTCGA CGCGGTGGCT TACGCGCCGC ACCGCGCCAT CGACGTACAG GCCTTCGACG CCGACTATTA TGTCTTCAGT CTCTACAAGA CCTATGGGCC GCACTATGCG CTGATGTACG GCAAATATGA TCTGCTGCTC GAGCTGGATA CGCTCTACCA CTACTTCTAC GGCAAGGATA AGGTTCCAGG AAAGCTCGAG CCAGGCAATC CGAACTATGA GCTCGCCTAT TCGACCTGCG GGATCGTCGA CTACCTTTGC GAGCTCGGCG CCCGCTCCGG AGAAGCCGGC ACTGTCAGGC GAAAGATCGA GGCCGCCTTC GATGCGGTGG CCAGCCAGGA GGACGCCCTG ACCGAGCGGC TCCTCTCCTA TCTGCGCTCG CGCAACGACT GCCGGATCAT CGGACAATCG GTCAATCGCG ACAGCCAGCG CGTGCCCACC ATTTCCTTCC GCTTCGACGG ACGCGATGCT GCAGAACTCT GCAAGGCGAT CGACGAGGAG AACATTGCCA TTCGCCACGG CGATTTCCAT TCCCGCCGGC TTGCCGAGTA TCTGGAGCTG ACCGATTACA ACGGCATGCT GCGCGTCTCC ATGGTGCATT ACAACACGAT CGAGGAAGTG GATCGTCTCA CGGCCGCCTT CGACCGGGTT CTTTCGACCG GCACCGGAAA CACAGCCGAT GCAAGGCGCG CCGGATGA
|
Protein sequence | MAANAQLDID FARSQFPGLE LGWTFFDNAG GSQILKGALE RINTFLIEKN VQIGGSYAVS QHAAKALYEA RTAGMHLVNA SRPEEIVFGS STTALLQNLA RAMASQLAPG DEIIVTVSDH ESNIGPWDRL QAAGIVLKIW PLDKKTLTLD LADLEPLMGA RTKLVCVPHA SNILGSINPV REIADFVHAR NAKLCVDAVA YAPHRAIDVQ AFDADYYVFS LYKTYGPHYA LMYGKYDLLL ELDTLYHYFY GKDKVPGKLE PGNPNYELAY STCGIVDYLC ELGARSGEAG TVRRKIEAAF DAVASQEDAL TERLLSYLRS RNDCRIIGQS VNRDSQRVPT ISFRFDGRDA AELCKAIDEE NIAIRHGDFH SRRLAEYLEL TDYNGMLRVS MVHYNTIEEV DRLTAAFDRV LSTGTGNTAD ARRAG
|
| |