Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3752 |
Symbol | |
ID | 5318742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 195709 |
End bp | 196737 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640775565 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312498 |
Protein GI | 150375902 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.236177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGG TTAAACTGGC CGCGGCAGCG CTCGCCGCGG GCGCGACGTT TTTTGCCTAT TCGGCGAAGG CCGATGGAAA TCTCAATCTG ATCTGTTCGG CCGATGTGGT GATCTGCGAA CAGATGAAGG GCGCCTTCGA AAAGAAGTCC GGCGTCTCCG TCAACATGGT GCGCCTCTCT TCGGGCGAAA CCTACGCGAA GATCCGGGCC GAGGCGCGCA ACCCCAAGAC TGACATCTGG TGGGCCGGGA CGGGTGATCC CCATCTGCAG GCCGCTTCGG AGGGGCTGAC GGTCGAGTAC AAGTCGCCGA TGCTCGGCGA ATTACACGAA TGGGCGGTGA AGCAGGCCGA GAGCGCCGGC TATCGCACGG TCGGCGTATA TGCCGGCGCA CTCGGCTGGG GATACAACAC CGAGATCCTC AAGCAGAAGA ACCTGAAGGA GCCGAAATGC TGGGCGGATC TGCTCGATCC CTCCTTCAAG GGCGAAGTGC AAATCGCCAA TCCCAACTCT TCCGGAACCG CCTATACCGC GCTCGCGACT CTCGTCCAGA TCATGGGTGA GGAAGAGGCT TTCGATTACC TGAAGAAGCT GAACGCGAAC GTCTCGCAAT ATACGAAATC CGGCTCCGCA CCGGTGAAAG CCGCAGCACG GGGAGAGACC GCAATCGGCA TCGTATTCAT GCATGATGCT GTGGCGCAGA CGGTCGAAGG GTTTCCGGTA AAGTCGGTGG CGCCGTGCGA AGGCACCGGC TACGAAATCG GCTCGATGTC GATCATCAAG GGCGCAAAAA ACCTCGACAA TGCCAAGAAA TGGTATGACT GGGCTCTCTC CGCCGACGTG CAGTCCAGCA TGAAGGAGGC GAAATCGTTC CAGCTTCCTT CGAACAAGAC GGCCAAGGTG CCCGAGGAGG CCCCGAAGTT CGAAGACATC AAGCTGATTG ACTACGACTT CAAGACCTAT GGCGATCCGG CCAAGCGCAA GGAGCTCCTG GAGCGGTGGG ACCGGGAGAT CGGCGCGGCC GCGAACTGA
|
Protein sequence | MGLVKLAAAA LAAGATFFAY SAKADGNLNL ICSADVVICE QMKGAFEKKS GVSVNMVRLS SGETYAKIRA EARNPKTDIW WAGTGDPHLQ AASEGLTVEY KSPMLGELHE WAVKQAESAG YRTVGVYAGA LGWGYNTEIL KQKNLKEPKC WADLLDPSFK GEVQIANPNS SGTAYTALAT LVQIMGEEEA FDYLKKLNAN VSQYTKSGSA PVKAAARGET AIGIVFMHDA VAQTVEGFPV KSVAPCEGTG YEIGSMSIIK GAKNLDNAKK WYDWALSADV QSSMKEAKSF QLPSNKTAKV PEEAPKFEDI KLIDYDFKTY GDPAKRKELL ERWDREIGAA AN
|
| |