Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3366 |
Symbol | |
ID | 5324250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3566586 |
End bp | 3568445 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640792317 |
Product | extracellular solute-binding protein |
Protein accession | YP_001329022 |
Protein GI | 150398555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACT TCTGCAGGAC CATGAAGCCA GGCCTTGCGG CCTCGCTTCT GACAGCAGCG CTTCTCCTCC TCCCTGCTGC ATCGAATGCC GAGGAACAGC CCGCCTGGCG CCACGCTACC TCGTCGATCG GCGAGCCCAA ATACAAGACG GATTTTGCGC GTTTCAACTA CGTCAATCCC GATGCGCCAA AAGGCGGAGA GCTACAGCTT TCGGAGAACG GGACGTTCGA CTCCTTCAAT CCCATCCTCG CCAAGGGCGA GGTGGCTACG GGCGTCTCTT CGCTGGTCTT CGACACACTC CTGATGTCGT CCGAGGACGA GATCACCACC TCCTACGGCC TGCTTGCAGA GGGCGTTTCC TATCCGGCCG ATATTTCCTC CGCCACTTTT CGCCTGAGGG CAGAAGCCAG ATGGGCCGAC GGCAGACCGG TGAGACCGGA GGATGTCATC TTCTCCTTCG ATAGGGTGAA GGAGCACAAT CCGCTCTTTT CCAACTACTA CCGTCACGTG GTTTCAGCGG AAAAGACCGG CGAACGGGAC GTTACCTTCC GGTTCGACGA GAAGAACAAT CTCGAACTTC CGAACATTCT CGGCCAGTTT CCCATCCTGC CGAAGCACTG GTGGGAAGGT CAGGACGCAG AAGGCAAGAA GCGCGATATC GGACGCACGA CACTGGAACC GGTCATGGGC TCCGGTCCTT ACAAGATCGC CGCTTTCCAG CCTGGCGGCT CCATCCGTTT CGAATTGCGC GACGACTATT GGGGCAAGGA TCTCAATGTG AATGTCGGCA GGTACAATTT CCGCACGATC AACTACGTTT TCTTCAGTGA CCGAAGCGTG GAGTTCGAAG CCTTCCGTGC CGGCAATGTC CATTTCTACC GGGACAACAG CGCGAGCCAC TGGGCCACGG CCTACGACTT TCCGGCAATG AAGGACGGAC GGGTGATCCG CGAGGAAATC GAGAACCCGC TGCGCGCAAC CGGCGTGATG CAGGCCTTCG TACCCAATCT GCGTCGGGAA AAATTCAAGG ATCAGCGGGT ACGCGAAGCG TTGAACTACG CCTTCGACTT CGAAGACCTG AATCGCAGCC TCGCCCACAA TGCGTATCAG CGCGTCGACA GTTATTTCTG GGGCACCGAG CTTGCCTCTT CCGGTCTGCC CGAGGGGCGC GAAAAGGAGA TCCTCGAGGA ACTGAAGGAC AAGGTCCCCG CCGCCGTTTT CGACAAGCCC TACAAGAATC CCGTCAACGG CGATCCGCAG AAGGTACGCG ATAACTTGCG CAAGTCGCTC TCTCTCTTCA AGGAAGCGGG TTACGAACTC AAGGGCAGCC GATTGGTGAA CTCGAAGACC GGCGAGCCGT TCCGCTTCGA GATCCTTCTG CCCAATCCCT CACTCGAGCG TACGGTTACG CCCTTCGTGA ACAGCGTGAG GAAAATCGGC ATAGATGCTC GCATTCGCAC GGTCGACGAC TCGCAATATA CAAATCGCGT CAGAAGCTTC GACTATGACA TGATCTACGG CGTCTGGGCG CAGACTCTGG TGCCCGGCAA CGAACAGAGC GATTACTGGG GCTCGGCGTC GGTTGACCGG CCGGGATCCA TGAACTATGC CGGTATCGCC GATCCGGCCA TCGACGAACT CATCCGGAAA ATCATCTTCG CGCCGAACCG CGAGGAACTC GTCGCGACGA CACGGGCGCT CGACCGCGTC CTTCTCGCCC ATCATTACGT CGTGCCTCTT TTCTATTCGA AGGCCTACCG CATCGCCTAT TGGAGCCACC TGGCCCGCCC GGAGGAGCTG CCCTATTACG GGATGGATTT CCCGGCTGCG TGGTGGTCGA AGAGCGCCGC TGCCAAATGA
|
Protein sequence | MPNFCRTMKP GLAASLLTAA LLLLPAASNA EEQPAWRHAT SSIGEPKYKT DFARFNYVNP DAPKGGELQL SENGTFDSFN PILAKGEVAT GVSSLVFDTL LMSSEDEITT SYGLLAEGVS YPADISSATF RLRAEARWAD GRPVRPEDVI FSFDRVKEHN PLFSNYYRHV VSAEKTGERD VTFRFDEKNN LELPNILGQF PILPKHWWEG QDAEGKKRDI GRTTLEPVMG SGPYKIAAFQ PGGSIRFELR DDYWGKDLNV NVGRYNFRTI NYVFFSDRSV EFEAFRAGNV HFYRDNSASH WATAYDFPAM KDGRVIREEI ENPLRATGVM QAFVPNLRRE KFKDQRVREA LNYAFDFEDL NRSLAHNAYQ RVDSYFWGTE LASSGLPEGR EKEILEELKD KVPAAVFDKP YKNPVNGDPQ KVRDNLRKSL SLFKEAGYEL KGSRLVNSKT GEPFRFEILL PNPSLERTVT PFVNSVRKIG IDARIRTVDD SQYTNRVRSF DYDMIYGVWA QTLVPGNEQS DYWGSASVDR PGSMNYAGIA DPAIDELIRK IIFAPNREEL VATTRALDRV LLAHHYVVPL FYSKAYRIAY WSHLARPEEL PYYGMDFPAA WWSKSAAAK
|
| |