Gene Smed_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3366 
Symbol 
ID5324250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3566586 
End bp3568445 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content59% 
IMG OID640792317 
Productextracellular solute-binding protein 
Protein accessionYP_001329022 
Protein GI150398555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACT TCTGCAGGAC CATGAAGCCA GGCCTTGCGG CCTCGCTTCT GACAGCAGCG 
CTTCTCCTCC TCCCTGCTGC ATCGAATGCC GAGGAACAGC CCGCCTGGCG CCACGCTACC
TCGTCGATCG GCGAGCCCAA ATACAAGACG GATTTTGCGC GTTTCAACTA CGTCAATCCC
GATGCGCCAA AAGGCGGAGA GCTACAGCTT TCGGAGAACG GGACGTTCGA CTCCTTCAAT
CCCATCCTCG CCAAGGGCGA GGTGGCTACG GGCGTCTCTT CGCTGGTCTT CGACACACTC
CTGATGTCGT CCGAGGACGA GATCACCACC TCCTACGGCC TGCTTGCAGA GGGCGTTTCC
TATCCGGCCG ATATTTCCTC CGCCACTTTT CGCCTGAGGG CAGAAGCCAG ATGGGCCGAC
GGCAGACCGG TGAGACCGGA GGATGTCATC TTCTCCTTCG ATAGGGTGAA GGAGCACAAT
CCGCTCTTTT CCAACTACTA CCGTCACGTG GTTTCAGCGG AAAAGACCGG CGAACGGGAC
GTTACCTTCC GGTTCGACGA GAAGAACAAT CTCGAACTTC CGAACATTCT CGGCCAGTTT
CCCATCCTGC CGAAGCACTG GTGGGAAGGT CAGGACGCAG AAGGCAAGAA GCGCGATATC
GGACGCACGA CACTGGAACC GGTCATGGGC TCCGGTCCTT ACAAGATCGC CGCTTTCCAG
CCTGGCGGCT CCATCCGTTT CGAATTGCGC GACGACTATT GGGGCAAGGA TCTCAATGTG
AATGTCGGCA GGTACAATTT CCGCACGATC AACTACGTTT TCTTCAGTGA CCGAAGCGTG
GAGTTCGAAG CCTTCCGTGC CGGCAATGTC CATTTCTACC GGGACAACAG CGCGAGCCAC
TGGGCCACGG CCTACGACTT TCCGGCAATG AAGGACGGAC GGGTGATCCG CGAGGAAATC
GAGAACCCGC TGCGCGCAAC CGGCGTGATG CAGGCCTTCG TACCCAATCT GCGTCGGGAA
AAATTCAAGG ATCAGCGGGT ACGCGAAGCG TTGAACTACG CCTTCGACTT CGAAGACCTG
AATCGCAGCC TCGCCCACAA TGCGTATCAG CGCGTCGACA GTTATTTCTG GGGCACCGAG
CTTGCCTCTT CCGGTCTGCC CGAGGGGCGC GAAAAGGAGA TCCTCGAGGA ACTGAAGGAC
AAGGTCCCCG CCGCCGTTTT CGACAAGCCC TACAAGAATC CCGTCAACGG CGATCCGCAG
AAGGTACGCG ATAACTTGCG CAAGTCGCTC TCTCTCTTCA AGGAAGCGGG TTACGAACTC
AAGGGCAGCC GATTGGTGAA CTCGAAGACC GGCGAGCCGT TCCGCTTCGA GATCCTTCTG
CCCAATCCCT CACTCGAGCG TACGGTTACG CCCTTCGTGA ACAGCGTGAG GAAAATCGGC
ATAGATGCTC GCATTCGCAC GGTCGACGAC TCGCAATATA CAAATCGCGT CAGAAGCTTC
GACTATGACA TGATCTACGG CGTCTGGGCG CAGACTCTGG TGCCCGGCAA CGAACAGAGC
GATTACTGGG GCTCGGCGTC GGTTGACCGG CCGGGATCCA TGAACTATGC CGGTATCGCC
GATCCGGCCA TCGACGAACT CATCCGGAAA ATCATCTTCG CGCCGAACCG CGAGGAACTC
GTCGCGACGA CACGGGCGCT CGACCGCGTC CTTCTCGCCC ATCATTACGT CGTGCCTCTT
TTCTATTCGA AGGCCTACCG CATCGCCTAT TGGAGCCACC TGGCCCGCCC GGAGGAGCTG
CCCTATTACG GGATGGATTT CCCGGCTGCG TGGTGGTCGA AGAGCGCCGC TGCCAAATGA
 
Protein sequence
MPNFCRTMKP GLAASLLTAA LLLLPAASNA EEQPAWRHAT SSIGEPKYKT DFARFNYVNP 
DAPKGGELQL SENGTFDSFN PILAKGEVAT GVSSLVFDTL LMSSEDEITT SYGLLAEGVS
YPADISSATF RLRAEARWAD GRPVRPEDVI FSFDRVKEHN PLFSNYYRHV VSAEKTGERD
VTFRFDEKNN LELPNILGQF PILPKHWWEG QDAEGKKRDI GRTTLEPVMG SGPYKIAAFQ
PGGSIRFELR DDYWGKDLNV NVGRYNFRTI NYVFFSDRSV EFEAFRAGNV HFYRDNSASH
WATAYDFPAM KDGRVIREEI ENPLRATGVM QAFVPNLRRE KFKDQRVREA LNYAFDFEDL
NRSLAHNAYQ RVDSYFWGTE LASSGLPEGR EKEILEELKD KVPAAVFDKP YKNPVNGDPQ
KVRDNLRKSL SLFKEAGYEL KGSRLVNSKT GEPFRFEILL PNPSLERTVT PFVNSVRKIG
IDARIRTVDD SQYTNRVRSF DYDMIYGVWA QTLVPGNEQS DYWGSASVDR PGSMNYAGIA
DPAIDELIRK IIFAPNREEL VATTRALDRV LLAHHYVVPL FYSKAYRIAY WSHLARPEEL
PYYGMDFPAA WWSKSAAAK