Gene Smed_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2183 
Symbol 
ID5323043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2257047 
End bp2258636 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content62% 
IMG OID640791121 
Productextracellular solute-binding protein 
Protein accessionYP_001327851 
Protein GI150397384 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.226598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT TCACGAAGCG TCCCGATGGC CTCATCGTCC CGGCCGGCAT CAACCGGCGC 
GGCTTTCTCG CGGGCACAGC AGCGCTTGGC ATCGCCGCAG GGCTCGGCAG TTCACTCGTA
GGCGGTGAAG CGCGGGCGCA GGAACCGAAG CGAGGCGGCC ATCTGAAGCT CGGCCTCGAC
GGCGGCGCGA CAACCGATTC ACTCGACCCG GCAAGCTACA GCGGTTCCGT ATCCTTCGTA
ATCGGCCATC TCTGGGGCGA CACACTGGTC GAATCCCATC CCGAGACCGG CGCTCCGCTG
CCGTCAATCG CTTCCTCCTG GGAGTCGTCG GCCGACGCTT CGGTCTGGAC GTTCAAGATT
CGAAAGGACA TTGCGTTCCA CGATGGCAGG AAACTCGCCG TTGCAGACGT GATCGCAACC
TTGAAGCGGC ACTCCGACGA AGGCTCCAAA TCGGGCGCGC TGGGACTTAT GCGCTCCGTG
AAGGCAATTG AGGAAAAGGC CGGCGATCTC GTCCTGACGC TGACGGAAGG CAATGCCGAC
CTGCCGCTTC TGCTGACCGA CTATCACCTC ATCATTCAGC CGGACGGCGG CGTCGATAAT
CCGGCTTCGC CGGTCGGCAC CGGACCCTAC AAGCTCGCAA GCTACGAAGC CGGTATACGT
GCGACCTTTG AAAAGAACAC CGCCGACTGG CGCTCGGACC GCGGCTATGT CGACAGCGTC
GAAATCATCG TCATGAATGA CACCACGGCG CGCATTGCGG CGCTTTCCTC CGGGCAGGTG
CACTTCATCA ACGGCGTCGA ACCGAAAACC GTGCCGCTGC TCAAGCGCGC GCCCCGCGTG
GAAATTCTGC AGACTTCGGG CAAGGGCTTC TACAGCTTCC TCATGCATTG CGACACGAGC
CCCTTCGACA ATAACGACCT GAGGCTCGCG CTGAAATATG CGATCGACCG GCAGGCCATT
CTCGACCGTG TGCTCGGCGG CTTTGGCACG CTCGGCAACG ATTATCCGGT CAATGAAAAC
TACGCGCTCG CACCTGAAGG CATCGAGCAG CGTGCCTACG ATCCCGACAA GGCCGCCTTC
CACTACAAGA AGTCCGGCCA TGACCGGCCG ATCCTGCTGC GCACATCCGA CGCCGCCTTC
CCGGGCGCGG TCGACGCCTC CGTCCTCTTC CAGGAAAGCG CCCGCAAGGC CGGCATCGAA
ATCGAGGTAC GTCGCGAACC GGAAGACGGC TACTGGACCA ATGTCTGGAA TGTGCAGCCT
TTCTGCGCAT CCTACTGGGG CGGCCGCCCC ACACAGGACT CCCGCTATTC CACCTCCTAT
CTTTCGAGTG CGGAATGGAA CGACACGCGG TTCAAGCGAG AGGATTTCGA CAAGCTGCTC
CTGCAGGCGC GCTCCGAACT GGACGAAGCC AAGCGCAAGG CGCTCTACCA TACCATGGCT
CTCATGGTGC GCGATGAAGG CGGCCTCATT CTGCCGGTCT TCAACGACTA TGTGAACGCC
GCCTCCGCTT CGCTGAAAGG TTTCGTGCAC GACATCGGCA ACGACCTTTC GAACGGCTAT
GTCGGAAGCC GCGTCTGGTT CGACAGCTAA
 
Protein sequence
MTEFTKRPDG LIVPAGINRR GFLAGTAALG IAAGLGSSLV GGEARAQEPK RGGHLKLGLD 
GGATTDSLDP ASYSGSVSFV IGHLWGDTLV ESHPETGAPL PSIASSWESS ADASVWTFKI
RKDIAFHDGR KLAVADVIAT LKRHSDEGSK SGALGLMRSV KAIEEKAGDL VLTLTEGNAD
LPLLLTDYHL IIQPDGGVDN PASPVGTGPY KLASYEAGIR ATFEKNTADW RSDRGYVDSV
EIIVMNDTTA RIAALSSGQV HFINGVEPKT VPLLKRAPRV EILQTSGKGF YSFLMHCDTS
PFDNNDLRLA LKYAIDRQAI LDRVLGGFGT LGNDYPVNEN YALAPEGIEQ RAYDPDKAAF
HYKKSGHDRP ILLRTSDAAF PGAVDASVLF QESARKAGIE IEVRREPEDG YWTNVWNVQP
FCASYWGGRP TQDSRYSTSY LSSAEWNDTR FKREDFDKLL LQARSELDEA KRKALYHTMA
LMVRDEGGLI LPVFNDYVNA ASASLKGFVH DIGNDLSNGY VGSRVWFDS