Gene Smed_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5067 
Symbol 
ID5319369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp13381 
End bp14868 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content60% 
IMG OID640776847 
Productextracellular solute-binding protein 
Protein accessionYP_001313779 
Protein GI150377184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.68424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA AGATCCGGAG AATGACTGCC GGCGTTGCGA TGCTGCTGGC ATCCACACTT 
GCCTCTTCGC CCGCATGGGC CCAATCCATC ACCATCGCCA TCGGGTCGGA GCCTTCGACG
CTCGACCCGC AGCTCAGGGA TGACGGCGGC GAGCGGCAGG TCAACGACAA CATCTATGAG
ACGCTGATGG CACGGACGCC GACCGGCGAA CTCGTGCCGG GCCTTGCCGC GGAGGCTCCA
AAACAGGTCG ATGCCACGAC CTGGCAGTTC AAATTGCGGG AGGGCGTCAA GTTCCACAAC
GGCGAACCCT TCAATGCCGA TGCAGTGGTC GCTTCCGTCG CGCGTGTAAT TGATCCCGCG
AACAATTCCG AGCAGATGGC GTATTTCGGC ACGATCAAGG CGGCCGAAAA GGTCGATGAC
CTGACCGTCA ACCTTGTCAC GACAGGCCCG GATCCGATCC TGCCTTCGCG CATGTACTGG
ATGAAGATGA TCGCGCCGGG CTATTCCAAG GACGGCGATC TTGCCGGTGC GCCGGTCGGG
ACAGGCCCTT ACAAGTTCGA AAGCTGGAAC CGGGGCACGG ACCTGAAGCT CGTCGCGAAC
AGCGAATACT GGGGAGGGGA ACCGCAGATC GACGACGTCA CTTACCGCTT CGTGACGGAG
CCGGGCACGC GCCTTTCCGG ACTGCTTTCC GGCGAATTCG ACGTGATTAC GAACCTTCTG
CCGGAGTTCA CGACGAATGT GCCGAAGTTC GCCGCCGTTC CTGGCCTCGA GACATCCGTC
TTCGTTCTGG GCACGGACAA TGAGGTAACG AAAGACCCCA AGGTACGCGA GGCGCTCAAC
CTCGCCATCG ACCGCAAGGC CATGGTCGAG GGCCTTTTCA TGGGTTACGC GACGATCGCC
AAAGGGTCGC ATATCAATCC GGCCGCCTTT GGCTTCAACG AAAAGCTGGA GCATTATCCC
TACGACATCG AGAAGGCGCG GGCGCTGATC AAGGAGGCAG GCGCCGAGGG CAAGCCTCTC
GTCGTCGTCG GCGAATCCGG CCGCTGGCTG AAGGACCGTG AGCAGATCGA GGCAGTTGCG
GGTTACTGGG CCGAGACCGG ACTGAACGTC ACGACCGACA TACAGGAGTT CTCGCAATAT
CTCGACAGCC TGATGGGCGA CGGGCCCCGT CCCGACGCGA TCTTCATCGC CAATTCCAAC
GAGCTGCTCG ATGCCGACCG GGAAATGTCC TTCATCTACC ACAAGGACGG TGCTGCGGCC
TCGAATTCGG ACGCCGAGAT GGCCACCTTG ATCGAGGCGG CGCGCCTCGA AACGGATACG
GCCAAACGCA AGGCGCTTTA CGACGAGATC CAGAAGAAGG GGCATGACCT GAACTACACG
GTGCCGCTGT TTAATCTTCA GGACATCTAC GGAATGTCGG AACGAATGGA ATGGCAGCCA
CGTGTCGACG CGAAGATGAT GGTGAGCGAA ATGAAGGTCA CCGAATAG
 
Protein sequence
MKKKIRRMTA GVAMLLASTL ASSPAWAQSI TIAIGSEPST LDPQLRDDGG ERQVNDNIYE 
TLMARTPTGE LVPGLAAEAP KQVDATTWQF KLREGVKFHN GEPFNADAVV ASVARVIDPA
NNSEQMAYFG TIKAAEKVDD LTVNLVTTGP DPILPSRMYW MKMIAPGYSK DGDLAGAPVG
TGPYKFESWN RGTDLKLVAN SEYWGGEPQI DDVTYRFVTE PGTRLSGLLS GEFDVITNLL
PEFTTNVPKF AAVPGLETSV FVLGTDNEVT KDPKVREALN LAIDRKAMVE GLFMGYATIA
KGSHINPAAF GFNEKLEHYP YDIEKARALI KEAGAEGKPL VVVGESGRWL KDREQIEAVA
GYWAETGLNV TTDIQEFSQY LDSLMGDGPR PDAIFIANSN ELLDADREMS FIYHKDGAAA
SNSDAEMATL IEAARLETDT AKRKALYDEI QKKGHDLNYT VPLFNLQDIY GMSERMEWQP
RVDAKMMVSE MKVTE