Gene Smed_4593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4593 
Symbol 
ID5319009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1091446 
End bp1093347 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content62% 
IMG OID640776394 
Productextracellular solute-binding protein 
Protein accessionYP_001313326 
Protein GI150376730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.996188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCC GCAGAACCGC GCTCGGCCTT CTCGCAACGG CGGCCTTTCC TAAAACCTTG 
CTTGCGGCTG CAGGCGGGGC CGACCCGCTC GCTGCCCTCG TTCAGGAGGG CAAACTTCCC
CCGGTCGGCG AAAGGCTGCC GAAGACGCCG CGGGTGATCA ATGTCGCGGG GATGGGCCGC
AAAGCCGGGC GGCACGGCGG CACCATCCGC AGCCTGATCG GCAGTGCGAA GGACATTCGC
CTGATGACGA TCTATGGCTA TGCACGCCTG GTCGGGTACG ACGAGGAACT GAACCTACAG
CCTGACGTTC TCGAAAGCTA CGAAACGGTC GAGGACCGGA TTTTCACCTT CCACCTGCGC
GAGGGCCATA GGTGGTCCGA TGGCACGCCG CTGACCGCCG AGGACTTCCG TTACTGCTTC
GAAGACGTTC TGCTCAACGA GGACCTTTCC CCGGCAGGCC TGCCGACCTC GATGGTGATG
GATGGTCAAG CGCCCAAGCT CGAAATCGTC GACGAACGGA CGGTCCGCTA TTCCTGGCCG
ATGCCAAATC CGGTCTTCCT GCAGGAACTC GCCGCGCCGC AGCCGCTGAT CGTGGCGATG
CCTTCGGCCT ATCTCAAGCA GTTCCACAAA AAATATCAGG AAGAGGACAA GCTGAAGACC
TTACTGAAAG AACAGCGGGT CAAGAGGTGG AGCCAGCTCC ACATGCGCAT GGCACGGTCC
TACCGGCCGG AGAACCCGGA CCTGCCCACC CTCGACCCCT GGCGCAACAC GACGCCGTTG
CCTGCCGAAC AGTTCGTCTT CGAGCGCAAT CCCTATTACC ACCGGGTGGA CGAAAACGGC
CTGCAGCTGC CTTACATCGA CCGGTTCGTT CTCAGCGTGA GCTCGTCCGC GCTTATCCCG
GCAAAGACCG GCACGGGCGA GAGCGACCTG CAGGCAAATG GGATCGATTT CGTCGACTAT
ACCTATCTTA AGGATGCGGA GAAGCGGTAT CCGGTCGAGG TGAAGCTCTG GAAGAAGACC
TCGGGATCGC GCCTGGCATT GCTGCCGAAC CTCAACTGTG CCGACCCGGT GTGGCGGGCG
TTGCTAAGAG ACGTGCGCGT CCGTCGGGCT CTGTCGCTCG CAATCGACCG GCGCGAGATC
AACATGGCCG CCTTCTACGG GCTGACGAAG GAGAGCGCCG ATACTGTGCT GCCGGAAAGC
CCTCTCTTCC GTCCCGAATT CGCAAGTGCC TGGATCGCAC ATGACCCGGA GCAGGCGAAC
ACCCTGCTCG ACGCCGCAGG GCTGGCCAAA CGCGGCAGCG ACGGGATCCG TATTCTTCCC
GACGGCCGGA AGGCGCAGAT CGTCGTCGAG ACGGCCGGCG AGAGCACACT CGATACGGAC
GTGCTGCAGC TGATCACCGA CTATTGGCGT GAAGTCGGCA TTTCGCTCTT CATCCGTACC
TCTCAGCGCG ATACCTTCCG CAGCCGTGCG GTGGGGGGCG AGATCATCAT GTCGATATGG
TTCGGTATCG ACAACGGCGT GCCGACGGCT GACATGAGTC CTCACCAGCT CGCGCCGACA
GCGGACGACC AGCTGCAATG GCCCGTCTGG GGCTTGAACT ATATCTCCCA CGGCGAAATG
GGAGAAACGC CAGACCTTCC GGCCGTGGTC GAACTTCTGG AGCTGCTGAA GCGCTGGAGG
CATTCGGCCG ACGATGCGGA GCGGGCGGAC ATCTGGAGAA AGATGCTGTC CATCTATACG
GACCAGGTCT TCTCGATCGG GCTGGTCAAC AGTTCCCTGC AGCCTATCCT CGTGACCAAG
AAGCTGCGCA ATTTCCCGGA AGAGGGCCTT TGGGGTTTCG ACCCGACGAG CTATTTCGGC
GCCTACAAGC CCGACACCTT CTGGCTGGAA CAGGACGGCT GA
 
Protein sequence
MITRRTALGL LATAAFPKTL LAAAGGADPL AALVQEGKLP PVGERLPKTP RVINVAGMGR 
KAGRHGGTIR SLIGSAKDIR LMTIYGYARL VGYDEELNLQ PDVLESYETV EDRIFTFHLR
EGHRWSDGTP LTAEDFRYCF EDVLLNEDLS PAGLPTSMVM DGQAPKLEIV DERTVRYSWP
MPNPVFLQEL AAPQPLIVAM PSAYLKQFHK KYQEEDKLKT LLKEQRVKRW SQLHMRMARS
YRPENPDLPT LDPWRNTTPL PAEQFVFERN PYYHRVDENG LQLPYIDRFV LSVSSSALIP
AKTGTGESDL QANGIDFVDY TYLKDAEKRY PVEVKLWKKT SGSRLALLPN LNCADPVWRA
LLRDVRVRRA LSLAIDRREI NMAAFYGLTK ESADTVLPES PLFRPEFASA WIAHDPEQAN
TLLDAAGLAK RGSDGIRILP DGRKAQIVVE TAGESTLDTD VLQLITDYWR EVGISLFIRT
SQRDTFRSRA VGGEIIMSIW FGIDNGVPTA DMSPHQLAPT ADDQLQWPVW GLNYISHGEM
GETPDLPAVV ELLELLKRWR HSADDAERAD IWRKMLSIYT DQVFSIGLVN SSLQPILVTK
KLRNFPEEGL WGFDPTSYFG AYKPDTFWLE QDG