Gene Smed_5619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5619 
Symbol 
ID5319921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp585732 
End bp587312 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content61% 
IMG OID640777362 
Productextracellular solute-binding protein 
Protein accessionYP_001314294 
Protein GI150377699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR02294] nickel ABC transporter, periplasmic nickel-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.4089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGA ACAGGCGTAC GTTCCTGCAG GGCGCATTCG GCGCCGTGGG ATTGGTCATG 
GCGCAGGGCG CCTTGTCCAA GCTTGTATAC GCGCAAGGGG CAAGCGGCAC GCTGCGGGTC
GCTATCGCAA AGCCCGCAGG TAACCTCGAT CCGCAAAGCC ACTACGCAAT CTGGGCGATA
CAGGACCTGA TGTTCGAACC GCTGGTCAAA TACGGCCGGG GAGGCCAGAT CGAACCTTGT
CTCGCGACCG ACTGGAAGAT CGAGGGCGGT GGCAAGACGC TACATCTCAC CTTGCGACAG
GGTGTTACCT TCCAGGACGG AACCAAGTTC GATGCCGCCG CGTGCAAGTG GAATCTCGAG
CGGTGGATGG GGCTCGACCA GTTCAGTTGG ATGAACTGCT CGAAGTATTT CGAGTCGCTC
GAAGTCGTTG ACGACTACCA CATCACCCTC CACTTCAACG AGCCCGTGCT GGCGCTGATG
CAAGAGCTTT CCTACACCCG GCCGCCACGC TTCCTCAGCC CGATGTCCGT TGGAGCCGAT
GGCAAGTTCA AAGAGCCGGT CGGTACGGGC CCTTGGCGCC AAGTCAAGGC GGATGACACC
GAAAGCGCAT TCGAGCGCTA CGACGGCTAT TGGGGTGACA AACCATCATA CGAGCGTCTT
GAGGCGAAGG TTATTCCCGA CCCGCGCTCG CGGGTCGCGG CACTGCGCAG CGGCGAGATC
GATCTGGTCG GCGGCTTCTG GATTGCGCCC TTGACCCCGG AAGAGGCCAA GCAACTCGAG
GCGGCCGCCG TCAACGTCGT CGTCGATCCG GGCAATGTTA CACTGGTGAT GGCGTTCAAT
CCCGATCGCG CCGCGGCACT CAAGGATTCG CAGGTACGCA AGGCGATCAG TATCGGCATC
GATCGTGCGG CAATCTCTCA GGTGCTCTAC CATGGCTATG CCAAGCCTGC GGGTAACTTG
TTCTCAAGCG CTTTGCCTTA TGCCGGCAAG CAGCATGGCG CTCCCGTCCG CGACGCGGCG
GCCGCGTCCG CGCTGCTGGA GAAGGCCGGC TGGACTGGTG GTCCTATTCG ATCCAAGGAT
GGCAAGCCGC TGACGCTCGA GATGGTCGTC AGTCCGGACG CAGTGCCGGG GTCACGGATC
ATCGCCGAAG TCATCCAGTC CGAGATGAAG GAGATCGGCA TCGACCTGGT GATCCGCTCG
GTCGACCATG CTTCCAAGCA CACCGACATG CTGGAACAGA AGTACGACCT CGGCTTCTTC
CTGACCTACG GCGCGCCTTA TGACCCGTTT GGCTCGATCG TCGGGCTGTG CCTGTCGACT
TTCAAGAATG ATGTCGAGGG CAAGCTGGTT ACCGATCCGG TTAACCTCGA TCCGTTGATC
AATGCGGCCA CGGCCGCAAC CGGAGACCAG ATCGAGCCGA CCATTCAGAA GGTCTACGAC
TGGCTGCGCG ACAACGACGC GATTGCGCCG CTGGTCTACG TACCAAGCAT CTGGGCGCAT
TCCAACCGGG TACAGGGCTT CACCAGTCCC GTCACCGAAT ACGACATGCC ATACGAAAAC
ATCGTTTTGG CCGCCGAGTA G
 
Protein sequence
MTVNRRTFLQ GAFGAVGLVM AQGALSKLVY AQGASGTLRV AIAKPAGNLD PQSHYAIWAI 
QDLMFEPLVK YGRGGQIEPC LATDWKIEGG GKTLHLTLRQ GVTFQDGTKF DAAACKWNLE
RWMGLDQFSW MNCSKYFESL EVVDDYHITL HFNEPVLALM QELSYTRPPR FLSPMSVGAD
GKFKEPVGTG PWRQVKADDT ESAFERYDGY WGDKPSYERL EAKVIPDPRS RVAALRSGEI
DLVGGFWIAP LTPEEAKQLE AAAVNVVVDP GNVTLVMAFN PDRAAALKDS QVRKAISIGI
DRAAISQVLY HGYAKPAGNL FSSALPYAGK QHGAPVRDAA AASALLEKAG WTGGPIRSKD
GKPLTLEMVV SPDAVPGSRI IAEVIQSEMK EIGIDLVIRS VDHASKHTDM LEQKYDLGFF
LTYGAPYDPF GSIVGLCLST FKNDVEGKLV TDPVNLDPLI NAATAATGDQ IEPTIQKVYD
WLRDNDAIAP LVYVPSIWAH SNRVQGFTSP VTEYDMPYEN IVLAAE