Gene Smed_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0329 
Symbol 
ID5321162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp358968 
End bp360563 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content60% 
IMG OID640789264 
Productextracellular solute-binding protein 
Protein accessionYP_001326022 
Protein GI150395555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.889875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0616043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TCACTACTCT TTTAGCGGCG ACGGCGCTCG CCACGCTTAT GGCCGGCACC 
GCCTGGTCGA AAACGTTCGT CTTCTGCTCG GAAGGTTCGC CGGAGGGCTT TGATCCTGGC
CTCTATACGG CCGGCACCAC GTTCGACGCT GCCGCCCACA CCGTTTACAG CCGTCTTCTC
GAGTTCAAGA AGGGCACGAC CGAAACGGAA CCCGGGCTCG CCGAAAGCTG GACGATCTCC
GACGACGGCC TCGAATACAC CTTCAAGCTG CGTCCCGGCG TCAAGTTTCA GACGACCGAA
TACTTCACTC CGACCCGTGA ATTGAACGCC GACGATGTCG TCTTTTCGAT CGAGCGCCAG
TGGAAATCGG ACCATCCATG GCATGGCTAT GTGACCGGCG GTTCCTGGGA ATATTTTGCC
GGCATGGGGC TGCCGGAACT GCTCGAGTCT GTCGAGAAGG TCGACGACAT GACCGTCAAG
ATCAAGCTGA AGCGCAAGGA AGCGCCGTTC CTGGCCAATC TTGCCATGCC CTTCGCGTCG
ATCATGTCGA AGGAATATGC CGACAAGCTG CAGGCCGAAG GCAAGATGAA CCAGCTCAAC
CAGATGCCGC TCGGCACCGG TCCCTTCGCC TTTGTCGCCT ATCAGCAGGA CGCGGTCATC
CGCTACAAGG CGCATCCGGA ATTCTGGGGC GGAAAGCAGA AGATCGACGA TCTGGTCTTT
GCGATCACCA CGGACGCGGC CGTTCGCTTC CAGAAGCTGC AGGCCGGCGA ATGCCACCTG
ATGCCCTATC CGAACGCGGC GGATGTCGAG GCAATGAAGG CCGATCCGAA CCTCAAGGTG
ATGGAGCAGG CCGGCCTCAA CGTCGCCTAT CTCGCCTATA ACACGACGCA GCCGCCCTTC
GATAAGCTCG AGGTGCGCAA GGCGCTGAAC AAGGCGATCA ACAAGGAGGC GATCGTCGAC
GCGGTCTTCC AGGGACAGGC GCAACCGGCG ACCAATCCGA TTCCGCCGAC CATGTGGTCC
TATAACGAGC AGATCGAAGA CGACACCTAT GATCCGGAAG CGGCGAAGAA GATGCTCGAG
GATGCCGGCG TGAAAGATCT TTCGATGAAG GTCTGGGCGA TGCCAGTGGC GCGTCCCTAC
ATGCTCAACG CCCGTCGCGC CGCCGAACTG ATGCAGGCCG ACTTCGCCAA GGTCGGCGTC
AAGGTCGAAA TCGTCTCCTA CGAATGGGCC GAATATCTCG AGAAGTCCAA GGCGAAGGAC
CGCGACGGTG CCGTGATCCT CGGCTGGACA GGCGACAACG GCGATCCGGA CAACTTCCTC
GACACGCTGC TCGGTTGCGA CGCCGTCGGC GGCAACAACC GCGCGCAATG GTGCAACCAG
GAGTTCGACG AACTCGTCAC GAAGGCGAAG GAAGCATCCG ACGTCGCAGA GCGCACCAAG
CTCTATGAAG AGGCGCAGGT CGTCTTCAAG CGCGAAGCCC CCTGGGCTAC GCTCGACCAC
TCGCTCTCCA TCGTCCCGAT GCGCAAGAAT GTCGAAGGCT TCGTGCAGAG CCCGCTCGGC
GACTTTGCTT TCGACGGCGT TGATATTGTA GAGTAA
 
Protein sequence
MKKLTTLLAA TALATLMAGT AWSKTFVFCS EGSPEGFDPG LYTAGTTFDA AAHTVYSRLL 
EFKKGTTETE PGLAESWTIS DDGLEYTFKL RPGVKFQTTE YFTPTRELNA DDVVFSIERQ
WKSDHPWHGY VTGGSWEYFA GMGLPELLES VEKVDDMTVK IKLKRKEAPF LANLAMPFAS
IMSKEYADKL QAEGKMNQLN QMPLGTGPFA FVAYQQDAVI RYKAHPEFWG GKQKIDDLVF
AITTDAAVRF QKLQAGECHL MPYPNAADVE AMKADPNLKV MEQAGLNVAY LAYNTTQPPF
DKLEVRKALN KAINKEAIVD AVFQGQAQPA TNPIPPTMWS YNEQIEDDTY DPEAAKKMLE
DAGVKDLSMK VWAMPVARPY MLNARRAAEL MQADFAKVGV KVEIVSYEWA EYLEKSKAKD
RDGAVILGWT GDNGDPDNFL DTLLGCDAVG GNNRAQWCNQ EFDELVTKAK EASDVAERTK
LYEEAQVVFK REAPWATLDH SLSIVPMRKN VEGFVQSPLG DFAFDGVDIV E