Gene Smed_2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2178 
Symbol 
ID5323038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2250833 
End bp2252422 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content61% 
IMG OID640791116 
Productextracellular solute-binding protein 
Protein accessionYP_001327846 
Protein GI150397379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.618007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT ACAAAGACTA CTTGGCTCGT CAGGTCATGC TCGGCAAGAT GAAGCGGCGC 
GAGTTTCTCG GACGCGCAGC CGCTCTCGGC ATTGCCGCGT CGAGCGCAAA TATGCTCTTC
GCGTCCAGCG CCGCAGCGCA GGAGCCAAAA CGCGGCGGTC ACCTTAAGCT CGGCCTAGAA
GGGGCTGCTG CAACGGACTC GCGGGACCCC GCAAAGGCCC TGTCGCAATT CATGTTCGTC
GTCGGCCGCA ACTGGGGCGA CATGCTGGTC GAGAGCCATC CGACGACCGG CGAGCCGGTG
CCGGCACTTG CGGAATCCTG GGAACCCTCC GCCGATGCGT CCACCTGGAC CTTTACGATC
CGCAAGGGCG TGAAATTCCA TGACGGCAAG GAACTGACGA TCGACGACGT CATCAAGACT
CTGCAGCGAC ACACGGATGA AAAATCCGAG TCCGGCGCGC TCGGCGTGAT GAAATCCATC
AAGGAAATCA AGGCCGATGG CGATAAGCTC GTCCTGGTGC TGACGGAAGG CAATGCGGAC
CTGCCGCTGC TTTTGACCGA CTACCATTTG ATCATCCAGC CGAACGGCGG CACCGACAAT
CCCGATGCGA TGATCGGTAC CGGTCCCTAC AAGGTCGCAA GCTTCGAGCC GGGCATACGC
GCCACGTTCG AGAAGAACCC GGACGACTGG CGCACCGACC GCGGCTTCGT CGATTCCATC
GAATTGATCG CCATGAACGA CGCGACGGCG CGTGTCGCGG CGCTTTCCTC GGGCCAGGTC
CACTTCATCA ACCGCGTCGA TCCTAAAACC GTCAATCTGC TGAAGAAGGC GCCGACTGTT
GAAATCCTCA ACACGTCCGG CCGCGGCCAC TACGTGTTCA TCATGCATTG CAACACGGCG
CCCTTCGACA ACAACGATCT GCGCATGGCG CTGAAATATG CCATGGACCG CGAGACCCTC
GTAGAGCGCA TCCTCGGCGG CTACGGCAAG ATCGGGAACG ACTTCCCGAT CAACGACACC
TATGCGCTTT TCCCGGAAGG GATAGAGCAA CGCACTTACG ACCCCGACAA GGCCGCATTC
CACTACAAGA AATCCGGCCA TAGCGGTCCG GTGCTGCTGC GCACCTCCGA CGTCGCCTTC
CCGAACGCGG TCGACGCCGC GGTTCTCTAC CAGGCGAGCG CCAGGAAGGC CGGCATCGAG
ATCGAGGTCA AGCGCGAGCC CGGCGACGGC TACTGGTCCA ATGTCTGGAA CGTCCAGCCT
TTCTCGACAT CTTATTGGGG CGGACGCCCG ACCCAGGATC AGATGTACTC CACCGCCTAT
CTCTCGACGG CCGACTGGAA CGACACCCGT TTCAAGCGTC CGGATTTCGA CAAGATCCTG
CTTGAGGCGC GTTCCGAGCT GGACGAAGCC AAGCGCAAGG ACATGTACCG CACCATGGCG
ATGATGGTGC GCGACGAGGG CGGCCTGATC CTGCCCATGT TCAACGACTT CGTGAACGCG
GCCGGCAAGA CGGTGAAGGG CTATGTCCAC GACATCGGCA ACGACATGTC CAACGGCTAT
GTCGCGACCA GGGTATGGCT GGACGCCTGA
 
Protein sequence
MSDYKDYLAR QVMLGKMKRR EFLGRAAALG IAASSANMLF ASSAAAQEPK RGGHLKLGLE 
GAAATDSRDP AKALSQFMFV VGRNWGDMLV ESHPTTGEPV PALAESWEPS ADASTWTFTI
RKGVKFHDGK ELTIDDVIKT LQRHTDEKSE SGALGVMKSI KEIKADGDKL VLVLTEGNAD
LPLLLTDYHL IIQPNGGTDN PDAMIGTGPY KVASFEPGIR ATFEKNPDDW RTDRGFVDSI
ELIAMNDATA RVAALSSGQV HFINRVDPKT VNLLKKAPTV EILNTSGRGH YVFIMHCNTA
PFDNNDLRMA LKYAMDRETL VERILGGYGK IGNDFPINDT YALFPEGIEQ RTYDPDKAAF
HYKKSGHSGP VLLRTSDVAF PNAVDAAVLY QASARKAGIE IEVKREPGDG YWSNVWNVQP
FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKIL LEARSELDEA KRKDMYRTMA
MMVRDEGGLI LPMFNDFVNA AGKTVKGYVH DIGNDMSNGY VATRVWLDA