Gene Smed_5469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5469 
Symbol 
ID5319771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp437084 
End bp438169 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID640777230 
Productextracellular solute-binding protein 
Protein accessionYP_001314162 
Protein GI150377567 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.27299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGA ACAATGACGG GAACTGGTCG CGCAGGCGGT TCCTGAAAAC CACCGCGATC 
GGAGCTGCAG CACTTTCAAG TCCGGCAATT TGGACCTCGG CGAGGGCGCA GGGCAAGCGC
ATCATCGTCC GCGACGATGG CGGCATCTAC ACGAAGGCCT ATAACGCCGT GTACTACGGC
CCCTTCAAGG AAGCGACCGG CATCGAGGTG GTCGGCGTTC AGGCCAATGC CGAACCGACG
GCGCAAATTA AATCGATGGT CGACGCGGGC TCCTATACCT GGGACATGGC AAAGATCAGC
GAACCTGCAA TCGAGCTCCT GACCGACGGC GAGAAGAAAT ACCTCGAGCA ACATGGGCTT
GGCAGTGAAG CGGCGATCGC AAGCCTTCCA AAGCAGTATC TGTCTGACTA CGGCGTCGGA
ACCAACGTAT ATACCACGGT GCTCGCCTAC CGTTCGGACG CGTTCGAAGG ACAGGACGCG
CCGAAATCGT GGGCCGATTT CTACGACGTC GCGAAGTATC CGGGCAGGCG CGCTTTGCGC
AAGCATCCTT TCGATACGAT CGAACAGGCA CTGATGGCAG ATGGCGTGCC GGTGGCGAAC
GTGTATCCGT GCGACGTCGA CCGCGCGTTC AAAAAGCTCG ACACGATCAA GAGCGACGTA
GAAGTATTTT GGACGAGTGG CGCTCAGGTC GAGCAGATGC TGATCTCCGG CGAAGTCGAT
ATGATCCCGA CCTGGGTTTC GCGTGCGCAG GCTGCACGGT CGGCGGGGGC GCCGGTCGAG
ATCGTCTGGG ATCAGAATAT CTGGGGCCTC GACAGCTGGG CGATCCTTGC CGGTACCCCC
AACGCGGATG CATGCCGCGA ATTCATCAAG TTCGCATCCG ATCCGAAACG GCAGGCAGCC
CTTGTGGATT ATTTCCCCGC AGGCGTCACG CAGCCGGCGG CGTTCGACGA CATCGATCCG
AAGATCGCAA AAGATTGCCC GACGTTCCCG GAACACATCA AGCGCGGTGT GAAGATCGAC
GCCAAGTACT GGTTTGCAAA TCAGGCACAA GTCATCGAAC GTTACAATTC TTGGCTCGTG
AGCTGA
 
Protein sequence
MSMNNDGNWS RRRFLKTTAI GAAALSSPAI WTSARAQGKR IIVRDDGGIY TKAYNAVYYG 
PFKEATGIEV VGVQANAEPT AQIKSMVDAG SYTWDMAKIS EPAIELLTDG EKKYLEQHGL
GSEAAIASLP KQYLSDYGVG TNVYTTVLAY RSDAFEGQDA PKSWADFYDV AKYPGRRALR
KHPFDTIEQA LMADGVPVAN VYPCDVDRAF KKLDTIKSDV EVFWTSGAQV EQMLISGEVD
MIPTWVSRAQ AARSAGAPVE IVWDQNIWGL DSWAILAGTP NADACREFIK FASDPKRQAA
LVDYFPAGVT QPAAFDDIDP KIAKDCPTFP EHIKRGVKID AKYWFANQAQ VIERYNSWLV
S