Gene Smed_5555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5555 
Symbol 
ID5319857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp520226 
End bp522154 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content57% 
IMG OID640777304 
Productextracellular solute-binding protein 
Protein accessionYP_001314236 
Protein GI150377641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0740019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAT TCAGCCGTCG CGCGTTTCTC TTTTCGAGCA CCGCCGCCGT TCTACTGCCT 
GTAATGCCAA TGATTGCACT GGCCAACTCG GCCAAAGAGC CTGCCATGTT GGAGGCGCTG
GCCAAGACGG GCAGCCTGCC AGCCGTTGCG GACCGCTTGC CGCTCAACCC GATGGTCGTC
ACCCCGCTGG ATCGGGTGGG GACCCATGGA GGCGACTGGA ACAGTGCAAT CGTTGGCGGG
GGATCCCTGT CGATGTTGTT CCGCTATCAG GCTTACGAGC CTCTGTTAAG GTATGCGCCG
GATTGGTCGG GCGTGGTGCC AAACGTTGCC GAACATTACG AAAGCAATGC CGACGCGACT
GAGTTCACGT TCAGGCTGCG CAAGGGCATG AAATGGTCGG ATGGCGAGCC CTTCACCACG
GAAGACATCC TGTTCTGGTA CGAGGATATC TTCAACTACG AGGGGCTCAA CGATGTCGGG
CAGAACCATC TGCGCGCCGG AGGCAAGAAG GCGCGCTTCG AAGCTGTCGA CGACGTCACG
TTCAAGGTGA TCTTCGCAGC ACCCAATGGA CTTTTTCCCC TCCGGCTCGC ATGGGCGAAC
GACGATCAGA CGACGCGGGC ACCGAAGCAT TACCTCAAGC AGTTCCACAT CAAGTACAAT
CCAAATGCCG AAGAGGAAGC CAAGACCAAG GGCGCCTCGG GATGGATCCA GCTTTTCCAG
CGGGAAGCTG GTCTCGTCGT AGACAACGAA TTCTTCCAGA ACTCGCAGCG CCCGGTCATT
CATGCCTGGA AGGTGGCCAT CGCGCCCGGT CAGAGTACCG ACCGTGCCGT TGCCGAGCGA
AATCCCTACT ACTGGAAAGT CGATACCGAG GGCAATCAAC TGCCTTATCT GGATCGGATC
GTCTACCAGA TGGTTTCCGA TCCGCAGGTC CTACTTCTGA AGGCGATGCA GGGCGAGATC
GATTTGATGG ATCAGTATAT TGCCACGCCT GCCAATCGTG CCGTGCTGTA CGATTCCCAA
GAGCAGGGCA GATTTGGATT CTACACGCTG ACCTCAACCG AAACAAACGA GATGGTTTTC
CAGCTCAATC TCAACCACCC CAATGAGGTG AAGCGCAAGC TCTACAACAA CAAGGACTTC
AGGGCAGCGC TCTCAATGGC GCTCGATCGC CAGGCCATCA TCGATACCGT GTTCATCGGA
CAGGGAACGA TCTCGCAGCC TGCCGTGCGA GCGGACGATC CGCTCTACAA CGAACGCCTC
GCAACGCAGT ACACGCAATA CGACCCCAAT CGCGCCAACG CTCTCCTGGA CAAGATCCTG
CCGAGCAAGG ATAGTGAAGG TTTCCGTCTC GATGAAGGCG GAAAACGGGT ATCGATCATT
TTTGAAATCG ATCAGGCGCG CGCCACCTTC CTCGACATCT TCCAACTTGC TCTGCCGATG
TTCCGGGCTG TCGGCGTTGA TGTCCAGATG AGAAGCATGG ACCGCTCGCT TTGGGAAGTG
CGCGTGCGGC AGGGTATCGA GTATGACGCG ACAGCCCATC GCTTTGGCGG CAATGGCGGC
ATCGCGGCAA TCCTTGACCC GCGTTATTTC ATTCCCAATA CGACAGAAGC GCTGTACGCG
AAAGGTTGGC AACTCTGGTA TCGAGATTCG CAATCCCAGG GTGCGGTGGA GCCACCGCAG
CCCGTCAGAA ACGCCTTGGC TCTCTACGAT CGGGTGCTCG CTTCGGCCGA TCCCGATGTG
CAAAAGAAGC TCATGGCCGA GATTCTGGAG ATCGCTGCCG ACCAGTTCTA TGTGTTCGGA
ATCTGCCTGC CCGCCGACAG CTATGGGGTG GTTAAAAACG ACATGCAGAA CGTCCCCGAG
GCGATGCCGA ACTCCTGGGG ATATCCGACG CCCGGACCTG TCAATCCCGA GACTTTCTTC
AAGGTCTGA
 
Protein sequence
MQEFSRRAFL FSSTAAVLLP VMPMIALANS AKEPAMLEAL AKTGSLPAVA DRLPLNPMVV 
TPLDRVGTHG GDWNSAIVGG GSLSMLFRYQ AYEPLLRYAP DWSGVVPNVA EHYESNADAT
EFTFRLRKGM KWSDGEPFTT EDILFWYEDI FNYEGLNDVG QNHLRAGGKK ARFEAVDDVT
FKVIFAAPNG LFPLRLAWAN DDQTTRAPKH YLKQFHIKYN PNAEEEAKTK GASGWIQLFQ
REAGLVVDNE FFQNSQRPVI HAWKVAIAPG QSTDRAVAER NPYYWKVDTE GNQLPYLDRI
VYQMVSDPQV LLLKAMQGEI DLMDQYIATP ANRAVLYDSQ EQGRFGFYTL TSTETNEMVF
QLNLNHPNEV KRKLYNNKDF RAALSMALDR QAIIDTVFIG QGTISQPAVR ADDPLYNERL
ATQYTQYDPN RANALLDKIL PSKDSEGFRL DEGGKRVSII FEIDQARATF LDIFQLALPM
FRAVGVDVQM RSMDRSLWEV RVRQGIEYDA TAHRFGGNGG IAAILDPRYF IPNTTEALYA
KGWQLWYRDS QSQGAVEPPQ PVRNALALYD RVLASADPDV QKKLMAEILE IAADQFYVFG
ICLPADSYGV VKNDMQNVPE AMPNSWGYPT PGPVNPETFF KV