Gene Smed_5080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5080 
Symbol 
ID5319382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp27372 
End bp28880 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content60% 
IMG OID640776860 
Productextracellular solute-binding protein 
Protein accessionYP_001313792 
Protein GI150377197 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0526207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACA AATCTGTTCG CGTGATTGCG GGCGCCATGG TGGTCGCCGG ATGGGCCGGC 
TATGCCGCGG CGCAGAATGC CTCGGAAATC AAGATCGTCC TGCCGGAACA GCCGGCCAAT
CTCGAGCCCT GCGGTACCAT CATCACCAAT GTCGGCCAGA TACTGAGCCG CAACGTGGTC
GAGCCGCTGA CGATCATCGA TCCGAAAAGC GGCCAGCCAA CATCCGGTCT TGCGACCGAG
TGGAAGCAAA CGGATCCGAA CACGTGGCAG CTCAAGCTGC GCGAAGGCGT CAAATTCCAG
GATGGTGCCG CCTTCAATGC AGAGGCGGTC AAATTCTCGA TCGAGCGCAT GACCGGCGGC
AAGCTGACCT GCAGCAACAT TGCCAAATTC GGCAATGCCA AGCTCACCGT CACGCCGATC
GACGACCTCA CGGTCGAGAT CAAATCGGAT ACGCCGCAGC CTATTTTGCC GACGCTGCTC
AGCGTGGTCA TGATCGTCTC GCCGAACACC CCGGCGGACA AGGCCGTGAA CGATCCGGTC
GGAACCGGTC CTTTCAAGCT CTCGAGCTTT ACGCCACAGA CTGTCGTGTT GGAAGCCTTT
GACGGCTACT GGGGCGAGAA GCCGGCCATT GCCAGGGCGA GCTATGTCTG GCGCCCGGAA
TCCTCGATCC GTGCCGCCAT GGTGGAGACC GGCGAGGCCG ATCTGACGCC GTCCATCGCC
ATCCAGGATG CCACCAACCC GGAAACGGAC TTCGCCTATC TGAACTCGGA GACGACAGCG
ATCCGCATCG ATGCCGGGTT CGCTCCGCTC GACGACGTGC GGATTCGCAA GGCGCTGAAC
CTTGCGATCG ACTGGAATGG TCTTGCGCAG CTTTTCGGCG AGGACGTGCA GCGTGCTTCG
CAGATGGTTG TCACCGGCAT CAACGGTCAT GACGACAAGC TGGCGCCCTG GGCCTTCGAT
GCCGAAAAGG CCCGTGCGCT GATCGCCGAG GCCAAGGCTG CGGGCGTACC GGTCGATACC
GAAATCGAAC TGATCGGCCG CAACGGAATT TATCCCAACG GTACGGAAGC CATGGAAGCC
ATGATGGCCA TGTGGCAGGA TGTCGGTCTG AATGTGAAGC TGACGATGCT CGACGTGAAC
GATTGGCTCC GCTACCTGCA GAAGCCTTTC CCGGAAAGCC GCGGGCCGAA CCTTTTGCAG
ATGATGCATG ACAACAACAA GGGCGACGCC GCCTTCACCG TTCCGATCTT CTATACGTCG
GGCGGAAGCT ACTCGACCTT CAACGATGCG GCGTTCGACA AGGAGATCGC CGATGCCATG
GCTGCCACCG GCGAGGACCG TACGGCCAAG TTCAAGGCGA TCTTCGCGAA GGTGCATGAG
GAGCTTGCGG TCGATATCCC GATGTTCCAC ATGATCGGCT ACACCCGGGT GGGCAGCCGT
CTGGAGTGGA AGCCGGACAT CACGACCAAC AGCGAGATCC CGCTGGCCAA TATCGGCCTC
AAGGATTAA
 
Protein sequence
MGNKSVRVIA GAMVVAGWAG YAAAQNASEI KIVLPEQPAN LEPCGTIITN VGQILSRNVV 
EPLTIIDPKS GQPTSGLATE WKQTDPNTWQ LKLREGVKFQ DGAAFNAEAV KFSIERMTGG
KLTCSNIAKF GNAKLTVTPI DDLTVEIKSD TPQPILPTLL SVVMIVSPNT PADKAVNDPV
GTGPFKLSSF TPQTVVLEAF DGYWGEKPAI ARASYVWRPE SSIRAAMVET GEADLTPSIA
IQDATNPETD FAYLNSETTA IRIDAGFAPL DDVRIRKALN LAIDWNGLAQ LFGEDVQRAS
QMVVTGINGH DDKLAPWAFD AEKARALIAE AKAAGVPVDT EIELIGRNGI YPNGTEAMEA
MMAMWQDVGL NVKLTMLDVN DWLRYLQKPF PESRGPNLLQ MMHDNNKGDA AFTVPIFYTS
GGSYSTFNDA AFDKEIADAM AATGEDRTAK FKAIFAKVHE ELAVDIPMFH MIGYTRVGSR
LEWKPDITTN SEIPLANIGL KD