Gene Smed_5623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5623 
Symbol 
ID5319925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp590144 
End bp591151 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content60% 
IMG OID640777366 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001314298 
Protein GI150377703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.692526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACG CACCTGTTCT CAAAGTCGAA AACCTGCAAA CACGCTTCAA GAGCGTCCAG 
CGGGGCAAGT ACGTCCATGC GGTCGACGAT GTTTCGATCG AGCTCTATCC AGGCGAGATC
GTCGGTTTGG TCGGCGAATC CGGCTGCGGA AAATCCACGC TCGGAAGAAC CATCGTCGGT
CTCGAGAAGG CAAGTGCTGG ACGGGTACTG CTCGACGGGG TCGACCTCAG CACGCTTTCG
GGCGCCGCAC TGCGAAACAG TCGTCGGGCT CTGCAGTACG TGTTCCAGGA TCCCTATTCG
TCCCTGAACG ATCGTCAGAC GGTTGGCGAG GCGATCGACG AAGCTCTATT GATCGATGGC
CTCAGGTCGG CGGACGAGAG AACTCGTCGG GCCAAGGAAC TATTGGAGCA GGTCGGTCTG
CCTCATACGG CAAGGGACCG TCACACACGC GAGCTATCGG GCGGCCAGCG TCAGCGCGTT
GCCATTGCCA GATCTCTCGC GGTGAACCCG CGAGTTCTGA TCTGCGACGA GCCGGTTAGC
GCCCTCGATC TCTCCATCCG GGCGCAAGTC ATGAACCTGT TCCTGCGCTT GCAGAAGGAT
CTGGGTGTCG CCTGCCTGTT CATCGCCCAT GACCTTGCAC TTGTGAGGCA GGCCGCCTCG
CGCGTTTACG TCATGTATCT AGGCAAGATC GTTGAGCATG GGCCGTCGCA GGAACTGTAC
GATCATCCTG GCCACCCATA CTCTCAGATG TTGCTGGCCT CCGTTCCCGA GGTCGACCCA
CGCGTTGAAA AGCTCCGCAG CGCTCCTTTG CTGAAGGGCG AAGTGCCAAG TCCGACCAAT
CCACCGTCCG GCTGCCGATT CCGGACACGT TGTCCGCTTG CGGTTGAGGA CTGCGCCCTA
AGAGCACCAG CATCACATGT CCTTTCGCCG GACCACAACG CCGCGTGCAT TTTTGCCCCC
GACCTTCATG GAGGGAAGCG CTCGGCTCTA ATTCACCAGG CTGCATAA
 
Protein sequence
MTHAPVLKVE NLQTRFKSVQ RGKYVHAVDD VSIELYPGEI VGLVGESGCG KSTLGRTIVG 
LEKASAGRVL LDGVDLSTLS GAALRNSRRA LQYVFQDPYS SLNDRQTVGE AIDEALLIDG
LRSADERTRR AKELLEQVGL PHTARDRHTR ELSGGQRQRV AIARSLAVNP RVLICDEPVS
ALDLSIRAQV MNLFLRLQKD LGVACLFIAH DLALVRQAAS RVYVMYLGKI VEHGPSQELY
DHPGHPYSQM LLASVPEVDP RVEKLRSAPL LKGEVPSPTN PPSGCRFRTR CPLAVEDCAL
RAPASHVLSP DHNAACIFAP DLHGGKRSAL IHQAA