Gene Smed_5559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5559 
Symbol 
ID5319861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp526584 
End bp528296 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content62% 
IMG OID640777308 
ProductABC transporter related 
Protein accessionYP_001314240 
Protein GI150377645 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0799333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC CCGACACGCT GCTCTCGGTG CGCAATCTCT CGATTGATTT CCACCTGCGC 
ACGCACGTGC TCCACGCGGT GCGCAACGTC AGCTTCGACC TGAAGCGCGG CCAGACCATG
GCTCTCGTGG GCGAAAGCGG CTCGGGAAAA TCGGTGACCG CGCGCGCCTT GATGCGGATC
ATTGATAAGC CCGGCCGAAT GATCGGCGGC CAGATCCTGC TCGACGGCCC GAATGGCCCT
GTGGATGTGG CACGCTTCAA GGAAGGCAGC CGCGAGGTGC TCGCCATCCG TGGTGGGCGG
ATCGGGTTGA TCTTCCAGGA GCCAATGAGT TCCCTGTCGC CGGTCCACAC TATCGGATCG
CAGATCGTCG AAGCGGTGCG CCTGCATCGG CGGGTGTCCA AGTCGGAGGC GCGCGCCCGC
TGTGTCGAAC TGCTGCGTCA GGTCGAGATC CCGCAACCAG AGCTGATGGC CGACCGGTAT
ACCTTCGAGT TTTCCGGCGG CATGCGGCAA AGGGCGATGA TTGCGATGGC GCTCGCCTGC
GATCCGCAGG TGCTTATTGC CGACGAGCCG ACGACGGCGC TCGACGTGAC CACTCAGGCA
GAAATCCTCG ACCTCATCAA GCGGCTGCAG GACGCACGCG GCATGGCAAT GCTGCTTATC
ACCCACGACA TGGGCATCGT CGCCGAAGTC GCGGATGACG TTGCCGTCAT GCGCTTCGGC
AAGATCGTTG AACAGGGGCC GGTTGACGAC ATATTCCACG CCAGCCAGCA CCCGTATACG
CGCCAGCTTC TCGACGCGAC GGTCAAGCTC GAAAGCGGCG CGGCGATCCG TGCTTTGCCG
GCATCGCTGA CACCGTCCGT CGAGCCAGTG CTTTCTGTTC GCAATCTCAC CAAGATCTAC
GGCGCACCCT CGCGCATGTT CGCGCGCAGC GGCGGCCGCG GACTGGTGGC TGTCGATGAT
GCCAGCCTCG ACCTCTTCCC GGGCGAGAAC CTTGGCATCG TCGGCGAAAG CGGCTCCGGC
AAAACGACGC TTGGACGCAT GATCCTGCGC ATTGTCGAGC CGACGTCCGG GACCGTGACG
TATCGTGCCG ATGCGACGTC TGCACCTGTC GACGTCACCG CGCTCGGCAA GGTAGATCTG
CGGCGCTACC ATCAGGATGT ACGGCTGATC TTTCAGGATC CCTTCGCGTC GCTCAATCCA
CGCATGACCG TGAAACAGAT CATCGGCGAT CCGCTCGTCA TCTCCAAGGG CATGTCCGGC
AAGGCCGTGG AGGTACGGGT TGCCGAACTT ATGGGCAAAG TGGGTCTGGA CCCGCTCGCC
ATGGAGCGCT ACCCGCACGC ATTTTCGGGC GGTCAGCGCC AGCGCATCGG CATCGCCCGG
GCGCTTGCCC TCAATCCCAC GGTCATCGTT GCGGACGAAG CGACATCCGC TCTCGATGTC
TCGATCCGCA GCCAGATTCT CGATCTTATG ATCGACATCC AGAAGCAGTT GCATCTCAGC
TTCATCTTCA TCTCCCACGA CATCTCGGTC GTGCGCTATT TCTGCGATCG CGTCGCCGTC
ATGCACCGGG GCAAGATCGT TGAAGTCGGT GATGCCGAAA CTATCTGCAC CAACCCTTCG
CAGCCCTACA CGAGGCGGCT GATTTCCTCT GTTCCAAACC CGGATCCCCG CAACAAGCGC
ATGCTTCACC GCCTGCGCAC AGATCAAGTC TAA
 
Protein sequence
MTKPDTLLSV RNLSIDFHLR THVLHAVRNV SFDLKRGQTM ALVGESGSGK SVTARALMRI 
IDKPGRMIGG QILLDGPNGP VDVARFKEGS REVLAIRGGR IGLIFQEPMS SLSPVHTIGS
QIVEAVRLHR RVSKSEARAR CVELLRQVEI PQPELMADRY TFEFSGGMRQ RAMIAMALAC
DPQVLIADEP TTALDVTTQA EILDLIKRLQ DARGMAMLLI THDMGIVAEV ADDVAVMRFG
KIVEQGPVDD IFHASQHPYT RQLLDATVKL ESGAAIRALP ASLTPSVEPV LSVRNLTKIY
GAPSRMFARS GGRGLVAVDD ASLDLFPGEN LGIVGESGSG KTTLGRMILR IVEPTSGTVT
YRADATSAPV DVTALGKVDL RRYHQDVRLI FQDPFASLNP RMTVKQIIGD PLVISKGMSG
KAVEVRVAEL MGKVGLDPLA MERYPHAFSG GQRQRIGIAR ALALNPTVIV ADEATSALDV
SIRSQILDLM IDIQKQLHLS FIFISHDISV VRYFCDRVAV MHRGKIVEVG DAETICTNPS
QPYTRRLISS VPNPDPRNKR MLHRLRTDQV