Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5559 |
Symbol | |
ID | 5319861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 526584 |
End bp | 528296 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777308 |
Product | ABC transporter related |
Protein accession | YP_001314240 |
Protein GI | 150377645 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.153225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0799333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGC CCGACACGCT GCTCTCGGTG CGCAATCTCT CGATTGATTT CCACCTGCGC ACGCACGTGC TCCACGCGGT GCGCAACGTC AGCTTCGACC TGAAGCGCGG CCAGACCATG GCTCTCGTGG GCGAAAGCGG CTCGGGAAAA TCGGTGACCG CGCGCGCCTT GATGCGGATC ATTGATAAGC CCGGCCGAAT GATCGGCGGC CAGATCCTGC TCGACGGCCC GAATGGCCCT GTGGATGTGG CACGCTTCAA GGAAGGCAGC CGCGAGGTGC TCGCCATCCG TGGTGGGCGG ATCGGGTTGA TCTTCCAGGA GCCAATGAGT TCCCTGTCGC CGGTCCACAC TATCGGATCG CAGATCGTCG AAGCGGTGCG CCTGCATCGG CGGGTGTCCA AGTCGGAGGC GCGCGCCCGC TGTGTCGAAC TGCTGCGTCA GGTCGAGATC CCGCAACCAG AGCTGATGGC CGACCGGTAT ACCTTCGAGT TTTCCGGCGG CATGCGGCAA AGGGCGATGA TTGCGATGGC GCTCGCCTGC GATCCGCAGG TGCTTATTGC CGACGAGCCG ACGACGGCGC TCGACGTGAC CACTCAGGCA GAAATCCTCG ACCTCATCAA GCGGCTGCAG GACGCACGCG GCATGGCAAT GCTGCTTATC ACCCACGACA TGGGCATCGT CGCCGAAGTC GCGGATGACG TTGCCGTCAT GCGCTTCGGC AAGATCGTTG AACAGGGGCC GGTTGACGAC ATATTCCACG CCAGCCAGCA CCCGTATACG CGCCAGCTTC TCGACGCGAC GGTCAAGCTC GAAAGCGGCG CGGCGATCCG TGCTTTGCCG GCATCGCTGA CACCGTCCGT CGAGCCAGTG CTTTCTGTTC GCAATCTCAC CAAGATCTAC GGCGCACCCT CGCGCATGTT CGCGCGCAGC GGCGGCCGCG GACTGGTGGC TGTCGATGAT GCCAGCCTCG ACCTCTTCCC GGGCGAGAAC CTTGGCATCG TCGGCGAAAG CGGCTCCGGC AAAACGACGC TTGGACGCAT GATCCTGCGC ATTGTCGAGC CGACGTCCGG GACCGTGACG TATCGTGCCG ATGCGACGTC TGCACCTGTC GACGTCACCG CGCTCGGCAA GGTAGATCTG CGGCGCTACC ATCAGGATGT ACGGCTGATC TTTCAGGATC CCTTCGCGTC GCTCAATCCA CGCATGACCG TGAAACAGAT CATCGGCGAT CCGCTCGTCA TCTCCAAGGG CATGTCCGGC AAGGCCGTGG AGGTACGGGT TGCCGAACTT ATGGGCAAAG TGGGTCTGGA CCCGCTCGCC ATGGAGCGCT ACCCGCACGC ATTTTCGGGC GGTCAGCGCC AGCGCATCGG CATCGCCCGG GCGCTTGCCC TCAATCCCAC GGTCATCGTT GCGGACGAAG CGACATCCGC TCTCGATGTC TCGATCCGCA GCCAGATTCT CGATCTTATG ATCGACATCC AGAAGCAGTT GCATCTCAGC TTCATCTTCA TCTCCCACGA CATCTCGGTC GTGCGCTATT TCTGCGATCG CGTCGCCGTC ATGCACCGGG GCAAGATCGT TGAAGTCGGT GATGCCGAAA CTATCTGCAC CAACCCTTCG CAGCCCTACA CGAGGCGGCT GATTTCCTCT GTTCCAAACC CGGATCCCCG CAACAAGCGC ATGCTTCACC GCCTGCGCAC AGATCAAGTC TAA
|
Protein sequence | MTKPDTLLSV RNLSIDFHLR THVLHAVRNV SFDLKRGQTM ALVGESGSGK SVTARALMRI IDKPGRMIGG QILLDGPNGP VDVARFKEGS REVLAIRGGR IGLIFQEPMS SLSPVHTIGS QIVEAVRLHR RVSKSEARAR CVELLRQVEI PQPELMADRY TFEFSGGMRQ RAMIAMALAC DPQVLIADEP TTALDVTTQA EILDLIKRLQ DARGMAMLLI THDMGIVAEV ADDVAVMRFG KIVEQGPVDD IFHASQHPYT RQLLDATVKL ESGAAIRALP ASLTPSVEPV LSVRNLTKIY GAPSRMFARS GGRGLVAVDD ASLDLFPGEN LGIVGESGSG KTTLGRMILR IVEPTSGTVT YRADATSAPV DVTALGKVDL RRYHQDVRLI FQDPFASLNP RMTVKQIIGD PLVISKGMSG KAVEVRVAEL MGKVGLDPLA MERYPHAFSG GQRQRIGIAR ALALNPTVIV ADEATSALDV SIRSQILDLM IDIQKQLHLS FIFISHDISV VRYFCDRVAV MHRGKIVEVG DAETICTNPS QPYTRRLISS VPNPDPRNKR MLHRLRTDQV
|
| |