Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1749 |
Symbol | |
ID | 5322607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1828694 |
End bp | 1830493 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640790687 |
Product | ABC transporter related |
Protein accession | YP_001327419 |
Protein GI | 150396952 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.84058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.153665 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGC CAAGGCTTTG CCCGATGACT CTCCCTCTGC TTTCCGTGAA CGACCTGAGC GTCCAGGTGG CCACGCCGAC CGGGCCGAAG GTCGTCGTGG AGCGGCTGTC CTTCGATCTG CTTCCAGGCA AGACGCTTTG CCTTGCAGGT GAGAGCGGAT CTGGAAAATC GATGACGGCC CTGTCGCTGA TGCAACTGCT GCCGAAGCCG ACGGCGCGAA TCGCGGACGG CTCGGCCATT CTCGATGGTG ACGACATACT CGCTCTTCCC GAAGCACGCA TGCGGCAGGT TCGCGGGCGC AAGATCGGGA TGATCTTTCA GGAGCCCATG ACCTCGCTCA ACCCCGTGAT GACCGTCGGT GCGCAACTGG TCGGAGCGAT CACCGCCCAC GGCAGCCGAG GCCGGCGCGC TTCGCATGCG CGCGCCGTCG AACTCCTGGA CCAGGTCCAG ATTCCGGAAC CGGAGCGCCG GCTCGGCCAG TATCCGCACG AGCTTTCCGG TGGCTTGCGT CAGCGTATCG TCATCGCGAT GGCGCTCGCG CAGAACCCGC GCATTCTGAT CGCGGATGAA CCGACGACTG CGCTCGACGT AACGGTTCAG GCCCAGATCC TGGCGCTTAT CCGCAAGCTG CAGGCCGATC ACGGAATTTC GGTGATCATG ATCACCCATG ACATGGGCGT GGTCGCGGAG ATGGCCGACG AGGTGCTGGT GATGAAGCAT GGCCGAACTG TCGAGCATGC AACCACCAAG GATCTTTTCG CGCGTCCTGA GGATGCGTAT ACGAAGGAGC TTCTTTCGGC CGTGCCGCGT CTCGGCGAAA TGGCAGGGTC GGAGATACCC AAGCGCGCTG CGGGAAGGGC GGTCGCCGGA TCTTCGAGCG ACGCCGAGCA GCCGATGCTC GAGGTCAAAA ACCTGACGGT CCGTTTCGAC ATCAAGGGCG GCATTCTACA GCGCCCGGTC AAGCGCCTTC ACGCGGTGGA GGGCATCTCA TTCGATGTGC TGAAGGGGGA GACGCTGTCT CTTGTCGGAG AATCCGGCTG TGGCAAGTCG ACGACCGGAA AGGCTCTGTT GAACCTTTTG CCATGGGCGG GGGAGATACG CGTCAACGGA CGCTCCACCC GCGGCTTACG GGGGAGCACG ATGCGGCCCA TCCTTCGAGA CGTCCAGATG ATCTTCCAGG ATCCTTACGC ATCCCTTGAT CCGCGCATGC GGGTTGGCGA TCTGGTGGCG GAGCCTCTGG CAATCCATGG GCTGGCAACC GGCAGCGCAT TGCGTGACCG GGTGGAATAT CTCTTAAAGC GGGTTGGATT ATCGCCTGAG CAGATGAAGC GTTACCCGCA TGAGTTTTCC GGCGGGCAAC GGCAGCGCAT CTGCATTGCT CGGGCTTTGT CGCTGTCACC GAAGCTCATC GTCGCAGATG AATCCGTTGC GGCACTCGAC GTGTCGATCC AGGCTCAGGT GCTCGATCTC CTGCAGGACA TCCAGGACGA GACAGGCATC TCCTACCTGT TCATTTCTCA CGACATGGCG GTCGTCGAGC AGATCAGCCA TCGGGTAGCG GTCATGTATA TGGGCCGGTT CGTCGAAATG GGGACGCGGC GACAGATATT CGAGAATCCG CAGCATCCCT ATACGAAGAA ACTGATGGCT GCGGTCCCCG TGGCCGATCC GACGAGACCA CGTCGGGATT TCGTACCAAA AGCGGAAGAC CTGCCAAGTC CCGTGCGCCC ACTGGACTAT ACGCCATCCT TCCCGCCGGC ATCGGATCTG GGCGGCGGGC ATCTCGTCTG GAACGTTTGA
|
Protein sequence | MTLPRLCPMT LPLLSVNDLS VQVATPTGPK VVVERLSFDL LPGKTLCLAG ESGSGKSMTA LSLMQLLPKP TARIADGSAI LDGDDILALP EARMRQVRGR KIGMIFQEPM TSLNPVMTVG AQLVGAITAH GSRGRRASHA RAVELLDQVQ IPEPERRLGQ YPHELSGGLR QRIVIAMALA QNPRILIADE PTTALDVTVQ AQILALIRKL QADHGISVIM ITHDMGVVAE MADEVLVMKH GRTVEHATTK DLFARPEDAY TKELLSAVPR LGEMAGSEIP KRAAGRAVAG SSSDAEQPML EVKNLTVRFD IKGGILQRPV KRLHAVEGIS FDVLKGETLS LVGESGCGKS TTGKALLNLL PWAGEIRVNG RSTRGLRGST MRPILRDVQM IFQDPYASLD PRMRVGDLVA EPLAIHGLAT GSALRDRVEY LLKRVGLSPE QMKRYPHEFS GGQRQRICIA RALSLSPKLI VADESVAALD VSIQAQVLDL LQDIQDETGI SYLFISHDMA VVEQISHRVA VMYMGRFVEM GTRRQIFENP QHPYTKKLMA AVPVADPTRP RRDFVPKAED LPSPVRPLDY TPSFPPASDL GGGHLVWNV
|
| |