Gene Smed_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1749 
Symbol 
ID5322607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1828694 
End bp1830493 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content61% 
IMG OID640790687 
ProductABC transporter related 
Protein accessionYP_001327419 
Protein GI150396952 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.84058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.153665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGC CAAGGCTTTG CCCGATGACT CTCCCTCTGC TTTCCGTGAA CGACCTGAGC 
GTCCAGGTGG CCACGCCGAC CGGGCCGAAG GTCGTCGTGG AGCGGCTGTC CTTCGATCTG
CTTCCAGGCA AGACGCTTTG CCTTGCAGGT GAGAGCGGAT CTGGAAAATC GATGACGGCC
CTGTCGCTGA TGCAACTGCT GCCGAAGCCG ACGGCGCGAA TCGCGGACGG CTCGGCCATT
CTCGATGGTG ACGACATACT CGCTCTTCCC GAAGCACGCA TGCGGCAGGT TCGCGGGCGC
AAGATCGGGA TGATCTTTCA GGAGCCCATG ACCTCGCTCA ACCCCGTGAT GACCGTCGGT
GCGCAACTGG TCGGAGCGAT CACCGCCCAC GGCAGCCGAG GCCGGCGCGC TTCGCATGCG
CGCGCCGTCG AACTCCTGGA CCAGGTCCAG ATTCCGGAAC CGGAGCGCCG GCTCGGCCAG
TATCCGCACG AGCTTTCCGG TGGCTTGCGT CAGCGTATCG TCATCGCGAT GGCGCTCGCG
CAGAACCCGC GCATTCTGAT CGCGGATGAA CCGACGACTG CGCTCGACGT AACGGTTCAG
GCCCAGATCC TGGCGCTTAT CCGCAAGCTG CAGGCCGATC ACGGAATTTC GGTGATCATG
ATCACCCATG ACATGGGCGT GGTCGCGGAG ATGGCCGACG AGGTGCTGGT GATGAAGCAT
GGCCGAACTG TCGAGCATGC AACCACCAAG GATCTTTTCG CGCGTCCTGA GGATGCGTAT
ACGAAGGAGC TTCTTTCGGC CGTGCCGCGT CTCGGCGAAA TGGCAGGGTC GGAGATACCC
AAGCGCGCTG CGGGAAGGGC GGTCGCCGGA TCTTCGAGCG ACGCCGAGCA GCCGATGCTC
GAGGTCAAAA ACCTGACGGT CCGTTTCGAC ATCAAGGGCG GCATTCTACA GCGCCCGGTC
AAGCGCCTTC ACGCGGTGGA GGGCATCTCA TTCGATGTGC TGAAGGGGGA GACGCTGTCT
CTTGTCGGAG AATCCGGCTG TGGCAAGTCG ACGACCGGAA AGGCTCTGTT GAACCTTTTG
CCATGGGCGG GGGAGATACG CGTCAACGGA CGCTCCACCC GCGGCTTACG GGGGAGCACG
ATGCGGCCCA TCCTTCGAGA CGTCCAGATG ATCTTCCAGG ATCCTTACGC ATCCCTTGAT
CCGCGCATGC GGGTTGGCGA TCTGGTGGCG GAGCCTCTGG CAATCCATGG GCTGGCAACC
GGCAGCGCAT TGCGTGACCG GGTGGAATAT CTCTTAAAGC GGGTTGGATT ATCGCCTGAG
CAGATGAAGC GTTACCCGCA TGAGTTTTCC GGCGGGCAAC GGCAGCGCAT CTGCATTGCT
CGGGCTTTGT CGCTGTCACC GAAGCTCATC GTCGCAGATG AATCCGTTGC GGCACTCGAC
GTGTCGATCC AGGCTCAGGT GCTCGATCTC CTGCAGGACA TCCAGGACGA GACAGGCATC
TCCTACCTGT TCATTTCTCA CGACATGGCG GTCGTCGAGC AGATCAGCCA TCGGGTAGCG
GTCATGTATA TGGGCCGGTT CGTCGAAATG GGGACGCGGC GACAGATATT CGAGAATCCG
CAGCATCCCT ATACGAAGAA ACTGATGGCT GCGGTCCCCG TGGCCGATCC GACGAGACCA
CGTCGGGATT TCGTACCAAA AGCGGAAGAC CTGCCAAGTC CCGTGCGCCC ACTGGACTAT
ACGCCATCCT TCCCGCCGGC ATCGGATCTG GGCGGCGGGC ATCTCGTCTG GAACGTTTGA
 
Protein sequence
MTLPRLCPMT LPLLSVNDLS VQVATPTGPK VVVERLSFDL LPGKTLCLAG ESGSGKSMTA 
LSLMQLLPKP TARIADGSAI LDGDDILALP EARMRQVRGR KIGMIFQEPM TSLNPVMTVG
AQLVGAITAH GSRGRRASHA RAVELLDQVQ IPEPERRLGQ YPHELSGGLR QRIVIAMALA
QNPRILIADE PTTALDVTVQ AQILALIRKL QADHGISVIM ITHDMGVVAE MADEVLVMKH
GRTVEHATTK DLFARPEDAY TKELLSAVPR LGEMAGSEIP KRAAGRAVAG SSSDAEQPML
EVKNLTVRFD IKGGILQRPV KRLHAVEGIS FDVLKGETLS LVGESGCGKS TTGKALLNLL
PWAGEIRVNG RSTRGLRGST MRPILRDVQM IFQDPYASLD PRMRVGDLVA EPLAIHGLAT
GSALRDRVEY LLKRVGLSPE QMKRYPHEFS GGQRQRICIA RALSLSPKLI VADESVAALD
VSIQAQVLDL LQDIQDETGI SYLFISHDMA VVEQISHRVA VMYMGRFVEM GTRRQIFENP
QHPYTKKLMA AVPVADPTRP RRDFVPKAED LPSPVRPLDY TPSFPPASDL GGGHLVWNV