Gene Smed_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4688 
Symbol 
ID5319330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1204404 
End bp1206005 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID640776486 
ProductABC transporter related 
Protein accessionYP_001313418 
Protein GI150376822 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.678377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.275572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA TGCAAGACGC CATTCTCGCG GTCCGTGGTC TCAAGGTCGA TTTCTCGACG 
CCCGATGGGA CCGTCGAAGC GGTAAAGGGA ATCGATCTCG ACGTCCGTTC CGGCGAGACT
CTCGCCGTCG TCGGCGAATC CGGCTCGGGC AAAAGCCAAA CCATGATGGG TATCATGGGC
CTGCTCGCCA ACAACGGAAC GGTGACCGGC TCGGCGCTTT ACCGCGGCCA GGAGCTCGTG
GGTCTTCCGC CGAAGGCCCT GAACAGAGTG CGCGGCTCGA AGATCACGAT GATCTTCCAG
GAGCCGATGA CCTCGCTCGA TCCGCTCTAC ACGATCGGTC GCCAGATCGC CGAGCCGATC
GTCCATCACC GCGGCGGTAC GTTCAAGGGT GCGCGCAAGC GCGTTCTCGA ACTCCTGGAG
CTCGTCGGCA TTCCCGAAGC GCAGCGCCGC ATCGACAGCT ACCCGCATGA GCTCTCCGGC
GGTCAGCGTC AGCGCGTCAT GATCGCCATG GCGCTTGCCA ACGAGCCGGA TATCCTGATT
GCCGACGAAC CGACGACCGC CCTGGACGTG ACCATCCAGG CCCAGATCCT CGATTTGCTC
AAGGCGCTGC AAACGCGTTT CGGCATGGCC ATCGTGCTGA TCACGCACGA CCTTGGGATT
GTCAAACACT TCGCCGAGCG GGTCGCGGTC ATGCGCCGCG GCGAGGTTGT GGAGCAGGGC
ACGACGGCCG ACATCTTCGA GCGGCCGGAG GCGGACTACA CCAGAATGCT GCTGGAGGCA
GAGCCGAGCG GACACAAGGC TCCGCCCCCG GACAATGCAC CGATCATCCT CGAGGGGCGC
AATGTCGGGG TCGACTATAC GATCCCCGGC GGCCTCTTCC GGGGCGCCTC GACCGCCTTC
CGGGCCGTCG ACGGCGTAAG CCTGAGGCTC AGGCAAGGCC AGACGATCGG CATTGTCGGC
GAATCCGGTT CGGGTAAGTC GACGCTCGGA CGGGCGCTGC TGAGGCTTTT GCCGAGTAGC
GGTTACTATC GCTTTGGTTC CACGGATATT TCGGGATTTG ACCGCGGCGC GATGCGGCCA
CTGCGCCGCC AGCTGCAGCT CGTATTTCAG GATCCTTACG GATCGCTCTC GCCGCGCCGG
ACCGTCGGCG AGATCATTAC CGAAGGCCTT CATGTGCATG AGCCCGACTT GAGCCGCGCC
GACCGCGACC GGCGGGCCAT CGCCGCACTG AAGGAGGTCG GCCTCGATCC CGCCTCGCGC
AACCGCTATC CGCATGAATT CTCCGGCGGC CAGCGCCAGC GCATCGCAAT TGCCCGCGCG
ATTATCCTGA AACCCAAGGT CGTCATTCTC GATGAACCGA CCTCAGCCCT CGATCGATCG
GTACAGGGGC AGGTGATCGC ACTCCTGCGC GATCTGCAGG AGAAGCACGG TCTTTCCTAC
ATCTTCATCA GCCACGATCT GTCAGTCGTG AAGGCGATGT CCGACTATGT GATCGTGATG
AAAAACGGCC GCATCGTCGA AGAGGGGGAA ACTGACGCGA TTTTCAAGGC GCCGCGGGAG
CCCTACACGA AGACGCTGAT CGGCGCGGCA TTCAACGTGT GA
 
Protein sequence
MTEMQDAILA VRGLKVDFST PDGTVEAVKG IDLDVRSGET LAVVGESGSG KSQTMMGIMG 
LLANNGTVTG SALYRGQELV GLPPKALNRV RGSKITMIFQ EPMTSLDPLY TIGRQIAEPI
VHHRGGTFKG ARKRVLELLE LVGIPEAQRR IDSYPHELSG GQRQRVMIAM ALANEPDILI
ADEPTTALDV TIQAQILDLL KALQTRFGMA IVLITHDLGI VKHFAERVAV MRRGEVVEQG
TTADIFERPE ADYTRMLLEA EPSGHKAPPP DNAPIILEGR NVGVDYTIPG GLFRGASTAF
RAVDGVSLRL RQGQTIGIVG ESGSGKSTLG RALLRLLPSS GYYRFGSTDI SGFDRGAMRP
LRRQLQLVFQ DPYGSLSPRR TVGEIITEGL HVHEPDLSRA DRDRRAIAAL KEVGLDPASR
NRYPHEFSGG QRQRIAIARA IILKPKVVIL DEPTSALDRS VQGQVIALLR DLQEKHGLSY
IFISHDLSVV KAMSDYVIVM KNGRIVEEGE TDAIFKAPRE PYTKTLIGAA FNV