Gene Smed_5077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5077 
Symbol 
ID5319379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp23806 
End bp25455 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID640776857 
ProductABC transporter related 
Protein accessionYP_001313789 
Protein GI150377194 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.704503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGT CGGACAAGCC AATACTGCGT ATCGACGGCC TGACGGTGGA CTTCCTGTCC 
GAAGGTGATC CGGTGCGCGC CGTGGACAAT GTCTCGTTCG ATGTCTGCCC GGGCGAGACG
CTCGTCATCC TCGGCGAGAG CGGATCGGGC AAGAGCGTCA GCACCGGTAC GGTGATGGGG
CTGATCGATT GCCCCCCGGG CGACATCGTT TCGGGCACCC TGGTATTCGA CGGGACCGAT
CTTTCCCGCC TCGACAACGA AGGCAGGCGT GAACTGAACG GTCGCCGTAT CGCCATGATC
TTCCAGGACC CGCTCGCCTA TCTCAATCCG GTCTATACTG TCGGCCGGCA GATCGCCGAG
GTTTTCGAAA GCCATGGCGC AGGCGAGGGC GGGGCGATGC GCGGAAGAGT CGTGCGCCTG
CTGGAACGGG TTGGGATCCC GGAAGCGGAA ACGCGGGTCG ATTACTATCC GCACCAGTTC
TCCGGCGGAC AGAGGCAACG CGTGATGATC GCGATGGCGA TTGCGCTCGA ACCGGACATT
CTGATCGCCG ACGAGCCGAC CACCGCGCTC GACGTCAGTG TCCAGGCGCA GATCCTCGAC
CTTCTGCGGG ACCTGCAGCG CGAAACCGGA ATGGCGCTGA TCATGATCAC CCACGATCTG
GAGGTCGCCG CGGCCATGGC GGACCGGATC ATCGTGATGA ATGCCGGCAA GGTGGTGGAG
AGCGGCAGGG CCGAGGATGT CTTCACCAAT CCGCGCCACA GCTATACCCG CCGGCTGATG
TCGGCGGTGC CTCATGGCGA CCCAAAGAAG CGAAGCCGGC CTGTCGAACA GGAGGTCCTG
CTGCAGGTCG CCCATCTGAG CAAGCACTAT AAGCTCGGCT CCGGCCCGTT TGCGCCCAAA
CGCGAGTTCA AGGCAGTGGA CGATGTGAGC TTTACGCTTC GTCGCGGCGA AACGGTCGGC
ATCGTCGGAG AGTCCGGTTC GGGTAAATCC AGCATTGCGC GCATGCTGCT GAGGCTCAAC
GAGCCGACAT CGGGCTCGGC GCTCTTTGCC GGCGAGGACA TCTTCAAGCT CGAGGGCAGG
GCGCTCAACG GATTTCGCCG GAAAGTGCAG ATGGTGTTTC AGGATCCGTT CGGCTCGATG
AACCCGCGCA TGAACGTCCG TTCGATCATT TCAGAACCCT GGGCGATCCA CCGGGATATC
CTGCCGCGCC AACGCTGGAA CGAACGGGTC GTTGAACTGC TGGAGCTTGT CGGCCTGAAG
CCGGAGCATG CGGAGCGCTA TCCGCATCAA TTTTCGGGCG GGCAGCGGCA ACGCATCGCC
ATTGCCCGGG CGCTCGCCAG CGAACCCGAG CTCATCGTCT GCGACGAAGC GGTCTCGGCG
CTCGACGTGT CGATCCAGAT GCAGGTCATC GAACTCCTGG CCGATCTCCG CCAGCGCCTC
GGCCTCTCCT ACATCTTCAT CACCCATGAT CTGCCCATCG TGCGTCAATT CGCAGACCGG
ATCCTGGTGA TGCAACGAGG CAAGATCGTC GAGGAGGGTG AGACGGAAGC TCTTTTCGTC
TCGCCTCGGC ACGAATACAC GCGAGCCCTG CTGAACGCCG TCCCCCAACC GAAATGGCTG
CAGCGCGATC CGACCCCGCT CGCGGGGTAG
 
Protein sequence
MTASDKPILR IDGLTVDFLS EGDPVRAVDN VSFDVCPGET LVILGESGSG KSVSTGTVMG 
LIDCPPGDIV SGTLVFDGTD LSRLDNEGRR ELNGRRIAMI FQDPLAYLNP VYTVGRQIAE
VFESHGAGEG GAMRGRVVRL LERVGIPEAE TRVDYYPHQF SGGQRQRVMI AMAIALEPDI
LIADEPTTAL DVSVQAQILD LLRDLQRETG MALIMITHDL EVAAAMADRI IVMNAGKVVE
SGRAEDVFTN PRHSYTRRLM SAVPHGDPKK RSRPVEQEVL LQVAHLSKHY KLGSGPFAPK
REFKAVDDVS FTLRRGETVG IVGESGSGKS SIARMLLRLN EPTSGSALFA GEDIFKLEGR
ALNGFRRKVQ MVFQDPFGSM NPRMNVRSII SEPWAIHRDI LPRQRWNERV VELLELVGLK
PEHAERYPHQ FSGGQRQRIA IARALASEPE LIVCDEAVSA LDVSIQMQVI ELLADLRQRL
GLSYIFITHD LPIVRQFADR ILVMQRGKIV EEGETEALFV SPRHEYTRAL LNAVPQPKWL
QRDPTPLAG