Gene Smed_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3988 
Symbol 
ID5317914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp439834 
End bp441477 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content64% 
IMG OID640775796 
ProductABC transporter related 
Protein accessionYP_001312729 
Protein GI150376133 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.704503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC ATCTGCTCGA GGTCCGTAAT CTCTCGGTCG AATTTCACAC CGCGGCCGGC 
GTGGTGCACG CCGTAAAATC GATCTCCTAT CACCTCGACA AGGGAGAGAC GCTGGCCATC
CTCGGCGAAA GCGGCTCCGG CAAGTCGGTA TCTTCCTCGG CGATCATGAA CCTCATCGAC
ATGCCGCCGG GGCGCATCAG CGGCGGCGAA ATCCTGCTCG ACGGCAGGGA CCTGCTGACG
ATGCCAGCCG AGGAGCGGCG CGAGGTCAAC GGCCGCCGGG TGGCCATGAT CTTCCAGGAT
CCGCTCAGCC ACCTCAATCC AGTCTACAGC GTCGGCTGGC AGATCGCGGA AGCCATGACC
ACCCATGGGC TCGCCGGCAG CAAGGCACGC GAGGAAGGAC TGCGGCTCCT TCGCCGCGTC
GGAATTCCCG AGCCCGAGCG GGCAATGCGC AAATATCCGC ACGAGTTCTC GGGTGGCCAG
CGCCAGCGCG TCATGATCGC AATGGCGTTG GCGCTGCGGC CGGATCTCTT GATAGCAGAC
GAACCGACCA CCGCACTCGA CGTGACCGTG CAGGCAGAAG TTCTAAAGCT TCTGAAGGAG
CTGCAGCGCG AAACAGGTAT GGCGGTGCTG ATCATCACGC ACGATCTCGG CGTCGTGGCG
GAAATCGCCG ACCGGGTGGT CGTGATGGAG AAAGGCACCC TCGTCGAGGC GGGGACCGTC
CGCGAGATCT ACAAGAACCC GCAGCATCCC TATACGCGCA AGCTCATCGC CGCCGCCCCC
GGAAAGGGCG TGATGCATGA GCCCGGCGCC CGCGCCGAGC CTCTGCTCAG CGTGCGCGAC
GTGCGCAAGA CCTACGGCTC GTTCGAAGCA CTGAAGGGCA TCTCCTTCGA CCTGATGCCC
GGCGAGACAA TGGCGGTGGT CGGGGAAAGC GGATCCGGTA AATCGACCCT GGCCCGTGCA
CTGCTTCGTC TCGACGAGCC GGACAGCGGC ACTGCCTTGT GGAAGGGACG CGACCTTTTC
GCGTTATCGC CTTCAGAGCT CTACAAGCTC CGGCGCGACC TCCAGATGGT GTTTCAGGAC
CCGACGCAGT CGCTCAATCC GCGCATGACG GTGTTTCAGC TGATCTCCGA AGCCTGGGTT
ATTCATCCGG ACATTCTGCC CAAGGCAAGA TGGCGCGAGC GCGTCGCGGA GCTCCTCGCG
CAGGTCGGTC TTTCGGCCGA ACACATGAGC CGCTATCCGC ACCAGTTTTC CGGCGGCCAG
CGCCAGCGCA TCGCCATCGC CCGGGCGCTT GCACTCGAGC CGCAGTTGAT CATCTGCGAT
GAAGCCGTCT CAGCGCTGGA CGTCTCCGTG CAGGCGCAGG TGATCGAGCT TCTCGACAAG
CTGCGGCGCG AAATGGGCAT CGCCTTCATC TTCATCGCCC ATGACCTTCC CGTCGTTCGT
GATTTCGCCG ATCACGTCAT GGTCATGCAG CAGGGAGAAA TCGTCGAGCT CGGCACCATC
CGCGAGGTCT TCGAAACGCC GCGCCAGGCC TATACGCGTG CGCTCCTGGC GGCCAGTCTC
AGTCCCGACC CCGACGCCAA AGCCGGTTGC CCTGAACCGC CGGCCGCCGA CGCCGAAATT
CTCATTCCGA AGCGGAGCCA CTGA
 
Protein sequence
MAEHLLEVRN LSVEFHTAAG VVHAVKSISY HLDKGETLAI LGESGSGKSV SSSAIMNLID 
MPPGRISGGE ILLDGRDLLT MPAEERREVN GRRVAMIFQD PLSHLNPVYS VGWQIAEAMT
THGLAGSKAR EEGLRLLRRV GIPEPERAMR KYPHEFSGGQ RQRVMIAMAL ALRPDLLIAD
EPTTALDVTV QAEVLKLLKE LQRETGMAVL IITHDLGVVA EIADRVVVME KGTLVEAGTV
REIYKNPQHP YTRKLIAAAP GKGVMHEPGA RAEPLLSVRD VRKTYGSFEA LKGISFDLMP
GETMAVVGES GSGKSTLARA LLRLDEPDSG TALWKGRDLF ALSPSELYKL RRDLQMVFQD
PTQSLNPRMT VFQLISEAWV IHPDILPKAR WRERVAELLA QVGLSAEHMS RYPHQFSGGQ
RQRIAIARAL ALEPQLIICD EAVSALDVSV QAQVIELLDK LRREMGIAFI FIAHDLPVVR
DFADHVMVMQ QGEIVELGTI REVFETPRQA YTRALLAASL SPDPDAKAGC PEPPAADAEI
LIPKRSH