Gene Smed_5591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5591 
Symbol 
ID5319893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp557089 
End bp558744 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content63% 
IMG OID640777336 
ProductABC transporter related 
Protein accessionYP_001314268 
Protein GI150377673 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.395056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACC TTGTAGAAAT CCGTGACTTG AAGGTCGAAG CCACCACCGA TTCCGGCCGC 
AAGGTCGAGA TCATCAAGGG CGTCAGCCTC GATGTCGCCG AGGGCGAGAT CGTCGCTCTG
ATCGGCGAAA GCGGCTCGGG CAAGACGACG ATCGCCCTGA CGCTGATGGG ATATGCCCGC
CCGGGCTGCC GCATCTCCGG CGGCAGCGTC TCGGTCGCCG GCAATGACTT GGTGACGCTG
ACGGAAAAGC AGCGAGCCAA GGTGCGCGGC ACCGAGGTCA CCTATGTTCC GCAATCCGCT
GCAGCCGCTT TCAACCCGGC CGCGACGATC ATGGACCAGG TGATCGAAAT CACCCGCATC
CACGGACTCA TGCCGGTCGT CGAGGCGCGC GCACGCGCCG TCGAGCTTTT CCGGGCGTTG
TCGCTACCCG AGCCCGAGAC GATCGGCAGC CGCTATCCGC ACCAGGTTTC CGGCGGACAA
CTGCAGCGTT TATCGGCCGC CATGGCCCTC ATCGGCGACC CGAAGCTCGT CATCTTCGAC
GAGCCGACAA CGGCGCTCGA CGTAACGACC CAAATCGAGG TGCTACGCGC CTTCAAGTCC
GTGATGAAGA AGGGCGGCAT CGCCGGCGTC TACGTCTCGC ATGACCTCGC GGTCGTCGCT
CAGATCGCCG ACCATATTGT CGTGTTGAAG GCCGGCGAGG TACAGGAGGC CGGCACCACC
GAGGAAATCC TTTCGAACGC CAAGCACCCC TATACGCGCG AACTCCTCTC GGCCTTTGAA
CCAAAGCCGC GGGAGGCTGC CGACGCAGCC GAGAGCGCAC CGGCTCCCCT GCTAAAGATC
GAAAATCTGG TCGCGGGATA CGGTGCGTCC AAGAGCGATG GCTTGCCGCT CGTGCGCGCC
GTTGAAGATG TGAGCCTCAA GGTGGAAAAG GGTCGCAACC TCGGCATCAT CGGCGAGTCG
GGTTGCGGCA AGTCGACGCT CGCCCGCGCC ATTGCCGGCA TATTGCCGGC CGCCGTCGGC
AAGATCGTCT TCGATGGCAA GGAACTCGGC CGCAGCGCCC GCGAGCGCAC GCGCGACCAG
CTGCGCGAAA TGCAGATCGT CTTTCAATAT GCCGACACCG CCCTCAACCC GGCGAAATCC
GTCGAGGACA TTCTCGACCG ACCGCTCGTC TTCTATCATG GGATGAATGC AAGGGCCCGC
AGCCTCCGCA TCGACGAGCT GCTCGACATG GTGCGCCTGC CTCGCAACCT GCGTCATCGC
CGGCCGGGCG AGCTTTCCGG CGGCCAAAAA CAGCGCGTCA ACTTCGCGCG CGCGCTCGCC
GCCGATCCAA AGCTGATCCT TTGCGATGAG ATCACTTCGG CACTCGACAC GGTCGTCGCC
GCCGCGGTCA TCGAGCTGCT GAAGGAATTG CAGCGCGAGC TCGGTCTATC TTACATCTTC
ATCAGCCACG ACCTCTCAGT GGTGGAAGCG ATCTGCGACG AGATTGTCGT GATGTATGGC
GGCAGAAAGG TTGAGGACAT CACTCCGGCC AGGATCAATG CGCCGCACCA CCCCTATTCG
CAGCTGCTCT TCTCCTCGGT GCCAAAGCTC GATCCCGCCT GGCTCGATGG CCTCGAGCAG
GATCCGGAAT TGGTCAGGGC CTACTGCCGG AGGTAG
 
Protein sequence
MVNLVEIRDL KVEATTDSGR KVEIIKGVSL DVAEGEIVAL IGESGSGKTT IALTLMGYAR 
PGCRISGGSV SVAGNDLVTL TEKQRAKVRG TEVTYVPQSA AAAFNPAATI MDQVIEITRI
HGLMPVVEAR ARAVELFRAL SLPEPETIGS RYPHQVSGGQ LQRLSAAMAL IGDPKLVIFD
EPTTALDVTT QIEVLRAFKS VMKKGGIAGV YVSHDLAVVA QIADHIVVLK AGEVQEAGTT
EEILSNAKHP YTRELLSAFE PKPREAADAA ESAPAPLLKI ENLVAGYGAS KSDGLPLVRA
VEDVSLKVEK GRNLGIIGES GCGKSTLARA IAGILPAAVG KIVFDGKELG RSARERTRDQ
LREMQIVFQY ADTALNPAKS VEDILDRPLV FYHGMNARAR SLRIDELLDM VRLPRNLRHR
RPGELSGGQK QRVNFARALA ADPKLILCDE ITSALDTVVA AAVIELLKEL QRELGLSYIF
ISHDLSVVEA ICDEIVVMYG GRKVEDITPA RINAPHHPYS QLLFSSVPKL DPAWLDGLEQ
DPELVRAYCR R