Gene Smed_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2018 
Symbol 
ID5322877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2067873 
End bp2068961 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID640790955 
ProductHflK protein 
Protein accessionYP_001327686 
Protein GI150397219 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID[TIGR01933] HflK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGGA GCAATCAGAA TGGCGGCGGC GGTGGCCCTT GGGGCGGCGG CGGCGGCAAT 
CAAGGGCCAT GGGGCCAGGG GCCCAACCGG CCACGCGGCG GAAGAGGTGG TCCGCCGGAT
CTGGAGGAGA TCATCCGGCG TGGCCAGGAT CAGCTGAAAA GCGTGGTTCC CGGGGGCTTC
AACGGCGGGA TCTTCGTCAT TGTCGGCCTG CTGGTTCTCG GTTTCATCCT GCTGAACTCC
ATCTATACCG TTCAGCCGGA CGAACGCGGC GTTGAGATGC GCTTCGGCAA GCCGAAGGAA
GAAATCTCCA TGCCGGGCTT GCACTATCAC TTCTGGCCGC TTGAAACGGT AGAGATCGTC
AAGGTGACCG AGCAGCAGCA GAATATCGGC GGTCGCACGG GCCAAACAAA TTCCGGCCTG
ATGCTGAGCG GCGATCAGAA CATCGTCAAC GTGCAGTTCT CGGTGCTGTT TTCGGTTACC
GATCCGAAGG CGTATCTCTT CAATGTCGAA AACCCGGCCG ACACGCTGCA GCAGGTGGCC
GAAAGCGCAA TGCGCGAGGT CGTCGGACGC CGTCCTGCGC AGGACATTTT CCGCGACAAC
CGTCAGGCGA TCGCCGCGGA TGTGAAGAAC ACGATCCAGG CGACCATGGA CAGCTATGGC
GCCGGCATAT CGGTGAATAC CGTGGCGATC GAGGATGCCG CCCCGCCACG CGAAGTGGCC
GATGCCTTCG ACGAAGTGCA GCGTGCCGAG CAGGACGAGG ATCGCTTCGT CGAAGAGGCC
AACCAGTATG CCAACCAGGT GCTCGGCAAG GCGCGCGGCC AAGGCGCGCA GATCCGGGAA
GAGGCGGCCG CATACAAGGA TCGGGTGGTC AAGGAGGCTC AGGGTGAGGC TCAGCGCTTC
ATCTCAGTCT ATGACGAGTA TTCCAAGGCT CCGGAAGTCA CGCGCAAGCG GCTCTATCTC
GAAACGATGC AAGGCGTTCT CGGCAAGTCC AAGAAGTTCA TCCTCGACGA GAAGAACGGC
CAGGGTGTGC TGCCCTATCT GCCGCTCAAT GAAATCGGCA GGCCCGTGCA GTCGGGAGGA
AACCAATGA
 
Protein sequence
MPWSNQNGGG GGPWGGGGGN QGPWGQGPNR PRGGRGGPPD LEEIIRRGQD QLKSVVPGGF 
NGGIFVIVGL LVLGFILLNS IYTVQPDERG VEMRFGKPKE EISMPGLHYH FWPLETVEIV
KVTEQQQNIG GRTGQTNSGL MLSGDQNIVN VQFSVLFSVT DPKAYLFNVE NPADTLQQVA
ESAMREVVGR RPAQDIFRDN RQAIAADVKN TIQATMDSYG AGISVNTVAI EDAAPPREVA
DAFDEVQRAE QDEDRFVEEA NQYANQVLGK ARGQGAQIRE EAAAYKDRVV KEAQGEAQRF
ISVYDEYSKA PEVTRKRLYL ETMQGVLGKS KKFILDEKNG QGVLPYLPLN EIGRPVQSGG
NQ