Gene Smed_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1839 
Symbol 
ID5322697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1917125 
End bp1918558 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content64% 
IMG OID640790777 
Producthypothetical protein 
Protein accessionYP_001327509 
Protein GI150397042 
COG category[R] General function prediction only 
COG ID[COG5565] Bacteriophage terminase large (ATPase) subunit and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.210765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAC CCCACCCATC CGAAAGAACC GTTGCGTTCG GCCTCTCCGC GATGCTCAGG 
GAACAGATGT CGCTGATGGC GGAACTCCAC CGGCGGCAAC GAACGAATAT CCTTTCCAGC
TATCAGCCCT ATGCCAAGCA GCGGGAGTTC CATGCCGCCG GCGCAACCTT TCGCGAGAGG
CTGTTCATGG CGGGCAACCA GCTCGGCAAG ACGCTTGCGG GCGCGGCGGA GGCAGCCATG
CATCTGACCG GCCGCTATCC CGAATGGTGG CAGGGAAGAC GGTTCGACCG GCCCGTCGCA
ATGCTCGCGG GCTCGGAGTC CTATGAGCTG ACACGCGACG GGGTGCAGCG GTTGCTGATA
GGCCCGCCTC TGAATGAAGA TGAATGGGGC ACCGGATTCG TGCCCAAGGC AACGATCCAG
GCGACGACGC GCCGCTCCGG CGCTTCCGGG GCTCTCGACA GCGTAACGGT GCGGCACGTT
GCAGGCGGAG CCTCGACGCT GCTTTTCAAA GCTTACGAGC AGGGACGGGC CAAGTGGCAG
GCCAACACGG TGGACTATGT CTGGTTCGAC GAGGAGCCGC CTGAGGACGT CTATTTCGAA
GGGATCACCC GCACCAATGC GACCCGCGGT TCCATCGCCG TGACCTTCAC GCCGCTCAAG
GGCCTGAGCG CCGTGGTGGC CAGATACCTG ATGGAAAAGT CGGCGGACCG CGAGGTCACC
ACCATGACGA TCGAGGATGC GGAACATTAT ACGCCCGAGG AGCGCCGGCG GATCATCGAC
AGCTATCCCG CCCATGAGCG CGAGGCGCGC ACCAAGGGCG TGCCGGCTCT CGGCTCCGGA
CGGATCTTTC CCGTAACCGA GGAGAGCATT CGTGCCGATC CGTTCGATAT ACCGAAGCAC
TGGGTCCAGA TCGGCGGACT CGACTTCGGC TGGGACCATC CTTTCGCGGC TGTCGGCTGC
GCCTGGGACC GGGATGCTGA TGTCTTCTAT GTGACCAAGC TCTATCGCGA GCGGGAATCG
ACGCCGATCA TCCACGCGGC AGCCCTCAAA CCCTGGGGCG GAACCTTGCC CTGGGCGTGG
CCCCATGACG GGTTGCAGCA TGACAAGGGC AGCGGCGAGC AACTGGCGGC CCAGTACCGG
GCACAGGGGC TGGCGCTTCT TCCCGAAAGG GCGACCTTCG ACGACGGCAC GAACGGCGTC
GAAGCCGGGC TTTCCGACAT GCTGCAGCGG ATGCAGACCG GGCGCTGGAA GGTGTTTTCC
ACCTGCACGG AATGGTTCGA GGAATTCCGC CTGTATCACC GCAAGGACGG CAGGATCGTC
AAGGAGCGCG ACGACCTCCT CGCCGCCTCG CGCTACGCGC TGATGATGAA GCGCCATGCA
CGGGCAATCG GCGGCAACGC AAACTGGAAA TTCACCGCCC GAAAGGTTCT CTGA
 
Protein sequence
MSAPHPSERT VAFGLSAMLR EQMSLMAELH RRQRTNILSS YQPYAKQREF HAAGATFRER 
LFMAGNQLGK TLAGAAEAAM HLTGRYPEWW QGRRFDRPVA MLAGSESYEL TRDGVQRLLI
GPPLNEDEWG TGFVPKATIQ ATTRRSGASG ALDSVTVRHV AGGASTLLFK AYEQGRAKWQ
ANTVDYVWFD EEPPEDVYFE GITRTNATRG SIAVTFTPLK GLSAVVARYL MEKSADREVT
TMTIEDAEHY TPEERRRIID SYPAHEREAR TKGVPALGSG RIFPVTEESI RADPFDIPKH
WVQIGGLDFG WDHPFAAVGC AWDRDADVFY VTKLYRERES TPIIHAAALK PWGGTLPWAW
PHDGLQHDKG SGEQLAAQYR AQGLALLPER ATFDDGTNGV EAGLSDMLQR MQTGRWKVFS
TCTEWFEEFR LYHRKDGRIV KERDDLLAAS RYALMMKRHA RAIGGNANWK FTARKVL