Gene Smed_5012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5012 
Symbol 
ID5318751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1533461 
End bp1534639 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID640776794 
Producthypothetical protein 
Protein accessionYP_001313726 
Protein GI150377130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.563946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0503052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGACA AAAGGTACTC GGACGCCCGC ACGATTTGCC GGGAGCATAT ATCGACAGGC 
GATCCATCCG GCTTTTGGCG GCATTACGAG AGCTTCGTTG ACGAGCTCGC AAGCTCCTGC
CGACAGCAAG AAGTGTCTGG CTCGAGAATC AAGATTGCTC TGTTCAACGA TACAGACTTC
AGGATCAATA TCGGATGCAG GCTGACCAGC CAGGGTCTCA AGCAACAAAT TCTAGATGCG
TTCCCCGCGG CTGAGATTAC GTCCATCGGT TTCAACTTCG CAGCCTTCAG AAAGGAGTTC
CCGAACTCCA CATCCGCTGG AGGATATGAA CTCTCGGACA TTGAGACCCG ACTTTCCACT
GCATATGGCG AAGACGCTGT TGACCATATA ACGGCCGCCG ATTTCGTGAT CCTTCAGCCA
GAGGGATCGT TGGACCACAG GACAACGGCA GAAGGGCTTG CAACCTTCTT CACTCCTATT
CTTACCGCCA GGAAGCTAGG GAAGCCATTT GCTGTATTGA ACGGAACGAT ACCAATCTAC
GAAGGCGAAC GATCGGACTA TCTCAAAGGA CTCTTTCGCG AACTCGGCCA TGTGGCCGCA
CGCGACGAAA TCTCGGCGGA GTATTACGGG ATCGAATTTC TGGCGGATGC TGCATTTCTT
CGGATATCGC CGGCGCCCGT CGCGGATCGC GATGGTTGCC TGATAACCAC GGGCGCCAGA
AACAATGCCG AAGAAGACGT CGAAATTCTA AAAGCTGCAC TGAAGATTTG CGAGGCGTGG
AAGCTTCGGC CTGTTGTTCT TACGCATGCA GTTGAACGAT TCTCTCCATA TGAGGCAGAG
ATCATCGACC GTGGCGGCAT CTTTGCTGAG ACAGCAAGCA TTGAACGTGC TGCCGAGACA
ATTTCAACTT GCCGACTTCA CATTGGTGGC CGATATCATA TGGCGATCTT CAGTCTCCTC
TGTAACGTTC CTTCTCTCCT GTTCGATGTT AAGACCCACA AGAATCAATG GCTTGAACGA
TACTCTCCTC TGATAACGCT TGTGCATCCG CACACGGACC TCGATGCCGC GGCAGCCGCG
GTGCTGAGCG GTGGCGTATC GCAGGGACAT CCGGCATCAA CGGGCGCGGA GAAATACGGT
CTTTTCCTGA AACGCGCTAT GGCTGAACAG CCGCTATAG
 
Protein sequence
MADKRYSDAR TICREHISTG DPSGFWRHYE SFVDELASSC RQQEVSGSRI KIALFNDTDF 
RINIGCRLTS QGLKQQILDA FPAAEITSIG FNFAAFRKEF PNSTSAGGYE LSDIETRLST
AYGEDAVDHI TAADFVILQP EGSLDHRTTA EGLATFFTPI LTARKLGKPF AVLNGTIPIY
EGERSDYLKG LFRELGHVAA RDEISAEYYG IEFLADAAFL RISPAPVADR DGCLITTGAR
NNAEEDVEIL KAALKICEAW KLRPVVLTHA VERFSPYEAE IIDRGGIFAE TASIERAAET
ISTCRLHIGG RYHMAIFSLL CNVPSLLFDV KTHKNQWLER YSPLITLVHP HTDLDAAAAA
VLSGGVSQGH PASTGAEKYG LFLKRAMAEQ PL