Gene Smed_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0116 
Symbol 
ID5320945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp129033 
End bp130190 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content66% 
IMG OID640789049 
Producthypothetical protein 
Protein accessionYP_001325811 
Protein GI150395344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.488565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAC TTGAGACAGC GATCAGAAAT GCGCTGGAAA GATCGGATCG AAGCAGTGCG 
GAGGTCCGGG CGCGCATCTA TCAGTCGGCG CGTCAGGCGC TTGAGAACGG CCTGCAAAAG
CAGCAGATCG AAGACCCGGA GGTCATCTCG GTGCAGCGCC ACCGGCTGGA AGCGGTCATT
CGGGCGATCG AGATGGAAGA GCGCGCTGCC TTGAAGGAGC GCGCACAGAC GCCGGTGGTC
AATCTCGACG AGGTGACGGC GAGGGGACAT GCGGTCGAGC GCGGCCCCGA AGCGCCAACC
CGAAGGCCCG AGTTGGAAAC GAAGCCGGAG GAACGTAGTC CCGCGCAAAC GGACGGGGGG
CTTGGGGCCT TGCGTCCCGA ACGCGACGGT CCGCTGGCTG CGACGAGAGC GGAGGGGAGC
GACGGTCGAT CGGAAGCCAC CGGAAGCACC GTCCCGCCCG CACCCGATCC AGGCCCTGCC
GACCGGCGGC CGCGCAAGCA CAGGCGCAGG CGCAGCCGTT TCTTCTCCTA TGCGATGATC
GTCGCGACGC TTGCTGCAGC GGCCGGGGTC GCCGTCTGGT GGATCCAGAC GAACGATCTG
CTGCGGTCGC CGACGGACAC CGGCGTTGCC AATCCGCCCG CGACGGTGGA TGCAGAGGAT
TTCGACGGCG CGGCCGGCTT GCAGACCCTG GGTGCCCAGG AAGGCTTTTC CGGTGATTGG
GTAGAGGTCT TTGCTCCCGG TGAGGCTGCA GCGGTCAAGC CGGGCCCGCG GGCGAGTGCG
GAACCCTTCG ACGGCGACGC CGGCGAGCGC CTGCGTTTGA TCTCGCAAGC GGCATCGAAG
GATGGCGACG TGGAAATCGA GATACCCGCC GATGTTCTTG CCCAGCTTTC GGGCAAATCG
TCGACATTCG CCCTGACGGT GCAGGCTGCC CCGGGCAAAG CGACCGAATT CTCCGTCGAA
TGCGATTTAG GGGCGCTCGG CGGCTGCGGC CGTCATCGTT TCACCGTACA CGACGAGCGG
ATCGATATGT TGTTCAAGAT CAATTTCGAT CGCGGTGCCG CACCGAGCGG CCCTGGAAAA
CTGGTGATCA ACAGCGACGT CGGCGGCGGC GGCAACAGCC TCGATCTCTT CGCGATCCGC
GTGCAGCCGG GCGGCTGA
 
Protein sequence
MSGLETAIRN ALERSDRSSA EVRARIYQSA RQALENGLQK QQIEDPEVIS VQRHRLEAVI 
RAIEMEERAA LKERAQTPVV NLDEVTARGH AVERGPEAPT RRPELETKPE ERSPAQTDGG
LGALRPERDG PLAATRAEGS DGRSEATGST VPPAPDPGPA DRRPRKHRRR RSRFFSYAMI
VATLAAAAGV AVWWIQTNDL LRSPTDTGVA NPPATVDAED FDGAAGLQTL GAQEGFSGDW
VEVFAPGEAA AVKPGPRASA EPFDGDAGER LRLISQAASK DGDVEIEIPA DVLAQLSGKS
STFALTVQAA PGKATEFSVE CDLGALGGCG RHRFTVHDER IDMLFKINFD RGAAPSGPGK
LVINSDVGGG GNSLDLFAIR VQPGG