Gene Smed_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0024 
Symbol 
ID5320851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp19875 
End bp21032 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID640788955 
Producthypothetical protein 
Protein accessionYP_001325719 
Protein GI150395252 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000201715 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGGAAA AAGATCGGCG ATCGAAGAAC GCCTCCGTGA CGGCGGCGAA GAGCCGGGGC 
GCCGAGGCGC GCGGCGGCCG TCATGAGCAA AGGCTGCCGC CAGCAAAAAG CGCAGGGCCC
GCGGCGAGGA CCGGATCAAA GACGGGGGCA GGAGGGCCAT CGCTGAAACC GGCACCGCGC
AAGGCCGAGC GGACCTCCGC CCAGAACAGC GCCCCGTTGC GCCGCCTTGA ACTCAGGACC
GGGGAGAAGC CTGCCGAGAC GGTGCCGCTG ATCCTCGCGA CCGCCGCCAC CGGCGGCTAT
CACCTGATCG ATAGCGGCGA TGGCGAGAAG CTCGAGCAAT ATGGCCCCTA TCGCATCGTC
CGTCCGGAGG CCCAGGCGCT CTGGCCGAAG GCCCTGTCTG CATCCATCTG GGAAAAAGCC
GATGCGGTCT TCACCGGCGA TACGGAAGAG GACGGGATGG GTCGCTGGCG GTTCCCGGGG
GATGTTCTCG GCGAGACCTG GCCGATGCAG CTCCTGGACA CGGATTTCCT CGGCCGGTTC
ACATCCTTCC GCCATGTCGG CGTCTTTCCG GAACAGCTCG CCCACTGGTC GTGGATGCGG
GACCAGGTTG CCGGCGCCGG CCGGCCCCTG AAGGTTCTCA ATCTCTTCGG CTATACCGGC
GTTGCTTCGC TCATCGCGGC GAAGGCGGGT GCGGAAGTAA CCCATGTCGA TGCCTCGAAA
AAAGCGATCG GCTGGGCGCG CGAGAACCAG GCAATGGCGC GAGCCGAGAA GCTGCCGATC
CGCTGGATCT GCGACGATGC CATGAAATTC ATCCAACGGG AGGAGCGGCG CGGCAGCCGC
TACGACGTCA TCCTCACCGA CCCGCCGAAG TTCGGCCGCG GCCCGAACGG CGAGGTTTGG
CAACTGTTCG ATCATCTCGC GGCGATGCTG GACGTCTGCC GCGAGATCCT GTCACCGGAC
GCGCGGGGCC TCGTGCTCAC CGCCTATTCG ATCCGTGCCA GCTTCTATTC GATTCACGAG
CTCATGCGGG AGACCATGCG CGGGCGCGGC GGGCGGGTGG AATCGGGCGA ACTCATCATT
CGCGAGGGCG GTCTCGACGG CGCCAGGCCG GGCCGGGCGC TCTCGACCTC CCTCTTCAGC
CGCTGGGTAC CGAAATGA
 
Protein sequence
MKEKDRRSKN ASVTAAKSRG AEARGGRHEQ RLPPAKSAGP AARTGSKTGA GGPSLKPAPR 
KAERTSAQNS APLRRLELRT GEKPAETVPL ILATAATGGY HLIDSGDGEK LEQYGPYRIV
RPEAQALWPK ALSASIWEKA DAVFTGDTEE DGMGRWRFPG DVLGETWPMQ LLDTDFLGRF
TSFRHVGVFP EQLAHWSWMR DQVAGAGRPL KVLNLFGYTG VASLIAAKAG AEVTHVDASK
KAIGWARENQ AMARAEKLPI RWICDDAMKF IQREERRGSR YDVILTDPPK FGRGPNGEVW
QLFDHLAAML DVCREILSPD ARGLVLTAYS IRASFYSIHE LMRETMRGRG GRVESGELII
REGGLDGARP GRALSTSLFS RWVPK