Gene Smed_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2501 
Symbol 
ID5323368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2596562 
End bp2597440 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content62% 
IMG OID640791443 
ProductHemK family modification methylase 
Protein accessionYP_001328166 
Protein GI150397699 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.132727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA CCCTCGACAA CCTTCTTGCC GAGACCCGCG ACCGGTTGAA GGCCGCCGGC 
ATCGAATCGG CGGCACTCGA TGCGCGGCAC CTGGTCTCCG GCTTGCTCGA ACTCGCACTT
GCCGCGCTCG TGACGCGCGG GAGGGAGCCC GTCAGCGACG AGGATGCGGC GCGCATCCGT
GCCGCGGTCG AGCGCCGGGC TGCGCACGAG CCAGTCTATA GGATCCTCGG TGAGCGGGAA
TTTTCCGGCC TGAAGCTCAA GCTCTCGAAG GAGACGCTGG AGCCGCGCCC GGACACCGAG
ACCATGGTCG AATGCCTAAT TCCCCACGCC CGGCGGATCG CCTTGAAAAA AGGGAGTTGC
CGCATCGTCG ACCTTGGAAC GGGCACGGGT GCGATTTGTC TCGCGCTTCT CGATGCGGTA
CTTGACGCGC GCGGCCTCGG TACCGATATA TCGGAGGACG CATTGGCGAC GGCATGTGAA
AATGCCCGCA GGAATGGCTT GGCGGGGCGC TTCGAAACGC TTCGGAGCAA TTGGCTCGAG
GCGGTGAATG GCCGGTTCGA CATCATCGTC TCAAATCCGC CTTATATCCG GTCTAATGTC
ATTCCAGACC TTGAGCCGGA AGTGAAATTC CACGATCCGG CCGCCGCGCT TGATGGCGGA
GAGGATGGGC TGAATGCCTA TCGTGCCATC GCTTCCGACG CTGGTCGCCA TCTTGAACCA
GACGGCGTGA TAGGCTTGGA AATCGGTTTC GACCAAAAGC AAGCGGTAAC GGCGCTCTTC
GAGGCGCATG GCTTTCATAT GCTTTATGCC GCGAAGGACC TCGGCGGCAA CGACCGGGTC
CTGGTGTTCG AGCATGATCC TGCAGCGCCG CGCGTCTGA
 
Protein sequence
MPETLDNLLA ETRDRLKAAG IESAALDARH LVSGLLELAL AALVTRGREP VSDEDAARIR 
AAVERRAAHE PVYRILGERE FSGLKLKLSK ETLEPRPDTE TMVECLIPHA RRIALKKGSC
RIVDLGTGTG AICLALLDAV LDARGLGTDI SEDALATACE NARRNGLAGR FETLRSNWLE
AVNGRFDIIV SNPPYIRSNV IPDLEPEVKF HDPAAALDGG EDGLNAYRAI ASDAGRHLEP
DGVIGLEIGF DQKQAVTALF EAHGFHMLYA AKDLGGNDRV LVFEHDPAAP RV