Gene Smed_5853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5853 
Symbol 
ID5320155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp815252 
End bp816439 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content60% 
IMG OID640777548 
Productpeptidase M24 
Protein accessionYP_001314480 
Protein GI150377885 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATCA ACGCAAGGGA CGCGCGGGCA GGGGAGCCGC CCTTCGATGC CGCGAAACTC 
GACAGACTGA TGGAACAGGC GGGTATCGAC GTTCTGCTCG CCACCTCCAA GCACAATACG
CAGTACCTGC TGGGCGGCTA TAAATTCATC TTCTTCGCCG CAATGGATGC GATCGGCCAC
AGCCGCTATC TGCCGATTGT CGTTTATGAG AAGGGCTCAC CCGATCATGC CGCTTATGTT
GGCAATCGCA TGGAGGGAGG AGAACATAAG AACAATCCGT TCTGGACGCC CGCCGTTCAT
ACAGCGACTT GGGGTACGCT CGACGCTGCA GAGCTTGCCG TAGAGCATCT GACGAAGATA
GGTAAGGCGA GTGCTCGCAT CGGTATTGAG CCGGCCTTTC TGCCGGCGGA TGCGCGTGAC
TTTCTGGCTT CTCGTCTCGA AGGTGCGCGG TTCATGGATG CGACGCACGC GCTGGAGCGA
CTGCGGGCGA TCAAGAGGCC TGAGGAACTG CAGATGCTCA AGCTGGCGTC CGAACTGATC
ACGGACTCAA TGCTCGCCAC CATCGCGGCA GCGCGGGAGG GTTCCACCAA GATCGAGATC
ATTGAACGGC TCAGGCGGGA GGAGACCAAT CGAGGGCTGC ATTTCGAGTA TTGCCTGCTG
ACCTTAGGTG CCAGTCACAA CCGTGCGGCC TCGCCGCAGG CGTGGGAGAA GGGCGAGGTG
CTTTCGATAG ATTCCGGGGG GAACCATTGC GGCTACATCG GGGACCTTTG CCGTATGGGA
GTGCTCGGAG ACCCCGATGC GGAGCTCGAA GATCTGCTGG CTGAAGTCGA GTCGATCCAG
CAGACGGCCT TCGCCAAGAT CAAGGCCGGG GCCGCGGCCA GTGAGATGAT TGCGGCCGCG
GAAGAGGTTT TGCAAAGCTC GCCATCGGCC GCCTTTACCG ATTTTTTCTG CCACGGCATG
GGGCTCATTA GCCACGAAGC TCCGTTTTTG ATGACCAACC ACCCGGTCGC CTATGAAGGC
AACGACGCGG ATCAGCCCCT GGAGGCAGGC ATGGTCATTT CTGTGGAGAC GACGATGCTT
CACCCGAAGC GCGGTTTCAT CAAGCTCGAG GATACGCTCG CCGTCACGAA CGGCGGATAC
GAGATGTTCG GCAACAGTGG GCGCGGCTGG AATCTCGGGG CGGCATAG
 
Protein sequence
MNINARDARA GEPPFDAAKL DRLMEQAGID VLLATSKHNT QYLLGGYKFI FFAAMDAIGH 
SRYLPIVVYE KGSPDHAAYV GNRMEGGEHK NNPFWTPAVH TATWGTLDAA ELAVEHLTKI
GKASARIGIE PAFLPADARD FLASRLEGAR FMDATHALER LRAIKRPEEL QMLKLASELI
TDSMLATIAA AREGSTKIEI IERLRREETN RGLHFEYCLL TLGASHNRAA SPQAWEKGEV
LSIDSGGNHC GYIGDLCRMG VLGDPDAELE DLLAEVESIQ QTAFAKIKAG AAASEMIAAA
EEVLQSSPSA AFTDFFCHGM GLISHEAPFL MTNHPVAYEG NDADQPLEAG MVISVETTML
HPKRGFIKLE DTLAVTNGGY EMFGNSGRGW NLGAA