Gene Smed_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1488 
Symbol 
ID5322346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1573881 
End bp1574888 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID640790436 
Productpeptidase S58 DmpA 
Protein accessionYP_001327168 
Protein GI150396701 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0291995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGA AGGGCGCCAG AAATCTGATC ACGGACGTTC CGGGCTTGCT GGTGGGCAAT 
GCTGAAGACC ATCACCTGAA ATCCGGTGTC ACGACGGTGC TCTGCGATCC ACCGGCGACG
GCGGCGGTTC AGGTATTGGG TGGCGCCCCC GGCACGCGCG AAACCGACCT GCTCGCTCCG
CACAACATGG TGCAGGCGGT CGACGCCATC GTGCTTTCCG GTGGCTCGGC CTTCGGCCTT
GACGCAGCCT CCGGCGTGCA GGCGGCACTG CGCGAGATGG GGCGCGGTTT CGCCGTCGGG
CCACACCGCG TACCGATCGT GCCTGCGGCG ATCCTCTTCG ATCTTGCAAA TGGCGGCGGC
AAGGATTGGG CGCGCTACGC TCCCTACCGC GAGCTCGGCT ACGAAGCGGC ATATGCCGTA
TCCGCGGATT TCGTCACAGG CAGTTCCGGC GCAGGTACGG GCGCGCTCAC GGCCACCTTC
AAGGGAGGCC TCGGCTCAGC CTCGACGGTG CTGGCCAACG GAATCACGGT CGGTGCGCTG
GTGGCGGTGA ATGCCTTCGG TTCCGCGACC GTTGGCGAGA CACGCCATTT CTGGGCCGCG
CCCTTCGAGA TGGAAAGAGA GTTCGGCGGG CTTGGCCAGC CATCGCCCTG GCCGGCGGAC
GCCGCTACCC CCCGGTTCAA GTTTCGGGAG CGGCAAGTCT CCCCGGCAAA CACGACCATC
GCCGTCATCG CCACCGATGC GTTGCTCAGC AAAGCGGAGG CGAAGCGGCT GGCAATCGCA
GCCCATGACG GCTTTTCGCG CGCGCTCTGG CCCTCTCATA CGCCGCTCGA CGGCGATCTC
GTCTTCGCGC TTTCGACCGG CACGAGTGGA AAAGCTCCGT CGCTTCAGGA TTCCATCGAT
CTCAGCGCCG CAGCCGCCGC CACTATGGCC CGCGCGATCG CGCGCGGCGT TTATGACGCA
TGCGATACAG GCAACGATCT GATACCAGCC TGGTCGGCCC GCTTCTGA
 
Protein sequence
MMQKGARNLI TDVPGLLVGN AEDHHLKSGV TTVLCDPPAT AAVQVLGGAP GTRETDLLAP 
HNMVQAVDAI VLSGGSAFGL DAASGVQAAL REMGRGFAVG PHRVPIVPAA ILFDLANGGG
KDWARYAPYR ELGYEAAYAV SADFVTGSSG AGTGALTATF KGGLGSASTV LANGITVGAL
VAVNAFGSAT VGETRHFWAA PFEMEREFGG LGQPSPWPAD AATPRFKFRE RQVSPANTTI
AVIATDALLS KAEAKRLAIA AHDGFSRALW PSHTPLDGDL VFALSTGTSG KAPSLQDSID
LSAAAAATMA RAIARGVYDA CDTGNDLIPA WSARF