Gene Smed_0717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0717 
Symbol 
ID5321554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp768355 
End bp769317 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content64% 
IMG OID640789654 
Productproline iminopeptidase 
Protein accessionYP_001326408 
Protein GI150395941 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.922147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.435096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCTC CATTGCGTAC TCTCTATCCG GAAATCGAGC CCTACGCCTC GGGTCGACTC 
GATGTGGGCG ACGGCCATTC GATCTACTGG GAGCGGGTGG GCACGCCCGG AGCGAAGCCG
GCGGTCTTCC TGCACGGTGG CCCGGGCGGT ACGATTTCGC CGAACCACCG GCGGCTCTTC
GATCCGGCGC TCTACGACGT GACGCTGTTC GACCAGCGCG GCTGCGGCAA GTCGGAGCCG
CATGCCGGGA TCGAGGCGAA CACGACCTGG CATCTCGTCG CCGATATCGA GCGGCTGAGG
GAAGCGGCCG GCGCGGACAA ATGGCTGGTT TTCGGCGGTT CCTGGGGTTC GACGCTGGCG
CTTGCCTATA CCGAAACCCA TCCCGGGCGG GTCTCCGAAC TCGTCGTCAG GGGCATTTAC
ACGCTGACCA GGGCCGAGCT CGACTGGTAC TATCAGTTCG GCGTTTCGGA ACTCTTCCCC
GACAAGTGGG AACGCTTCAT CGCCCCGATC CCGCCGGAAG AGCGCCATGA GATGATGCGC
GCCTACCATC GCCGCCTCAC GAGCGATGAC CGTGCGATAC GGCTTGCAGC GGCACGCGCC
TGGAGCATAT GGGAGGGCGA GACGATAACG CTTCTGCCGG AGCCGGCCAC CAGCACGCCC
TTCGAGGAAG ACGAATACGC GCTCGCCTTT GCCCGCATCG AGAACCATTT CTTCGTCAAT
GCCGGATGGC TGGAAGAGGG CCAATTGCTG CGCGATGCGC ATAAGCTCCG CGGCATTCCG
GGTGTGATCG TGCACGGCCG CTACGATATG CCGTGCCCGG CGAAATATGC ATGGCAATTG
CACAAGGCTT GGCCGGAAGC GGAATTCCAT CTGATCGAGG GGGCCGGGCA CGCCTATTCG
GAGCCCGGCA TTCTCGATCG GCTGATCCGA TCGACCGACA AATTCGCCGG CAAGGCCGAA
TAA
 
Protein sequence
MSSPLRTLYP EIEPYASGRL DVGDGHSIYW ERVGTPGAKP AVFLHGGPGG TISPNHRRLF 
DPALYDVTLF DQRGCGKSEP HAGIEANTTW HLVADIERLR EAAGADKWLV FGGSWGSTLA
LAYTETHPGR VSELVVRGIY TLTRAELDWY YQFGVSELFP DKWERFIAPI PPEERHEMMR
AYHRRLTSDD RAIRLAAARA WSIWEGETIT LLPEPATSTP FEEDEYALAF ARIENHFFVN
AGWLEEGQLL RDAHKLRGIP GVIVHGRYDM PCPAKYAWQL HKAWPEAEFH LIEGAGHAYS
EPGILDRLIR STDKFAGKAE