Gene Smed_4659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4659 
Symbol 
ID5319334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1169940 
End bp1171187 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content65% 
IMG OID640776457 
Productimidazolonepropionase 
Protein accessionYP_001313389 
Protein GI150376793 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.563946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0326494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGAA ACGACAACCC AGGCCCTGTC AGAACCAGCC TCTGGCGTAA CGCACGCCTT 
GCGACGCTTC GCGAAGATCT GGGCACCCTC GGAGTCGTCG AGAGCGGGGT CGTCGCCGCT
CGCGGCGAGC GGATCGTTTA TGCCGGCCCC GAGGCCGGTC TCACCTCGGA GCTTGCGCGC
GCAGACCAGA TCTTCGATTG CGAAGGCCGC TGGGTAACCC CCGCGTTGAT CGACTGCCAC
ACACATATCG TTCATGGCGG CAACCGGGCC CGGGAATTCC AGCTCCGCCT CGAAGGCGCG
ACCTATGAGG CGATCGCCCG CGCCGGAGGC GGCATCGCTT CGACCGTCGA GGCGACCAAC
GCCCTTTCCG TCGACGAACT TGTGGCGGCC GCACTGCCAC GGCTCGACGC CCTGCTCGCG
GAGGGCGTTT CGACCGTCGA GGTAAAGTCG GGCTATGGTC TCAACGTCGA GACCGAGCTC
AAGATGCTGC GCGCCGCCCG CCGGTTGGAG ACCCTGCGCC CGGTGCGCAT CGTCACCAGC
TATCTCGCAG CCCATGCGAC CCCGCCGGGA TATCAGGGCC GAAACGGCGA CTACATCGCC
GAGGTCGTCC TGCCGGGCCT CGCTGCAGCG CATACGGAAG GGCTTGTGGA TGCCGTCGAC
GGATTCTGCG AAGGCATAGC CTTTTCGCCG GCGGAGATCG CCTTGGTCTT CGACAAGGCG
AAGTCGCTCG GCCTTCCCGT GAAGCTTCAC GCCGAACAGC TTTCCGATCT CGGCGGCGCA
AAGCTCGCTG CTTCCTACAG CGCTCTTTCC GCCGACCACC TCGAATATCT CGACGCTGCG
GGCGCCGCCG CCATGGCAAA GGCCGGCACG GTCGCCGTCC TGCTGCCCGG CGCTTTCTAC
ACCCTCCGGG AAAAGCAGCT TCCACCCGTC GAAGCACTCC GCGCGGCGGG GACGCGCATG
GCTATCGCCA CCGATTGCAA TCCCGGAACC TCGCCGCTCA CCTCGCTGCT GCTCACAATG
AACATGTCCG CGACGCTCTT CCGCCTGACG TTGGAGGAAT GTCTCGCCGG AGTTACTCGC
GAGGCCGCCC GTGCACTCGG GGTGCTCGAC GAGACCGGTA CGATCGAAGC CGGCAAGTCC
GCGGACCTTG CGATCTGGAA CATCGATCAA CCGGCGGAGC TGATCTACCG CGTCGGCTTC
AATCCCCTGT GCGAGCGTGT TTTCAAGGGC GAAAGGGTTT CCCGATGA
 
Protein sequence
MDRNDNPGPV RTSLWRNARL ATLREDLGTL GVVESGVVAA RGERIVYAGP EAGLTSELAR 
ADQIFDCEGR WVTPALIDCH THIVHGGNRA REFQLRLEGA TYEAIARAGG GIASTVEATN
ALSVDELVAA ALPRLDALLA EGVSTVEVKS GYGLNVETEL KMLRAARRLE TLRPVRIVTS
YLAAHATPPG YQGRNGDYIA EVVLPGLAAA HTEGLVDAVD GFCEGIAFSP AEIALVFDKA
KSLGLPVKLH AEQLSDLGGA KLAASYSALS ADHLEYLDAA GAAAMAKAGT VAVLLPGAFY
TLREKQLPPV EALRAAGTRM AIATDCNPGT SPLTSLLLTM NMSATLFRLT LEECLAGVTR
EAARALGVLD ETGTIEAGKS ADLAIWNIDQ PAELIYRVGF NPLCERVFKG ERVSR