Gene Smed_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3663 
Symbol 
ID5318677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp100349 
End bp102109 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content65% 
IMG OID640775476 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001312409 
Protein GI150375813 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0156041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA TCGACCCCAT CACCCTGACC GTTATCCAAT CCGGCCTCCA GCAGGTCTGC 
GACGAGATGG ACATGACCTT CTCGCGAGCT GCATTCTCGC CGATCATCGC CGAGGCGGAC
GACCGCTCCG ACGGCATCTA TTCCGCCGAG GATGGTTCGC TGATCGCCCA AGGCATCAAG
GGGCTGCCGG TCTTCGTCGG AACGATGCAG GCATCGACCC GCACTCTTAT CGAGTTCATT
CGCGACGGCC GCTGCCTGCC GCCGGAGGAG GATGACATCT ATGCGGTGAA CGACCCCTAT
CTTGGCGGCA CGCACCTGAT GGACGTCCGC TTTGCCACGC CGTTCTATCG TAACGGGGAG
ATTTTCTGCT GGCTCTCGAA CATCGGACAC TGGCCCGATA TCGGCGGCGC CGTGCCCGGC
GGGTTTTCGG CATCCGCCAC CTCCGTGGAG CAGGAAGGAC TTCGTTTCCC GCCGATCAAG
CTGTTCAAGC GCGGCGTTCT CGATCGCGAG CTTTTCTCGA TCATCAGCTC CAACATCCGC
GTCGCAGAGC AGCGCATAGG CGATATCCGG GCTCAGGCGG CCGCCTTGCG GGTCGGCAAG
GAGCAGTTCA CGGCCCTGCT CGACCGCTAT GGTGACGACA CGGTAGCGGC GGCCATTGCA
GAATTGAGGC GCCGCTCGGC AGCGCAGATG CGCGCCTCCA TCCGCACCAT CGCGCCCGGA
ACCTATCACG GCAAAGCCTT CATCGATTCC GATGGAGTGG TCAACGAACC GCTCACCATT
GCCCTGTCTG TGACCAAGAC CGGCGACGAC CGGCTGGTCT TCGATTTCGC GGGGTCCAGT
CCGCCCTGCC GCGGCCCGAT GAATTGCGTG CTGGCGACGA CCCACTCCTC GGTCTATCTC
GCCATGCGGC ACATCTTTCC GGAGATCCCG CTCAGCGCCG GCGCTTTCGA GCCGCTGGAG
ATCGTCAGCC CTGCTGGCAC GTTTCTTGAT GCGCAATATC CACGGCCGGT CTCAGGCTGT
GCGGCGGAAG TGTCGCAGCG CATCGCCGAG GCGGTCTTTT CCGCCCTCGT ACAGGCACTG
CCGGAGCGCG TGACCGCGGC TCCCGCCGGT TCCAGCGGCA ATTTCGCACT CGGCGGCAGC
GATCCGCTGC TTGGCCGCGA CTACGTGATG TACCACATTT CCGGTGGCGG TTACGGCGGC
AACGCCCGGG AAGACGGCCT GACCAACGGC TGCTCCACCA TCGGCATATC CAAGTCCGCC
CCGGTCGAGA TCACCGAGCA GGTTTTTCCG GTGTTCTTCC GCGAATACGC GATCCACGAG
GGCTCCGGCG GTGCGGGACG CAACCGGGGC GGCTTCGGCC TCAGCTACGA AGTGGAATTG
CTGCGCGGCG ACGCCCAGGC ATCCTTCGTG ATGGATCACG GCGCCTTCGG ACCCCAGGGG
GCCCTCGGCG GCGCGGATGG CGCAGTGGGC ACGATCACGG TCACGCGAGG TGGAAAAACC
TATCGGCCGG AACATCTGTC GAAGGAACAG GACATAGCAC TGACCGCCGG GGACCGGGTC
CGGGTGGAGA CCCCCGGTGG CGGCGGATAC GGCCTGGCCC ACGAACGGGA CGTCGAGGCC
GTGCTCAAGG ACGTGTTCCT CGGCTATTAC TCGCTGCAGC AGGCAGAAAG CCTCTTCGGC
GTCGTGATCG ATACGACAAG CGGCAAGCTT GACAGGGAAG CGACCGAAAA GTTGCGCCGC
CGCCGCTCGA AAGGCGCCTG A
 
Protein sequence
MSTIDPITLT VIQSGLQQVC DEMDMTFSRA AFSPIIAEAD DRSDGIYSAE DGSLIAQGIK 
GLPVFVGTMQ ASTRTLIEFI RDGRCLPPEE DDIYAVNDPY LGGTHLMDVR FATPFYRNGE
IFCWLSNIGH WPDIGGAVPG GFSASATSVE QEGLRFPPIK LFKRGVLDRE LFSIISSNIR
VAEQRIGDIR AQAAALRVGK EQFTALLDRY GDDTVAAAIA ELRRRSAAQM RASIRTIAPG
TYHGKAFIDS DGVVNEPLTI ALSVTKTGDD RLVFDFAGSS PPCRGPMNCV LATTHSSVYL
AMRHIFPEIP LSAGAFEPLE IVSPAGTFLD AQYPRPVSGC AAEVSQRIAE AVFSALVQAL
PERVTAAPAG SSGNFALGGS DPLLGRDYVM YHISGGGYGG NAREDGLTNG CSTIGISKSA
PVEITEQVFP VFFREYAIHE GSGGAGRNRG GFGLSYEVEL LRGDAQASFV MDHGAFGPQG
ALGGADGAVG TITVTRGGKT YRPEHLSKEQ DIALTAGDRV RVETPGGGGY GLAHERDVEA
VLKDVFLGYY SLQQAESLFG VVIDTTSGKL DREATEKLRR RRSKGA