Gene Smed_3586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3586 
Symbol 
ID5318577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp13478 
End bp15187 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content66% 
IMG OID640775401 
ProductSMP-30/gluconolaconase/LRE domain-containing protein 
Protein accessionYP_001312334 
Protein GI150375738 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1414] Transcriptional regulator
[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.63579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA AGGCAGTCTT AGGCGGGCTT GAAGATCGCG GGGCACCGGC GGGTGCGGCC 
GCCCTCGCCA AGGGTCTCGC TCTCCTCGAC CTCATAGCGG AGGCTCCGAA GCCGCTGCGC
TTCGCTGACC TGCAGAAGAT GAGCGGCGTG CCGAAGCCGA CGCTTGCGCG CATGCTGAAG
ACGCTGATGG TGTTTCGTCT GATCCGGCAG GACGAGGCAA CGGGCGCTTA TCTGCTCGGA
CACCGCTTCA TCGAGCTTTC GCACCGTGTG TGGGATAAAT TCGACCTCGT CAGCGCCGCC
GCCCCCGAAC TCGAACGCCT TTCGGCCGAA CTCGGGGAAA CGGTCGCACT TTGCCGCCTC
GACGGTCACC GTGTGGTTTA TCTGGAAGAG CGCTCGAGCG GCGGCCTGGG CGTGCTGATT
GCGGTCGGCC GGCGTGTGCC GGTCCACTGC ACCGCGGCCG GCAAGGCTCT GCTCGCGTTT
CAGGAGCCCT CCTTCGCCCG CTCGCTTGCT GGACAGCTCG ATTATGATCG CTTCACGCCG
CAGACCATTA CAGATCCGCA GGCGCTCGAG GCCGATCTCG TCCTGACACG CGCCCGGGGC
TATGCCGTTT CCTACGAGGA GCATCTCGCA GGGGTCAATA CTGTCGCTGC TCCGATCGCC
GGGCGCGACG GAGTGCCGCT CGGAGCGCTC GTGGTGCTGG GGCCAGCCTC CCGTCTCGAC
AGTTCCGCCA TCCATCCGGT CGGCCGCGAA TTGATGGCTG CGGCGCGACG GATCACCGGC
ACGGTCGGGG CCGTGGCCAT CAGCTCCGGC CCGCGGCCGC GGACGCGTGC GGGAGGACAG
AGCGATGTGC AATGCGTTCT GCCCTGGGGC GCCCAGCTCG GCGAAGCCCC AGTCTGGGTC
GGCAGGGAAA AGCGGCTCTA TTGGGTTGAT ATTCTCCACC CGGCCGTGCA TCGGTTCGAT
CCTGTTACCG GCAAAAACGA GACCTGCAAC ACAGCCAAGC TCGTCAGTGC CGTCCTTCCG
AGCGAGGACG GGCGGCTGGT CGTCGCCTCG CAGGACGGCG TCGAACGTTT CGACTTCGAG
CGCGGCCTGT TCACCCGGTT TGCCGAGCCG GAACCGGGGG TACCGGAAAA CCGCCTCAAT
GACGCAAAAG TCGACCCGCA TGGAAGGCTT TGGGTGGGAT CGATGCGCCT TGATGTGAGC
CGGCCGACCG GCAGCCTGTA TCGCCTGACG AAGACCGGTG AAGTCGCCCG CGCCGAAAGC
GGGTTCACGG TCGCCAACGG GCTGGCCTGG AGTCCCGATA GTTCGACCTT CTACTTCGTC
GATACGGTGC CGGGCATCAT CTATGCCTAC GACTTCGATG CGAGGGAGGG GAGCATCGCC
AACCGCCGCG TTTTCGTTAC CGTGCCCGAG GCGGAGGGGC GTCCGGACGG GCTTGCCGTC
GATGCCGATG GCGGCGTCTG GTGTGCAATC TGGGATGGAT GGCGCGTGAA CCGCTATCGG
CCCGATGGCC GGCTGGATCG CGCCGTCGAA CTGCCGGTGC CACGCCCCAC CAGCGTGGCC
TTCGGCGGGG ACGATCTTGC GACCTTGTTC ATCACCAGCG CGCGCACCCG TCTCCCGGCC
TCGACGCTGA CGGAAGCACC GCTCTCCGGC GGCATTTTCG CCTGTAACCC CGGTGAGCGC
GGCCTGGCCA CCTCTCTCTT CGGCGTCTGA
 
Protein sequence
MDDKAVLGGL EDRGAPAGAA ALAKGLALLD LIAEAPKPLR FADLQKMSGV PKPTLARMLK 
TLMVFRLIRQ DEATGAYLLG HRFIELSHRV WDKFDLVSAA APELERLSAE LGETVALCRL
DGHRVVYLEE RSSGGLGVLI AVGRRVPVHC TAAGKALLAF QEPSFARSLA GQLDYDRFTP
QTITDPQALE ADLVLTRARG YAVSYEEHLA GVNTVAAPIA GRDGVPLGAL VVLGPASRLD
SSAIHPVGRE LMAAARRITG TVGAVAISSG PRPRTRAGGQ SDVQCVLPWG AQLGEAPVWV
GREKRLYWVD ILHPAVHRFD PVTGKNETCN TAKLVSAVLP SEDGRLVVAS QDGVERFDFE
RGLFTRFAEP EPGVPENRLN DAKVDPHGRL WVGSMRLDVS RPTGSLYRLT KTGEVARAES
GFTVANGLAW SPDSSTFYFV DTVPGIIYAY DFDAREGSIA NRRVFVTVPE AEGRPDGLAV
DADGGVWCAI WDGWRVNRYR PDGRLDRAVE LPVPRPTSVA FGGDDLATLF ITSARTRLPA
STLTEAPLSG GIFACNPGER GLATSLFGV