Gene Smed_5979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5979 
Symbol 
ID5320281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp936308 
End bp937546 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID640777657 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001314589 
Protein GI150377994 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.448623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCTA ACCCGACCTC GATTCGTCAG CTGCTCGATC GCCGTCTATC GGGCTTCAGC 
CTGGAGCAGC CCTTCTACGT CTCGCCCGAG GTCTACGCGC TCGACCTGGA GCACATTTTC
TACATGGAAT GGCTCTATGC CGTTCCGGCC TGCCAGCTCG CCAAGACGGG TAGCTATGTC
ACGCTGCGCG TAGGCGCTTA TGAGGTCGTC ATCGTCCGTG GCCGTGACGG TGAAGTCCGT
GCCTTTCACA ATTCCTGTCG CCACCGCGGA TCGTTGATCT GCAAGGCGCG CCAGGGGCAG
GTGGCGAAGC TTGTCTGTCC CTACCACCAA TGGACCTATG AGCTTGACGG TAAACTGATC
TGGGCGAACG ACATGGGTCC CGATTTCCAT GCTTCGAAAT ACGGCCTGAA ACCCGTCAAC
CTGCGCAGTC TCGACGGCCT CATCTATATC TGTCTTTCCG ATACGCCGCC GGATTTCCAA
ACATTCGCAC AGCTGGCGCG CCCCTATCTG GAGGTTCACG ACCTCAAGGA TGCCAAGGTC
GCGTTCACCT CTACGATCAT CGAGAAAGGC AACTGGAAGC TGGTCTGGGA GAACAACCGC
GAGTGCTATC ATTGCAGCAG CAACCATCCG GCTCTCTGCC GCTCCTTCCC ACTCGACCCG
GAAGTTGCCG GTGTTCAGGC CGATGGCGGA GTATCTGAGA AGCTGCAGGC GCATTTCGAC
CTTTGCGAAG CCGCCGGCAC ACCGGCGCAA TTCGTCCTTG CTGGCGACGG TCAGTATCGC
CTCGCACGTA TGCCGCTGCA GGAAAAGGCG TTGAGCTATA CGATGGACGG CAAGGCCGCG
GTTTCCCGGC ATCTGGGCCG GGTTGCCCCG CCGGATGCCG GCACGCTCCT GATGTTCCAC
TATCCGTCGA CGTGGAACCA CTTCCTGCCG GATCACTCAC TCACCTTCAG GGTTATGCCG
ATCAGCCCGA CCGAAACCGA GGTCACGACG ACCTGGCTCG TACACAAGGA TGCGGTCGAA
GGAGTCGACT ACGACCTCAA GCGCCTGACG GAGGTCTGGA TCGCCACCAA TGACGAAGAT
CGCGAGATCG TCGAAACGAA CCAGCAAGGG ATCCTCTCTC CGGCTTACGT GCCCGGTCCC
TATTCACCGG GTCAGGAAAG CGGCGTCATG CAGTTCGTCG ACTGGTATGC GGCCTGGCTG
GAGCGCGCCC TTGCGCCGCG TCAAGTGGCT GCGGAGTGA
 
Protein sequence
MTANPTSIRQ LLDRRLSGFS LEQPFYVSPE VYALDLEHIF YMEWLYAVPA CQLAKTGSYV 
TLRVGAYEVV IVRGRDGEVR AFHNSCRHRG SLICKARQGQ VAKLVCPYHQ WTYELDGKLI
WANDMGPDFH ASKYGLKPVN LRSLDGLIYI CLSDTPPDFQ TFAQLARPYL EVHDLKDAKV
AFTSTIIEKG NWKLVWENNR ECYHCSSNHP ALCRSFPLDP EVAGVQADGG VSEKLQAHFD
LCEAAGTPAQ FVLAGDGQYR LARMPLQEKA LSYTMDGKAA VSRHLGRVAP PDAGTLLMFH
YPSTWNHFLP DHSLTFRVMP ISPTETEVTT TWLVHKDAVE GVDYDLKRLT EVWIATNDED
REIVETNQQG ILSPAYVPGP YSPGQESGVM QFVDWYAAWL ERALAPRQVA AE