Gene Smed_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4540 
Symbol 
ID5319041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1027229 
End bp1028599 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content63% 
IMG OID640776341 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001313273 
Protein GI150376677 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTA TCGAAGAACT CATGGCCGCA CTCGGCAATG CCGTGCTCAC CGGCGACCGG 
ATCACGGAGC GTTACCGCAG CGATGCGAGC CTGTCGGGGC GGACCCTTCC CCTGGCGGTC
GTTCGCCCGG GAAGCGTCGA CGAGGTCGCC GTGGCTCTCA AGATCTGCAA CGCGCACGGC
CAGTGCGTGG TGCCGCAGGG CGGGTTGACC GGGCTTGCCG GCGGCGCCAA TCCGCGCGCC
GGCGACATCG CCGTTTCGCT GGAGAGACTT TCCGGCATTG AAGAGGTCGA CGCGACCGCT
TCGAGCATGA CGGTGCTCGC GGGAACGCCG CTCGAAGTCG CCCAGCGCGC GGCCGAAGAC
GCGGGCTTTC TGCTGCCGAT CGATCTTGGT GCGCGGGGTA GCTGCCAGAT CGGCGGCAAC
CTGGCCACGA ATGCCGGCGG CATCCGGGTC ATCCGCAATG GCGTGGCACG GGACAACGTG
CTGGGGCTCG AAGCCGTGCT TGCCGACGGC ACCGTCGTCT CGTCAATGAA CAAGATGATC
AAGAACAACA CCGGCTACGA TCTCCGGCAG TTGTTCATCG GTTCGGAAGG CACGCTCGGC
ATCATCACGC GCGCGGTGCT GCGCCTTCGT CCTTTGCCAA CCGGCCGGCT GACGGCACTT
TGCGCGCTCG ACAGCTATTC GGAAGTCGTT ACCCTGCTCA AGCGGGCACA GCAGGAGCTC
CCCGGCCTCG GCGCCTATGA GGCGATGTGG GAGAGCTACT TCCGTTTCAA TTCCGAGGCC
GACGGGCTCA GGCTCTTCGA ATCCTGCCCG GCTTTCGCAG TCATTGTGGA GCAGGATTTG
CAAGGCCATG ATGCGGAGAG CGAGCGGTTC GAAGCGTTTC TCGGGAGAGC CTTGGAGGAC
GGTGTGATCG GAGACGCGCT CGTCGCTCAA TCGCAGAAGG AGGCACAAGC CTTCTGGCGA
ATTCGAGAGG GTCATGCGCT CGACCGGCTG CCCCTGCTCC TGAACTTCGA TGTCAGCCTG
GCGATCGGCG ATATCGGCCG CTTCGCCGAT GAATGCGGCC AGGCGCTTCG GGCAAAGTTC
CCCGAAGCAC ATGTGTCCTT CTTCGGCCAT GTCGGCGACA GCAACCTCCA TATCGCCTTT
TCCGATCCGG GCGCTACCGA AGAAACGATC CACGCAGTGG ACGATATCGT TTACGCGCTG
GTTGGGACCT ATCGCGGATC GGTGTCTGCC GAACACGGGA TCGGCTTGCT GAAGCGCGAC
TTTCTCCACT ATTCCCGCAG CCCGGCGGAG CTCGAACTCA TGCGGCGGAT AAAGAGCGCT
CTCGATCCCA ACGGAATCCT CAACCCCGGC AAAGTTCTGG GCTCGGTTTA G
 
Protein sequence
MSVIEELMAA LGNAVLTGDR ITERYRSDAS LSGRTLPLAV VRPGSVDEVA VALKICNAHG 
QCVVPQGGLT GLAGGANPRA GDIAVSLERL SGIEEVDATA SSMTVLAGTP LEVAQRAAED
AGFLLPIDLG ARGSCQIGGN LATNAGGIRV IRNGVARDNV LGLEAVLADG TVVSSMNKMI
KNNTGYDLRQ LFIGSEGTLG IITRAVLRLR PLPTGRLTAL CALDSYSEVV TLLKRAQQEL
PGLGAYEAMW ESYFRFNSEA DGLRLFESCP AFAVIVEQDL QGHDAESERF EAFLGRALED
GVIGDALVAQ SQKEAQAFWR IREGHALDRL PLLLNFDVSL AIGDIGRFAD ECGQALRAKF
PEAHVSFFGH VGDSNLHIAF SDPGATEETI HAVDDIVYAL VGTYRGSVSA EHGIGLLKRD
FLHYSRSPAE LELMRRIKSA LDPNGILNPG KVLGSV