Gene Smed_5468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5468 
Symbol 
ID5319770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp435582 
End bp437003 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content61% 
IMG OID640777229 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001314161 
Protein GI150377566 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00404471 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTC TGGGGCAATT GGCCGAAATA GTCGGCGGTG AGATGATTAT TACCGGCGTG 
GCGGAAATGG CATCGGCAAC TGAAGACTGG CGGGGGCGCT ACCATGGAGC CGCCCTCTGC
GTCACGCGCC CGGCCGATAC CACACAGGTC TCCGAAATCG TCGCTTGTTG TCATCGTCAT
GGTGTCCCGG TGCTGCCCCA GGGCGGCAAT ACCGGCCTCG TGGGTGGTAG CGTGCCTGCC
TCGACCGGCG TCGCCCCAGT CATCGTAAGC CTCGACCGGA TGCGCCGCAT CAGGAGCGTC
GACCCGGTAA ATAGTACAAT AGAGGTTGAG GCAGGATGCG TGCTCGCCAA TGTGCACGAT
GCCGCAAAGT CGGCCAATCG CTTTTATCCT GTGAGCCTCG GATCGGAAGG GTCCTGCCAG
ATTGGCGGTA CGATCGCCAC CAACGCCGGT GGCACATCCG TCCTTCGATA CGGCACTACG
CGGGACAACG TGCTGGGCCT CGAAGTCGTG CTGCCCGATG GGACGATTTG GTCCGGTCTC
ACGGGGCTTC GCAAGAACAA CACGGGTTAT GACCTGAAGC ACCTTTTCAT CGGCTCCGAG
GGTACACTTG GCATCATAAC GGCGGCCGTG CTGAAGCTGC ATCCCTTTCC GGCGCGGACG
GCCGTCGCAT GGGCCGGACT GGATTGCCCG GAAGACGCGC TGAAGATGCT GACGCTCATC
CAGGGCACCT ACGGTGCCAA ACTTTCCGGA TTCGAGTTGA TGAACCGCCT GCAGCTCGAT
CTCGTCGTCA AGCACGTTCC TCAACGCCGA TCGCCGATCG AAACCGAGCA CGATTGGCAC
CTTCTCATCG AATTGTCGGA TTCCGGCGGT GAGGGTGATC TGGACGAGGC GCTGCAGGCC
GTTCTGGAGA AGGGCTTCTC GGCCGAGCTC GTTGCCAACG CAGTGATCGC CGCAAGCGAG
GCCCAGCGGG CTGCACTATG GGAAGTGCGA CATAGCGTTT CGGAGGCAAA CAAGAAGGCG
GGAGTCGGCC TGACGACAGA TTGCGCCGTT CCGGTCTCCG CAACGGCCGA CTTTATCAGC
AAGGCGACCG AGAACGTTCA TGCGATAGCC CCCGCCGCAT CCGTAGTTGT CGTCGGACAT
GTGGGGGACG GCAATATCCA CTTCATTCCG TTCTTCACCT TCGACGCCTG GAACGGCCTC
GCCGAGCGCG ACGCGCTTTC CCTACGTATC CGACACGCGG TCAACGACGT CGCCGCCGCT
CTCGGGGGCA CGTTCAGCGC GGAACATGGT GTCGGACAAG TGCAATTGGC TGAGATGGAC
CGGTACAAGC AGCCGGCCGA ACTCGCCCTG ATGCGGGTGG TTAAAGCCGC AATCGACCCT
AAAGGCCTCT TCAATCCAGG CCGTCTGCTT CCAAACACCT GA
 
Protein sequence
MNVLGQLAEI VGGEMIITGV AEMASATEDW RGRYHGAALC VTRPADTTQV SEIVACCHRH 
GVPVLPQGGN TGLVGGSVPA STGVAPVIVS LDRMRRIRSV DPVNSTIEVE AGCVLANVHD
AAKSANRFYP VSLGSEGSCQ IGGTIATNAG GTSVLRYGTT RDNVLGLEVV LPDGTIWSGL
TGLRKNNTGY DLKHLFIGSE GTLGIITAAV LKLHPFPART AVAWAGLDCP EDALKMLTLI
QGTYGAKLSG FELMNRLQLD LVVKHVPQRR SPIETEHDWH LLIELSDSGG EGDLDEALQA
VLEKGFSAEL VANAVIAASE AQRAALWEVR HSVSEANKKA GVGLTTDCAV PVSATADFIS
KATENVHAIA PAASVVVVGH VGDGNIHFIP FFTFDAWNGL AERDALSLRI RHAVNDVAAA
LGGTFSAEHG VGQVQLAEMD RYKQPAELAL MRVVKAAIDP KGLFNPGRLL PNT