Gene Sare_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2091 
Symbol 
ID5704670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2406012 
End bp2407322 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID641271576 
Productmonooxygenase, FAD-binding 
Protein accessionYP_001536947 
Protein GI159037694 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00849976 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGGAG ACCGGGCCGT AGTGCTCGGT GGGAGTGTTA CCGGAATGTT CGCGGTGAGC 
GCGCTCGCGG AGGGCTACCG CGAGGTGATC GTGGTGGACC GTGACGAACT GATCGGCGTA
CGTGAGGCCA GGCGTGGTTC GCCGCAGGCA CGTCACATCA ACGGGCTGCT GGCCCGCGGG
GCGCGGGCGT TGGAGGACCT GTTCCCGGAG ATCACCGCCG AGATGGTGGC CGCCGGCTGT
CCGTTGACCG ACCTTGCCGG CACCGTGCGC TGGTACTTCA ACGGAAAGCC GCTGAAGCAG
ACCCGGGCCG GCCTCACCAA CGTGGCCGCC CGCCGGCCGG TTATGGAGGC GCACTGCCGG
GACCGGGTGC AGGCCATGCC GAACGTCCGA TTCATGGAGC GGTACGACAT CACCGGCCTG
GTGCACACCC CGGACGGGAG CAGGGTCACC GGCGTACGCG TTCAGCCCCA CGGGGACGGC
GGTGCCGAGG AGGTTCTGGA GGCCGATCTG GTCGTCGACA CGACCGGCCG TGGCTCCCGT
ACCCCGGTGT GGCTGGAGGC GATGGGCTAT CCGAGGGTCG AGGAAGAGGG CACCAAGATG
GGGCTCGGCT ACGCGACCCG GCACTACAAG CTCCGGTACG ACCCCTTCGG CACCGACCAC
TCGATCGTCT GCGTGGCGTC GCCGGCATCG CCCCGTGGTG CCATCTGCAC CAAGACCGAC
TCCAACACGG TTGAGTTGAC CACGTACGGC ATCCTTGGTG ACCACCCGCC GACCGATCCC
GACGGCTTCA ACGCCTTCGT GAAGACACTC GCTGCGCCCG AGATCTATGA GGCGATCATT
GACGCGGAAC CGCTCGACGA TCCGGTGTTG TTCCGCTTCC CGACCACCCT GCGGCGGCGG
TACGAGCGGA TGGGTCGCTT TCCCGAGGGC CTTGTCGTCA TGGGTGACGC GGTCTGCACC
CCGAACCCGG TGTTCGCCCA GGCGCAGACC CTCTCCGCGT TGCAGGCGCT CGCTCTCCGC
GACGAGCTGC GACGTGGGAT CGTGCCGAAC TCGACGGAGT TCATGGCCAC GGTCGGTCGC
ATCGTCGATC CCGCCTGGGA TATGACCGAA GGCATCAACC TGAGCTACCC GGGGGTCGAG
GGCAAGCGCA CCCGCAAGGT GTTGCTACTA CACGCGTACA TGCGTCGACT GCACGATGTG
GCGAGCCGGG ACGGAAGCGT GACCGAGGCG TTCATGCGGG CTGCCAGTCT GGTTGATTCA
CCGGCGGCCC TGATGCGTCC AGGCCTGGTG TGGCGGGTAC TGCGGGGCTG A
 
Protein sequence
MVGDRAVVLG GSVTGMFAVS ALAEGYREVI VVDRDELIGV REARRGSPQA RHINGLLARG 
ARALEDLFPE ITAEMVAAGC PLTDLAGTVR WYFNGKPLKQ TRAGLTNVAA RRPVMEAHCR
DRVQAMPNVR FMERYDITGL VHTPDGSRVT GVRVQPHGDG GAEEVLEADL VVDTTGRGSR
TPVWLEAMGY PRVEEEGTKM GLGYATRHYK LRYDPFGTDH SIVCVASPAS PRGAICTKTD
SNTVELTTYG ILGDHPPTDP DGFNAFVKTL AAPEIYEAII DAEPLDDPVL FRFPTTLRRR
YERMGRFPEG LVVMGDAVCT PNPVFAQAQT LSALQALALR DELRRGIVPN STEFMATVGR
IVDPAWDMTE GINLSYPGVE GKRTRKVLLL HAYMRRLHDV ASRDGSVTEA FMRAASLVDS
PAALMRPGLV WRVLRG