Gene Sare_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2333 
Symbol 
ID5707961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2685508 
End bp2687118 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID641271811 
Productmonooxygenase FAD-binding 
Protein accessionYP_001537182 
Protein GI159037929 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.307173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACT CCGGCGAGTG GACCGACGTG CTGATCGTGG GCGGCGGCCC GGTCGGGATG 
GCGTTGGCAC TGGACCTGAG GTACCGCGGA ATCGACTGCA TGGTTGTCGA AGCGGGCGAC
GGCGAGGTCC GGCACCCCAA GGTGAGTACG ATCGGCCCGC GCTCGATGGA GCTGTTTCGC
CGTTGGGGTC TCGCGGACAC CATCCGTAAC GCGGGATGGC CCGCCGACCA TCCCCTGGAC
ATCGCCTGGG TCACCAGAGT CGGTGGCCAC GAGGTGCACC GGTATCGGCG GGGCACGACG
GCGAACCGTG GGCCCTACGT ACACACCCCT GAGCCCGACC AGATCTGTCC GGCGCACTGG
CTGAATCCAC TGTTGCAACG GGCCGTCGGC GTGCACCCCA CCGGTCCACT GCGGCTCAGG
ACGACCGTGG ACCGCGTGCG TCAGGCGGCC GACCATGTCG AGGCCACCCT CGTCGAGGAC
GGTTCCGGAA GGACCGGCAC CGTCCGCGCA CAATTCCTGG TTGCCTGCGA CGGGGCTGCC
TCACCCATTC GGCAGGCCTG CGGCATCGAC GCACCGCCAC GCCACACGAC GCAGGTCTTT
CGGAACATCC TCTTCCGTGC CCCGGAACTC AAGCGGCAAC TCGGTGACCG CGTCGCCCTT
GTCTACTTCC TTGTGCGGTC ATCCACATTG CGCTTCCCCA TGCGCTCGCT CAACGGCAGT
GACCTCTACA ACCTGGTCGT TGGTGTGGAC GCTGCCGCTC AGGTGGATGG CAGGTCGTTG
ATCACCGATG CCATCGCCTT CGACACACCA GTGGAGCTGC TCAGCGACAG CCAGTGGCAT
CTCACGCACC GGGTGGCCGA CAGCTACCGA GCGGGACGGG TCTTCCTCGC GGGTGACGCC
GCCCACACAC TCTCGCCCTC TGGTGGCTTC GGGCTCAACA CCGGATTCGG CGATGTCGCC
GATCTGGGCT GGAAGCTCGC CGCCGCACTG AACGGCTGGG CGGGGTGCCA CCTGCTGGAC
ACGTACGAAG CCGAGCGCAG GCCGATCGCC CTGGAGAGCC TGCAGGAGGC GAACCTCAAC
CTGCAACGCA CCATGCGCCG GCACGTTCCG GCCGAGATCC ACGCGGACGG TCCGGCGGGT
GAACAGGCTC GCGCGGAAAT GGCCGAGCAG CTGGTGCGCG GCGGAGCGCA TCGCGAGTTC
GACGCACCTG AGATTCACTT CGGGCTGTCC TACCGATCCC CGGCCGTTAT CTCTGACCCA
CTGGTCCCAC CCCGTCAGGG CCAGCCGGAT GCCGCCTGGC GCCCGGGCAG CGATCCCGGC
TACCGCGCCG CGCATGCCTG GTGGGATACC GAGACCTCCA CACTCGACCT GTTCGGTCAC
GGCTTCGTCC TGCTGTCCTT TACCGAGGGG GCAGACGTGT CTGCCGTGGA GCGGGCATTC
GCCAAACGAG CCGTACCGCT GACCGTTCGA CGCGGGAGCG ACCCGGAGAT AGCCAAGCTC
TACGAGCGTT CTCTCGTGCT GGTTCGTCCC GATGGCCATG TGGCCTGGCG AGGCGACGAA
CTGCCCGCTG ATCTGGGGAA GTTCGTCGAC ACGATCCGAG GCGAATATTG A
 
Protein sequence
MTNSGEWTDV LIVGGGPVGM ALALDLRYRG IDCMVVEAGD GEVRHPKVST IGPRSMELFR 
RWGLADTIRN AGWPADHPLD IAWVTRVGGH EVHRYRRGTT ANRGPYVHTP EPDQICPAHW
LNPLLQRAVG VHPTGPLRLR TTVDRVRQAA DHVEATLVED GSGRTGTVRA QFLVACDGAA
SPIRQACGID APPRHTTQVF RNILFRAPEL KRQLGDRVAL VYFLVRSSTL RFPMRSLNGS
DLYNLVVGVD AAAQVDGRSL ITDAIAFDTP VELLSDSQWH LTHRVADSYR AGRVFLAGDA
AHTLSPSGGF GLNTGFGDVA DLGWKLAAAL NGWAGCHLLD TYEAERRPIA LESLQEANLN
LQRTMRRHVP AEIHADGPAG EQARAEMAEQ LVRGGAHREF DAPEIHFGLS YRSPAVISDP
LVPPRQGQPD AAWRPGSDPG YRAAHAWWDT ETSTLDLFGH GFVLLSFTEG ADVSAVERAF
AKRAVPLTVR RGSDPEIAKL YERSLVLVRP DGHVAWRGDE LPADLGKFVD TIRGEY