Gene Sare_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3904 
Symbol 
ID5704977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4444717 
End bp4446057 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID641273329 
Productkynurenine 3-monooxygenase 
Protein accessionYP_001538686 
Protein GI159039433 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGG ACCACGACGA CGAGGTCGCC GTCGTCGGCG CCGGGCTCTC CGGATGCTTG 
CTCGCTGCCT TCCTCGCCCG GCGCGGCTAC CCGGTCACCC TCTACGAACG CCGGCCCGAC
CCCCGGACCG GTCGGGTCGA CCGGGGTCGC TCCATCAACC TGGCGCTCTC CGAACGCGGG
CTGGACGCGT TGCGCCGGAT CGGCCTGGAC GCACGGGTGA TGGCCGAGGC GTTGCCGATG
CGTGGCCGGA TGATCCACCC GGTTGACGGC GAGCCGCAGT TCCAGGCGTA CAGCGCGGCC
GGGGACCGGG CGATCAACTC GATCAGCCGG GGGGCGTTGA ACAACGCCCT GCTGACCGAG
GCCGCCGCGC TGCCGGGCGT ACAGGTCGCC TTCGACCACC GGCTGGTCGA CCTCGACCCG
GGTACTGGGG AGATGACCTT CGAGACTCCG CAGGGCAAGG TCACCGCTAC CGCGCCAGTG
GTCCTGGGCG CGGACGGGGC CGGCTCGGCG GTCCGTGGGC AGTTGCTCGG CCACGGGCTG
CTGCGGGAGA GCCTGGACTT TCTCGACTAC GGCTACAAGG AGCTGACCAT TCCGCCGTTG
GGCGGAAACT TCGCGCTCGA CCCGGAGGCA CTGCACATCT GGCCGCGGGG TACCTCGATG
ATGATCGCGC TGCCGAACCC GGACCGCTCC TTCACCTGCA CGCTTTTCTG GCCCACCCAC
GGCACGGCGA GTTTCGCCTC GCTGGGCAGC CCGGCGGCGA TCGAGCGGCA CTTCGCGCAG
CACTACCCCG ACCTGCTCCC GCTCGCGCCG AACCTGGTCG ACGAGTATCT GCACAACCCG
GTCGGGGTGC TCGGTACCGT CCGTTGCGAC CCCTGGCAGG TGAACGGGAC CGTCGGGCTG
CTCGGCGACG CGGCGCACGC CATCGTGCCG TTCTACGGGC AGGGCGCCAA CTGTGCCTTC
GAGGACGTGG TGGAACTGGA CCGCTGCCTC GACGAGTGTG CCGACGACTG GTCCGCCGCC
CTCCCGTTGT ACCAGCACCG CCGACAGGGC AACGCCGAGG CGATCGCCCA GATGGCGCTG
GCCAACTTTG TCGAGATGCG GGACCGGGTC GCCTCGCCGC TGTTCCAGCT CGGCCGAAGG
GTGGAGCACA CGCTGGAGCG GGCGTTGCCC GGCCGGTACG TGTCCCGGTA CGAGCTGGTG
TCCTTCTCGA CCACCCCATA CGCCGAGGTG CGCCGTCGGG TCCGCTATCA ACACCAGGTG
CTCGGTGCGG TGGTCGGGGG TGCGGCGGCC CTGCTGGCCG GCGCGGTCGG GGCGGCGCTC
CGGCGACGGA GGCGCGGATG A
 
Protein sequence
MSADHDDEVA VVGAGLSGCL LAAFLARRGY PVTLYERRPD PRTGRVDRGR SINLALSERG 
LDALRRIGLD ARVMAEALPM RGRMIHPVDG EPQFQAYSAA GDRAINSISR GALNNALLTE
AAALPGVQVA FDHRLVDLDP GTGEMTFETP QGKVTATAPV VLGADGAGSA VRGQLLGHGL
LRESLDFLDY GYKELTIPPL GGNFALDPEA LHIWPRGTSM MIALPNPDRS FTCTLFWPTH
GTASFASLGS PAAIERHFAQ HYPDLLPLAP NLVDEYLHNP VGVLGTVRCD PWQVNGTVGL
LGDAAHAIVP FYGQGANCAF EDVVELDRCL DECADDWSAA LPLYQHRRQG NAEAIAQMAL
ANFVEMRDRV ASPLFQLGRR VEHTLERALP GRYVSRYELV SFSTTPYAEV RRRVRYQHQV
LGAVVGGAAA LLAGAVGAAL RRRRRG