Gene Sala_2865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2865 
Symbol 
ID4080658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3016583 
End bp3017839 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content64% 
IMG OID638011249 
Productlinalool 8-monooxygenase 
Protein accessionYP_617903 
Protein GI103488342 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.961693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.635546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCC AGATTCGCAT CGCCCCCGAA ACCGTCAATC CGCTCGACGT CAGCCGCGCC 
GAGCTGTGGA GCGAGGATCG CTGGCAGGAA CCGATGCGCC AGTTGCGCGC CGAATCACCC
ATCTATTATT GCCCCGAATC GAAGTTCGGC CCCTATTGGT CGGTGACGAC GTACAAACCG
ATCCAGCATA TCGAGGCGCT GCCGAAGATT TTCTCCTCGA GCTGGGAATA TGGCGGCATC
ACCGTCGCCG GCGACGGGGT CGAGCATCTG AAAGAGGGCG AGATCCCGAT GCCGATGTTC
ATCGCGATGG ACCCGCCGCA GCACACCGCG CAGCGCCGCA CCGTCGCCCC CGCCTTCGGT
CCGTCGGAGA TCGAGCGGAT GCGCGCCGAC ACGCAGGCGC GCACCGCCGC GCTGATCGAC
ACCCTGCCCG TCGGCGAGGC GTTCGACTGG GTCGAGAAAG TGTCGATCGA GCTGACCACC
GACATGCTCG CGATCCTGTT CGACTTTCCC TGGGCCGACC GCCACAGGCT GACCGCCTGG
TCCGACGCGC TCGGCGACAT CGAAAGCTTC AACACGCTCG AAGAACGGCA GGCGCGGCTC
GCGACCGCCT TTGAAATGGG CGCGGCGTTC AAGGAGTTGT GGGACCACAA GGCCAGGAAT
CCGGGCAAGC ACGACCTCAT CTCGATCATG CTTCAGTCGG ATGCGATGAG CCATATGAGC
CATGAGGAGT TCATGGGGAA TCTCATTCTG CTGATCGTCG GCGGCAACGA CACGACGCGC
AACTCGATGT CGGCCTATGC CTATGGCCTG CACTGCTTTC CCGAGGAACG CGCCAAGCTG
GAGGCGAACC ACGATCCCGA ACTGGCGGTC AATGCGATGC ACGAGATCAT CCGCTGGCAG
ACGCCGCTCG CGCACATGCG CCGCACCGCG ATGGAGGACA CCGAACTGTT CGGCCACCAG
ATCAAGGCGC GCGACAAGAT CGCGCTCTGG TATGCCTCGG CGAACCGCGA CGAAAGCATC
TTTCCCGACG GCGACCGCAT CATCGTCGAC CGCGAAAATG CGCGCCGCCA CCTTGCCTTC
GGCTATGGCA TCCACCGCTG CGTCGGCGCG CGCGTCGCCG AACTGCAGCT TACGACGCTG
ATTTCGGAGA TGCAGAAGCG CCGCCTGCGC GTCAACGTCG TCGGCGAGCC CCAGCGCGTC
CATGCGTGCT TCGTCCATGG CTATCGACAC CTGCCGGTCG AACTCGAACG ATATTGA
 
Protein sequence
MATQIRIAPE TVNPLDVSRA ELWSEDRWQE PMRQLRAESP IYYCPESKFG PYWSVTTYKP 
IQHIEALPKI FSSSWEYGGI TVAGDGVEHL KEGEIPMPMF IAMDPPQHTA QRRTVAPAFG
PSEIERMRAD TQARTAALID TLPVGEAFDW VEKVSIELTT DMLAILFDFP WADRHRLTAW
SDALGDIESF NTLEERQARL ATAFEMGAAF KELWDHKARN PGKHDLISIM LQSDAMSHMS
HEEFMGNLIL LIVGGNDTTR NSMSAYAYGL HCFPEERAKL EANHDPELAV NAMHEIIRWQ
TPLAHMRRTA MEDTELFGHQ IKARDKIALW YASANRDESI FPDGDRIIVD RENARRHLAF
GYGIHRCVGA RVAELQLTTL ISEMQKRRLR VNVVGEPQRV HACFVHGYRH LPVELERY