Gene Sala_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1698 
Symbol 
ID4081101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1788271 
End bp1789713 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content66% 
IMG OID638010072 
Productcarotenoid oxygenase 
Protein accessionYP_616744 
Protein GI103487183 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.459113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCC AGGTCGAAAC ACTGATCCGC AGCGCCGTCG TCAAAACCAT CGGCAAGGTC 
GCCGATTTCA ATCGCCGCCG CCTCCCCCGC CCCACCGACG CCCACCCCTT CCTCTCCGGC
ATCCACAGGC CGATGACCGA AGAATTGACG ATCGAGGCGC TGCGCGTCGA TGGCGAAATC
CCGGCCGCGC TACGCGGCCG TTACCTGCGC AACGGCCCGA ATCCGGCAAT GCCGCCCGAT
CCCGCCAGCT ATCACTGGTT CATCGGCGCC GGCATGGTCC ACGGCATCCG CATCGAAGGC
GGCCAGGCCG TCTGGTATCG CAACCGCTGG GTGCGTGGCA GCGAGGCGTG CGCTGCGCTT
GGCGAGGAGC TGCCGCCCGG CCCGCGCGAA GAGCGCAACG ACGCGCCGAA CACCAATGTC
GTGGGTCTCG CCGGGCGGAC GTTCGCAATC GTCGAGGCGG GCGGCACGCC GGTCGAGCTC
GATCATGAGC TCGGCACCAT CGCGCATAAC CCTTTCGACG GCACCCTCGC GGGTGCGTTC
ACCGCGCACC CACACGCCGA CCCCTTCACC GGGGAAACGC ACGCGATCAC CTATCGCAGC
GACGAACCGA ACAAGGTCTG GCACGTCGTG CTCGACGAAC AGGCGCATGT CGTGCGCGAA
GAACCGATTG CGGTCAGCGA CGGCCCCTCG ATCCACGACT GCGCGCTGAC CGAAAATTAC
GTGCTCGTCT TCGACCTGCC CGTTACTTTC TCGATGAAAC GGCTGCTCGC AGGTTATCGT
TTTCCCTATA TGTGGAACGA AAATCACCCG GCGCGCGTCG GCCTGCTCCC GCGCGAGGGC
CGTGGCGACG ACATCGTCTG GGTGCCGGTC GATCCCTGCT ATGTCTTTCA CCCCGCCAAC
GCCTTTGAAA CCGCGGACGG CCGGGTGATC GTCGATGTCG TCGCGCACGA AACGATGTTC
GCCACGTCGA AGCGCGGCCC CGACAGCGAA AAGTCGCGCA TGGAACGCTG GACGATCGAT
CCCGTCGCGC GCACGACGAC ACGCACCGTG ATCCACGACC ATGCGCAGGA GTTTCCGCGC
TATGACGAGC GGCTGACGAC GCGGCCCTAT CGCTATGTCT ACAGCATCGC GATCCCCGAC
GGCCGTTCAG CCGAATGGGC GCTCGCCGAT ACCCGGCTGT TCCGCCACGA TCTCGAAACG
GGCACGACCG CCATCCACGA CTTCGGCTCC GGCCGCCACC CCGGCGAGTT TGTATTCGTT
CCGCGCAAGG CGGCGGGCGC CGAGGATGAC GGCTGGCTGA TCGGCCTCGT CGTCGACATG
AACGACGAGA CCACCGACCT CGTCATCCTC AACGCCGACG ATTTCACCGG GCCGCCGCAA
GCCGTCGTCC ATCTGCCGCA CCGCGTTCCG CCGGGGTTTC ATGGCAATTG GGTCGCGGAC
TGA
 
Protein sequence
MASQVETLIR SAVVKTIGKV ADFNRRRLPR PTDAHPFLSG IHRPMTEELT IEALRVDGEI 
PAALRGRYLR NGPNPAMPPD PASYHWFIGA GMVHGIRIEG GQAVWYRNRW VRGSEACAAL
GEELPPGPRE ERNDAPNTNV VGLAGRTFAI VEAGGTPVEL DHELGTIAHN PFDGTLAGAF
TAHPHADPFT GETHAITYRS DEPNKVWHVV LDEQAHVVRE EPIAVSDGPS IHDCALTENY
VLVFDLPVTF SMKRLLAGYR FPYMWNENHP ARVGLLPREG RGDDIVWVPV DPCYVFHPAN
AFETADGRVI VDVVAHETMF ATSKRGPDSE KSRMERWTID PVARTTTRTV IHDHAQEFPR
YDERLTTRPY RYVYSIAIPD GRSAEWALAD TRLFRHDLET GTTAIHDFGS GRHPGEFVFV
PRKAAGAEDD GWLIGLVVDM NDETTDLVIL NADDFTGPPQ AVVHLPHRVP PGFHGNWVAD