Gene Sala_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1008 
Symbol 
ID4081696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1033569 
End bp1035065 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content62% 
IMG OID638009368 
Productcarotenoid oxygenase 
Protein accessionYP_616058 
Protein GI103486497 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0158946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.495511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC AGCTTGTCGA AACGATCCAG TCGACGCTTC AGCCGAACGA CCATCCCTAT 
ATGCAGGGTG CGTGGCGTCC GACCTATAAT GAGTGGAACG CGATATTTTC CAATGGCGAT
GCCGAGGTGA TCGGGACGAT CCCCGATGAC ATCGACGGCG TCTATGTTCG CACCGGCGAG
AACCAGATTC ACGAGCCGAT CGGCCGTTAC CACCCCTTCG ACGGCGATGG TTTCATTCAC
GCCATCTCGT TCAAAGGAGG CCGCGCGAGC TATCGCAGCC GCTTCGTGCG GACCAAGGGA
TTCGCGGCCG AGAAGGAAGC CGGGCGTTCG TTATGGGCGG GCCTGATGGA GCCGCCGCAC
AAGTCGACGC GCCCCGGCTG GGGGGCGCAG GAGTGGCTCA AGGATTCCTC CTCGACCGAT
GTGGCGATCC ATGCGGGGAA GATCATCTCG ACCTTCTATC AGTGCGGCGA AGCCTACCGG
CTCGATCCCT TCACGCTCGA ACAGTTCGGC ACCGAAAGCT GGGTGCCGCT CGACGGCATT
TCGGCGCATT GCAAGGTCGA CCTCGCGACG GGCGAGCTGA TGTTCTTCAA TTACTCGAAA
CACGCCCCCT ACATGCACTA CGGTGTCGTC GGCGCGGACA ACAAGCTCAA ACATTATATC
CCGGTGCCGC TGCCCGGACC ACGGCTGCCG CACGACATGG CGTTCACCGA ACATTACACG
ATCCTCAACG ACATGCCGCT CTATTGGAAT GAGGAGCTGC TCAAAAAGAA TCTGCACGTC
GTCCAGTTCC ACCCCGACCA GAAAACGCGC TTCGCGATCA TCCCGCGCCA CGGCCAGCCC
GAAGACATCC GCTGGTTCGA GGCCGAGCCG ACCTACACGC TCCACTGGCT CAACGCCTGG
GAGGAAGGCG ACGAGATCAT CCTCGACGGC TATTATCAGG AAGAGCCGAT GCCCAAATCC
TATCCGAACG CGCCCGAGGG GCTCGAACGG ATGATGGCCT ATCTCGATCA GGGGCTTTTG
AAGCCGCGCC TCCACCGCTG GCGCTTCAAC CTCAAGACCG GCGCGACGGT CGAGGAGCGG
CTTGACGACC GCGATCTTGA GTTCGGCATG TTCAATCATC GCTACGCGGG CAAGCCCTAT
CGCTACGCCT ATAGCGCGAT CCCCGAGCCC GGCTGGTTCC TGTTTCGCGG GATCGTCAAG
CACGATCTCG ATTCCCGGAC GAGCGAGGCA TATGAGTTCG GCCGCGGTCG TTTCGGCAGC
GAGGCACCGT TCGCGCCGCG CATCGGCGCC AGGGACGAGG ACGACGGCTA CCTCGTATCC
TTCATCGCGG ATCTTGAAAC CGACCAGTCC GAATGCGTGC TGATCGACGC GAAGAATATA
ACGGCGGGGC CGGTGTGCCG GATCATCCTG CCCGAGCGCA TCTGTTCGGG CACGCACAGC
GTGTGGGCGA GCGGCAATGA TATCGGCATG GGCGAAAACA GCGTGCTGGC CGCCTGA
 
Protein sequence
MTAQLVETIQ STLQPNDHPY MQGAWRPTYN EWNAIFSNGD AEVIGTIPDD IDGVYVRTGE 
NQIHEPIGRY HPFDGDGFIH AISFKGGRAS YRSRFVRTKG FAAEKEAGRS LWAGLMEPPH
KSTRPGWGAQ EWLKDSSSTD VAIHAGKIIS TFYQCGEAYR LDPFTLEQFG TESWVPLDGI
SAHCKVDLAT GELMFFNYSK HAPYMHYGVV GADNKLKHYI PVPLPGPRLP HDMAFTEHYT
ILNDMPLYWN EELLKKNLHV VQFHPDQKTR FAIIPRHGQP EDIRWFEAEP TYTLHWLNAW
EEGDEIILDG YYQEEPMPKS YPNAPEGLER MMAYLDQGLL KPRLHRWRFN LKTGATVEER
LDDRDLEFGM FNHRYAGKPY RYAYSAIPEP GWFLFRGIVK HDLDSRTSEA YEFGRGRFGS
EAPFAPRIGA RDEDDGYLVS FIADLETDQS ECVLIDAKNI TAGPVCRIIL PERICSGTHS
VWASGNDIGM GENSVLAA