Gene Sala_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2021 
Symbol 
ID4079958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2131015 
End bp2132301 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content64% 
IMG OID638010397 
Productcytochrome P450 
Protein accessionYP_617065 
Protein GI103487504 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00496503 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAAAAC CTATATTGCC CGGAGAGGTC GCCGCCGCGG TCGTCAATCC CGCCGCCTAT 
GGGGCGTGGA AGCCGCTTCA TGAACAGCTT GCCTGGGCGC GGGCGAACAT GCCGCTGGCG
GTTGCGGAGA ATCCGAACCA CGATCCTTTC TGGCTCGTCA CGCGCCATGC CGACGTCATG
GCGATCAGCC GCGATCCGCA ACGCTTTGCC AACGGCATCC GGCCGACGGT GCTGACCGAC
CGCGCGGGGG AGGCGCTGGC GCGCGCGGCG ACGCCGGGGG GCGATGGCCA TCTGGTTCGC
TCGCTCGTCC AGATGGATGC GCCCGATCAT ATGAAATACC GACTGCTGAC GCAGAGCTGG
TTCATGCCCA GGAATCTGAA GACGATCGAG GACCGGATTC GGCAGATCGC GCGCGACACC
GTTGAGCACA TGCTGGAGGC AGGGGGATCA TGCGATTTCG CGCGCGATGT GGCGGCGCAT
TATCCGCTGC GCGTCATCAT GGATATATTG GGCGTGCCGC CCGAGGACGA ACCGCGGATG
CTGATGCTGA CGCAGCAATT GTTCGGACCG ACCGATCCCG AACTCAACCG CAGCCGTGAA
GCAATCACCA GTTCCGAACA GGCGATCGCG ATGCTGCATT ATGTCATCGC GGACTTCGAG
GCGTATTTCG GGGCGCTGAC CGCCGACCGC CGCGCCAACC CGCGCGAGGA TATTGCGACG
GTGATCGCCA ATGCCATGGT CGATGGCGAG CAGATTCCCG ACCGCGAACT CGCCGGCTAT
TATATGATCA TCGCGACCGC GGGCCACGAC ACGACGAGCG CGTCCACCGC CGGGGCGATC
ATGGAACTGG CCCGCAATCC CACGCTGTTT CAGCGGTTTC GCGATGCGGA GAGCGACAAG
GCGGGGCTGA TCGAGGAAGC GATCCGCTGG ACGACGCCGG TGCAGCATTT CATGCGCAGT
GCCCGGCAAG ATGTCGAAAT GGGCGGGCAG ACGATCCGCG AAGGCGACTG GCTGATGCTG
AATTATGTTT CCGCAAACCG CGACGAGGGG GTCTTTGTCG ATCCGTTCAT GTTCGATCCT
GACCGCGCGA AGAACCCGCA GATCGCCTTT GGTTTCGGCG CGCATGTCTG CCTGGGGCAG
CATCTGGCGC GGCTGGAGAT GCGGATTTTG ATGGAGGAGT TACTGCCGCG GCTGACCAGC
CTGGAGCTGG CGGGCGAGCC CGCGCGCGTC GAATCGGTGT TCGTCGGCGG GCTGAAGCGG
CTGCCGATCC GGTTTGAAGC GGCGTAG
 
Protein sequence
MTKPILPGEV AAAVVNPAAY GAWKPLHEQL AWARANMPLA VAENPNHDPF WLVTRHADVM 
AISRDPQRFA NGIRPTVLTD RAGEALARAA TPGGDGHLVR SLVQMDAPDH MKYRLLTQSW
FMPRNLKTIE DRIRQIARDT VEHMLEAGGS CDFARDVAAH YPLRVIMDIL GVPPEDEPRM
LMLTQQLFGP TDPELNRSRE AITSSEQAIA MLHYVIADFE AYFGALTADR RANPREDIAT
VIANAMVDGE QIPDRELAGY YMIIATAGHD TTSASTAGAI MELARNPTLF QRFRDAESDK
AGLIEEAIRW TTPVQHFMRS ARQDVEMGGQ TIREGDWLML NYVSANRDEG VFVDPFMFDP
DRAKNPQIAF GFGAHVCLGQ HLARLEMRIL MEELLPRLTS LELAGEPARV ESVFVGGLKR
LPIRFEAA