Gene Sala_2861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2861 
Symbol 
ID4080654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3012207 
End bp3013742 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content63% 
IMG OID638011245 
Producthypothetical protein 
Protein accessionYP_617899 
Protein GI103488338 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.245537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG CCCTTGGTTC ACAAAACCAC CCGACGAAGG TTGACGTGGC GATCATCGGC 
GCGGGACCGG CCGGCCTCAC CGCCGCCTAT CTGCTGACCA GGCAGGGCTA TGGCGTGACG
GTGATCGAAA AGGATCCGAC CTATGTCGGC GGGATCAGCC GCACGGTCGA ACATGAGGGC
TTTCGTTTCG ACATCGGCGG GCACCGCTTT TTCTCCAAAA GCCGCGAAGT CGTCGATCTG
TGGAACGAGA TTCTGCCCGA CGACTTCATC GAGCGCCCGC GGATGAGCCG CATCTATTAT
GAGGGCAGGT TTTACAGTTA TCCGCTGCGC GCGTTCGAGG CGCTGTGGAA CCTCGGCGTC
GTGCGGTCGA CCTTGTGCAT GATGAGCTAC GCCAGGGCGA AGCTGCTGCC GAAGAAGACG
GTGCGCAGTT TCGAGGACTG GACGATCAAC CAGTTCGGGA ACAAGCTTTA TTCGATCTTT
TTCAAGACCT ATACCGAAAA GGTGTGGGGA ATGCCGTGTG ACGAAATGTC GGCCGACTGG
GCGGCGCAGC GCATCAAGGG GCTTTCGCTC GGCGGCGCAG TGATCGACGG GCTGAAACGC
AGCCTGGGGC TCAACAAGAA GCCCAATGAC GGCATGGCGA CCAAGACCTT GCTCGAAACC
TTCCGTTATC CGCGGCTCGG GCCGGGCATG ATGTGGGAGG CGGCGGCGCG CAAGGTCGTG
GCGGGCGGCA ATCATCTGCT GATGGGCCAC AGCTTCAAGC AGCTGACGCA GGACCAGAAA
ACCGGGCGCT GGCGGATGAC CGCGACCGGG CCCGACGGCG ATGTCGTGAT CGACGCGGGC
CATGTCATCA GCTCGGCGCC GATGCGCGAA CTGGCGGCGC GCATCCATCC GCTCCCGGCC
TGCGCGCTGA CCGCGGCGCC CAAGCTCAAT TACCGCGACT TCCTGACCGT CGCGCTCAAG
ATTCGGTCGG AGGATCTGTT CCCCGACAAC TGGATCTATA TCCACGACAG CCAGGTGCGG
GTCGGCCGGG TGCAGAACTT CCGCAGCTGG TCGCCCGAAA TGGTCCCCGA TCCGGCGATC
GCCTGCGTCG GCCTCGAGTA TTTCTGTTTC GAGGGCGACG GACTCTGGTC GTCGAGCGAC
GCCGATCTCA TCGCCCTGGC GACGAAGGAA ATGGCGATCC TTGGCCTGTG CAAGGCCGAG
GATGTCGTCG GCGGCGCGGT GGTGCGGCAG GAAAAGGCCT ATCCCGTCTA TGACGATGAC
TATGCCGCGC ATGTCGAGAC GATGCGGTCC GAGCTGGAGG CGCGGTATCC GACGCTGCAC
ATGGTCGGGC GGAACGGCAT GCACCGCTAT AACAACCAGG ATCATGCGAT GATGACGGCG
ATGCTGACCG TGCGCAACAT CGTCGCCGGC GAGAAGCTTT ACGACATCTG GGGCGTCAAC
GAGGACGCCG AATATCATGA AAGCGGCACC GAGGGTGAAC GCGCCGCGCT GGCGTCGGTG
CGCGACGTCC CGGCGCGCGT CGCCAGGGCC GCCTGA
 
Protein sequence
MSEALGSQNH PTKVDVAIIG AGPAGLTAAY LLTRQGYGVT VIEKDPTYVG GISRTVEHEG 
FRFDIGGHRF FSKSREVVDL WNEILPDDFI ERPRMSRIYY EGRFYSYPLR AFEALWNLGV
VRSTLCMMSY ARAKLLPKKT VRSFEDWTIN QFGNKLYSIF FKTYTEKVWG MPCDEMSADW
AAQRIKGLSL GGAVIDGLKR SLGLNKKPND GMATKTLLET FRYPRLGPGM MWEAAARKVV
AGGNHLLMGH SFKQLTQDQK TGRWRMTATG PDGDVVIDAG HVISSAPMRE LAARIHPLPA
CALTAAPKLN YRDFLTVALK IRSEDLFPDN WIYIHDSQVR VGRVQNFRSW SPEMVPDPAI
ACVGLEYFCF EGDGLWSSSD ADLIALATKE MAILGLCKAE DVVGGAVVRQ EKAYPVYDDD
YAAHVETMRS ELEARYPTLH MVGRNGMHRY NNQDHAMMTA MLTVRNIVAG EKLYDIWGVN
EDAEYHESGT EGERAALASV RDVPARVARA A