Gene Sala_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2146 
Symbol 
ID4080143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2253691 
End bp2255886 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content65% 
IMG OID638010524 
Productcatalase/peroxidase HPI 
Protein accessionYP_617188 
Protein GI103487627 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.423847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC AGACCCCCAT CGGAAGCGGC TGCCCCGTCC ACCAGCCCGG CGGCGTTCGC 
TCGCTGCTCG GCCGCACCAA CAAGGACTGG TGGCCCGACA TGCTGGCGAC CGAGATACTG
ACTCCGAACG GGCCGTCGAA CCCGATGGGT GAGGATTTCG ATTATGCCAA GGCGTTCAAG
TCGCTCGACT ATTATGCGCT GAAGGACGAT CTCAAGGCGC TGATGACCGA CAGCCAGCCC
TGGTGGCCCG CCGATTATGG CCATTACGGG CCCTTTTTCA TCCGAATGGC GTGGCACGCC
GCGGGCACCT ATCGCACCGC CGACGGCCGC GGCGGCGCCA ACAGCGGGCA ACAGCGTTTC
GCGCCGCTCG ACAGCTGGCC CGACAACGGC AATCTCGACA AGGCGCGCCG CCTGCTTTGG
CCGATCAAGC AGAAATATGG CAACAAGATC AGCTGGGCCG ACCTGTTCAT CCTGGCTGGC
AATGTCGCGA TCGAAAGCAT GGGCGGTCCG GTGTTCGGCT TTGGCGGCGG GCGCGTCGAT
GTCTATGAAC CCGAGCGCGA CATCTATTGG GGCAGCGAAG ACAAATGGGT CAATCAGGGC
GTGCAGACGC GCATCGACCC GGCGAAGGGG ATGGAGACGA TCGAAGGTCC GCTCGCCGCG
ATCCAGATGG GCCTGATCTA CGTCAATCCC GAGGGGCCGC AGGGCAACCC CCACGACGAT
GAGGGGATGG CGCGCGACAT GAAGGAAACC TTCAAGCGCA TGGCGATGAA CGACGAGGAA
ACCGTTGCGC TCACCGCTGG CGGCCATACT TTTGGCAAGG CGCACGGCAA TGGCGACCCT
TCGCTGCTCG GCCCCGCGCC CGCGGGCAGC GACCTTGCCG CGCAGGGTTT CGGCTGGGTC
AGCAGCCACG AGAGCGGCGG CATCGGCGAA CATGCCGTCA CCAGCGGCAT CGAGGGCGCG
TGGACCAACA CCCCGCGCGA GTGGACCGAG AATTATTTCC GCCTGCTGTT CGACTATGAC
TATGAACTTG TGAAGTCGCC CGCCGGTGCC TGGCAGTGGC AGCCGATCAA CCAGAAAGAG
GAGGATATGG CCCCGGCGGC GTGGGATCCC GGCATCAAGG TCCCGACGAT GATGACCACC
GCCGACATGG CGCTGAAGCG CGATCCCGCC TATCGCGCGA TCAGCGAGCG GTTCCGCAAC
GACCATGAAG CCTTCAAGGA CGCCTTCGCG CGCGCCTGGT TCAAGCTCAC GCACCGCGAC
ATGGGGCCGA AGGTCCGTTA TCTCGGCCCC GAAGTCCCTG ACGAGGATCT GATCTGGCAG
GATCCGATCC CCGCGGGCAC CAAGCCCTCG GACGCCGAAG TTCAGGCGGT GAAGGACAAG
ATCGCCGCGA GCGGTCTGAC CGTCAGCCAG CTCATCAAGA CCGCCTGGGC GTCGGCCAGC
ACGTTCCGCA AGTCCGATTT CCGCGGCGGC GCCAATGGCG CGCGCGTGCG CCTCGCGCCG
CAAAAGGACT GGGAGGTCAA CGAACCCGCG ATGCTCGCCA GGGTGCTGGA CACGCTCGAT
GGCCTGCGCG GCAGCCTGTC GATGGCCGAT GCGATCGTGC TCGGCGGCGT GGTCGGGCTT
GAAAAGGCGA TCAGGGATGC GGGCTTCAAC GTCGCCGTGC CGTTTACGGG CGGCCGCGGC
GATGCGACGC AGGAGCAGAC CGACGTCGAA AGCTTTGAGG TGATGGAGCC CGAGGCCGAC
GCCTTCCGCA ACTATGTGGG CAAGAAGAAG CTCGCGGTGA AGGTGGAGGA AATGATGCTC
GACAAGGCGT CGCTGCTCGG CCTGTCGGTG CCCGAAATGA CCGTGCTGAT CGGCGGGCTG
CGGGTGCTCG GCGCCAATCA TGGCGAGCGC GGCCACGGCC ACTTCACCAG GCGGTCGGGT
CAGCTCACCA ACGATTTCTT CGTCAACCTG CTCGACATGA CCAATGTGTG GAAGGCGGTC
GAGGGATCGA ACGACCAGGA ATATGTCGCC ACCGACCGCA CGACCGGCGG CGAGACCTGG
CGCGCGACTC GGGCCGATCT GATCTTCGGT TCCAATTCGG AACTGCGCGC GGTGGCCGAA
GTCTATGCCG AGAACGGCCA TGAAGAGAAG TTCGTGCGCG ACTTCGTGAA GGCGTGGACC
AAGGTGATGA ACGCCGACCG TTTCGACCTC GCCTGA
 
Protein sequence
MNDQTPIGSG CPVHQPGGVR SLLGRTNKDW WPDMLATEIL TPNGPSNPMG EDFDYAKAFK 
SLDYYALKDD LKALMTDSQP WWPADYGHYG PFFIRMAWHA AGTYRTADGR GGANSGQQRF
APLDSWPDNG NLDKARRLLW PIKQKYGNKI SWADLFILAG NVAIESMGGP VFGFGGGRVD
VYEPERDIYW GSEDKWVNQG VQTRIDPAKG METIEGPLAA IQMGLIYVNP EGPQGNPHDD
EGMARDMKET FKRMAMNDEE TVALTAGGHT FGKAHGNGDP SLLGPAPAGS DLAAQGFGWV
SSHESGGIGE HAVTSGIEGA WTNTPREWTE NYFRLLFDYD YELVKSPAGA WQWQPINQKE
EDMAPAAWDP GIKVPTMMTT ADMALKRDPA YRAISERFRN DHEAFKDAFA RAWFKLTHRD
MGPKVRYLGP EVPDEDLIWQ DPIPAGTKPS DAEVQAVKDK IAASGLTVSQ LIKTAWASAS
TFRKSDFRGG ANGARVRLAP QKDWEVNEPA MLARVLDTLD GLRGSLSMAD AIVLGGVVGL
EKAIRDAGFN VAVPFTGGRG DATQEQTDVE SFEVMEPEAD AFRNYVGKKK LAVKVEEMML
DKASLLGLSV PEMTVLIGGL RVLGANHGER GHGHFTRRSG QLTNDFFVNL LDMTNVWKAV
EGSNDQEYVA TDRTTGGETW RATRADLIFG SNSELRAVAE VYAENGHEEK FVRDFVKAWT
KVMNADRFDL A