Gene Sala_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1404 
Symbol 
ID4081760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1458198 
End bp1459868 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content63% 
IMG OID638009770 
Productcytochrome-c oxidase 
Protein accessionYP_616451 
Protein GI103486890 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.29775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA TCGCAGCGAC TGCACCCGCG CACGGCCATG CCGATCACGC GCACGACCAT 
GATCACGACA CTCCGGGCTT TTTCGTCCGC TGGTTCATGT CGACGAACCA CAAGGACATC
GGCACGCTCT ACCTGATCTT CGCGATCGTC GCGGGGATCG TGGGCGGCGT GCTGTCGGGC
ATGATGCGCC TCGAACTCGC CGAACCGGGC GTGCAATATC TGGGCGGCTG GGCCGAAATG
CTCGGCGGCG CGGGCGACGA GGTTGCGGGC AAGCATTTCT GGAACGTGAT GATCACCGCG
CACGGGCTGA TCATGGTCTT CTTCATGGTC ATGCCCGCGA TGATCGGCGG CTTCGGCAAC
TGGTTCGTGC CGCTGATGAT CGGCGCGCCC GACATGGCCT TCCCGCGCAT GAACAATGTC
TCCTTCTGGC TGACCTTCGT GGCGTTCGTC ATGCTCGTCG GATCGATGTT CGTGCCCGGC
GGCAGCGGCC TTGGCGCCGG CACCGGCTGG ACGGTCTATG CGCCGCTGTC GACCAGTGGT
TCGGCCGGGC CCGCGGTCGA CATGGCGATC TTCTCGCTTC ACCTTGCGGG CGCGGCGTCG
ATCCTCGGCG CGATCAACTT CATCACGACC ATCTTCAACA TGCGCGCGCC GGGCATGACG
CTGCACAAGA TGCCGCTGTT CGTATGGTCG GTGCTCGTCA CCGCCTTCCT GCTGCTGCTC
GCGCTTCCGG TGCTTGCCGC GGCGATCACG ATGCTGCTCA CCGACCGCAA TTTCGGCACC
ACCTTCTATG ATGCGGCGGG CGGCGGCGAT CCCGTGCTTT ACCAGCATCT CTTCTGGTTC
TTCGGCCACC CCGAAGTCTA TATCATGATC CTGCCGGGCT TCGGCATCGT CAGCCAGGTC
ATCTCGACCT TCAGCCGCAA GCCGGTGTTC GGCTATCTCG GCATGGCCTA TGCCATGGTG
GCGATCGGTG TCGTCGGCTT CATCGTGTGG GCGCACCACA TGTTCACCGT CGGCATGAGC
GTGAACCTGA AGATGTATTT CACCGCGGCG ACGATGGTGA TCGCGGTCCC GACCGGCATC
AAGATATTCA GTTGGATCGC GACGATGTGG GGCGGGTCGA TGAGCTTCAA GACCCCGATG
GTCTGGTCGC TGGGCTTCAT CTTCATGTTC ACCGTCGGCG GCGTGACCGG CGTGGTTCTC
GCCAATGGCG GCGTCGATAC CGCGCTTCAC GACACCTATT ATGTCGTCGC GCACTTCCAC
TATGTGCTTT CGCTCGGCGC GGTCTTCTCG CTCTTCGCCG GTTTCTATTA CTGGTTCCCG
AAGATGTCGG GCCGGATGTA CAACGAGGCG CTGGGCCAGC TGCACTTCTG GATCTTCTTC
ATCGGCGTGA ACGTGCTCTT CTTCCCCCAG CACTTCCTGG GTCAGCAGGG AATGCCGCGC
CGTTACCCCG ACTATCCCGA CGCCTATGCC TTCTGGCACC TCATCTCGTC CTGGGGCTAT
GTCATCATGG GCGTGGGCGT GCTGATCTTC TTCGCGAACA TTCTCTGGTC GCTGGTCGCG
GGGAAGAAGG CGGCGGACAA TCCGTGGGGC GAAGGCGCAA CGACGCTCGA ATGGACGCTG
TCGAGCCCGC CGCCGTTCCA CCAGTTCAAC GAACTGCCGC GTATCGCCTG A
 
Protein sequence
MTDIAATAPA HGHADHAHDH DHDTPGFFVR WFMSTNHKDI GTLYLIFAIV AGIVGGVLSG 
MMRLELAEPG VQYLGGWAEM LGGAGDEVAG KHFWNVMITA HGLIMVFFMV MPAMIGGFGN
WFVPLMIGAP DMAFPRMNNV SFWLTFVAFV MLVGSMFVPG GSGLGAGTGW TVYAPLSTSG
SAGPAVDMAI FSLHLAGAAS ILGAINFITT IFNMRAPGMT LHKMPLFVWS VLVTAFLLLL
ALPVLAAAIT MLLTDRNFGT TFYDAAGGGD PVLYQHLFWF FGHPEVYIMI LPGFGIVSQV
ISTFSRKPVF GYLGMAYAMV AIGVVGFIVW AHHMFTVGMS VNLKMYFTAA TMVIAVPTGI
KIFSWIATMW GGSMSFKTPM VWSLGFIFMF TVGGVTGVVL ANGGVDTALH DTYYVVAHFH
YVLSLGAVFS LFAGFYYWFP KMSGRMYNEA LGQLHFWIFF IGVNVLFFPQ HFLGQQGMPR
RYPDYPDAYA FWHLISSWGY VIMGVGVLIF FANILWSLVA GKKAADNPWG EGATTLEWTL
SSPPPFHQFN ELPRIA