Gene Saro_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0914 
Symbol 
ID3918000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp967462 
End bp969150 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content62% 
IMG OID640443648 
Productcytochrome-c oxidase 
Protein accessionYP_496193 
Protein GI87198936 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00560774 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCA TACCAGCTAA CGAAGGCTTC GATGCGCATC ACGATGCGCA CGACCACGAC 
CATGATCATC CGGGCTTCTT CGCCCGGTGG TTCATGTCCA CCAACCACAA GGACATCGGT
ACGATGTACC TGATCTTCGC GATCTTCGCG GGGGTCATCG GCGGCGCGGT TTCGGGCATG
ATGCGTGCCG AACTCGCACA GCCGGGCATC CAGTACCTGG GCATGTTCGC CGAATTCCTC
GGCGAGAAGA ACCCGAGCTT CGACCAGCAG CTCCACCTGT GGAACGTGCT GATCACCGCG
CACGGCCTGA TCATGGTGTT CTTCATGGTC ATGCCCGCGA TCATCGGCGG CTTCGGCAAC
TGGTTCGTTC CGATCATGAT CGGCGCGCCG GACATGGCGT TCCCGCGCAT GAACAACGTC
TCGTTCTGGC TGACCTTCGT CGGCTTCTGC TCGCTGATGT TCTCGGCCTT CGTGCCGGGC
GGCACCGGGC AGGGCGCCGG CACCGGGTGG ACGGTCTACG CGCCGCTCTC CACTTCGGGC
TCGGTCGGTC CGGCCGTCGA CTTCGCGATC TTCTCGCTGC ACCTTGCGGG CGCGGGTTCG
ATCCTGGGTG CGATCAACTT CATCACCACC ATCTTCAACA TGCGCGCGCC GGGCATGACC
CTGCACAAGA TGCCGCTGTT CGTGTGGTCT GTGCTGGTTA CCGCGTTCCT GCTGCTGCTG
TCGCTGCCGG TCCTTGCCGC CGCGATCACC ATGCTGCTGA CCGACCGCAA CTTCGGCACG
ACGTTCTTCG ATCCGGCCGG CGGCGGTGAC CCGGTTCTGT ACCAGCACCT GTTCTGGTTC
TTCGGCCACC CCGAAGTGTA CATCATGATC CTGCCGGGCT TCGGCATCAT CAGCCAGATC
ATCTCGACCT TCAGCCGCAA GCCGGTCTTC GGCTACCTCG GCATGGCCTA CGCCATGGTC
GCGATCGGCG TCGTCGGGTT CATCGTGTGG GCTCACCACA TGTACACCAC CGGCCTCGAC
GTGAACACGA AGATGTACTT CACCGCCGCA ACCATGGTCA TCGCGGTGCC GACCGGCATC
AAGATCTTCT CGTGGATCGC GACGATGTGG GGCGGCAGCC TCGAGTTCAA GTCGCCGATG
GTCTGGGCGA TCGGCTTCAT CTTCCTCTTC ACCGTGGGCG GCGTGACGGG CGTCGTGCTG
GCCAACGGCG GCGTGGACGA CAACCTGCAC GACACCTATT ACGTCGTCGC GCACTTCCAC
TACGTGCTGT CTCTGGGTGC CGTTACCGCG CTCTTCGCCG GGTTCTACTA CTGGTTCCCG
AAGATGAGCG GCCGCATGCA CTCGGAGTTC CTTGCGCACC TGCAGTTCTG GATCTTCTTC
GTCGGCGTGA ACCTGATCTT CTTCCCGATG CACTTCCTCG GGATCCAGGG CATGCCGCGC
CGCTATCCGG ACTACGCGCT GGCCTATGCC AAGTGGAACG AAGTGGCCTC GATCGGCTAC
GCCATCATGG CGGTATCGAT CGTGATCTTC TTCATCAACA TCCTCTACGC ATTCCTTGCC
GGCAAGAAGG CGGAAGCCAA CTACTGGGGC GAAGGTGCGA CGACGCTGGA ATGGACGCTG
TCCTCGCCGC CGCCGTTCCA CCAGTTCGAG ACACTGCCGG TGATCGACGA CAAGGCCCAC
GCACACTGA
 
Protein sequence
MATIPANEGF DAHHDAHDHD HDHPGFFARW FMSTNHKDIG TMYLIFAIFA GVIGGAVSGM 
MRAELAQPGI QYLGMFAEFL GEKNPSFDQQ LHLWNVLITA HGLIMVFFMV MPAIIGGFGN
WFVPIMIGAP DMAFPRMNNV SFWLTFVGFC SLMFSAFVPG GTGQGAGTGW TVYAPLSTSG
SVGPAVDFAI FSLHLAGAGS ILGAINFITT IFNMRAPGMT LHKMPLFVWS VLVTAFLLLL
SLPVLAAAIT MLLTDRNFGT TFFDPAGGGD PVLYQHLFWF FGHPEVYIMI LPGFGIISQI
ISTFSRKPVF GYLGMAYAMV AIGVVGFIVW AHHMYTTGLD VNTKMYFTAA TMVIAVPTGI
KIFSWIATMW GGSLEFKSPM VWAIGFIFLF TVGGVTGVVL ANGGVDDNLH DTYYVVAHFH
YVLSLGAVTA LFAGFYYWFP KMSGRMHSEF LAHLQFWIFF VGVNLIFFPM HFLGIQGMPR
RYPDYALAYA KWNEVASIGY AIMAVSIVIF FINILYAFLA GKKAEANYWG EGATTLEWTL
SSPPPFHQFE TLPVIDDKAH AH