Gene Sare_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0983 
Symbol 
ID5703709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1107034 
End bp1108791 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content65% 
IMG OID641270498 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001535885 
Protein GI159036632 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.615245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000770913 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCACCG TCGCACCGAA GCCGGTCGTG ACCCGGCCCT GGCCGGTCCG AGAGCCGGTC 
AAGGGTTCGG CTTTCGCCCG GCTGCTGCGT ACCACGGACG CGAAGCAGAT CGGGATCATG
TATATGGTCA CCGCGTTCGT ATTCTTCATG ATCGGCGGCC TGATGGCCCT GATCATGCGG
GCCGAGCTGG CGCGGCCCGG GCTGCAGTTC CTGACGCCCG AGCAGTTCAA CCAGCTGTTC
ACGATGCACG GCACGATCAT GCTGCTGTTC TTCGCGACGC CGATCGTGTT CGCCTTCGGT
AACTACATCG TGCCGATCCA GATCGGCGCG CCGGACGTCT CGTTCCCGCG GCTGAACAGC
TTTGCCTACT GGCTCTTCCT GTTCGGCGGC ACGTTGGCCA CCGCCGGCTT CATCACGCCC
GGTGGTGCCG CCGACTTCGG CTGGTTCGCG TACACCCCGC TGAGCAGCGT GGAGCACTCC
CCGGGCGTCG GCGCGAACAT GTGGATCGTC GGGCTGGCGA TCTCTGGTCT GGGCACCATC
CTCGGCTCGG TCAACATGAT CACCACGATC CTGACCCTGC GCGCGCCTGG CATGACCATG
TTCCGGATGC CGATCTTCAC CTGGAACATG CTGGTCACCA GCCTGCTGGC GCTCCTGATC
TTCCCGCTGC TGGCCGCTGC GCTGTTCGCG CTCGCCGCCG ACCGGATCCT CGGCGCCCAC
GTGTACGACC CGGCGACCGG CGGCCCGCTG CTCTGGCAGC ACCTCTTCTG GTTCTTCGGG
CACCCCGAGG TGTACATCAT CGCGCTGCCG TTCTTCGGCA TCATCTCCGA GATCATTCCG
GTCTTCTCCC GCAAGCCGAT CTTCGGCTAC AAGGGCCTGG TCGCCGCGAC CGTCGCCATC
GCCGCCCTGT CGATGAGTGT CTGGGCGCAC CACATGTTCG CCACCGGTCA GGTGCTGCTG
CCGTTCTTCA GTTTCCTGAG CTTCCTCATC GCCGTCCCCA CCGGGATGAA GTTCTTCAAC
TGGATCGGCA CCATGTGGCG GGGGCAGCTC AGCTTCGAGT CGCCGATGCT GTTCTCAATC
GGCTTCCTGG TCACCTTCCT CTTCGGTGGC CTCACCGGCG TACTGCTGGC CAGCCCGCCG
CTGGACTTCC ACGTCTCGGA CTCCTACTTC GTGGTGGCCC ACTTCCACTA CGTGCTGTTC
GGCACGATCG TGTTCGCCGT CTTCGCCGGC ATCTACTTCT GGTTCCCGAA GATGTTCGGC
CGGATGCTCG ACGAGCGGCT CGCCAGGGTC CATTTCTGGC TGACCATGGT CGGCTTCCAC
ACCACCTTCC TGGTGCAGCA CTGGCTGGGT AACGAGGGCA TGCCGCGGCG GTACGCCGAC
TACCAGGTCA TTGACGGCTT CACCACGCTG AACATGATTT CCACGGTCGG CGCGTTCATC
ACCGGTATCT CGACGCTGCC GTTCATCTAC AACTGCTGGA AGTCGTACAA GGCGGGACCG
GTGGTCGAGG TCGAGGACCC CTGGGGGCAC GGCAACTCGT TGGAGTGGGC GACCAGCTCG
CCGCCGCCGT TGCGTAACTT CGACCGGATG CCGCGGATCC GTTCCGAGCG GCCCGCGTTC
GACTACAAGT TCCCCGAGCT GGCCGCCGGT CAGACCCTGG CCGGCCCGCC CGAGGGTGGA
GCCAAGCTGC TGACCAGCGA GTCCGACGGT GGCGCCAGTT ACCAGGAGGA CGTGGCGAGC
GACCGGGACC GCCACTGA
 
Protein sequence
MTTVAPKPVV TRPWPVREPV KGSAFARLLR TTDAKQIGIM YMVTAFVFFM IGGLMALIMR 
AELARPGLQF LTPEQFNQLF TMHGTIMLLF FATPIVFAFG NYIVPIQIGA PDVSFPRLNS
FAYWLFLFGG TLATAGFITP GGAADFGWFA YTPLSSVEHS PGVGANMWIV GLAISGLGTI
LGSVNMITTI LTLRAPGMTM FRMPIFTWNM LVTSLLALLI FPLLAAALFA LAADRILGAH
VYDPATGGPL LWQHLFWFFG HPEVYIIALP FFGIISEIIP VFSRKPIFGY KGLVAATVAI
AALSMSVWAH HMFATGQVLL PFFSFLSFLI AVPTGMKFFN WIGTMWRGQL SFESPMLFSI
GFLVTFLFGG LTGVLLASPP LDFHVSDSYF VVAHFHYVLF GTIVFAVFAG IYFWFPKMFG
RMLDERLARV HFWLTMVGFH TTFLVQHWLG NEGMPRRYAD YQVIDGFTTL NMISTVGAFI
TGISTLPFIY NCWKSYKAGP VVEVEDPWGH GNSLEWATSS PPPLRNFDRM PRIRSERPAF
DYKFPELAAG QTLAGPPEGG AKLLTSESDG GASYQEDVAS DRDRH