Gene Sare_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2295 
Symbol 
ID5705885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2636224 
End bp2638224 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content67% 
IMG OID641271773 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001537144 
Protein GI159037891 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0269783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC GGGTCACCAC GGAACCACAG GACCGGGGTC CGGCGATCCT GGCGCCGGCC 
CGCTTCGGCG GCTACCCGGG CCCGCGACGG CCGGCGTTGC CCGGCGGCAA GCTGGCACGA
TGGCTGGCCA CCACCGATCA CAAGCAGATC GGCCTGCTCT ACCTGATCAC GTCCTTCGGC
TTCTTCGTGC TGGCCGGCAT CGAGGCGATG CTGATCCGGG GCGAGCTGGC CCGGCCCGGA
CTCCAGTTCC TCTCCCCCGA GCAGTACAAC CAGTTGTTCA CCACCCACGG TCTCGCGATG
TTGCTGCTCT TCGCCACGCC GGCGGCACTG GGCACAGCCA ACTACATCGT CCCGATCCAG
ATCGGGGCGC CGGACGTCTC GTTCCCCCGG CTGAACGCCT TCGGCTACTG GCTGTTCCTC
TTCGGCGGCC TGCTGCTCTA CGGCAGTTTC CTGACCCCAG CCGGATCCGC CGACTTCGGC
TGGTTCGCCT ACACTCCGCT GAGCCAGGCC GAGAACTCCC CCGGAATCGG CCCCGACATG
GTCCTGGTCG GGCTGGTGCT CGGTGGCCTC GGCACCATCC TCACGGCGGT CAACCTGATC
ACCACCATCA TCACCCTCCG GGCACCCGGC ATGACCATGT TCCGTATGCC GATCTTCACC
TGGAACATGC TGTTCACCAG TGTGCTGATC CTGATGGTGT TCCCACTGCT GGCCGCCACG
CTGCTCGCCC TGCTCGCCGA CCGCCTCCTC GGCGCGCACG TCTTCGACCC GGCCAGCGGC
GGGCCGCTGC TCTGGCAGCA CCTGTTCTGG TTCTGGGGAC ATCCCGAGGT CTACATCATC
GCAATCCCCT TCTTCGGGAT CATCACCGAG ATCATTCCGG TCTTCGCCCG CAAGCCCGTC
TTCGGGTACA CCGGTCTGGT GCTCGCCACC ACCGCCATCA CCGTGCTGTC CATGGCGGTG
TGGGCACACC ACATGTTCGG TACCGGTCAG GTACTGCTGC CGTTCTTCAG CATCCTGAGC
TACCTGATCG CCGTACCGAC GGGGGTGAAG TTCTTCAACT GGATCGGCTC CATGTGGAAG
GGGCAGCTCA CCTTCGAAAC ACCGATGCTC TTCTCCATCG GCTTCCTGGT CACCTTCCTG
TTCGGCGGTC TCACCGGAGT GCTGCTGGCC AGCCCGCCCG TGGACTTCCA CGTGACCGAC
AGCTATTTCG TGGTGGGGCA CTTCCACTAC GTGCTCTTCG GCACGGTGGT CTTCGCCTTC
TTCGGCGGGA TCTACTTCTG GTTCCCGAAG ATGACCGGCC GACTACTCGA CGAACGGCTC
GGCAAGGCGC ACTTCTGGAC CATGTTCCTG GGCTTCCACG GCACCTTCCT GGTCCAGCAC
TGGCTGGGCA ACGAGGGAAT GCCCCGCCGA TACGTCGACT ACCTGCCCGG CGACGGCTTC
ACCATCCTGA ACACCATCTC GACCGTCTCC TCGTTCGTAC TCGGGGCATC TACTCTGTTC
TTCATCTGGA ACGCCTGGAA GTCCTGGCGA TACGGACCCG TGGTCAACGT GGACGACCCT
TGGGGCTTCG GTAACTCCCT GGAGTGGGCG ACCACCTGCC CGCCGCCGCT GCGCAACTTC
GACCGGATAC CCAGGATCCG CTCCGAGCGG CCGGCCTTCG ACGCCAAGTA CGGCCCACTC
GTCGCCGACC TCGGCCGCGA CCTGCCGCAG CGCATCACCC GGCCACCGCA GGACCTCCGC
GACGAACTGC ACGCGGAGAG GCGGCCACCC GAGGCGCCCA GCGCCGGGGG CGCGGTCGGC
GCGCGCGAGG CGGTGGCCTA CCAACCCGCC CCCGAGTCCG GGGCGCGACC GGTCGAGGTA
CCGGAGCCGG ACATGGTGCG TCGTCCCGGC TTCGAGGAGA CCGACGAGCC CGAGGGGACC
GACCTCGAAG CCCAGGACGA ACAGCAGCAA AACGACCGCT GGCGCCACCC GCGCGGCCAC
GGTGACCCGA CCGAAAGCTG A
 
Protein sequence
MPKRVTTEPQ DRGPAILAPA RFGGYPGPRR PALPGGKLAR WLATTDHKQI GLLYLITSFG 
FFVLAGIEAM LIRGELARPG LQFLSPEQYN QLFTTHGLAM LLLFATPAAL GTANYIVPIQ
IGAPDVSFPR LNAFGYWLFL FGGLLLYGSF LTPAGSADFG WFAYTPLSQA ENSPGIGPDM
VLVGLVLGGL GTILTAVNLI TTIITLRAPG MTMFRMPIFT WNMLFTSVLI LMVFPLLAAT
LLALLADRLL GAHVFDPASG GPLLWQHLFW FWGHPEVYII AIPFFGIITE IIPVFARKPV
FGYTGLVLAT TAITVLSMAV WAHHMFGTGQ VLLPFFSILS YLIAVPTGVK FFNWIGSMWK
GQLTFETPML FSIGFLVTFL FGGLTGVLLA SPPVDFHVTD SYFVVGHFHY VLFGTVVFAF
FGGIYFWFPK MTGRLLDERL GKAHFWTMFL GFHGTFLVQH WLGNEGMPRR YVDYLPGDGF
TILNTISTVS SFVLGASTLF FIWNAWKSWR YGPVVNVDDP WGFGNSLEWA TTCPPPLRNF
DRIPRIRSER PAFDAKYGPL VADLGRDLPQ RITRPPQDLR DELHAERRPP EAPSAGGAVG
AREAVAYQPA PESGARPVEV PEPDMVRRPG FEETDEPEGT DLEAQDEQQQ NDRWRHPRGH
GDPTES