Gene Sare_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3532 
Symbol 
ID5704600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4073008 
End bp4073979 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content66% 
IMG OID641272959 
Productcytochrome c oxidase subunit II 
Protein accessionYP_001538325 
Protein GI159039072 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.636307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000170854 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGTCGCAA GGAGTTCGGA GGTACGGTCG TCGGCCGTAC GGCACAGCGC TTCCCCGGGA 
GTCGGCGGGC GTCGGGCGCG GGGTGTCGGC CGGCTGGCCG GGCTCGGTCT CGGCGGAGCG
GCGCTGCTGG TCCTGCTCAC GGGCTGCGAC GTCGGCGCCA CGTTCGCCGG CTTCGGATGG
CCGCAGGGAG GCATCACCCC CGAGGCCAAC CGGATGTACG ACCTGTGGAT CGCGTCGTGC
ATCGCGGCGC TCGCGGTCGG TGTGTTCGTG TGGGGCCTCA TCTTCTGGTG CGTCGTGCGT
TACCGGAAGC GGGGTAACGA ACTGCCCGTG CAGACGCGCT ACAACCTGCC GATGGAGTTC
CTCTACACCA TCGCTCCGAT TCTGATCGTC TCCGTGCTCT TCTACTACAC GGCGATCGTG
CAGACCGACG TGGGGAAGAC CTCCCGGAAC CCGGACGTCA CCGTCGAGGT GGTCGCCTTC
AAGTGGAACT GGCAGTTCAA CTACCGCGAC GGGCAGGGCG TGGAGGCGAA CACGATCGCC
TCGGTTCTCG GTACCAGCGA GGTCATCCCG ATCCTCGTGT TGCCGTCCGA GCGGTCGATC
CGCTTCGAGG AGACCAGCCG CGACGTCATC CACTCGTTCT GGGTGCCAGA GATGCTGTTC
AAGCGCGACG TCTTCCCCGG TAGCATCCGC AATGTCTTCG AGGTCTCCGA GCTCGAGGGT
GAGGGCGCGT ACGTGGGCCG TTGCGCCGAG CTGTGCGGCA CGTACCACGC CTTCATGAAC
TTCGAACTTC GGGTCGTCTC GCCGGAGAGG TACGACCGTT TCATCGCGCT CAAACAGGAC
GGCCAGTCCA CGCAGGAGGC GCTGACCGCA ATCGGCGAGA ACCCGTATGC GACGACCACC
GAACCGTTCG AAACGCGGCG TACCGAAGCG AACTTCAACC CCGACAAGCC GGCAAACGGC
TCGGGTAACT GA
 
Protein sequence
MVARSSEVRS SAVRHSASPG VGGRRARGVG RLAGLGLGGA ALLVLLTGCD VGATFAGFGW 
PQGGITPEAN RMYDLWIASC IAALAVGVFV WGLIFWCVVR YRKRGNELPV QTRYNLPMEF
LYTIAPILIV SVLFYYTAIV QTDVGKTSRN PDVTVEVVAF KWNWQFNYRD GQGVEANTIA
SVLGTSEVIP ILVLPSERSI RFEETSRDVI HSFWVPEMLF KRDVFPGSIR NVFEVSELEG
EGAYVGRCAE LCGTYHAFMN FELRVVSPER YDRFIALKQD GQSTQEALTA IGENPYATTT
EPFETRRTEA NFNPDKPANG SGN