Gene Acid345_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2995 
Symbol 
ID4071550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3547730 
End bp3549364 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content58% 
IMG OID637985014 
Productcytochrome-c oxidase 
Protein accessionYP_592070 
Protein GI94970022 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0459294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA CAGCAATCAA TCAGTCCGTG GAAAAAGAGA CCTACCTCAA TGCAGGGTAC 
GGTCTGAAGT CATGGCTCTT GACGAAGGAC CACAAGCGCA TCGCGATCCT CTACTTGATC
TCGATCACCG TGTTCTTCGC AATTGGCGGG TTCTTCGCCA TGCTCATCCG TCTTGAGCTG
CTGACCCCCG CGGGCGACTT GGTCGAAGCC GACACCTACA ACAAGCTCTT CTCGATGCAT
GGCATCATCA TGGTGTTCTT CTTCCTGATC CCGTCCATCC CTGCGACGCT CGGCAATTTC
CTCGTGCCCC TCATGGTCGG CGCCAAGGAC TTGGCCTTCC CGCGCATCAA CCTGCTGAGC
TGGTATCTCT ACATCATTGG CGGAACCATG GCGCTCGTCG CCATGTTCAT GGGCGGCGTT
GACACCGGCT GGACCTTCTA CACTCCGCTC AGCACCGAGT ACGTCAATAC TAACGTGATC
CCGGTTGCCC TCGGCGTCTT CGTCGCCGGG TTCTCGTCCA TCTTCACTGG ACTGAATATC
ATCGTGACCA TCCACCGCAT GCGCGCTCCC GGTATGACCT GGAGCCGCCT GCCACTCTTC
ATCTGGTCGC ATTATGCCGC CAGCCTGATC ATGGTCCTCG GTACGCCGGT TGTTGCTATC
ACCCTGGTGC TCGCCGCTCT CGAGCGCGCC TTCCACATCG GCATCTTCAA CCCGCAGCTT
GGCGGAGACC CGGTGCTCTT CCAGCACCTC TTCTGGTTCT ATTCGCATCC CGCCGTCTAC
ATCATGATTC TGCCTTCGAT GGCGGTGATC TCCGAGATCG TGCCCTGCTT CACGCGTAAG
CGCATCTTCG GATATGAATT CGTTGCGCTC TCCTCGATCG GCATCGCCGT CCTCGGCTTC
CTCGTGTGGG CGCACCACAT GTTCGTCGCC GGAATCTCGG TGTACGCTGC TCTGGTTTTC
TCGCTTCTGA GCTACCTCGT CGCCATCCCG TCCGCCGTAA AGGTCTTCAA CTGGACGGCT
ACGATGTTCA AAGGCTCCAT CAGCTTCGAG ACCCCGATGC TCTACGCCTT CGGGTTCATT
GGACTGTTCA CCATCGGCGG ACTCACCGGC TTGTTCCTCG CCAACCTCGG CGTCGACATC
CACGTCCACG ACACTTACTT CGTGATCGCG CACTTCCACT ACATCATGGT CGGCGGTGCC
ATCATGGGTT ATCTCGGCGG ACTCCACTTC TGGTGGCCCA AGATGACCGG CCGCATGTAT
CCCGAAGCCT GGGCAAAGCT CTCGGCGCTG CTCGTCTTCG TCGGCTTCAA CCTCACCTTC
TTCCCGCAAT TCGTTCTCGG ATACATGGGC ATGCCGCGTC GCTATCACGC CTACGCCCCT
GAATTCCAGG TTCTGAACGT GCTCTCCACC GCCGGCGCTT CGGTGCTCGC CGTGGGATAT
CTGTTCCCGC TCTTCTATTT CCTGTGGTCG CTGAAGTATG GGCAGATCGC ACCCAACAAT
CCGTACAACG CCGTTGGTTT GGAGTGGATG ACGCAATCGC CGCCACCCGC CCACAACTTC
GATAAGACAC CTGTTGTCAC CTGGGAAGCC TACGATTACG AGAACCAGCC CCAGGAGGAG
GTCCCCGTTG TCTAG
 
Protein sequence
MATTAINQSV EKETYLNAGY GLKSWLLTKD HKRIAILYLI SITVFFAIGG FFAMLIRLEL 
LTPAGDLVEA DTYNKLFSMH GIIMVFFFLI PSIPATLGNF LVPLMVGAKD LAFPRINLLS
WYLYIIGGTM ALVAMFMGGV DTGWTFYTPL STEYVNTNVI PVALGVFVAG FSSIFTGLNI
IVTIHRMRAP GMTWSRLPLF IWSHYAASLI MVLGTPVVAI TLVLAALERA FHIGIFNPQL
GGDPVLFQHL FWFYSHPAVY IMILPSMAVI SEIVPCFTRK RIFGYEFVAL SSIGIAVLGF
LVWAHHMFVA GISVYAALVF SLLSYLVAIP SAVKVFNWTA TMFKGSISFE TPMLYAFGFI
GLFTIGGLTG LFLANLGVDI HVHDTYFVIA HFHYIMVGGA IMGYLGGLHF WWPKMTGRMY
PEAWAKLSAL LVFVGFNLTF FPQFVLGYMG MPRRYHAYAP EFQVLNVLST AGASVLAVGY
LFPLFYFLWS LKYGQIAPNN PYNAVGLEWM TQSPPPAHNF DKTPVVTWEA YDYENQPQEE
VPVV