Gene Hoch_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2420 
Symbol 
ID8544806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3347356 
End bp3349026 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID646387119 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003266850 
Protein GI262195641 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.040842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.534979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCA CTACCGCTAA GGAACCGAAT TACCTCGAGG TAAAGCGCGG GTTGATGTCG 
TGGCTCGTCA CGCTCGATCA CAAGCGGATC GGGGTCATGT ACCTCATCGG TATCCTGACC
GCGCTGCTCA TCGGCGGCCT GTTCGCGCTG CTGGTGCGTA TCGAGCTATT CTCGCCCGGC
GAGCTGTTTT CGCCCGATCA GTACAACCAG ATCTTCACTC TGCACGGCGC GGTGATGGTG
TTCCTGGTCA TCATTCCAGG ACTGCCGGCC GCGCTGGGTA ACTTCATCTT ACCGATTCAG
CTCGGCGCAC CCGATGTCGC GTTCCCGCGC ATCAACCTGT CGAGCTTCTA TCTGTGGTGC
ACGGGCGCCG CGCTGCTGGT GGCGACCATC GTCGTCGGCG CAGTGGACAC GGGCTGGACC
TTCTACACGC CGTACAGCAT GACCACCGAC AAGGGCACAG CCGTGATCCT GTCGGTGCTG
GGCGTGTTCC TGCTCGGCTT CAGCTCGATC TTCACCGGTC TCAACTTCCT GGTGACGATC
CACAAGTTCC GCGCCAAGGG CATGGGCTGG TTCAACATGC CGCTCAACCT GTGGGCGCTG
TACGCCACCG CCGTCATCCA GGTGCTGGCC ACGCCGGTGC TGGGTATCAC CGTGCTGCTG
CTCTTCGTCG AGAAGGTCAT GCACATCGGT ATCTTCGACC CCAACCTCGG CGGCGACCCG
GTGCTGTTCC AGCACTTCTT CTGGTTCTAC TCGCACCCGG CCGTGTACAT CATGATCATC
CCGGCCATGG GCGTGATCTC CGAGATCATC ACGACCTTCT CGCGCAAGCA CATCTTCGGC
TATCGCTTCA TCGCGTACTC GAGCATCTCG CTGGCCCTGC TCAGCTTCCT GGTCTGGGGC
CATCACATGT TCGTGAGCGG CCAGTCGAAG ATGGCGGCCA TGATCTTCTC GGCGCTCACC
TTCACGGTCG GCATACCGTC GGCCATCAAG GTGTTCAACT GGACCGCCAC CCTGTACAAG
GGCGAGATCT ATCTCAAGAC GCCGATGTTG TACGCGCTGT CGTTCGTCCT GCTGTTCACG
ATCGGCGGCT TGACCGGCTT GTTCCTCGGC ATCTTGAGCG TGGACGTGCA CCTGCACGAC
ACCTACTTCG TCGTCGCTCA CTTCCACTAC GTGATGATGG GCTCGACCCT GGTCGCCTTC
CTGGCGGCGC TTCACTACTG GTTCCCGAAG ATGAGCGGCA AGATGTACCC GGAGAAGCTG
GCTCAGATCT GCTCGGTGTT CGTGTTCATC GGCTTCAACC TCACCTTCCT GCCGCAGTTC
GTGATGGGCG CCCGCGGCAT GCCGCGCCGC TACTGGGACT ACGACCCCGA GTTCACGCTG
ATGCACCGGC TCTCGACCAT CGGCGCGCTG ATCCTCGGCA TCACGCTGTT CATCGTGGTG
GTGTACCTCG CCATCGCGGC GCTGCGCGGC AAAGTGAAGG CCGCCGACAA TCCCTGGGGC
GCGTCCACCC TCGAGTGGCA GACGACCTCG CCGCCGCCGC TGTACAACTT CCACGAGGCG
CCCGAGAAGC CGCTGCTGTA CCACTACGAG GAATTCGAGT ACGACGAGTC CATCGACGGG
TATGTCTACC GGCCCAACGA AGACCGCCAC ATCCCGATCG CGCATCACTA A
 
Protein sequence
MSTTTAKEPN YLEVKRGLMS WLVTLDHKRI GVMYLIGILT ALLIGGLFAL LVRIELFSPG 
ELFSPDQYNQ IFTLHGAVMV FLVIIPGLPA ALGNFILPIQ LGAPDVAFPR INLSSFYLWC
TGAALLVATI VVGAVDTGWT FYTPYSMTTD KGTAVILSVL GVFLLGFSSI FTGLNFLVTI
HKFRAKGMGW FNMPLNLWAL YATAVIQVLA TPVLGITVLL LFVEKVMHIG IFDPNLGGDP
VLFQHFFWFY SHPAVYIMII PAMGVISEII TTFSRKHIFG YRFIAYSSIS LALLSFLVWG
HHMFVSGQSK MAAMIFSALT FTVGIPSAIK VFNWTATLYK GEIYLKTPML YALSFVLLFT
IGGLTGLFLG ILSVDVHLHD TYFVVAHFHY VMMGSTLVAF LAALHYWFPK MSGKMYPEKL
AQICSVFVFI GFNLTFLPQF VMGARGMPRR YWDYDPEFTL MHRLSTIGAL ILGITLFIVV
VYLAIAALRG KVKAADNPWG ASTLEWQTTS PPPLYNFHEA PEKPLLYHYE EFEYDESIDG
YVYRPNEDRH IPIAHH