Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0104 |
Symbol | |
ID | 5731997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 136808 |
End bp | 137887 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277226 |
Product | cytochrome c oxidase subunit II |
Protein accession | YP_001542884 |
Protein GI | 159896637 |
COG category | [C] Energy production and conversion |
COG ID | [COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 |
TIGRFAM ID | [TIGR02866] cytochrome c oxidase, subunit II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.100023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAATC GTTCGCCTGC CAAGGCTCTG CTGCGCCCTA CGGCAACCCT GTTGCTTGGA AGCGTTGTAC TGGCAGCGTG CGGTCAGAAG ACCCCTCAGA CGACCCTGAA CCCAGCAAGC GAGAGCACCC GCGCAATTTA CAATCTCTCG GAATTGTTGT TTTGGTTGGG CGTTGTGGTC TTTTTGATCG TACAAACCTG GTTGATCGTT TCGATCATCA AGTATCGGCA AAAAGATAGT TCGCAGATTC CCACACAGAT CCATGGCAAT ACGAAGGTTG AAATTGCTTG GACAATCGTG CCAGCAATTA TTGCGATTGT CATTTTCGTC TTTACCTTCG ACACGATTCG CAAAATCGAG TTTATGCCCG ACGAAGCCGC TGGCAATACC TTAAATGTTA AGGTTATCGG CCATCAGTGG TGGTGGGAGT TCCAGTATCC TGATATTAAG GATGCCAGCG GTAAGCCTTT GGTCACTGCT AACGAGCTAT GGATTCCATC AGGAAGCTAT ATCGACGTGA AAATGACCTC GGTTGACGTA ATCCACGACT TCTGGATTCC TGGCTTGGCT GGCAAGCGCG ACGTGATGCC CAATCGCGAG AGTGGCTTGT GGTTTAAAGC CGATGACGTG GCCGATGGTT CGCCAGCAGT ATTTTGGGGT CAATGCGCCG AATACTGTGG TGGCCAACAT GCTTATATGA AAATGCGCGT GGTTGTGGCC AGCCCTGCCG ACTTCCAAAA ATGGTCAAGC GAGCAAAGCC AAGTGGCGGT TAACACCACC TTGCCCGAAT CGTTTACCAA AAATTGTATC GGTTGTCACG TGGTGCGTGG CACCAACGCC GCTGGTATTA CCGGCCCCGA CTTGACCCAC TTCGGTGGCC GCATGACGAT TGCCGCTGGC ACGACCGATA ACACCCGTGA ACATCTCTAT GCCTGGTTAG ACGACCCTGA TGCAGTCAAA CGTGGCAACA TTATGACCAC CGCGATCAAG GCCGATACCC TGACCGAAGC AGAAATTACC GAGTTAGTCG ATTATCTGGA AAGCCTCAAT CCTGGCACCA GTGTTAAAGG TCAGCAGTAG
|
Protein sequence | MPNRSPAKAL LRPTATLLLG SVVLAACGQK TPQTTLNPAS ESTRAIYNLS ELLFWLGVVV FLIVQTWLIV SIIKYRQKDS SQIPTQIHGN TKVEIAWTIV PAIIAIVIFV FTFDTIRKIE FMPDEAAGNT LNVKVIGHQW WWEFQYPDIK DASGKPLVTA NELWIPSGSY IDVKMTSVDV IHDFWIPGLA GKRDVMPNRE SGLWFKADDV ADGSPAVFWG QCAEYCGGQH AYMKMRVVVA SPADFQKWSS EQSQVAVNTT LPESFTKNCI GCHVVRGTNA AGITGPDLTH FGGRMTIAAG TTDNTREHLY AWLDDPDAVK RGNIMTTAIK ADTLTEAEIT ELVDYLESLN PGTSVKGQQ
|
| |