Gene Haur_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0104 
Symbol 
ID5731997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp136808 
End bp137887 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID641277226 
Productcytochrome c oxidase subunit II 
Protein accessionYP_001542884 
Protein GI159896637 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAATC GTTCGCCTGC CAAGGCTCTG CTGCGCCCTA CGGCAACCCT GTTGCTTGGA 
AGCGTTGTAC TGGCAGCGTG CGGTCAGAAG ACCCCTCAGA CGACCCTGAA CCCAGCAAGC
GAGAGCACCC GCGCAATTTA CAATCTCTCG GAATTGTTGT TTTGGTTGGG CGTTGTGGTC
TTTTTGATCG TACAAACCTG GTTGATCGTT TCGATCATCA AGTATCGGCA AAAAGATAGT
TCGCAGATTC CCACACAGAT CCATGGCAAT ACGAAGGTTG AAATTGCTTG GACAATCGTG
CCAGCAATTA TTGCGATTGT CATTTTCGTC TTTACCTTCG ACACGATTCG CAAAATCGAG
TTTATGCCCG ACGAAGCCGC TGGCAATACC TTAAATGTTA AGGTTATCGG CCATCAGTGG
TGGTGGGAGT TCCAGTATCC TGATATTAAG GATGCCAGCG GTAAGCCTTT GGTCACTGCT
AACGAGCTAT GGATTCCATC AGGAAGCTAT ATCGACGTGA AAATGACCTC GGTTGACGTA
ATCCACGACT TCTGGATTCC TGGCTTGGCT GGCAAGCGCG ACGTGATGCC CAATCGCGAG
AGTGGCTTGT GGTTTAAAGC CGATGACGTG GCCGATGGTT CGCCAGCAGT ATTTTGGGGT
CAATGCGCCG AATACTGTGG TGGCCAACAT GCTTATATGA AAATGCGCGT GGTTGTGGCC
AGCCCTGCCG ACTTCCAAAA ATGGTCAAGC GAGCAAAGCC AAGTGGCGGT TAACACCACC
TTGCCCGAAT CGTTTACCAA AAATTGTATC GGTTGTCACG TGGTGCGTGG CACCAACGCC
GCTGGTATTA CCGGCCCCGA CTTGACCCAC TTCGGTGGCC GCATGACGAT TGCCGCTGGC
ACGACCGATA ACACCCGTGA ACATCTCTAT GCCTGGTTAG ACGACCCTGA TGCAGTCAAA
CGTGGCAACA TTATGACCAC CGCGATCAAG GCCGATACCC TGACCGAAGC AGAAATTACC
GAGTTAGTCG ATTATCTGGA AAGCCTCAAT CCTGGCACCA GTGTTAAAGG TCAGCAGTAG
 
Protein sequence
MPNRSPAKAL LRPTATLLLG SVVLAACGQK TPQTTLNPAS ESTRAIYNLS ELLFWLGVVV 
FLIVQTWLIV SIIKYRQKDS SQIPTQIHGN TKVEIAWTIV PAIIAIVIFV FTFDTIRKIE
FMPDEAAGNT LNVKVIGHQW WWEFQYPDIK DASGKPLVTA NELWIPSGSY IDVKMTSVDV
IHDFWIPGLA GKRDVMPNRE SGLWFKADDV ADGSPAVFWG QCAEYCGGQH AYMKMRVVVA
SPADFQKWSS EQSQVAVNTT LPESFTKNCI GCHVVRGTNA AGITGPDLTH FGGRMTIAAG
TTDNTREHLY AWLDDPDAVK RGNIMTTAIK ADTLTEAEIT ELVDYLESLN PGTSVKGQQ