Gene Haur_3759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3759 
Symbol 
ID5735623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4727893 
End bp4729269 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content50% 
IMG OID641280911 
Productcytochrome bd ubiquinol oxidase subunit I 
Protein accessionYP_001546523 
Protein GI159900276 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAT TGATTCTTGC TCGTTGGCAA TTTGCCGTTA CAACGGTTTA TCACTTTTTG 
TTTGTACCGC TAACAATCGG GCTTTCGTTT TTTGTGGCTT TGCTACAAAC CATCTACTAT
CGCACTGGCG ATATCACCTA TAAACGTATG ACCAAATTTT GGGGCCATCT GTTTTTGATC
AATTTTGCGA TTGGCGTGGC CACCGGGATT GTCCAAGAGT TTCAGTTTGG CATGAACTGG
TCGGAATATT CGCGCTTTGT CGGCGATATT TTCGGCGCTC CGTTAGCAAT TGAAGCCTTG
ATGGCCTTCT TTATTGAATC GACCTTCTTA GGAATTTGGA TCTTCGGCTG GGATCGCATA
CCCAAGCTCG CTCACTTGGC TTCGATTTGG CTGGTAGCGA TTGCCACGAT GCTCTCCAGT
TTGTGGATTT TGATCGCCAA TTCGTTTATG CACCAGCCAG TTGGCTATGT GTTACGCAAT
GGGCGTGCCG AAATGGCTGA TTTTTGGGCC TTGCTGACCA ACGGCCACGT GTGGGTGCAA
TGGCCGCATA CCGTCACCGC AGCCATGGTC ACGGCGGCAT TTTTCGTTTT GGGCATTAGC
GCATGGCAAT TGCGCAAACA ACCCAAGCCT GCCGATCGCC TGATTTTTCA GCGCTCATTC
AAATTGGCCT TGGGCTATGC CTTGGTTTCA ACAATTTTGG TGATGGTGGT CGGGCATAAT
CAAGCTCAAT ACATGATCAA AGTGCAACCA ATGAAAATGG CCGCCGCCGA AGCATTGTGG
GAAACCGCTG ATCCCGCGCC GATGTCGTTG TTTACGGTGG CCGATGTGCC CGAACAACGC
GATCGCTTCG TGGTCAAAGT TCCAGGTTTG CTGAGCTTTT TGGCCTACAA CCGCTTTGAT
GGCGAAGTCA AAGGCATCAA AGATCTCCAA GCTGAATTTG AACAAACCTA CGGCCCGGGC
AATTACACCC CCGATGTCTT TATGACCTAC TGGATGTTTC GGATTATGGT TGGCTCAGGC
ATGGCGATGT TTGGTTTGGC GATCATTGGC CTGTTTTTCG TGTTGCGCAA GCGCTTGAAC
TTCCCTAATT GGTATTTGTG GTTCATGACT TTGGCGCTTA GTTTGCCCTA TATCGCCAAC
GCCAGCGGCT GGATTTTCAC CGAAATGGGC CGCCAACCAT GGATCGTCTA TGGCTTGCTG
CGCACCAGCG ATGGGGTTTC GACGGCAGTT TCTGGCGGCA GCGTGCTGAC CTCGTTAATT
GGGTTTAGTT TGATCTATCT GGTCTTGATT GGCGTGATGG TCTTTTTGAT CGTGCGCGAA
GCCAATCATA TTCCCGATGC TAGCCCTGCG ACGGTCGAAG CTGATTTAGC GCTCTAA
 
Protein sequence
MDALILARWQ FAVTTVYHFL FVPLTIGLSF FVALLQTIYY RTGDITYKRM TKFWGHLFLI 
NFAIGVATGI VQEFQFGMNW SEYSRFVGDI FGAPLAIEAL MAFFIESTFL GIWIFGWDRI
PKLAHLASIW LVAIATMLSS LWILIANSFM HQPVGYVLRN GRAEMADFWA LLTNGHVWVQ
WPHTVTAAMV TAAFFVLGIS AWQLRKQPKP ADRLIFQRSF KLALGYALVS TILVMVVGHN
QAQYMIKVQP MKMAAAEALW ETADPAPMSL FTVADVPEQR DRFVVKVPGL LSFLAYNRFD
GEVKGIKDLQ AEFEQTYGPG NYTPDVFMTY WMFRIMVGSG MAMFGLAIIG LFFVLRKRLN
FPNWYLWFMT LALSLPYIAN ASGWIFTEMG RQPWIVYGLL RTSDGVSTAV SGGSVLTSLI
GFSLIYLVLI GVMVFLIVRE ANHIPDASPA TVEADLAL