Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0105 |
Symbol | |
ID | 5731998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 137897 |
End bp | 139744 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277227 |
Product | cytochrome c oxidase subunit I type |
Protein accession | YP_001542885 |
Protein GI | 159896638 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTG ACGTAAGTAC CCTCCATCAC GCCCCAGCTA AGCAACGGCG GGGTTGGTTG GAATGGATCA CCACTGTCGA TCATAAAAAA ATTGGCATCA TGTATGCTGT GACTGCCTTT TTATTCTTTG TGATCGGTGG GCTTGAGGCG TTGCTCATCC GTTTGCAGCT TGGTACGCCC CAAAATACGC TGCTGACACC CGAAGCTTAT AACCAAGTCT TTACGATGCA CGGCACAACC ATGGTGTTTA TGGCGATTAT GCCTGTCAAC GCTGGGTTTA TGAACTATTT TGTGCCCTTG ATGATTGGCG CAGGCGATAT GGCCTACCCG CGCATGAACG CCATGAGCTA TTGGCTCTTG TTGTTCGGCG GGATTGTCAT GTATTCCAGC TTTGTGTTCG GCGGCGGCGC TCCCGATGCT GGCTGGTTTG CTTATGCTCC ACTCACCTCA ACCACCTACT CGGTTACCCG AGGGATGGAT TATTGGGTGC TAGGTTTGCA GCTGCTGGGG GTTTCCTCGC TGGCTGGCTC GGTCAACATC ATCGTAACAA TCATCAGGCT ACGAGCACCT GGCATGCGCT TTAATCGGAT GCCGCTGTTC GTTTGGATGA GCTTCGTTAC CTCATTCTTG TTGATTTTTG CCTTGCCAAG CATCACGGTT GGGATCACCT TGCTGTTCTT TGACCGCAAC TTTGGTACCA ACTTCTTCTT GCCTGCTGCT GGCGGCGACC CACTGTTGTG GCAACATTTG TTCTGGTTCT TCGGTCACCC CGAAGTGTAC ATTATGATTC TGCCCGCCTT CGGTGTGGTT TCAGAAATGT TGCCAGTTTT CTCACGCAAG CCGATTTTCG GTTACGAGTT TGTGGCCTAC TCTGGGGTTG CAATCGGGGT GTTGGGCTTC ACCGTGTGGG CACACCACAT GTTTGCCACC AACTTGGGCG TGATTGCCGA CACCTTCTTT GCTGCCGCTT CGATGTTGAT TTCTGTCCCG ACGGGGGTCA AAATCTTCAA CTGGCTAGCA ACCTTGTGGC GCGGTGAGTT GCGCTTCAAA ACCCCAATGC TGTTCAGCCT TGGCTTTATC GCCATGTTCG TGATCGGCGG GATTTCCGGG GTTTCGTTGG CGGCTGCGCC GTTCGACTTA CAAGTTACCG ATAGCTACTT TGTGGTTGCT CACTTCCACT ATGTGCTGTT TGGTGGCGCT GTCTTTGCCT TGTTTGGTGC ATCATACTAT TGGTTCCCCA AAATTACTGG CAAGATGATG AGCGAACGCA TCGGCAAATG GCACTTCTGG ATTTTAATGC TGGGCTTTAA TTTGACCTTC TTCCCGCAAC ATATGCTTGG GCTGCAAGGT ATGCCACGGC GGGTCTGGAC GTATCAACAA AACCAAGGCT GGGATTTCTA CAACTTGCTT TCAACGATTG GGGCATTTTG TATCGCCTTG GGTACGTTGG TCTTTATCGT CAACTTTATT ATCAGCTTGC GCAAGGGCGC TCCTGCGGGC AACGACCCAT GGGATGCCGC AACCCTGGAA TGGGCGACCA CCTCGCCACC ACCAGCCCAC AACTTCGATG TGGAATACAT TGTGCATAGC CGTCGCCCCT TGTGGGATAA CAAATATAGC GGCGAAGGTC GAGGCATGAC GATCAATTAT GATTTCCATC CCCACTTGCC ACCGCCATCG TTTGCCCCAA TCATCTTCTC GTTTGGCTTG TTTGTGCTAG CTTATGGCAT GCTCAACCTC GCCTCTGCGC CATTGATTGG GATTCCATTG ATCTTGGTGG CCTTGGCAAT TGCCTTCGTT GGCATGAACC GCTGGGTCGG CGAAATTGCC CAAGATCCGG TGCTGTAA
|
Protein sequence | MATDVSTLHH APAKQRRGWL EWITTVDHKK IGIMYAVTAF LFFVIGGLEA LLIRLQLGTP QNTLLTPEAY NQVFTMHGTT MVFMAIMPVN AGFMNYFVPL MIGAGDMAYP RMNAMSYWLL LFGGIVMYSS FVFGGGAPDA GWFAYAPLTS TTYSVTRGMD YWVLGLQLLG VSSLAGSVNI IVTIIRLRAP GMRFNRMPLF VWMSFVTSFL LIFALPSITV GITLLFFDRN FGTNFFLPAA GGDPLLWQHL FWFFGHPEVY IMILPAFGVV SEMLPVFSRK PIFGYEFVAY SGVAIGVLGF TVWAHHMFAT NLGVIADTFF AAASMLISVP TGVKIFNWLA TLWRGELRFK TPMLFSLGFI AMFVIGGISG VSLAAAPFDL QVTDSYFVVA HFHYVLFGGA VFALFGASYY WFPKITGKMM SERIGKWHFW ILMLGFNLTF FPQHMLGLQG MPRRVWTYQQ NQGWDFYNLL STIGAFCIAL GTLVFIVNFI ISLRKGAPAG NDPWDAATLE WATTSPPPAH NFDVEYIVHS RRPLWDNKYS GEGRGMTINY DFHPHLPPPS FAPIIFSFGL FVLAYGMLNL ASAPLIGIPL ILVALAIAFV GMNRWVGEIA QDPVL
|
| |