Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0042 |
Symbol | |
ID | 8382302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 41238 |
End bp | 42998 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644971100 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_003128964 |
Protein GI | 257051131 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0382499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGCG AACAGCTCGC ATTGACGGTC TTGATGGGAA TTTTGCTGGT CGGGGTTGCG GCGTTTCTCA CGCGTATCGA AGACTGGCGG AGCTACGCGT CACCGACGGC CAGTGGCGGG ACCGTCAGCG AGACAGGATA CGGCCACCGG GAGAAGCCCG CCGGGGTGCT CCGGTGGCTC ACGACAGTCG ATCACAAGGA CATCGGATTG CTCTACGGGC TCTTTGCACT GATTGCGCTC GCGGTCGGTG GCCTGGCCGT CATGGTCATG CGGCTCGAAC TGGTCACTCC GTCTTCGAAC ATCGTGAGTG CGAGCTTCTA CAACGCGTTG CTCACGAGCC ACGGCATCAC AATGTTGTTC CTCTTCGGAA CGCCGATCAT CGCGGCCTTC GGGAACTACT TCATCCCGCT ACTGATCGGG GCTGACGACA TGGCCTTCCC CCGGATCAAC GCGATCGCGT TCTGGCTGTT GCCGCCGGGC GCGTTACTCA TCTGGAGTGG CTTTCTCATC CCGGGGATCG CCCCCTCCCA GGCGTCCTGG ACGATGTACG CGCCGCTGTC GGTCGAACAG CCCCATCTCG GGACGGACAT GATGTTGCTG GGACTCCATC TGACGGGCGT CTCGGCGACG ATGGGGGCGA TCAACTTCAT CGTCACAATC TTCACCGAAC GCGGTGATGA CGTCTCCTGG GCGAACTTAG ACATCTTCTC CTGGACGATG CTCACCCAGT CGGGGATCAT CCTCTTCGCG TTCCCGCTGC TGGGCAGCGC ACTCATCATG TTGCTGCTCG ATCGGAACCT CGGGACGCTG TTCTTCGCCG TCGACGGCGG GAACCCGATA CTGTGGCAAC ATCTGTTCTG GTTCTTCGGC CATCCGGAGG TGTACATACT GGTGTTGCCA CCGATGGGGC TGATAAGCTA CATCATCCCG CGGTTCTCGG GTCGGAAGCT GTTCGGGTTC AAGTTCGTCG TCTACTCGAC GCTGGCGATC GGCGTGCTCT CCTTCGGCGT CTGGGCCCAC CACATGTTCG CGACGGGGAT CGACCCGCGA CTCCGGGCGA GTTTCATGGC CGTCTCCTTG GCGATCGCGA TACCGAGCGC AGTCAAGACG TTCAACTGGA TCACGACGAT GTGGAACGGC CGCATCAGGC TAACTACCCC GATGCTGTTC TGCATCGGAT TCGTGGCAAA CTTCATCATC GGCGGCGTGA CTGGCGTGTT CCTGGCGTCG ATCCCGATCG ACCTGATCCT GACGGACACT TACTACGTCG TCGGGCACTT CCATTACGTG ATCATGGGGG CGATCGCCTT CGCGGTGTTC GCCGGCGTCT ACTACTGGTT CCCCATTTAC ACTGGGCGGA TGTACCAGCG CACGCTGGGC AAGTGGCACT TCTGGCTGAC CATGATCGGC ACGAACGTGA CGTTCTTCGC CATGATCCTG CTGGGCTACG TCGGCCTGCC GCGCAGACTT GCAACCTACA ACGCCATCAC CGTCGGCCCG ATCGACGTCA TCACGCTCCT CCACCAGGCC GCCACCGTCG GCGCACTGAT CCTGTTCGTG GGGCAACTCG TCTTCGTCTG GAACATTCTC CAGTCGTGGC TCGACGGCCC GAAACTCACC GACGGCGACC CGTGGGATCT CAAGGACGAC GGCCTGTTCA CCCGTGAGTT TGCCTGGAAC GAGGATCGAA TCACTGCGGA CGAGACGGAA GCAGACGCCG ATCTATGGGC CGACGGCGGC GAATCTGACG AGACCCAATA G
|
Protein sequence | MAGEQLALTV LMGILLVGVA AFLTRIEDWR SYASPTASGG TVSETGYGHR EKPAGVLRWL TTVDHKDIGL LYGLFALIAL AVGGLAVMVM RLELVTPSSN IVSASFYNAL LTSHGITMLF LFGTPIIAAF GNYFIPLLIG ADDMAFPRIN AIAFWLLPPG ALLIWSGFLI PGIAPSQASW TMYAPLSVEQ PHLGTDMMLL GLHLTGVSAT MGAINFIVTI FTERGDDVSW ANLDIFSWTM LTQSGIILFA FPLLGSALIM LLLDRNLGTL FFAVDGGNPI LWQHLFWFFG HPEVYILVLP PMGLISYIIP RFSGRKLFGF KFVVYSTLAI GVLSFGVWAH HMFATGIDPR LRASFMAVSL AIAIPSAVKT FNWITTMWNG RIRLTTPMLF CIGFVANFII GGVTGVFLAS IPIDLILTDT YYVVGHFHYV IMGAIAFAVF AGVYYWFPIY TGRMYQRTLG KWHFWLTMIG TNVTFFAMIL LGYVGLPRRL ATYNAITVGP IDVITLLHQA ATVGALILFV GQLVFVWNIL QSWLDGPKLT DGDPWDLKDD GLFTREFAWN EDRITADETE ADADLWADGG ESDETQ
|
| |