Gene Huta_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0042 
Symbol 
ID8382302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp41238 
End bp42998 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content61% 
IMG OID644971100 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003128964 
Protein GI257051131 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0382499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGCG AACAGCTCGC ATTGACGGTC TTGATGGGAA TTTTGCTGGT CGGGGTTGCG 
GCGTTTCTCA CGCGTATCGA AGACTGGCGG AGCTACGCGT CACCGACGGC CAGTGGCGGG
ACCGTCAGCG AGACAGGATA CGGCCACCGG GAGAAGCCCG CCGGGGTGCT CCGGTGGCTC
ACGACAGTCG ATCACAAGGA CATCGGATTG CTCTACGGGC TCTTTGCACT GATTGCGCTC
GCGGTCGGTG GCCTGGCCGT CATGGTCATG CGGCTCGAAC TGGTCACTCC GTCTTCGAAC
ATCGTGAGTG CGAGCTTCTA CAACGCGTTG CTCACGAGCC ACGGCATCAC AATGTTGTTC
CTCTTCGGAA CGCCGATCAT CGCGGCCTTC GGGAACTACT TCATCCCGCT ACTGATCGGG
GCTGACGACA TGGCCTTCCC CCGGATCAAC GCGATCGCGT TCTGGCTGTT GCCGCCGGGC
GCGTTACTCA TCTGGAGTGG CTTTCTCATC CCGGGGATCG CCCCCTCCCA GGCGTCCTGG
ACGATGTACG CGCCGCTGTC GGTCGAACAG CCCCATCTCG GGACGGACAT GATGTTGCTG
GGACTCCATC TGACGGGCGT CTCGGCGACG ATGGGGGCGA TCAACTTCAT CGTCACAATC
TTCACCGAAC GCGGTGATGA CGTCTCCTGG GCGAACTTAG ACATCTTCTC CTGGACGATG
CTCACCCAGT CGGGGATCAT CCTCTTCGCG TTCCCGCTGC TGGGCAGCGC ACTCATCATG
TTGCTGCTCG ATCGGAACCT CGGGACGCTG TTCTTCGCCG TCGACGGCGG GAACCCGATA
CTGTGGCAAC ATCTGTTCTG GTTCTTCGGC CATCCGGAGG TGTACATACT GGTGTTGCCA
CCGATGGGGC TGATAAGCTA CATCATCCCG CGGTTCTCGG GTCGGAAGCT GTTCGGGTTC
AAGTTCGTCG TCTACTCGAC GCTGGCGATC GGCGTGCTCT CCTTCGGCGT CTGGGCCCAC
CACATGTTCG CGACGGGGAT CGACCCGCGA CTCCGGGCGA GTTTCATGGC CGTCTCCTTG
GCGATCGCGA TACCGAGCGC AGTCAAGACG TTCAACTGGA TCACGACGAT GTGGAACGGC
CGCATCAGGC TAACTACCCC GATGCTGTTC TGCATCGGAT TCGTGGCAAA CTTCATCATC
GGCGGCGTGA CTGGCGTGTT CCTGGCGTCG ATCCCGATCG ACCTGATCCT GACGGACACT
TACTACGTCG TCGGGCACTT CCATTACGTG ATCATGGGGG CGATCGCCTT CGCGGTGTTC
GCCGGCGTCT ACTACTGGTT CCCCATTTAC ACTGGGCGGA TGTACCAGCG CACGCTGGGC
AAGTGGCACT TCTGGCTGAC CATGATCGGC ACGAACGTGA CGTTCTTCGC CATGATCCTG
CTGGGCTACG TCGGCCTGCC GCGCAGACTT GCAACCTACA ACGCCATCAC CGTCGGCCCG
ATCGACGTCA TCACGCTCCT CCACCAGGCC GCCACCGTCG GCGCACTGAT CCTGTTCGTG
GGGCAACTCG TCTTCGTCTG GAACATTCTC CAGTCGTGGC TCGACGGCCC GAAACTCACC
GACGGCGACC CGTGGGATCT CAAGGACGAC GGCCTGTTCA CCCGTGAGTT TGCCTGGAAC
GAGGATCGAA TCACTGCGGA CGAGACGGAA GCAGACGCCG ATCTATGGGC CGACGGCGGC
GAATCTGACG AGACCCAATA G
 
Protein sequence
MAGEQLALTV LMGILLVGVA AFLTRIEDWR SYASPTASGG TVSETGYGHR EKPAGVLRWL 
TTVDHKDIGL LYGLFALIAL AVGGLAVMVM RLELVTPSSN IVSASFYNAL LTSHGITMLF
LFGTPIIAAF GNYFIPLLIG ADDMAFPRIN AIAFWLLPPG ALLIWSGFLI PGIAPSQASW
TMYAPLSVEQ PHLGTDMMLL GLHLTGVSAT MGAINFIVTI FTERGDDVSW ANLDIFSWTM
LTQSGIILFA FPLLGSALIM LLLDRNLGTL FFAVDGGNPI LWQHLFWFFG HPEVYILVLP
PMGLISYIIP RFSGRKLFGF KFVVYSTLAI GVLSFGVWAH HMFATGIDPR LRASFMAVSL
AIAIPSAVKT FNWITTMWNG RIRLTTPMLF CIGFVANFII GGVTGVFLAS IPIDLILTDT
YYVVGHFHYV IMGAIAFAVF AGVYYWFPIY TGRMYQRTLG KWHFWLTMIG TNVTFFAMIL
LGYVGLPRRL ATYNAITVGP IDVITLLHQA ATVGALILFV GQLVFVWNIL QSWLDGPKLT
DGDPWDLKDD GLFTREFAWN EDRITADETE ADADLWADGG ESDETQ