Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2018 |
Symbol | |
ID | 7402037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2012226 |
End bp | 2013674 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643709089 |
Product | FO synthase subunit 2 |
Protein accession | YP_002566666 |
Protein GI | 222480429 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.595846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGG CGCGGGACCC CGACGCGACA GACCGCGACG CCGCCGACCT CGACGACGAC TTCGGCTTCG CCGAGCCGGC GACCGACCAG TCGTTCGAGA ACGCGCTCGC GAAGGCCCGC GACGGCGACC GCCTCTCAAT CGACGACGCG ACCGAACTGC TCGCGACCGG CACGGACACC GAGGGGGTCG ACCCCGTCCG CAAAGAGCGG GTCCTCGAAC TCGCCGACCG CCGGCGCCAT GAGGAGGTCG GCGACGAGAT CACGTTCGTC GCCAACCTCA ACAACAACGT CACGACCGCC TGCAACACGG GCTGTCTGTT CTGCAACTTC AAGGACTCGG CGCACGCCTT CGAGGCCGAC AGCGACGCCG ACCACGTTGG CTTCACGAAG ACGCCCGCCG AGTCCCGCGC GATCGTCGAA GACGCCCTCG ACATGGGCGT CTACGAGGTG TGCTCGGTGT CCGGGCTCCA CCCCGCGCTC GCGCTGAACG AGGAGCACCA CGAGATCCTT CAATCCTACG ACGACCCGGA AAGCGAGGTG AACTACAAAC CGCCCGAGGA GTACGCCACG GATCCGGGCA CCTACGTCGA GCAGATCGAG GCGATGTCGG TCGGCGGGAT CCACCTGCAC TCGATGACGC CCGAGGAGGC GTACCACGCC GCCCGCGGCA CCGACTGGGA CTACGAGACG GTCTACCGCG AACTCGCTGC GGCCGGACTC GACTCCGCTC CCGGCACCGC CGCGGAGATC CTCGTCGACG AGGTCCGCGA CGTGATCTGC CCCGGGAAGA TCCGCACCGA CGACTGGGTC GCGGCCATGG AGGGCGCGAT GGCGGCCGGC CTCGATGTCA CCTCAACGAT GATGTACGGT CACGTCGAGA CGGTCGAACA CCGCGCGAAA CACCTCGGAG TGATCCGCGA TCTACAGGAC CGCACCGGCC GGATTACGGA GTTCGTCCCC CTCTCCTTTA TCCACCAGAA CACGCCCCTC TACCGCCACG GCGTTGTCGA CTCCGGCCCC TCTCACGACG AGGACGAACT CGTGGTGGCG GTCGCGCGCC TCTTCTTGGA CAACGTCGAT CACGTGCAGG CCTCGTGGGT GAAGTCGGGC GACGCGCACG GACTGAAACT GCTCAACTGC GGCGCCGACG ACTTCATGGG CACCATCCTC TCCGAGGAGA TCACCAAACG CGCCGGCGGC GAGTACGGGG AGTTCCGCTC GTTCGACGAC TACGTCGACA TGATCACGGC GATCGGCCGC ACGCCGGTCG AGCGCTCGAC CGACTACCGC ACCCGCCGGC GGATCGATCC CGACGACAAT CCTCACGGAC CGCGGCTCGG TCCTCGCGCC GACGGCACGC CGATGCTCTC GGACTCGTCG TCCGGCGGGT CCGGTAAGTC TGGCGCGTCT GGCGCGTCCG ACGGGGAGTC GTCGGGCGCC GACGACTGA
|
Protein sequence | MNPARDPDAT DRDAADLDDD FGFAEPATDQ SFENALAKAR DGDRLSIDDA TELLATGTDT EGVDPVRKER VLELADRRRH EEVGDEITFV ANLNNNVTTA CNTGCLFCNF KDSAHAFEAD SDADHVGFTK TPAESRAIVE DALDMGVYEV CSVSGLHPAL ALNEEHHEIL QSYDDPESEV NYKPPEEYAT DPGTYVEQIE AMSVGGIHLH SMTPEEAYHA ARGTDWDYET VYRELAAAGL DSAPGTAAEI LVDEVRDVIC PGKIRTDDWV AAMEGAMAAG LDVTSTMMYG HVETVEHRAK HLGVIRDLQD RTGRITEFVP LSFIHQNTPL YRHGVVDSGP SHDEDELVVA VARLFLDNVD HVQASWVKSG DAHGLKLLNC GADDFMGTIL SEEITKRAGG EYGEFRSFDD YVDMITAIGR TPVERSTDYR TRRRIDPDDN PHGPRLGPRA DGTPMLSDSS SGGSGKSGAS GASDGESSGA DD
|
| |