Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1046 |
Symbol | |
ID | 7400118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1037875 |
End bp | 1039023 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708114 |
Product | Luciferase-like monooxygenase |
Protein accession | YP_002565713 |
Protein GI | 222479476 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.974969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCG AATTCGCGTA CTGGGTGCCG AACGTCAGCG GCGGGCTGGT CGTCTCCGAC TGGGAGACGG AGACGGACTG GACGTACGAC TACAACCTCG ACCTCGCCCG CACGGCGGAG TCGGTCGGGT TCGACTACGC GCTGGCACAG GCGCGCTTCT TCGGCAGCTA CGGGGCGGAC AAACAGCTAG AGGCGCTGTC GATCGCCAAC GCGCTCGCCG CACAGACCGA CGAGCTACAC GTGATCGGCG CGGTCCACCC CGGGCTGTGG GAGCCCGGCC CGCTGGCGAA CTTCATCGCC ACGTCCGACC GCATCAGCAA CGGACGGTTC TCGATCAACA TCGTCTCCGG CTGGTTTAAA GGGGAGTTCA CCGGATTCGG CCAGCCGTGG CTCGAACACG ACGAGCGGTA CGCCCGCTCG TCGGAGTTCA TCGAGGTGCT GAAAGCGCTG TGGACTGAGG AGCGCGCGAC CTACGACGGC CGGTTCTACA CGATCGGGAA GGACATCGAG GGGTTCGAGG GCGCGCCGCT GGAGCCCAAA CCGGTGCAGG ATCCGTACCC ACAGGTGTTC CAAGGCGGGA ACTCGCAGGC GGCTCGGGAG ATGGCCGCGA AACACTCGGA TGTGCTCTTC ATCAACGGCG GGAGCCTCCA AGAGATCCGC GCCGTCATCG ACGACGTCGA GGAGTACGCC GAGGAGTTCG GCACGGAGCC GCCGCGGTTC GCCGCGAACG CGTTCGTCAT CCAGCGCGAC ACCGAGTCGG AGGCCAAGGA GGTCCTGGAG GGGATCATCG AGAACGCCAC CGACGAGGCG GTCGACGCGT TCAAAGACCA GGTGAAGGAG GCCGGCCAGT CGTCGGGCGA GGGCGAGGGG ATGTGGGCCG ACTCCGACTT CGAGGACCTC GTGCAATACA ACGACGGCTT CAAGACCGGA CTCATCGGGA CCGACGACCA GATCGTCGAG CGGATCCGGA AGCTGGATGC GATCGGCGTC GACATCGTCC TCGCGGGCTT CCTCGACTTC GAGGCGGAGC TCGAACGCTT CGGTGAGACG ATCATCCCGG CGATCGACGA GGCCGACTCC CTCGACCCCG ACGAGGTCGA CGCGGTCGAC GAGATCGAGG AAGTCGGCGG CGCGAAGGTG GCCCGATAG
|
Protein sequence | MDTEFAYWVP NVSGGLVVSD WETETDWTYD YNLDLARTAE SVGFDYALAQ ARFFGSYGAD KQLEALSIAN ALAAQTDELH VIGAVHPGLW EPGPLANFIA TSDRISNGRF SINIVSGWFK GEFTGFGQPW LEHDERYARS SEFIEVLKAL WTEERATYDG RFYTIGKDIE GFEGAPLEPK PVQDPYPQVF QGGNSQAARE MAAKHSDVLF INGGSLQEIR AVIDDVEEYA EEFGTEPPRF AANAFVIQRD TESEAKEVLE GIIENATDEA VDAFKDQVKE AGQSSGEGEG MWADSDFEDL VQYNDGFKTG LIGTDDQIVE RIRKLDAIGV DIVLAGFLDF EAELERFGET IIPAIDEADS LDPDEVDAVD EIEEVGGAKV AR
|
| |