Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0210 |
Symbol | |
ID | 7402139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 224479 |
End bp | 227577 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707273 |
Product | FAD linked oxidase domain protein |
Protein accession | YP_002564885 |
Protein GI | 222478648 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.23019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGCG AGCCCACGCG AGGAGATAAC ACGCCCGGCT TGGATACGTC GGCGGCCGCG CTCGGTCACG AGCGACCGGA CGTGCCGGCC TATCGAGCGC TCGCGGAGGA TCTCCGCGAG CGCGTCGACG GCGAGGTTCA GTTCGACGAG TACGCGCAGG TGCTGTACGC CACCGACGGC AGTATTTATC AGGCGCGACC GGCCGGCGTC GTGACGCCGC GCTCGGTCGC GGACGTGCAG GCAACGATGC GTGTCGCGGC CGACCACGGG GTTCCCGTCA TCCCGCGGGG TGCGGGCTCC TCGCTCGGGG GACAGACCGT CGGGCCAGGG TGTGTCGTGC TCGACTTCTC GACGCACATG GACGAGATTC AGGAGGTCCG GCCCGACGAT CGTCGAGCGG TGGTCCAGCC CGGAGTCGTC CAGGACCAGC TCGACGACCG ACTGACCGAG GACGGCTTGA AGTTCGCGCC CGACCCGGCC TCCTCGGCGC GGGCGACCGT CGTTGGCGGC ATCGGCAACA ACTCCACCGG TGCGCACTCG GTGCGGTACG GGATCACCGA CGCGTACACG GAGGAGTTGC AGGTCGTCCT CGCGGACGGC TCGCTGATCC ACACCCGCGA GGTCGTCCTC GACTCGCCGG AGTACGAGGA GATCGTCTCG AAGAACGATC GGGAGGCCGC TCTCTACGAG ACCACCCGAA AGCTCGTCGA GGAGAACGAA GCCGAGATCG ACGAGAAGTA CCCGAACCTC AAGCGCTCCG TCTCCGGGTA CAATCTCCAC AAGGTCATCT ACGAGAACGA CGACGGCGAG GACGTGATCA ACCTCTCGAA GCTGTTCGTC GGCGCCGAGG GGACGCTCGG AACGATCGTC GAGGCCGAGG TGTCGCTCGT CAGCCGCCCC GAGGAGACGG CGCTCGCGCT GTACACCTTC GACTCGCTGG TCGACGCGAT GAAGGCGGTC CCGGAAGCCT TGGAGTTCCC GGTGAGCGCG GTCGAGCTGA TGGACGACGA GGTGTTCGAC CTCGCTGCGG GCTCTCAGGA GTTCGCGCAG TACGCCGAGC CGATTCCGGA CCGCGCTACC GCGGCGCTCA TGCTGGAGTG GGACTCGGAG CTCGTCGACG ACTTCGAGGC GGCGATCGCC GACACGAACG CCCACTTCGT CGAGGAAGGC GACGCCTTCG ACGTGCTGGA GGCGTACACT CCCGAGGACC AAGAAGACCT CTGGAAGCTC CGGAAGGCGG CCATCCCGCT ACTGATGAGC ATGCAGGGCG ACCCGAAACC GTACCCGTTC ATCGAAGACG CGACGGTGCC GCCCGAGGAA CTCGCGGAGT ACGTCGGGCA GTTCGAGGAG GTGCTCACCG ACCACGACAC CTCGGCCGCC TACTTCGCGC ACGCCGGCAG CGGCACCCTT CACATTCGAC CCATCCTCTC GCTGAAAGAG GAGGAAGGCG TCGAGAAGAT GCACTCCATC TCCGAGGACG TCACCGACCT CGTCTTGGAA CACCACGGCG CCTTCTCGGG CGAGCACGGC GACGGGCTCG CCCGCACCGA GTTCAACCCG AAGATGTACG GCGAGGCGCT CTGGAGCGCG TTCCAGGAGC TCAAATCGAC GTTCGATTCC GAGTGGCGGA TGAACCCGGG GAAGGTCGTC TACGTCGACG GCGAGACCGC CGACGAGCGC GGCTACCCCG ACACCGCCGC TGACACGGAC ATGCGCGAGA ACCTCCGGTA CGGTCCTGCC TATCAGTCGA TCGAGCCGCA GACGACGCTG GACTTCTCAG AGGAGGGCGG GTTCTCCCAT CTCGTCGAGC TGTGTAACGG CTGTGGCACC TGCCGGGAAG TCGACTCCGG CGTGATGTGT CCGACCTACC GCGCCTCCGA GGAGGAGATC CAAGCGACCC GCGGCCGGGC GAACATGCTT CGGGCCGCCA TCAGCGGCGA GCTCGACGAC GACGAGATCC ATTCCGACCG GTTCCAAGAG GAGGTGCTCG GACTCTGCGT CGGCTGTAAG GGCTGTAAGA GCGACTGCCC GACCGGCGTC GACCTCGCGA AGCTCAAAGC CGAGGTGAAA CACGAGCACC ACGAGGAGGA GGGCTCCGGG CTCCGCGAGC GGATCTTCCG GGACATCGAC CGCTTCTCAG CGATCGGGAG CGCGCTCGCA CCGGTGTCGA ACGCGGCGAC GAAGATTCCC GGCGCTCGCG CGGTGATGGA CGCGGTCGCG GGGATCGCCC CGGACCGCGA GCTGCCGACG TTCCGCTCCG AGAGCTTCGA GGAGTGGTTC GCGTCCCGCG GCGGATCGAC GATCGACCCC GCCGAGGCGG TCGACACGGT CGCGCTGTTC CCCGACACGT ACACCAACTA CAGCTACCCG GCGGCGGGCA AGGCCGCCGT CGAGGTGCTT GAGGCGGCCG GCGTCCGCGT GGAAGTACCG GACGATCTGG CGCCCTCGGG CCGGGCGGCG TTCTCGACCG GCTTCCTCAA CGACGCCCGC GAGCGCGCGG CAACCAACGT GGCGGCGCTC GCGCCCCGCG TCCGCGACGG GCAGTCAGTC GTCTTCGTCG AGCCCTCGGA CGCGGTGATG TTCCAGGATG AGTACCTCGA TCTCCTCGAC GGCGACGATG TTGAGGCGGT GTCGGCCGCC GCGTACGGCG TCTTAGAGTA CCTCGACGCC GGCCGCGTCG ACGAGCAGTT GGCGTTCGAT GCGCCTGCGG AGTCGCTCAC GTATCACGGC CACTGCAACC AGAAGGCGAC GAACAAGGAC CACCACGCGG TCGGGGTACT CCGCCGGGCC GGCTACGACG TGGACCCGCT CGACTCCTCG TGTTGCGGGA TGGCCGGCTC GTTCGGCTAC GAGTCGGAAC ACTACGACAT CTCGAAGGCG ATCGGCCGGA TCCTCTTCGA TCAGGTCGAG GAGAGCGGCG GCGAGACGGT GACCGCGCCG GGCGCCTCCT GCCGCTCGCA GCTGGGAGAC CGTGACGGCG CGGAGAACCC ACCGCACCCG ATCGAGAAGG TCGCCGAGGC GGTGACCGGG GCCGCATCCG ACGCCGTCGC CGACGCGGGC GCCGCCGAGG CCGCGAGCCC GTCGCCCGCC GACGACTGA
|
Protein sequence | MASEPTRGDN TPGLDTSAAA LGHERPDVPA YRALAEDLRE RVDGEVQFDE YAQVLYATDG SIYQARPAGV VTPRSVADVQ ATMRVAADHG VPVIPRGAGS SLGGQTVGPG CVVLDFSTHM DEIQEVRPDD RRAVVQPGVV QDQLDDRLTE DGLKFAPDPA SSARATVVGG IGNNSTGAHS VRYGITDAYT EELQVVLADG SLIHTREVVL DSPEYEEIVS KNDREAALYE TTRKLVEENE AEIDEKYPNL KRSVSGYNLH KVIYENDDGE DVINLSKLFV GAEGTLGTIV EAEVSLVSRP EETALALYTF DSLVDAMKAV PEALEFPVSA VELMDDEVFD LAAGSQEFAQ YAEPIPDRAT AALMLEWDSE LVDDFEAAIA DTNAHFVEEG DAFDVLEAYT PEDQEDLWKL RKAAIPLLMS MQGDPKPYPF IEDATVPPEE LAEYVGQFEE VLTDHDTSAA YFAHAGSGTL HIRPILSLKE EEGVEKMHSI SEDVTDLVLE HHGAFSGEHG DGLARTEFNP KMYGEALWSA FQELKSTFDS EWRMNPGKVV YVDGETADER GYPDTAADTD MRENLRYGPA YQSIEPQTTL DFSEEGGFSH LVELCNGCGT CREVDSGVMC PTYRASEEEI QATRGRANML RAAISGELDD DEIHSDRFQE EVLGLCVGCK GCKSDCPTGV DLAKLKAEVK HEHHEEEGSG LRERIFRDID RFSAIGSALA PVSNAATKIP GARAVMDAVA GIAPDRELPT FRSESFEEWF ASRGGSTIDP AEAVDTVALF PDTYTNYSYP AAGKAAVEVL EAAGVRVEVP DDLAPSGRAA FSTGFLNDAR ERAATNVAAL APRVRDGQSV VFVEPSDAVM FQDEYLDLLD GDDVEAVSAA AYGVLEYLDA GRVDEQLAFD APAESLTYHG HCNQKATNKD HHAVGVLRRA GYDVDPLDSS CCGMAGSFGY ESEHYDISKA IGRILFDQVE ESGGETVTAP GASCRSQLGD RDGAENPPHP IEKVAEAVTG AASDAVADAG AAEAASPSPA DD
|
| |