Gene Hlac_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0210 
Symbol 
ID7402139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp224479 
End bp227577 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content68% 
IMG OID643707273 
ProductFAD linked oxidase domain protein 
Protein accessionYP_002564885 
Protein GI222478648 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.23019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG AGCCCACGCG AGGAGATAAC ACGCCCGGCT TGGATACGTC GGCGGCCGCG 
CTCGGTCACG AGCGACCGGA CGTGCCGGCC TATCGAGCGC TCGCGGAGGA TCTCCGCGAG
CGCGTCGACG GCGAGGTTCA GTTCGACGAG TACGCGCAGG TGCTGTACGC CACCGACGGC
AGTATTTATC AGGCGCGACC GGCCGGCGTC GTGACGCCGC GCTCGGTCGC GGACGTGCAG
GCAACGATGC GTGTCGCGGC CGACCACGGG GTTCCCGTCA TCCCGCGGGG TGCGGGCTCC
TCGCTCGGGG GACAGACCGT CGGGCCAGGG TGTGTCGTGC TCGACTTCTC GACGCACATG
GACGAGATTC AGGAGGTCCG GCCCGACGAT CGTCGAGCGG TGGTCCAGCC CGGAGTCGTC
CAGGACCAGC TCGACGACCG ACTGACCGAG GACGGCTTGA AGTTCGCGCC CGACCCGGCC
TCCTCGGCGC GGGCGACCGT CGTTGGCGGC ATCGGCAACA ACTCCACCGG TGCGCACTCG
GTGCGGTACG GGATCACCGA CGCGTACACG GAGGAGTTGC AGGTCGTCCT CGCGGACGGC
TCGCTGATCC ACACCCGCGA GGTCGTCCTC GACTCGCCGG AGTACGAGGA GATCGTCTCG
AAGAACGATC GGGAGGCCGC TCTCTACGAG ACCACCCGAA AGCTCGTCGA GGAGAACGAA
GCCGAGATCG ACGAGAAGTA CCCGAACCTC AAGCGCTCCG TCTCCGGGTA CAATCTCCAC
AAGGTCATCT ACGAGAACGA CGACGGCGAG GACGTGATCA ACCTCTCGAA GCTGTTCGTC
GGCGCCGAGG GGACGCTCGG AACGATCGTC GAGGCCGAGG TGTCGCTCGT CAGCCGCCCC
GAGGAGACGG CGCTCGCGCT GTACACCTTC GACTCGCTGG TCGACGCGAT GAAGGCGGTC
CCGGAAGCCT TGGAGTTCCC GGTGAGCGCG GTCGAGCTGA TGGACGACGA GGTGTTCGAC
CTCGCTGCGG GCTCTCAGGA GTTCGCGCAG TACGCCGAGC CGATTCCGGA CCGCGCTACC
GCGGCGCTCA TGCTGGAGTG GGACTCGGAG CTCGTCGACG ACTTCGAGGC GGCGATCGCC
GACACGAACG CCCACTTCGT CGAGGAAGGC GACGCCTTCG ACGTGCTGGA GGCGTACACT
CCCGAGGACC AAGAAGACCT CTGGAAGCTC CGGAAGGCGG CCATCCCGCT ACTGATGAGC
ATGCAGGGCG ACCCGAAACC GTACCCGTTC ATCGAAGACG CGACGGTGCC GCCCGAGGAA
CTCGCGGAGT ACGTCGGGCA GTTCGAGGAG GTGCTCACCG ACCACGACAC CTCGGCCGCC
TACTTCGCGC ACGCCGGCAG CGGCACCCTT CACATTCGAC CCATCCTCTC GCTGAAAGAG
GAGGAAGGCG TCGAGAAGAT GCACTCCATC TCCGAGGACG TCACCGACCT CGTCTTGGAA
CACCACGGCG CCTTCTCGGG CGAGCACGGC GACGGGCTCG CCCGCACCGA GTTCAACCCG
AAGATGTACG GCGAGGCGCT CTGGAGCGCG TTCCAGGAGC TCAAATCGAC GTTCGATTCC
GAGTGGCGGA TGAACCCGGG GAAGGTCGTC TACGTCGACG GCGAGACCGC CGACGAGCGC
GGCTACCCCG ACACCGCCGC TGACACGGAC ATGCGCGAGA ACCTCCGGTA CGGTCCTGCC
TATCAGTCGA TCGAGCCGCA GACGACGCTG GACTTCTCAG AGGAGGGCGG GTTCTCCCAT
CTCGTCGAGC TGTGTAACGG CTGTGGCACC TGCCGGGAAG TCGACTCCGG CGTGATGTGT
CCGACCTACC GCGCCTCCGA GGAGGAGATC CAAGCGACCC GCGGCCGGGC GAACATGCTT
CGGGCCGCCA TCAGCGGCGA GCTCGACGAC GACGAGATCC ATTCCGACCG GTTCCAAGAG
GAGGTGCTCG GACTCTGCGT CGGCTGTAAG GGCTGTAAGA GCGACTGCCC GACCGGCGTC
GACCTCGCGA AGCTCAAAGC CGAGGTGAAA CACGAGCACC ACGAGGAGGA GGGCTCCGGG
CTCCGCGAGC GGATCTTCCG GGACATCGAC CGCTTCTCAG CGATCGGGAG CGCGCTCGCA
CCGGTGTCGA ACGCGGCGAC GAAGATTCCC GGCGCTCGCG CGGTGATGGA CGCGGTCGCG
GGGATCGCCC CGGACCGCGA GCTGCCGACG TTCCGCTCCG AGAGCTTCGA GGAGTGGTTC
GCGTCCCGCG GCGGATCGAC GATCGACCCC GCCGAGGCGG TCGACACGGT CGCGCTGTTC
CCCGACACGT ACACCAACTA CAGCTACCCG GCGGCGGGCA AGGCCGCCGT CGAGGTGCTT
GAGGCGGCCG GCGTCCGCGT GGAAGTACCG GACGATCTGG CGCCCTCGGG CCGGGCGGCG
TTCTCGACCG GCTTCCTCAA CGACGCCCGC GAGCGCGCGG CAACCAACGT GGCGGCGCTC
GCGCCCCGCG TCCGCGACGG GCAGTCAGTC GTCTTCGTCG AGCCCTCGGA CGCGGTGATG
TTCCAGGATG AGTACCTCGA TCTCCTCGAC GGCGACGATG TTGAGGCGGT GTCGGCCGCC
GCGTACGGCG TCTTAGAGTA CCTCGACGCC GGCCGCGTCG ACGAGCAGTT GGCGTTCGAT
GCGCCTGCGG AGTCGCTCAC GTATCACGGC CACTGCAACC AGAAGGCGAC GAACAAGGAC
CACCACGCGG TCGGGGTACT CCGCCGGGCC GGCTACGACG TGGACCCGCT CGACTCCTCG
TGTTGCGGGA TGGCCGGCTC GTTCGGCTAC GAGTCGGAAC ACTACGACAT CTCGAAGGCG
ATCGGCCGGA TCCTCTTCGA TCAGGTCGAG GAGAGCGGCG GCGAGACGGT GACCGCGCCG
GGCGCCTCCT GCCGCTCGCA GCTGGGAGAC CGTGACGGCG CGGAGAACCC ACCGCACCCG
ATCGAGAAGG TCGCCGAGGC GGTGACCGGG GCCGCATCCG ACGCCGTCGC CGACGCGGGC
GCCGCCGAGG CCGCGAGCCC GTCGCCCGCC GACGACTGA
 
Protein sequence
MASEPTRGDN TPGLDTSAAA LGHERPDVPA YRALAEDLRE RVDGEVQFDE YAQVLYATDG 
SIYQARPAGV VTPRSVADVQ ATMRVAADHG VPVIPRGAGS SLGGQTVGPG CVVLDFSTHM
DEIQEVRPDD RRAVVQPGVV QDQLDDRLTE DGLKFAPDPA SSARATVVGG IGNNSTGAHS
VRYGITDAYT EELQVVLADG SLIHTREVVL DSPEYEEIVS KNDREAALYE TTRKLVEENE
AEIDEKYPNL KRSVSGYNLH KVIYENDDGE DVINLSKLFV GAEGTLGTIV EAEVSLVSRP
EETALALYTF DSLVDAMKAV PEALEFPVSA VELMDDEVFD LAAGSQEFAQ YAEPIPDRAT
AALMLEWDSE LVDDFEAAIA DTNAHFVEEG DAFDVLEAYT PEDQEDLWKL RKAAIPLLMS
MQGDPKPYPF IEDATVPPEE LAEYVGQFEE VLTDHDTSAA YFAHAGSGTL HIRPILSLKE
EEGVEKMHSI SEDVTDLVLE HHGAFSGEHG DGLARTEFNP KMYGEALWSA FQELKSTFDS
EWRMNPGKVV YVDGETADER GYPDTAADTD MRENLRYGPA YQSIEPQTTL DFSEEGGFSH
LVELCNGCGT CREVDSGVMC PTYRASEEEI QATRGRANML RAAISGELDD DEIHSDRFQE
EVLGLCVGCK GCKSDCPTGV DLAKLKAEVK HEHHEEEGSG LRERIFRDID RFSAIGSALA
PVSNAATKIP GARAVMDAVA GIAPDRELPT FRSESFEEWF ASRGGSTIDP AEAVDTVALF
PDTYTNYSYP AAGKAAVEVL EAAGVRVEVP DDLAPSGRAA FSTGFLNDAR ERAATNVAAL
APRVRDGQSV VFVEPSDAVM FQDEYLDLLD GDDVEAVSAA AYGVLEYLDA GRVDEQLAFD
APAESLTYHG HCNQKATNKD HHAVGVLRRA GYDVDPLDSS CCGMAGSFGY ESEHYDISKA
IGRILFDQVE ESGGETVTAP GASCRSQLGD RDGAENPPHP IEKVAEAVTG AASDAVADAG
AAEAASPSPA DD