Gene Hlac_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1400 
Symbol 
ID7400719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1406425 
End bp1409496 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content69% 
IMG OID643708461 
ProductFAD linked oxidase domain protein 
Protein accessionYP_002566058 
Protein GI222479821 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA ACACGCCGGA TCCGGATCCG CGGGACGGGG GGGACTTCGA TTACACTGGC 
GGCGCGGTAG AGCGCCCGGG CCTCGTCGAC GCGCTGGAAG ACCGGGTCGA TGGCAACGTG
CGGTTCGACG AGTACTCGAA GCGGCTGTAC GCGACCGACG CCTCGGCGTA CGAGGTCACA
CCGATCGGCG TCGTGGTGCC GGAGTCGACG GCGGACGTGG CGGCGGTCCA CGAGTACTGC
TACGAAGAGG GGATTCCGGT GTTGCCCCGC GGGGGCGGCA CGTCGCTCGC GGGCCAGACC
GTCAACGAGG CGGTCGTCCT CGATCTCACC GGTTCGATGG ACGAGGTGCT GTCGACGGAT
CCGGAGGCGG GGACCGCCCG AGCGCAGGCG GGGGCGTACG TCGGCGACCT CAACGCCGCG
GTCGAGCCCC ACGGGCTGAA GTTCGCGCCC GATCCGGCGT GGCGCGACAA GTCCGCGATC
GGCGGCGCAA TCGGCAACAA CTCCACCGGG TCGCACTCGC TGAAGTACGG GAAAACCGAC
CACTACGTCG AGGAGCTGGA GGTCGTGTTA GCCGACGGGA CGGTGACGAC CTTCGGAGAG
GTCGCCGTCG AGGAGCTACG AGAGTCCGCA GACGCCGAGA GCGACGACCT CCTACCGCGG
ATCCACGCCG AGATCGTCCG GATCCTCGAC GAGGAGGCGG GCGCGATCGA CGAGCGCTTC
CCGGAGATGA AGCGGAACGT CTCCGGCTAC AACCTCGACC GCCTGCTCGC CGAGTACCGC
GGCGAGTACG GCGAGGCGGG CGTCGTCAAC CTCGGTCGGC TCATGGCCGG CAGCGAAGGG
ACCCTCGCGA CGGTCACGGA AGCGACCGTC TCGCTTGTCG AGATTCCAGA GACGAAGGCG
GTGGCGCTGC TCACCTACGA CGACCTGCTG GACGCGATGG AGGATGTGGC GCCGATCCTC
GACCACGACC CCGCCGCCGT CGAGGTGATG GACGACGTGC TCCTCGGCCT CGCGGCCGAC
ACGCCCGAGT TCGAGGGCGT CGTGGGGATG TTGCCCGAGG GGACCGACTC CGTCCTCCTC
GTCGAGTTTT ACGCCGACAG CGACGTGGAT GGGAAACAGA AGGTCGCGGA CCTGATCGCG
GATCGGGTGG GTGCGACGGC CGACCGTGCC CGCCCGGCGA CGATGGCGGA GCCGAGCGAC
GGCGCGGCCG AGACGACGAG CCAGCCCCGC CGCGCGGTCG ACGCGATGGA AGCGCACGAC
CCCGAAAAGC GCGACCGGTT CTGGAAGATG CGCAAGGCCG GACTCCCGAT CCTCCTGTCG
CGCACCACAG ACGAAAAGCA CATCTCCTTC ATCGAGGACT GCGCGATCCC GCCCGAACAC
CTCCCCGAGT ACACCCGCGA GTTCCAGGAG ATCCTCGACG ACAACGACAC CTTCGCGACC
TTCTACGCGC ACGCCGGCCC GGGCGTACTC CACATCCGGC CGCTGATCAA CACGAAAGAC
GTGGACGACG TGGACGCGAT GGTCGACATC GCCGACCGCG TCACCGACGC CGTGGTCCGG
CTCGGCGGGT CGGTGTCGGG CGAACACGGG GACGGCCGCG CCCGGACCCA GTGGAACCGG
AAGCTGTACG GCGACGACCT CTGGGACGCC TTCCGCGACC TCAAGACCGC CTTCGACCCC
GACTGGCTGC TCAATCCCGG AAACGTCTGC GGCGACCACG ACATGCGCGA GAACCTCCGG
TTCGACTCGG AGTACGAGTA CGACGCCGGC TTCGACCCCG CGATGGAGTG GGCGATCGAC
AACGGGATGC AGGGGATGGT CGAGCTCTGT CACGGCTGTG GCGGGTGTCG CGGCCCGCAG
GAGACCACCG GCGGCGTGAT GTGTCCGACC TATCGCGCCT CCGAAGAAGA GATACAGGCG
ACCCGCGGCC GGGCGAACAT GCTCCGCGGA GCCATGAACG GCGAACTCCC CGACGACCCG
ACCGACGACG AGTTCGTCAC CGAGGTGATG GATCTGTGTG TCGGCTGTAA GGGCTGTACG
AAGGACTGTC CGAGCGGCGT CGACATGGCG AAGCTCAAAG CCGAGGTCGA ACACGCCCAC
CACGAGGAAC ACGGGATCGA CCTGCGGACC CGACTGCTCG GCAACTTCGA ATCGCTCGCG
CCGATCGGCT CGACGCTCGC GCCCCTCTCG AACCTCCCCG GGAAGATCCC CGGCGCCGGC
CTCGTCATGG AGAAGGCCCT CGGGATCGCG AAGGAGCGTG ACCTCCCCAC GTTCCGCTCG
GAGAGCCTGA CCGACTGGTT CGAGGCCCGC GGCGGCGCGG GGATTCCCCG AGCCGAGGCC
GACCGCGACG TACTGCTGTT CCCCGACGTG TACACCACCT ACACGAACCC CGGTGCGGGG
AAGGCCGCGG TGCGCGTGCT GGAGGCCGCG AACTGCCACG TCCGAATCCC CGATGTGGAC
GGGAGCGGGC GCCCCCCGCA CTCGAAGGGG ATGCTCGACG AGTCGCGCGC GGCGGCGAGA
GACGCCGTGG AGACGCTCGT ACCAGATGTG GCCGACGGCT GGGACGTGGT GGTCGTCGAG
CCCACCGACG CCGTGATGCT GCAGACCGAC TACCACGACC TGCTCGACGG CGACGCCAGC
GACGTGCCCG ACGCCGACGT GGCGACCGTC TCGGCGAACA CCTACGGCGT CATGGAGTAC
GTCGACGCCC ACCGGCTCGA CGAGGATATC GACCTCGACG CGCCCACCGA ATCGCTCACG
TATCACGGGC ACTGCCACCA GAAGGCGACG AAGAAGGACC ATCACGCGGT CGGCGTGCTC
CGACGTGCGG GGTACGGGGT CGATCCGCTC GACTCCTCCT GCTGTGGGAT GGCGGGCTCG
TTCGGCTACG AGGCCGAGCA CTACTCGATG AGCAAGGCGA TCGGCCGGAT CCTCTTCGAT
CAGGTCGCCG AGAGCGACGG TGACGAGGTC GTTGCTCCCG GTGCCTCCTG CCGGAGCCAA
CTGAAGGAGC GCGACGGCGG TGCCCCCGAG CCGCCGCACC CGGTCGAGAA GCTGGCGGCG
GCGCTGGCGT AG
 
Protein sequence
MATNTPDPDP RDGGDFDYTG GAVERPGLVD ALEDRVDGNV RFDEYSKRLY ATDASAYEVT 
PIGVVVPEST ADVAAVHEYC YEEGIPVLPR GGGTSLAGQT VNEAVVLDLT GSMDEVLSTD
PEAGTARAQA GAYVGDLNAA VEPHGLKFAP DPAWRDKSAI GGAIGNNSTG SHSLKYGKTD
HYVEELEVVL ADGTVTTFGE VAVEELRESA DAESDDLLPR IHAEIVRILD EEAGAIDERF
PEMKRNVSGY NLDRLLAEYR GEYGEAGVVN LGRLMAGSEG TLATVTEATV SLVEIPETKA
VALLTYDDLL DAMEDVAPIL DHDPAAVEVM DDVLLGLAAD TPEFEGVVGM LPEGTDSVLL
VEFYADSDVD GKQKVADLIA DRVGATADRA RPATMAEPSD GAAETTSQPR RAVDAMEAHD
PEKRDRFWKM RKAGLPILLS RTTDEKHISF IEDCAIPPEH LPEYTREFQE ILDDNDTFAT
FYAHAGPGVL HIRPLINTKD VDDVDAMVDI ADRVTDAVVR LGGSVSGEHG DGRARTQWNR
KLYGDDLWDA FRDLKTAFDP DWLLNPGNVC GDHDMRENLR FDSEYEYDAG FDPAMEWAID
NGMQGMVELC HGCGGCRGPQ ETTGGVMCPT YRASEEEIQA TRGRANMLRG AMNGELPDDP
TDDEFVTEVM DLCVGCKGCT KDCPSGVDMA KLKAEVEHAH HEEHGIDLRT RLLGNFESLA
PIGSTLAPLS NLPGKIPGAG LVMEKALGIA KERDLPTFRS ESLTDWFEAR GGAGIPRAEA
DRDVLLFPDV YTTYTNPGAG KAAVRVLEAA NCHVRIPDVD GSGRPPHSKG MLDESRAAAR
DAVETLVPDV ADGWDVVVVE PTDAVMLQTD YHDLLDGDAS DVPDADVATV SANTYGVMEY
VDAHRLDEDI DLDAPTESLT YHGHCHQKAT KKDHHAVGVL RRAGYGVDPL DSSCCGMAGS
FGYEAEHYSM SKAIGRILFD QVAESDGDEV VAPGASCRSQ LKERDGGAPE PPHPVEKLAA
ALA