Gene Hlac_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0428 
Symbol 
ID7401046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp446324 
End bp447736 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content66% 
IMG OID643707493 
ProductFAD dependent oxidoreductase 
Protein accessionYP_002565101 
Protein GI222478864 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.106038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.210825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA ACACTGATCT GATCATCGTC GGCGGGGGGA TAAGCGGGGC GTCGCTTCTG 
TACACGGTGG CGAAGTTCAC CGACATCGAT GACGTGACGC TGATCGAAAA AGAGCGGGAG
ATCGCCGCGA TCAACTCCCA CCGGACGAAC AACTCCCAGA CGCTGCACTT CGGCGACATC
GAGACGAACT ACACCCTCGA AAAGGCCGAG GAGGTCAAGG AGGGCGCCGA GCTGCTGGCG
GGCTACCTCG AAGGGACCGA CCCGGACCGG GAGATGCACA GCAAGCGCAG CAAGATGGTC
CTCGGCGTCG GCGACGAGGA GGCCGCCAAG CTGGAAGAGC GCTACCATCA AAACGGATTC
GGCGACCTCT ACCCGAAGCT CCGAGAGATC GGCCGCGAGG AGATCGAGGA GCTAGAGCCG
AAGGTCGTCG AGGGCCGCGA CCCGACCACG GCGCTCAAGG CGCTCCAGAC ACCGGACGGC
TATGTCGTCG ACTACGGCGC GGTCGCGAAG TCGTTCGTCG ACGCCGCCCG CGAGGAGGAC
GGTGTCGGCG TCCACCTCGG CACCGCGGTC GAGAACGTCG ACGAGGGGTA CGACGGGTTC
ACGGTCGAGA CCGACGACGG CGACTTCGAG GCCGACGCGG TCGTCGTCGC CGCCGGCTCC
CACAGCCTCC AGTTCGCCAA GGAGATGGGG TACGGCGAGC ACATGTCGCT GCTTCCGGTC
GCGGGGAGCT TCTTCCTCGC CGACGACCTG TTGAACGGGA AGGTGTACAC GCTGCAGATG
AAGAAGCTCC CGTTCGCGGC GGTCCACGGC GACGCCGACG TGCACGACGG CGGAGTCACG
CGCTTCGGCC CCACCGCGAA GCTCGTCCCC GCACTCGAAC GCGGCCGACT CTCCACCGTG
AGCGACTTTG CCGACGTGTT CGGCTTCTCG CCGGCGTCGG TCCTGAGCTA CGTCAACATC
CTCTCGGACC GCATCCTCTT CCCCTACGTC GTGCGCAACC TCGTGTACGA CATCCCCGAG
ATCGGCAAGC GCGCGTTCCT CCCGGACGTC CAGAAGGTCG TGCCGAGCGT CGACGTCGAC
GACATCGAGC GCGCGAAGGG GTACGGCGGC GTGCGCCCGC AGATCGTCAA TACGGAGACC
AAGGAGCTCG ACATGGGCGA GGCGAAGATC ACCGGCGACG GGGTTCTGTT CAACATCACG
CCCTCGCCGG GCGCGTCGAC CGCGCTGAAG AACGCGATGA CCGACGTACA CACGGTTCTC
GACTTCTTCG ACGAGGACCA CGAGTTCGAC GAGGCGGCGT TCCGCGAGGC GACGATCGAG
AACTTCCCAC GGGTCGACGC CGACGCGGAG GCAGGCGCTG AGACCGAGGA CGACGGCGCC
GACGATCACG CTACCGCCGA AGCCGACGAC TGA
 
Protein sequence
MSENTDLIIV GGGISGASLL YTVAKFTDID DVTLIEKERE IAAINSHRTN NSQTLHFGDI 
ETNYTLEKAE EVKEGAELLA GYLEGTDPDR EMHSKRSKMV LGVGDEEAAK LEERYHQNGF
GDLYPKLREI GREEIEELEP KVVEGRDPTT ALKALQTPDG YVVDYGAVAK SFVDAAREED
GVGVHLGTAV ENVDEGYDGF TVETDDGDFE ADAVVVAAGS HSLQFAKEMG YGEHMSLLPV
AGSFFLADDL LNGKVYTLQM KKLPFAAVHG DADVHDGGVT RFGPTAKLVP ALERGRLSTV
SDFADVFGFS PASVLSYVNI LSDRILFPYV VRNLVYDIPE IGKRAFLPDV QKVVPSVDVD
DIERAKGYGG VRPQIVNTET KELDMGEAKI TGDGVLFNIT PSPGASTALK NAMTDVHTVL
DFFDEDHEFD EAAFREATIE NFPRVDADAE AGAETEDDGA DDHATAEADD