Gene Hlac_0288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0288 
Symbol 
ID7401214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp310553 
End bp311950 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content67% 
IMG OID643707351 
ProductDNA photolyase FAD-binding 
Protein accessionYP_002564963 
Protein GI222478726 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.155544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.119697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAT TTTGGCACCG ACGCGATCCG CGCACCCGGG ACAACGTCGG ACTCGCGGCG 
GCCGCACGGA CGGGGACCGT CGTCCCCGTC TTCGTCTACG ACACCGACCT GTTCGGGACG
ATGGGTGCAC GCCAGCGGGC CTTCTTCCTC CGACACGTAA AGCGACTGAA GGAGCGCTAC
CGAGAGTTCG GAAGCGACCT CGTCGTCCGC GCGGGCGACC CGGAGAAGGT CCTCGTCGAT
CTCGCCGACG AGTACGACGC GGAGGCGGTC TTCTACAACG AGTACTACCG TCCCGCGAGG
CGGAACCGCC AGCGGGCCGT CGAGGGGGCG CTCGAGGGCT TCGGTGTCGA AACGAATGCA
CGGACCGATG CCGTGTTAGT CGATCCCGGC CGGCTGGAGG AGCGCTACGC GAATCACGGT
CGATTTCACG ATGAGTGGGA GACCGTCCCT AAGCCCGAGC CGTCCCCGGA ACCAGACGCG
GACGCGCTCG TGAACATGCG CGACGAAAAA ACGGTCTCTG AGATTGACCC CGACATCGAC
CCCGACATCG ACCTGCCGGC GACCGGCTAC CGGGCCGCAC GCGAGCGGTT CGACGGCTTC
CTCGAACACG GAATCCGGTC GTACGCCGAC ACGCGCGACG ACCTTGCCCG GGCCGTCGAG
GCGCCGACGC ACGCGGTCTC GCGAATGTCG CCGTACCTCG CAACGGGGGC GATCGGGATC
CGGGAGATGT GGGCGGACGC GACCGACGCG TTCGAGGCGG CGACGGGCGA CGAGCGCCGC
AACGTCGACA AGTACCGCGA CGAGCTGTCG TGGCGCGAGC AGATGTACCA CCTGCTGTAC
TACACCCCCG ATCTGGCCGT CGCGAACTAC AAATCGTTCC CGAACGAGAT CGCGTGGCGC
GAGGACGACA CGGCGTTCGA GGCGTGGACG CGCGGCGAGA CCGGCTACCC GCTCGTCGAC
GCCGGGATGC GCCAGCTGAA CGCGGAGGGG TACGTCCACA ACCGCCCGCG ACAGGTGGTC
GCGAGCTTTC TCACGAAACA CCTCCTGATC GACTGGCGGC GCGGGGCGCG CTACTTCACC
ACACAGCTGA TCGACCACGA CCACGCCTCG AACCACGGCG CGTGGCAGTG GACCGCCTCC
ACCGGCACCG ATTCGGTGGA TGTGCGCATC TTCGATCCGG TGGCACAGAT GGCGAAGTAC
GACGCTGACG CGACGTTTGT GAAAGAATAC GTCCCCGAAC TGCGAGACGT GCCCGCCGAG
GAGATAGTCG ACTGGCCGAC TCTCTCGCGG GTCGAGCGCG AGACGCTGGC GCCGGAGTAC
CCGCATCCGA TCGTCGACCG GAACGAGGGG TACGAGCGGG CGCAGCGGGT GTTCGAGGAA
GCGCTCGGGA AGCGGTGA
 
Protein sequence
MQLFWHRRDP RTRDNVGLAA AARTGTVVPV FVYDTDLFGT MGARQRAFFL RHVKRLKERY 
REFGSDLVVR AGDPEKVLVD LADEYDAEAV FYNEYYRPAR RNRQRAVEGA LEGFGVETNA
RTDAVLVDPG RLEERYANHG RFHDEWETVP KPEPSPEPDA DALVNMRDEK TVSEIDPDID
PDIDLPATGY RAARERFDGF LEHGIRSYAD TRDDLARAVE APTHAVSRMS PYLATGAIGI
REMWADATDA FEAATGDERR NVDKYRDELS WREQMYHLLY YTPDLAVANY KSFPNEIAWR
EDDTAFEAWT RGETGYPLVD AGMRQLNAEG YVHNRPRQVV ASFLTKHLLI DWRRGARYFT
TQLIDHDHAS NHGAWQWTAS TGTDSVDVRI FDPVAQMAKY DADATFVKEY VPELRDVPAE
EIVDWPTLSR VERETLAPEY PHPIVDRNEG YERAQRVFEE ALGKR