Gene Hmuk_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2541 
Symbol 
ID8412085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2446241 
End bp2447641 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content69% 
IMG OID645020882 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003178356 
Protein GI257388583 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.272644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCC ACTGGCATCG ACGCGACCTC CGGACGACCG ACAACGCCGG CCTGGCCGCA 
GCGACGGCCG ACAGCCCGGT CGTGCCGGTG TTCGTCTTCG ACGACGCCGT CCTCGACCAC
GCCGCACCGC CCCGCGTGGC GTTCATGCTG GACGCGCTCG ACTCGCTGCG GGCACAGTAC
CGCGACCGCG GGAGCGACCT CGTGATCGCT CACGGCGATC CGACGGCCGA GATCCCGCGG
CTGGCCGAGG CGTTCGGAGC CGACGGCGTG ACCTGGGGCG AGGCCTACTC CGGGCTCGGA
ATCGAGCGCG ACATCGCCGT CCGGCAGGCC CTCGACGACG TGGGCGTCGA ACGCGAGGCG
GTCACCGATT CGGTTCACCA TCGCCCCGGC GAGATCACGA CCAACGACGG CGATCCGTAC
TCGGTGTTCA CGTACTTCGG GCGCAAGTGG CACGACCGGG AGAAAGAGGA CCCCTACGAC
GCGCCCGGCC CGGACGAACT GGCCGACGTG TCCGGCGATC CCCTGCCGTC GGTGGGAGAC
CTGGGGTTCG AGGAACCACA GGCAGAGATC CCTCCGGCGG GGACGGAGCC GGCCCGGGAG
CTCCTCGACG CGTTCTGCGA GGACGACATC TATCGGTACG AGGACCGCCG AGACTACCCC
GCAGACGACT GCACCTCACG GCTCTCGGCT CACCTCAAGT TCGGGACGAT CGGCATCAGG
GAGGTGTACG AGCGGACCGC GAGCGCGGCG GCAGCGGCCG ACGACGAGGA ACGGCGCGAA
TCCGTCGCGG AGTTCCAGTC GCAGTTGGCC TGGCGGGAGT TCTACACGCA GGTCCTCTTT
GCCAACCGGT CGGTCGTCAC GGACAACTAC AAGACCTACG AGCGCCCGCT CCAGTGGCGC
GACGACCCCG AGGCGCTCCA GGCCTGGAAG GACGGCGAGA CGGGATACCC GATCGTCGAC
GCCGGGATGC GCCAGCTCCG CCAGGAGGCG TTCGTGCACA ACCGCGTCCG GATGATCGTC
GCCTCCTTTC TCACCAAGGA CTTGCTGATC GACTGGCGAG CGGGATACGA GTGGTTCAAA
GAGCGTCTGG TGGACCACGA CACCGCGAAC GACAACGGCG GGTGGCAGTG GGCCGCCTCG
ACGGGAACCG ACGCCCAGCC GTACTTCCGG ATCTTCAATC CGATGACTCA GGGCGAGCGG
TACGACCCCG ACGCGGAGTA CATCAAGACG TACGTCCCCG AACTGCGTGA CGCCGAGCCG
TCGGTGATCC ACGAGTGGCC CGACCTCTCG CTGACCCAGC GTCGCAACGC CGCCCCGGAG
TACCCCGACC CCATCGTCGA CCACAGCGAG CGGCGCGACC AGGCTCTGGA GATGTTCGAG
ACCGCCCGCG GCGAGAGCTG A
 
Protein sequence
MRIHWHRRDL RTTDNAGLAA ATADSPVVPV FVFDDAVLDH AAPPRVAFML DALDSLRAQY 
RDRGSDLVIA HGDPTAEIPR LAEAFGADGV TWGEAYSGLG IERDIAVRQA LDDVGVEREA
VTDSVHHRPG EITTNDGDPY SVFTYFGRKW HDREKEDPYD APGPDELADV SGDPLPSVGD
LGFEEPQAEI PPAGTEPARE LLDAFCEDDI YRYEDRRDYP ADDCTSRLSA HLKFGTIGIR
EVYERTASAA AAADDEERRE SVAEFQSQLA WREFYTQVLF ANRSVVTDNY KTYERPLQWR
DDPEALQAWK DGETGYPIVD AGMRQLRQEA FVHNRVRMIV ASFLTKDLLI DWRAGYEWFK
ERLVDHDTAN DNGGWQWAAS TGTDAQPYFR IFNPMTQGER YDPDAEYIKT YVPELRDAEP
SVIHEWPDLS LTQRRNAAPE YPDPIVDHSE RRDQALEMFE TARGES