Gene Hmuk_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2235 
Symbol 
ID8411775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2153665 
End bp2155116 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content68% 
IMG OID645020578 
Productcryptochrome, DASH family 
Protein accessionYP_003178055 
Protein GI257388282 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR02765] cryptochrome, DASH family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.14399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.570848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCG TTCTCGTCTG GTTCCGCCGC GATCTGCGCT GTCACGACAA CGCGACGTTG 
CGACGCGCCG TCGCCGAGGC CGACACCGTC GTGCCGCTGT ACTGTCTCCC GGATCGACTG
ACCGGCGAGG GGATGTTCGG GCTCGACAGG GTCGGTCCCC ATCGGGCGCA GTTCCTGATC
GAGAGCCTCG CGGACCTGCG CGAGTCGTTG CGGGACCGGG ACGGCGAACT GTACGTTCGC
AGCGGCGACC CCGGGACGGT CGTCCCCGAG GCCGCTGAGG AGTTCGACGC CGACGCGGTC
TACTGGCAGG CGCTCCCGGG TCCCGAAGAG CGGGACGAAG CTGGCAGCGT TCGGGCGGGG
CTGGCCGACG CCGGGATCGA CTCCGAGACG TTCTGGACGC ACACGCTGTA CCACCGCGAC
GACCTCCCCA GACCGCCCGA CGAGATCGAG GACACCTTCA CGCCGTGGAA GGACCGAACC
GAAGCGAAGG CGACCGTCCG ACCGCCCAAA CCGGCCCCGG AGTGGGTCCA CGCCCCCAAC
GGCGGCCGGC GCGCCAGCAG CGGTGCCGAC GATCTCCCCA CGCTCGCGGA CTTCGGCTTC
GGCGAGGACG AGGCGACGGT CGACGACCGC GGCGTCCTCG ACTGGACCGG CGGCGAGACG
GCGGGGCTGG ATCGCGTCGC GACGTACGTC TGGGAGCGTG ACTGCCTGCG GGAGTACCGC
GAGACGCGCA ACGGCCTCGT GGGTGCCGAC TACTCCTCGA AGTTCTCGCC GTGGCTCTCC
TTTGGCTGTC TCTCGCCGCG TCAGATCCAC CGCGAGGTCG AGCAGTACGA GACCGATCGG
GTGGAAAACG ACTCGACGTA CTGGCTCGTC TTCGAGCTGA CCTGGCGGGA CTTCTTCCAG
TACCAGCTCG CGAAGTACGG CGCGAAGTGG TTCCAGCCCG GCGGCATCCG CGACCGGGAC
GACATTCGGT GGCGGCGCGA CCGTGCGCAG TTCGAACGCT GGGCGCGTGG CGAGACGGGG
ATCCCCTTCG TCGACGCCAA CATGCGCGAG CTGAACGCGA CGGGATACGT GAGCAATCGC
GGCCGCCAGA ACGTCGCCTC GTTTCTCTCG AACAACCTCC GGATCGACTG GCGGCTCGGG
GCGGCATACT TCGAGTCGCG GCTGGTCGAC TACGACGTGG CCTCGAACTG GTGTAACTGG
GCGTACCAGT CACAGGTCGG CAACGACTCG CGAGACAGCT ACTTCGAGAT CGTCGGCCAG
GCGACACACT ACGATCCCGA GGGGGCGTAC GTCACTCGCT GGTGTCCGGA ACTGTCGGCA
CTTCCGCCGG AGTACGTCCA CGAGCCCTGG ACGATGAGCG AGCACGAGCA GGCCGACTAC
GGCGTCGAGC TGGGGACCGA CTACCCCGCG CCGATGATCG ACCTCGAAGC GTCCTACGAG
AAGCTACGCT GA
 
Protein sequence
MSTVLVWFRR DLRCHDNATL RRAVAEADTV VPLYCLPDRL TGEGMFGLDR VGPHRAQFLI 
ESLADLRESL RDRDGELYVR SGDPGTVVPE AAEEFDADAV YWQALPGPEE RDEAGSVRAG
LADAGIDSET FWTHTLYHRD DLPRPPDEIE DTFTPWKDRT EAKATVRPPK PAPEWVHAPN
GGRRASSGAD DLPTLADFGF GEDEATVDDR GVLDWTGGET AGLDRVATYV WERDCLREYR
ETRNGLVGAD YSSKFSPWLS FGCLSPRQIH REVEQYETDR VENDSTYWLV FELTWRDFFQ
YQLAKYGAKW FQPGGIRDRD DIRWRRDRAQ FERWARGETG IPFVDANMRE LNATGYVSNR
GRQNVASFLS NNLRIDWRLG AAYFESRLVD YDVASNWCNW AYQSQVGNDS RDSYFEIVGQ
ATHYDPEGAY VTRWCPELSA LPPEYVHEPW TMSEHEQADY GVELGTDYPA PMIDLEASYE
KLR