Gene Hmuk_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3369 
Symbol 
ID8409447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp171079 
End bp172659 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID645018292 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003175813 
Protein GI257373039 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.149993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GGCCGACAGC GGTGACACCC GACGCCGACG ACGTGCCGAC GACGACGGAC 
GGCTGTGTCG TCTGGCACCG CCGCAATCTC CGGACGACCG ACCACGGCGC GCTCTCGTAC
GCGGGCGAGG AGTACGACAC CGTCTTGCCG CTGTTCGTCT TCGACCCGCA GTTCTACGGT
AACGACGGAC TCGCCTGTGA CGCCAGACTC CTCCTCCTCC ACGAGTCCGT CGAGAGTCTC
CGGCGACTGT ACGAGGCTGT CGGGGGAACG CTCAGCTACG CCCACGGTGA TCCGCTGTCG
GTGCTCTCGG CTCTGGCAGC GGCCGGCTGG GACATCGTCG CGACGGCCGA CCCGACCGCA
CGCTACGGGC GTCAGCGAGA CGACCGGGCC GCGGCACACT GTGACGTGCG CTTCGTCGAC
AGTGACGGGC TGGTTCGAGA CCAGGCGTGG CCGCGTGCGG GGTGGAGCGA CCGCGTCGAG
GCGTGGTTCG AGTCGTCGCC CCACTCCTGG GACCCGGCCG ACGTTTCGTT CGCCTCGCTG
CCGGGGACGG TGTCCGTCGC CGATATCGAG CGTACGTACG ACGTACACGC GGCGAAGACG
ACGGTCCCAC CCGGCGGCCG AGCGGCGGGG GCCAGACGAC TCCGACAGTT CGTCGCAGAC
ATCGAGCAGT ATCCCGGCAA CATCTCCGCG CCGACAGACG CACGGACCGG TACCAGCGGG
CTCTCGCCGC ACCTCCGGTT CGGATCGCTG TCGGTACGGG AAGTCTACCG GTACGTGATG
GACAACGCAC CGCCGTGTAC CGGCCGCGAG ATGTTCGTCT CCAGGCTCTA CTGGAACAAA
CACTACCACC AGAAACTGGA AGACTGGAGC GGCTGGACGA CGACGGCGGT CAACCCCGCA
ATGCGAGGGT GTCGGGCGGA GTCACACGAC CCCGAGCTGG TGACTGCCTG GAAGCGGGGG
ACCACCGGCT TTCCCATGGT CGACGCGTCC ATGCGGTGTC TCGTCGAGAC CGGGTGGCTC
AACTTCCGGA TGCGAGCGAT GTGTGCGAGC GTCTTCGCCG ACCTGTTCCA GCAGCCCTGG
CAGATCGGCG CGGACTTCTA TCACTACCAC CTGATCGACG CCGATCCGGC GATAAACTAC
ACCCAGTGGC AGTCACAGGC CGGTACGGTC GGAACGAACC TCATGCGCAT CTACAACCCG
ATCAAACAGG TGCGTGACAA CGATCCCGAC GGCACGTTCG TCTCGACGTA CGTCCCGGAA
CTGGCTCCTC TCCCGGCCGA GTACCTGCCA CGCCCCGAAA AGACACCGCT CCACGTTCAG
GAAGCGTGTG GCGTCGAGAT CGGCACGGAC TATCCGTACC CCGTCGTCGA GTACGAGGCA
GCCAGACAGC GTGCGATCGA GCGCTACGAA CGTCTCGAAC CGGCCGCCCG CCAGGCCCTC
CAAGAGCCGG CGGTCGCTCG ACGAGCGTCA CTGTCCTCGC AGTCACGGCC ACCGGACGGG
TCGGACGACA CGTCGACGGT GACCGAAGCG GGTCCACAGC AGCGCGGCCT GGACGCGTAC
ACGGAGACAG ACAAGGAGTG A
 
Protein sequence
MSDGPTAVTP DADDVPTTTD GCVVWHRRNL RTTDHGALSY AGEEYDTVLP LFVFDPQFYG 
NDGLACDARL LLLHESVESL RRLYEAVGGT LSYAHGDPLS VLSALAAAGW DIVATADPTA
RYGRQRDDRA AAHCDVRFVD SDGLVRDQAW PRAGWSDRVE AWFESSPHSW DPADVSFASL
PGTVSVADIE RTYDVHAAKT TVPPGGRAAG ARRLRQFVAD IEQYPGNISA PTDARTGTSG
LSPHLRFGSL SVREVYRYVM DNAPPCTGRE MFVSRLYWNK HYHQKLEDWS GWTTTAVNPA
MRGCRAESHD PELVTAWKRG TTGFPMVDAS MRCLVETGWL NFRMRAMCAS VFADLFQQPW
QIGADFYHYH LIDADPAINY TQWQSQAGTV GTNLMRIYNP IKQVRDNDPD GTFVSTYVPE
LAPLPAEYLP RPEKTPLHVQ EACGVEIGTD YPYPVVEYEA ARQRAIERYE RLEPAARQAL
QEPAVARRAS LSSQSRPPDG SDDTSTVTEA GPQQRGLDAY TETDKE