Gene Hmuk_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0760 
Symbol 
ID8410274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp728225 
End bp729538 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID645019095 
Productprotein of unknown function DUF21 
Protein accessionYP_003176598 
Protein GI257386825 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.332278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA TCGCACTCTC GCTGGGGGGT ATCGCGCTCG CGCTCTTTCT GGTCCTGTTG 
AACGGCTTCT TCGTCGCCTC GGAGTTCGCC TTCGTCCGGA TCAGGGCGAC CTCCGTCCAG
CAACTCGTCG AAGAGGGGGC CGCCGGTGCT GGCGTGCTGG ACGACGTGAT GGACAACCTC
GACGACTACC TCGCGACGAC GCAACTGGGG ATCACGGTCG CGTCGCTGGG ACTGGGGTGG
GTCGGTGAGC CCGCCATCGC CGACCTGCTG GAGCCGGTGC TGGCTCCGAT TCTGCCACCG
AGTCTGCTCC ACGCCGTCGC CTTCGCCGTC GGGTTCACCG TCATCACCTT CCTCCACGTG
GTCTTCGGAG AACTCGCGCC CAAGACGCTG GCTATCGCGG ACGCCGAGAA GATCTCGCTG
CTCGTGGCCG CGCCGATGAA GTTCTTCTTC TATCTCCTCT ATCCGGGCAT CGTCGTGTTC
AACGGCTCTG CGAACTTCTT CACGCAGTTG ATCGGCGTCG AGCCCGCTTC CGAGAGCGAG
GAGACGCTCG AAGAAGAGGA GATTTTGCGG GTGCTGAACC AGTCCGGGCA GGCGGGCCAC
GTCGACGCGG GCGAAGTCGA GATGATCCAG CGCGTCTTCG AGTTCGACGA CCGGTCCGTC
CGCGAGGTGA TGGTCCCGCG ACCTGACGTG ATCAGTGTCA CGGCCTCCAC GCCGGTGACG
GAACTGCGTT CGATCGTCCT CGACGCCGGG CACACGCGTT ATCCCGTGGT CGAGGGCGAC
GACGGCGACC AGGTGGTCGG CTTCGTCGAC GCCAAGGACG TGCTTCGGGT GCTGGATGCC
GGCGACGAGT CCCCGGCGAC CGCCGGGGAC ATCGCGCGGG ACCTCCCGAT GGTTCCGGAG
TCGACCCGCA TCGACGACCT CCTGCGGGAG TTTCAGGACG AGCAGCGCCA GATGGCGATC
GTGATCGACG AGTGGGGCGC CTTCGAGGGG ATCGCCACCG TCGAGGACGT CCTCGAGACG
CTCGTGGGCG ACCTCCAGGA CGGCTTCGAC GCGGCGACGG GGGAACCCTC GATCGACGCG
CGCGACGACG GTTCGTACCG CGTCGACGGT GCAGTCCCCC TCTCGACGGT CAACGACGAA
CTGGACGCCA CCTTCGAGAG TCCCGCCTTC GAGACGATCG GTGGCCTCGT GCTGGATCGG
CTGGGCCGGG CCCCGAAGGC CGGGGACACG GTCGAGACCG ACGGCTACCT GATCACCGTC
GTGAGCGTCG ACGGCGCGCG CGTCTCGGTC GTCGACGTGG AACCGGCGAC GTGA
 
Protein sequence
MTDIALSLGG IALALFLVLL NGFFVASEFA FVRIRATSVQ QLVEEGAAGA GVLDDVMDNL 
DDYLATTQLG ITVASLGLGW VGEPAIADLL EPVLAPILPP SLLHAVAFAV GFTVITFLHV
VFGELAPKTL AIADAEKISL LVAAPMKFFF YLLYPGIVVF NGSANFFTQL IGVEPASESE
ETLEEEEILR VLNQSGQAGH VDAGEVEMIQ RVFEFDDRSV REVMVPRPDV ISVTASTPVT
ELRSIVLDAG HTRYPVVEGD DGDQVVGFVD AKDVLRVLDA GDESPATAGD IARDLPMVPE
STRIDDLLRE FQDEQRQMAI VIDEWGAFEG IATVEDVLET LVGDLQDGFD AATGEPSIDA
RDDGSYRVDG AVPLSTVNDE LDATFESPAF ETIGGLVLDR LGRAPKAGDT VETDGYLITV
VSVDGARVSV VDVEPAT