Gene Hmuk_2924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2924 
Symbol 
ID8412476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2812295 
End bp2813860 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content66% 
IMG OID645021270 
ProductPKD domain containing protein 
Protein accessionYP_003178736 
Protein GI257388963 
COG category[R] General function prediction only 
COG ID[COG3979] Uncharacterized protein contain chitin-binding domain type 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0135075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA CACGACGGAA CATTCTACGC AAAGCATCGG CACTGACAGC ACTCGGCATC 
GGAGCCAGCG GCATCGCGGC TGGCGCAGAC TGCAGTAGCG TCCCGGAGTG GGACGCCGAC
GCCACCTACA CCGGCGGCGA CCAGGTCACC TACGACGGTG CGCTCTGGAC CGCCGAGTGG
TGGACGCAGG ACGAGCCGTC CGAGAGCGCC AACGTCTGGA CGCGGGAGGG CGCTTGTGGC
GGGAACGGCG GCGACGACGG CGACGACGGC GACGACAGTG ACAGCGCGAA CTGCGACGAC
TACCCCGAGT ACGATTCGGG GGCGACCTAC ACCGGCGGCG ACCGAGTGAT CTACGACGGT
CGGCTCTGGG AAGCCGAGTG GTGGACCAAG GGCACCGAAC CCGCCGAGAG CCAGAACGTC
TGGACGCTGG TCGGCACGTG TGGGAACTTC GCACCCACTG CGGTCGCCTC GGCCTCGCCC
TACTCGCCGG AGGTCGGCGA GACGGTCACC TTCGACGGCT CCGACTCTTC CGATCAGGAC
GGTTCGGTCA CGAGCTACGA GTGGAGCTTC GACGACGGTA CCACCGAGAC GGGCGAGACC
GTCACCAGAA GCTACGACGC GAACGGCGAG TACACGGCGA CGCTGACCGT CACCGACGAC
GCTGGCGCGA CCGCCAGCGA CTCCGTGAGC GTCGCGGTCG GCGACACCAG TGGCGGCTCC
GAGAAGGAAG ACGACGTGTT CGCGCCCTAC CAGGGCACGT GGGGAAGTCT CGTCGACGGG
ACGCTGAACG TCGACACCGA TCGGGTCGTC GTTTCGTTCG TCGGCGACGC GACCGACGAC
GGCGAGATCA ATCCCGGCTG GCTCACCTCC GGCGGCCAGC GTCCGCTCAC CGACTACACC
GACGAGATTC AGACGCTCCA GGACAACGGT ATCGAGGTCT GGGTCGCCAT CGGTGGCTGG
GACGGCCGCA CCGTCGCGCG GGACGCGACC GACGCGACGG AGCTCAAGAA CGTCTACGCC
GACATCCTCG ATACGCTCGG GGTCACCCAC CTCGACATCG ACGACGAGAA CGCCAACGAG
GCGGGCCGTG ACGGCAGCGT CTACGAGATC CGCAACGAAG CGCTCGCGAT GCTGCAAGAC
GAGCGCCCCG AGGTGAAGAT CTCCTACACC GTCCCGGCAG GACAGGGCGG CATCGAGAAC
CGCGACTATT CGCCCGCCAA GGACATGGTC AGCGATGCCG TCCAGCAGGG AATCGATCTG
TCCTACGTCA ACATCATGAC TATGGGCTTC TCGGGCGATT ACACCTCGAT CATCCCCTCG
GCCGGCCAGG GCACTGTCGA CTGGCTGGCC AACGTCTACC CGGACAAATC CGAGCAGGAA
CGCTGGGAGA TGCTGGGTGT GACGCCGAAC GTCGGCGAGG ACAACTTCAC GACCGACGAC
GCCAGTGCCA TCGTCGACTG GGCGGAAAAC GAGGATCTCG GACTGCTGAG CTTCTGGGCG
CTGTACAAGT CCAGTGCTGC CGAACAGGCG GAGATCTTCG CCACGTTCGA GTCCGACGAG
GACTGA
 
Protein sequence
MKQTRRNILR KASALTALGI GASGIAAGAD CSSVPEWDAD ATYTGGDQVT YDGALWTAEW 
WTQDEPSESA NVWTREGACG GNGGDDGDDG DDSDSANCDD YPEYDSGATY TGGDRVIYDG
RLWEAEWWTK GTEPAESQNV WTLVGTCGNF APTAVASASP YSPEVGETVT FDGSDSSDQD
GSVTSYEWSF DDGTTETGET VTRSYDANGE YTATLTVTDD AGATASDSVS VAVGDTSGGS
EKEDDVFAPY QGTWGSLVDG TLNVDTDRVV VSFVGDATDD GEINPGWLTS GGQRPLTDYT
DEIQTLQDNG IEVWVAIGGW DGRTVARDAT DATELKNVYA DILDTLGVTH LDIDDENANE
AGRDGSVYEI RNEALAMLQD ERPEVKISYT VPAGQGGIEN RDYSPAKDMV SDAVQQGIDL
SYVNIMTMGF SGDYTSIIPS AGQGTVDWLA NVYPDKSEQE RWEMLGVTPN VGEDNFTTDD
ASAIVDWAEN EDLGLLSFWA LYKSSAAEQA EIFATFESDE D