Gene Hmuk_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0034 
Symbol 
ID8409531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp29842 
End bp31197 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content73% 
IMG OID645018372 
Productconserved repeat domain protein 
Protein accessionYP_003175892 
Protein GI257386119 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.559435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA GCTCGATTCC GGCGGCGGCC ACGCGCTCTG CCGACGTGAC CGACGAAGCG 
GCCGAGTTCG AGATCGAGCC GGGCACGGTC GTCGACCGCC GGACGAGGCG GTGGTACGCC
GTCACCGTCT TCGCGCTCCT GGCGCTCGGA GCGGGGGTCC TGACCCGCGA GTCCGGCCTC
CTGCTGACCA GTGCCTTCGG GATCGCCTTC GCGGGCTACG GGCAGGTCAC CTCGCCGCCG
CCGGTCGAGG TGAGCGTCGA GCGCTCGATC AGCGACGACG CTCCCGACAC CGACGACACC
GTCGCGGTGA CCGTGACGGT GCGCAACGAG AGCGACCGGA CGATGCCGGA CCTGCGGCTC
GTCGACGGCG TCCCGCCGAA GATGACCGTG GCCGAGGGGT CACCCCGCCT GGCGACCGCG
CTCCGACCGG GCGAGGAGAC GACGTTCGCC TACTCGCTGC GGGCTCGGCG GGGTCGCCAC
GAGTTCGAGC CGACGACGAT CCTCACTCGG GACGCGTCCG GAGCCACGGA GCGACGCGGC
ACGGTCGACG CGCCAGACAC CGTCGACTGC GAGGCGTCGC TGCCGAGCCA GAGCGTCTCC
TTTCCGCTGC GGTCACAGAC CACCCGTCAC ACGGGACGGT TCCCGGCGGA CACCGGCGGG
CCCGGCGTGG AGTTCTACGC GACCCGCGAG TACCGGCCGG GCGACCCGCT GAACCGGGTC
GACTGGAACC GCACCGCCCG GACTGGTGAC CTCACCACCG TCCAGTACCG GGTCGAACGC
AGCGTCTCGG TCGTGCTGGT GGTCGACGCC CGACAGGCAG CGTACGCCGC CCCAGCGCCA
CAGGCCCGGA CGGCGCTCGA CGCGGCCGTC GACGCGGCGG GTCACGCCTA CGTCTCGCTG
ACCGACGCGG GCCACGACGT GGGGCTGACG GCGCTGTCGC CGACGGAGTG TTGGCTCTCG
CCGGGCAACG GGGACGAACA CCGCGTTCGA GCACGCGAGT TCCTCTCGAC GGAGCCGGCG
CTCTCTCCGT CTGGGCCCGA CGCGGAGACC TCCCTCTACG CCGCGGTCCA GCGGATCAGG
CGTCGCGCGC CGACGGACGC CCAGATCGTC GTGTTCTCGC CGCTGACCGA CGATCGCGTG
GCCGGCTCTG CGATCCGACT CGACGCCAAC GGCCACCGGA CGACGGTGAT CTCGCCGGAC
CCGACGGCCG ACGACTCCGT CGGTCACCGA CTGGCCGGAG TCAGGCGGTC GCTGCGCATC
GCCGACCTCC GGCAGCGCAA CATCCCGGTC GTCGACTGGG ACGGGACGGA ACCGTTCCCG
CACGCGCTCG CCCGCTGGGA CGGGGGGTCG CGATGA
 
Protein sequence
MSFSSIPAAA TRSADVTDEA AEFEIEPGTV VDRRTRRWYA VTVFALLALG AGVLTRESGL 
LLTSAFGIAF AGYGQVTSPP PVEVSVERSI SDDAPDTDDT VAVTVTVRNE SDRTMPDLRL
VDGVPPKMTV AEGSPRLATA LRPGEETTFA YSLRARRGRH EFEPTTILTR DASGATERRG
TVDAPDTVDC EASLPSQSVS FPLRSQTTRH TGRFPADTGG PGVEFYATRE YRPGDPLNRV
DWNRTARTGD LTTVQYRVER SVSVVLVVDA RQAAYAAPAP QARTALDAAV DAAGHAYVSL
TDAGHDVGLT ALSPTECWLS PGNGDEHRVR AREFLSTEPA LSPSGPDAET SLYAAVQRIR
RRAPTDAQIV VFSPLTDDRV AGSAIRLDAN GHRTTVISPD PTADDSVGHR LAGVRRSLRI
ADLRQRNIPV VDWDGTEPFP HALARWDGGS R