Gene Hmuk_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0108 
Symbol 
ID8409605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp105925 
End bp107850 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content75% 
IMG OID645018433 
Productconserved repeat domain protein 
Protein accessionYP_003175953 
Protein GI257386180 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCG GGCGTCTCCT CGCGGCCGTC GGGCTCGTGG CCTTCGCGGT CGGCGTCGGA 
GCGCTCGCCG CGCCGGGGGT GTTCGGGCTC GGCGTCCAGC GCTACGCCGT CGTCGTGATC
GGTCTGCTGG CGGTCGCCGC GGCGATTCGC GTCGTGCAGG GACGACGCCA CAGCCCCAGA
CGGCGCGGCG AGACGGCGAC GCCGGAGGAG CTGCCCGCGG TCGCCTCGCC GGGCGAGGAG
CTGGACGCCG TGCTGGCGGC GTTCGACCCC ACGCGCTACG GGACGGCCGA CCGCCGCCGG
AAGCAACTCC GGCGCGTGGC CACCGAAGTG CTGACGCGCT ACCGGGGCGA CAGCGAAGCG
ACGGCCAGCG AGGCGATCGA GCGGGGCACC TGGACCGACG ATCCAGTCGC CGCCGAGTTC
CTCGCAGACG AGCGCTCGCG GCTCCCGCTC ACCGATCGAC TCCGCACGCG CCTGGGCGGG
CAGTCGGCCT ACTGGCAGGG CATCGAGCAC ACCGCTCGCG CGATCGCCGA GACGGCCGGC
GTCGACACCG ACGACCGCGA CGGCTCCCCC CTCCCGGGAC TGGACGGGGT CGTCGGATCG
ATCGACGGCC GGGACGCGGG GCTCGACCGG GCCAGCGCCG TCGCGGATCC GACACCGACG
ACGCGCTCGA CGGGACACTG GCGAGGGATC AGCGCGGCCG CGCTGGCGGC GCTCGGCGTC
GGCGTGCTCG CCGAACGGGC GGGCGTCGTC CTCGTCGCCG TCGTCGGGAT CGCCTACGCG
GCTGCCGCCC GCCAGCGGAC GCTCTCCGCG CCGACGCTCT CGGTCGAGCG CACCGTCGAG
CCGGCGGATC CGGAGCCCGA CGAGCCCGTC ACGGTGACGC TGACCGTCAC CAACGAGGGC
GAGGAGGCGG TGTGGGATCT GCGCCTCGTC GACGGCGTTC CGCCCGCCCT CTCCGTGGTC
GAGGGGTCCC CGCGGCTCGG GACGGCGCTG TGGCCGGGGG CCAGTGCGAC GGTCACCTAC
GAGGTGGCGG CCGAGCGCGG TCGCCACGAG TTCGGTCCCG CGCAGGTCCT CGTCCGAACG
CTGTCGGGCA GCGTCGAACG CGAACAGACC GTCGAGCCCG CGACGACGAC GGAAATCACC
TGCGTCCCGC CGCTCCGGCC GATCGACGAA CCGGTCCCGC TGCGCCGCCA GCCCACTCGC
CACACCGGGC GCGTCGAGAC CGCCACCGGC GGCGAGGGCG TCGAGTTCTT CGCGACGCGC
GCGTACCGCT CGGGGGACCC GCTGTCGCGG ATCGACTGGA ACCGTCACGC CAGAACCGGT
GAGCTGGCGA CCCTGGAGTT TCGCGAGGAG CGATCCGCGA CGGTAGTGCT GGTGATCGAC
CGCGACGAGG CGGCCGCGGT CGGTCCCTCA CAGCGGGGCC AGACCGCCGT CCAGCGCTCC
GTCGACGCCG CGAGCCGACT GTTCGCGACG CTGCTCGACG ACGGCAACCG CGTCGGCATC
GCCGCGCTCG GGGCTCGCGA CTGCTGGCTC GCGCCGGGTG CGGGCGACGC CCACCGCGTT
CGGGGCCGGG AGCTGCTCGC GACCCATCCC GCGCTCGCTC CCGGCGAGAC GCCCGAGAGC
GTCTTGCCGT TCGGCTGGCT CGCCTCCCTG CGGAGTCAAC TGCCCGGCGA CGCGCAGGTC
GTCCTGTTCT CGCCGCTGTG CTCGCCCGCC GTCGCCACCG TCGCCCGCCA GCTCGACGCC
GGGGGTCACC TCGTGACCGT CGTCAGCCCG AACCCGACGA CGACGGCCTC CGGGGCCGGC
CAGCTCGCGA CGGTGGCGCG GCGCTTCCGG ATCGCCGACC TCCGGGCCGC CGGGATCCCC
GTCGTCGACT GGCCGTGGGC GCAGTCGCTG CCGGTGGCCC TGGCCCGGAC GAGGTGGTCG
AAATGA
 
Protein sequence
MSRGRLLAAV GLVAFAVGVG ALAAPGVFGL GVQRYAVVVI GLLAVAAAIR VVQGRRHSPR 
RRGETATPEE LPAVASPGEE LDAVLAAFDP TRYGTADRRR KQLRRVATEV LTRYRGDSEA
TASEAIERGT WTDDPVAAEF LADERSRLPL TDRLRTRLGG QSAYWQGIEH TARAIAETAG
VDTDDRDGSP LPGLDGVVGS IDGRDAGLDR ASAVADPTPT TRSTGHWRGI SAAALAALGV
GVLAERAGVV LVAVVGIAYA AAARQRTLSA PTLSVERTVE PADPEPDEPV TVTLTVTNEG
EEAVWDLRLV DGVPPALSVV EGSPRLGTAL WPGASATVTY EVAAERGRHE FGPAQVLVRT
LSGSVEREQT VEPATTTEIT CVPPLRPIDE PVPLRRQPTR HTGRVETATG GEGVEFFATR
AYRSGDPLSR IDWNRHARTG ELATLEFREE RSATVVLVID RDEAAAVGPS QRGQTAVQRS
VDAASRLFAT LLDDGNRVGI AALGARDCWL APGAGDAHRV RGRELLATHP ALAPGETPES
VLPFGWLASL RSQLPGDAQV VLFSPLCSPA VATVARQLDA GGHLVTVVSP NPTTTASGAG
QLATVARRFR IADLRAAGIP VVDWPWAQSL PVALARTRWS K