Gene Hmuk_0316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0316 
Symbol 
ID8409814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp304614 
End bp306014 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID645018641 
ProductLVIVD repeat protein 
Protein accessionYP_003176160 
Protein GI257386387 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCC GCCCCCTCCT TCGAACGCTC GGCAGCGGTC TCGCGCTCGG GAGCGCCGGC 
CTCGCGAGCG GGCACCCGAC TGCCACCAGC GACGGGACGC CGCCCGCCGA GACGCCGGAC
AGCCAGCCCC TCGGGACGGT CTCGATCGAG AACGTCCGCG AGATGGTCCT GAATCCCGAC
GGGACGGTCG CCTACGTCGC CACCGTCGAC GGCTTCGCGG TCGTCGACGT GAGCGATCCG
ACCGAGATGC GGGTGCTGGC TCGCGAGCGG CTCCTGGCCG ACCACGCCGA CGGGCCGCTG
TCTGGGATCT GGGACCTGCA CTGTGACGGC GACCGACTGC TCGTGGCCGG CCCGGCAAAC
GGTGGGCGCG ACTCGGTCCG TGGCTTTGGC TACGTCGACG TGTCCGATCC CGCCGATCCC
GAACTTCTCG CCGAACACGA GGTCGACTTC TACACCCACA ACTGCGTGCT CGCGGACGGC
GTCGGCTACT TCACCGGCGG CGGTCTGGAC GGCTCGCCCC TGGTCGTCGC CGATCCCGAG
AGTGGCACGG AACTGGCCCG CTGGAGCGTC GTCGACGTCG ACGACCGCTG GGCCGAGCTG
CCCTTCGGCA TGGTGAACCT CCACGACGTG TGGGTCCACG ACGACCGCGC GTATCTGGCC
TACTGGGACG CCGGCACCTG GTGTCTCGAC GTGTCCGACC CCGGCGAGCC GACGCTCGTT
TCGCGGGTGC GCGGTCGGCC ACTCGACGAG CTCCTCGATG TTACCAACAG GCGACGCGAG
CGCACGGAGC CGCCGGGCAA CGACCACTTC GTCACCGTCG ACGAGACCGG CGATCTGCTG
GGGATCGGGA CCGAATCGTG GGCGGCGGCC TCGGGCTCGA CCGGCCCGGG CGGGATCGCC
TTCTACGACG TGACCGACCC CGCCGAACCG ACGCGACTCG GGGCGATCGA CCCGCCGCCG
ACGCCCGATC CCACCCGCGG CGGCGTCTGG ACGACCGCCC ACAACTTCGA GCTCGTCGAC
GGGCGCTGTT ACGCCGCCTG GTACCAGGGC GGGGTCACCG TCCACGACGT GACCGACGCG
ACGGATCCCG TCGAGCGGTT CCACTGGCGC GACGCCGGCC GCGGGAAGTT CTGGACCGCA
CAGCTTGCCG CGCCGGGCGA GTTCTTCCTG GGGGCCAGCA TCGGCGCGTT CGGTGTCAAC
ACCGCCGCCG ACTCGCCCCT GGAGTCGGCG CTGTTCGCGT TTCCGGACCA GCGGCCGGCC
GACGGCGCGA CGACGACCGA CGGCACCCGG TCGGGACGGG CGTCGACGCC CACCAGCGAG
ACGGGAGCCG GCGCTGGTGT CGGCGCGGGT CTGCTCGGAC TGCTCGGTGC CGGCGCGTGG
TGTCGACGGC GTTCGGAGTG A
 
Protein sequence
MRRRPLLRTL GSGLALGSAG LASGHPTATS DGTPPAETPD SQPLGTVSIE NVREMVLNPD 
GTVAYVATVD GFAVVDVSDP TEMRVLARER LLADHADGPL SGIWDLHCDG DRLLVAGPAN
GGRDSVRGFG YVDVSDPADP ELLAEHEVDF YTHNCVLADG VGYFTGGGLD GSPLVVADPE
SGTELARWSV VDVDDRWAEL PFGMVNLHDV WVHDDRAYLA YWDAGTWCLD VSDPGEPTLV
SRVRGRPLDE LLDVTNRRRE RTEPPGNDHF VTVDETGDLL GIGTESWAAA SGSTGPGGIA
FYDVTDPAEP TRLGAIDPPP TPDPTRGGVW TTAHNFELVD GRCYAAWYQG GVTVHDVTDA
TDPVERFHWR DAGRGKFWTA QLAAPGEFFL GASIGAFGVN TAADSPLESA LFAFPDQRPA
DGATTTDGTR SGRASTPTSE TGAGAGVGAG LLGLLGAGAW CRRRSE