Gene Hmuk_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0201 
Symbol 
ID8409699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp196820 
End bp198277 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content69% 
IMG OID645018526 
Producthypothetical protein 
Protein accessionYP_003176045 
Protein GI257386272 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00798209 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGCG CCGCCCTGCT CGCTGCGCTG CTCGTTCTTG CCGGCTGTCA GGCTCCGTCG 
GCCGGCCCGT CGGCGGCGTC CCCCGTACAC GAGACGACGG GGACCGACGC CCCCGAGCCG
ACGGCCACGC CCACATCTGT CGACACTGAC GACGGCCCGC CGGATCCCGA GACGGATCGG
ATCGGCTGGG AGAACGGATA CTGGCACAAC GAATCGCTGG ACGTGACCAA CAGCGACGGC
CTCAACGAGA GCGAACGCGA GGCCGTCGTC GCCCGCTCGA TGGCCCGCGT CGAGTCGGTC
CGAGAACGCG AGTTCGACGA GACGGTCCCG GTCAGCGTGA TCGACCGGTC GACGTACCGG
AACCGCTCTA CCCCCGCGTC GAACGCGACG AGCGCGCGCT TCGACAACGG CAAGTTCGAG
GCTTTGTTCC TGATCGGCGA GGACAGGGAC GCGCTGGCGG CACAGAACGC CGCGCTGAGC
CAGAGCGTGC TCGGCTACTA CAGTCCGGCC CGCGACGAGA TCGTCATCGT CGCCGACAGC
GAGACGCCAC AGCTCGACGG CGAGGCGACG CTCGCCCACG AACTGGTCCA CGCCTTGCAA
GACCAGCAGT TCGACCTGAC CAACGGGACC GTGACGACCC GCGACGCCTA TCAGGGCCGC
AACGGGATCG TCGAGGGCGA CGCCTCCTTC GTCGAGCGGC GCTACACCGC CAACTGCGGC
GCCACCTGGT CCTGTCTCGA CTCGGCCGAC AGCACGCAAA GCGGCGGCGG TGACAGTCAC
TTCGGGCTGA ACTTCCTCCA GTTCTTCCCC TACAGCGACG GTCCCGGCTT CGTCGAGCAT
CGCTACGAGG CCGGCGGCTG GGCGGCCGTC GACGCCGCGT TCGCGGACCG GCCCGACGGT
GCGACGGAAG TGATCTACCC CGAACAGTAC CCCGAGTGGG AGCCGGCGAC CGTCTCCCTG
CCCGACCGCA GTAACGACGA GTGGGAGCGG GTCCGACCGC CGGGACGCGC CGACTACGCC
GTCGTCGGCC AGTCCGCCAT CGCGGCGTCG CTCGCCTACA CCGTTACCGA CGACTACGAA
GACGCCAGCG TGGTGCGGGC CCGCGACGTG ATCAACTTCG AGGCCGACGG CAAAGTCGAC
GGGGACGATC CGTACAACTA CGACGTGCCG GCCGCCCGCG GCTGGGCCGG CGGCAAACTG
TTCGTCTACG AGAACGGCGA GCGGTCGGGC TACGTCTGGC GGACCCGCTG GACCAACGAG
AGCGAGGCGG CCGAGTTCGC AAGCACCTGG AGCGCGGTCG TCCGCCACTG GGGCGGCGAG
GAAACCGAGG AGAACGTCTG GACGATCGGC GAGGACAGTC CCTTCACCGA CGCCGTCAGG
ATCGAACGCT CCGGCGCGAC GGTGACGGTC GTCGACGCGC CCGACGCGGC GGCATTGGAG
GCGGTACACG ATGCGTAG
 
Protein sequence
MYRAALLAAL LVLAGCQAPS AGPSAASPVH ETTGTDAPEP TATPTSVDTD DGPPDPETDR 
IGWENGYWHN ESLDVTNSDG LNESEREAVV ARSMARVESV REREFDETVP VSVIDRSTYR
NRSTPASNAT SARFDNGKFE ALFLIGEDRD ALAAQNAALS QSVLGYYSPA RDEIVIVADS
ETPQLDGEAT LAHELVHALQ DQQFDLTNGT VTTRDAYQGR NGIVEGDASF VERRYTANCG
ATWSCLDSAD STQSGGGDSH FGLNFLQFFP YSDGPGFVEH RYEAGGWAAV DAAFADRPDG
ATEVIYPEQY PEWEPATVSL PDRSNDEWER VRPPGRADYA VVGQSAIAAS LAYTVTDDYE
DASVVRARDV INFEADGKVD GDDPYNYDVP AARGWAGGKL FVYENGERSG YVWRTRWTNE
SEAAEFASTW SAVVRHWGGE ETEENVWTIG EDSPFTDAVR IERSGATVTV VDAPDAAALE
AVHDA