Gene Hmuk_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1011 
Symbol 
ID8410528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp964792 
End bp965865 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID645019346 
Productperiplasmic solute binding protein 
Protein accessionYP_003176846 
Protein GI257387073 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.932798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.91735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC ACACGACACG ACGGCAGTGG CTCGGAGTAC TGGCAGGAGC GACAGCTGCG 
GGGACGGCGG GCTGTCTGGA CAGTACGGCC GGTGGGACCG ACGAAACGCT CTCGGTCGCG
GCGTCGTTTT TCGTCCTGGG AGATCTGGCG TCGAACGTCG CGGGCGATCG GGCGAGCGTC
GAGACACTGG TACCGGTCGG TCAGCACGGC CACGGCTGGC AGCCGGGTCC CGACGTGACT
CGGCGAGCCG TCGAGGCAGA CGTGTTCGTC TACATGGCCC CCGGATTCCA GCCGTGGGCC
GACGACGTGG TCACCAACAT CGAGTCCGGA GACAGCGATA CGGCGATCAT GGAGGCCAGG
GCCGGCGTCG ACCTGCTCAC GGTCCCCGAA GAGGGCGGGC ACGATCACGG GAGCGCGACC
GCCGTGCACG GAGAGAAACA CGCCGAACAC AGCGGTCACA GCGACGACCA CGCGGACCAC
GACACCGCCC ACAATGATAA CCATGACACC GCCCACGAGG ACGGCCACGA CGAGAGACCG
GTCGATCCGC ACTTCTGGCT CGATCCGCGG CGTGCGAGAA CGGCCGTCGA GACGATCGAA
GAAGGACTAC GGGCGGTCGA CGAGGGCAAC GCGGGAACGT ACGCTGACAA CGCAGACCGA
TACCGAGAGC GACTCTCGGA GCTCGACGAG ACGTTCGAGA CGGCGCTGTC CGATCGGGAG
CGCGAGACGG TCCTCGTCGC GGGCCACAAC GCGTTCCAGT ATCTCGGCCG TCGATACGGG
TTCGATGTCG TGGCACTGTC GGGGCTCTCG CCCGACGACT CGCCGACGAG CGACGACCTG
CGGCGAGCCG AACGCGTCAT CAGCGACCAC GATCTCGACC ACGTGCTCGC GCCGGTGATG
GAGTCAGAGC AGGCAGCAGC GGGTATCGTC TCCGACACGT CGGCACGCGA GCGACTCCCG
ATCACCGCGT TGCCGGGACG CCACAGCGAG TGGGCCGATC GGGGCTGGGG CTACGAGCAG
ATCATGTCTG AGGTGAACCT GCCGACCCTG GAGACGGCAC TGGGGTCACG ATGA
 
Protein sequence
MTNHTTRRQW LGVLAGATAA GTAGCLDSTA GGTDETLSVA ASFFVLGDLA SNVAGDRASV 
ETLVPVGQHG HGWQPGPDVT RRAVEADVFV YMAPGFQPWA DDVVTNIESG DSDTAIMEAR
AGVDLLTVPE EGGHDHGSAT AVHGEKHAEH SGHSDDHADH DTAHNDNHDT AHEDGHDERP
VDPHFWLDPR RARTAVETIE EGLRAVDEGN AGTYADNADR YRERLSELDE TFETALSDRE
RETVLVAGHN AFQYLGRRYG FDVVALSGLS PDDSPTSDDL RRAERVISDH DLDHVLAPVM
ESEQAAAGIV SDTSARERLP ITALPGRHSE WADRGWGYEQ IMSEVNLPTL ETALGSR