Gene Hmuk_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1810 
Symbol 
ID8411336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1731138 
End bp1732292 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content65% 
IMG OID645020140 
Productarsenical-resistance protein 
Protein accessionYP_003177631 
Protein GI257387858 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.324178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.452859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG GCGAACACGA CCACGGGCCG GACTGCGAGT GTGAGAGCTG TGGAGACCCA 
CGGTCGATGG ACGTGCTCGA CAAGTACCTC ACCGTCTGGA TCTTCGGCGC GATGGCGGTC
GGCGTCGGTC TCGGGTTCGT CGCTCCGTCG GTGACTCAGC CGATTCAGGA TCTCCACCTC
GTGGAGATCG GCCTGATCCT GATGATGTAC CCGCCGCTGG CCAAGGCCGA CTACTCGCAG
CTCCGGACCG TCTTCAGCAA CTGGCGCGTA CTCGGGCTGA GCCTCGTCCA GAACTGGCTG
ATCGGTCCGA CGCTGATGTT CGGACTCGCC GTGATCTTCT TCAGCGGTCT CGTCCCGGGA
CTGCCCGCTC GTCCCGAGTA CTTCCTCGGC CTGGTGTTCA TCGGGATGGC CCGCTGTATC
GCGATGGTCC TCGTCTGGAA CGAACTCGCG GAGGGGTCGA CCGAGTACGT GACGGGACTG
GTCGCGTTCA ACAGCCTCTT CCAGATCGTC ACCTACGGGG TCTACGTCTG GTTTTTCGCC
CTCTTTCTCC CGCCGGTGCT GGGCATGGAG TCGCTGGTCG CCGGCATCAC GACCTTCGAC
ATCACGCCGA TGCAGGTGTT CGAGGCGATC GTCGTCTACC TCGGGATTCC CTTTGGGGCC
GGGTTCCTGA GCCGCTACGT GGGGACCCGC GTCAAGAGCG AAGCGTGGTA CGAGGAGGAG
TTCGTCCCGA GGATCGATCC GCTGACACTG GTCGCGCTGC TCTTTACGGT CGTCGTGATG
TTCGCCACGC AGGGCGGAGC CATCGTCGCG TCGCCGGGCG ACGTGTTGCT GATCGCGGTG
CCGCTGACGA TCTACTTCGT CGTGATGTTC CTCGTGAGCT TCGGTATGGG ACGGGGCATC
GGTGCGGACT ACTCGACGAC GACGGCGATC GGGTTCACCG CCGCCTCGAA CAACTTCGAA
CTGGCGATCG CGGTCGCGGT CGCGGTGTTC GGCGTCGGCT CCGGCGTCGC GTTCGCGACC
GTCGTCGGCC CGCTCATCGA GGTGCCGGTC CTGCTCGCGC TGGTCAACGT CGCGCTGTAC
TTCCAGCGCC GGTACGACTG GGGCGGTGCC ACGACCGGAC AGCTCGACCG TTCCGGAGCG
ACAGACGACG ATTGA
 
Protein sequence
MSNGEHDHGP DCECESCGDP RSMDVLDKYL TVWIFGAMAV GVGLGFVAPS VTQPIQDLHL 
VEIGLILMMY PPLAKADYSQ LRTVFSNWRV LGLSLVQNWL IGPTLMFGLA VIFFSGLVPG
LPARPEYFLG LVFIGMARCI AMVLVWNELA EGSTEYVTGL VAFNSLFQIV TYGVYVWFFA
LFLPPVLGME SLVAGITTFD ITPMQVFEAI VVYLGIPFGA GFLSRYVGTR VKSEAWYEEE
FVPRIDPLTL VALLFTVVVM FATQGGAIVA SPGDVLLIAV PLTIYFVVMF LVSFGMGRGI
GADYSTTTAI GFTAASNNFE LAIAVAVAVF GVGSGVAFAT VVGPLIEVPV LLALVNVALY
FQRRYDWGGA TTGQLDRSGA TDDD