Gene Hmuk_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1598 
Symbol 
ID8411120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1523368 
End bp1525053 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content64% 
IMG OID645019924 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003177419 
Protein GI257387646 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0732805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACA CACGGCACAC GGGACGGATC GCATGGCTCT CACGGCGACA GTACGTCCGC 
GGCGTCGCCG GCAGTGCTCT CGCGGTCGCT CTGGCGGGCT GTCAGGGCGA CGGCGGTGAG
GACGATGGCG GCGACGGCGG CGCACAGCAA CCGGAGGAGA CGACGGCCGA CAGCGAGGAC
ACGGCGACGG ACACCGACGA CACAGAGGCC ACGACCGACG GCTCGATCAC GTTCGCACAG
GCGAAATCGC CCGTGGAGTT CGATCCCGTC GTCCTGAACG ACGTTCCGTC GGCCGAGGTC
GCGATGCTCG TCTTCGACTC GCTGTACACG TACGACGAGG GCACCAATCT CGTTCCGCAG
ATCGCCGCCG ACATGCCGGA GGTAGAGCGC GGAGCCCAGC GGTGGATCGT TCCGATACGG
ACCGACGCCA CTTTCCAGAA CGGCGACCCG GTCACGGCCG AAGACGTCGC TTACTCCTTC
CGGGCACCGG TCGAAGAAGA GACGGAGAAC GCCGGGGAGT TCAACATGAT CGATAGCGTC
GAAGTCGTCG ACGAGTCGAC CGTCCAGTTC GACCTGCAGT TCCAGTTCGG GGCATTCGAC
TCGTATCTCC CGTGGGAAAT CGTCGACAAG TCCGTCCGCG AATCGGACAG AGACGCCTAC
AACACGTCAA GCCCCGTCGG AGCCGGCCCG TTCACGTTCG ACGACTGGCA GGAGGGCGAG
TACGTCCGGC TCAGCCGCTG GGACGACTAC TGGGGAGAGC CACTCCCGAA CCTCGCCGAG
ATCGAATTCG TCCCGGTCGA AGAGCCGACG ACCCGGATCA CGACGCTGCG AACCGGTGAG
AACGACGTGG TAAAGAACAT TCCACCGGCA AACTGGGAGA CTGTCGAGAA CATGGGCGAG
GCCAGCATCG AGTCGGTTCT GGGAACGAGC TACTTCTATC TCGCCTTCAA CTGCAACGAG
GGACCCACCG CCGACCCCGA GGTGCGGGAG GCGATCGACT ACGCCTTCTC GATGGACGAC
GCGGTCGGCC AGTACGTCGA ACCGACCGGC GAGCGACAGT ACGCGCCGGT TCCCAGAGCG
ATCTCCGAAG ACTGGGAGTT CCCCGTCGAG GAGTGGCAGC AGATCCCCCA CGAGCCGGAT
CTGGATCGGG CCAAGTCGCT GCTCGACGAC AACGACAGCG TCCCGGACGA CTGGCAGCCC
CGGATCATCG TCCCGCCGGA CGACAAGCGC GAACAGATCG GGGTCTCCGT CTCGAACGGG
CTCAGCGAGG CCGGCTACGA CGCGACGGTC CAGCGCCTCG ACTGGGGTGC CTTCCTCGAA
CAGTACGTCA CCGGCAGCGC CGACGACTAC AACATGTTCA CGCTTGGCTG GGCCGGTTCG
CCCGATCCGG ACACGTTCAT GTACTTCCTG TTCGCCCACG ACCAGATCGG CACGAACAAC
GGCACCTACT ACCGCAACGA GTCGATGAAC GAACAGATCA TGAACGCCCG TCAGTCCAAC
GACGACGAAC AGCGCCGCGA GTGGTACGTC GACGCCATCC AGACGGTGCT CGAAGACCGG
GTCCACCTCC CGGCGTACAA CATCAAGAAC AGCTTCGGGG TCCGGAGTCA CGTCTCGGAC
TTCCGGGCCC ACCCCGTCGA CCAGTTCAGC ATCGTCTCGG CGTACAACAA CGTCTCCGTC
CAGTGA
 
Protein sequence
MKDTRHTGRI AWLSRRQYVR GVAGSALAVA LAGCQGDGGE DDGGDGGAQQ PEETTADSED 
TATDTDDTEA TTDGSITFAQ AKSPVEFDPV VLNDVPSAEV AMLVFDSLYT YDEGTNLVPQ
IAADMPEVER GAQRWIVPIR TDATFQNGDP VTAEDVAYSF RAPVEEETEN AGEFNMIDSV
EVVDESTVQF DLQFQFGAFD SYLPWEIVDK SVRESDRDAY NTSSPVGAGP FTFDDWQEGE
YVRLSRWDDY WGEPLPNLAE IEFVPVEEPT TRITTLRTGE NDVVKNIPPA NWETVENMGE
ASIESVLGTS YFYLAFNCNE GPTADPEVRE AIDYAFSMDD AVGQYVEPTG ERQYAPVPRA
ISEDWEFPVE EWQQIPHEPD LDRAKSLLDD NDSVPDDWQP RIIVPPDDKR EQIGVSVSNG
LSEAGYDATV QRLDWGAFLE QYVTGSADDY NMFTLGWAGS PDPDTFMYFL FAHDQIGTNN
GTYYRNESMN EQIMNARQSN DDEQRREWYV DAIQTVLEDR VHLPAYNIKN SFGVRSHVSD
FRAHPVDQFS IVSAYNNVSV Q