Gene Hmuk_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1749 
Symbol 
ID8411273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1665269 
End bp1666597 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID645020077 
ProductABC transporter substrate-binding protein 
Protein accessionYP_003177570 
Protein GI257387797 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.239653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.31899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGG ACGACCGTGG TGGCTGCGGT TCGCGGGCGA CGCGGCGGCG TGTACTCGAA 
TCTCTGGGTG GGACCGGAGC GGTGGCGCTG GCCGGCTGTC TCGGATCTGA CGGGGCGACG
ACAGAAACCA GCACCGACGA TTCGGTGACG ATCGGACTGT CCTCGCCCGA GTCCGGTCGC
TACTCTCCGA TCGGCGATCA CGAGCGGCGA GGGTTCGAAC TCGCGGTCAC CCATCTCAAC
GAGGGCGGGG GCCTGGTCGG TGGCGATGGG TTCGCTCGTC TCTCAGGGGA CGGCGTGCTC
GGACGGACCG TCGAGACCGC CGTCGCCGAC ACGGAAGGGG ACGCCTCGAC GGCCGAAACG
AACCTCCAGC AGCGACTCGG CGACGACGAG CTGGCCATGT TCACCGGCGG TGTCGCGGGC
AGCGTCGTCA CGACACACCG GGATCTGGCG GACGAACACG AGACGCCGTA CTTCGTCGGC
ACGTCGACGC TGAACCAGCT CACCGGCGAG AACTGCTCGC CACACGTCTA CCGCGAGCTG
TTCAACTCTC GCACGCTGGC CCGTGCCCTC GTCCCGGAAG TCGTCTCCGA CATCGACGGT
GCCCAGTTCT TCTTCCAGGT CACTACCGAC ACGATCGAAG GACGCGACCT CAAGCGGAGC
ATCAACCGCT ACGCGACCGC GAGCGACCTC GACTTGCGGC CCATCGGCAC GACGACGGTC
CGGTCGGGCT CGACCGACTT CGAGCGAGCG CTCTCGGAAG CGTCGAGCAA CAGCGTCGAC
ATCGTCTTCC TCGATCTGTT CGGCCTCGAC GCCGTCAACG CGATCCAGCA GGCCAAAGAG
ATCCTCTCAG AGGATGCAGT CATCGTCGTT CCGTTGCTCA CCCAGTCGGT CTCGGATTCG
CTGGGCGAGC GGGTCGAGGG AGTCTACGGC ACCGTCGGCT GGCACGAGAA CCTCGATACG
CCGCTGTCGA GCAAGTTCGG CGACGCCTAC CAGAACGAGT ACTCGGGAAC GATGAGCACC
ACGGCCCTCG TCCCACCGGG CCCGGCCCAG AACACGTACG GACAGGTCCT GTTGTGGGCC
AGCGCGGCCG AGGCCGCCGG CACGTTCGAC GCGGACGCGG TCCGGTCCGA ACTCGAAGGG
GTCGAGTACG CACTCGGTGC GGGTTCAGAG ACGATGCGGG CCTGCGACCA CCAGGCGATG
CGAGCCGTCC CGGTCGTTCG TGGCAGTACC GGGACGGACT CGGTCGGGAA CTACTTCGAG
CTGTTGAACG GGCGGCGAAA CGTGGAACCC GGCTGTGACG AACCGCCCGC GTCGGCGTGT
GAGCTGTAA
 
Protein sequence
MTTDDRGGCG SRATRRRVLE SLGGTGAVAL AGCLGSDGAT TETSTDDSVT IGLSSPESGR 
YSPIGDHERR GFELAVTHLN EGGGLVGGDG FARLSGDGVL GRTVETAVAD TEGDASTAET
NLQQRLGDDE LAMFTGGVAG SVVTTHRDLA DEHETPYFVG TSTLNQLTGE NCSPHVYREL
FNSRTLARAL VPEVVSDIDG AQFFFQVTTD TIEGRDLKRS INRYATASDL DLRPIGTTTV
RSGSTDFERA LSEASSNSVD IVFLDLFGLD AVNAIQQAKE ILSEDAVIVV PLLTQSVSDS
LGERVEGVYG TVGWHENLDT PLSSKFGDAY QNEYSGTMST TALVPPGPAQ NTYGQVLLWA
SAAEAAGTFD ADAVRSELEG VEYALGAGSE TMRACDHQAM RAVPVVRGST GTDSVGNYFE
LLNGRRNVEP GCDEPPASAC EL