Gene Hmuk_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0050 
Symbol 
ID8409547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp46154 
End bp48040 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content66% 
IMG OID645018388 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003175908 
Protein GI257386135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.600017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.630489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATA ATCATACCGA TCACGAGCAC CAGTCGATAC TATCTCGCCG CCGTTTCGTC 
AGCGGCGTCA GCCTGGCCGG CATCGCCGGC CTCGCGGGCT GTGGCGGACA GCAGGCCGAG
CAGACGGCGA CCGAGACGTC CGGCGACGGC GGCGACGAGA CCGACGCGGG CGATACTGAC
ACCGAGACCG AAACGGAGGT CCAGGCCACT AGCGAGGCCC AGCGCAAGAT TCAGGAGCTG
GCCTACATCA CGAACCAGAC GCTCCCGGTG TTGCCGGTCA TGGAGAAACT CGCCCAGTCG
TTCCAGTCGA CCGACGACTG GAACGTGCCC GGGACCGACA GCGATGCGGT CCAGACCTAC
TGGCCCACAG AGTGGCTCCC GCGGGAGGGA CAGTGGACGG CGACGGACGA CTCCGACGAC
GACCGACTGA CCTTCGCACA GTGGGCCGTT CCCCAGGACT CACAGTACAA CCCCTGGAAC
GGGCAAAACT ACGGCGAAGC CCGTCGTCTG ATGTTCGATC GGTTCATGAA GTACAACCTC
GCGACCCAGG AGTACACCGG CTACGCCATC CAGGACTGGG AGGTCGGCGA GGAGACGGTC
TCGCTCACCG TCCGCGAGGG GCTGACCTGG CACAACGGCG ACGCCGTCAC CGCGACGGAC
GTGGCCAACC AGGTCAAACT CGACATCTAC AACGGCGGTT CGCTGGGTAA CTTCGTCGCG
CCCGAGGACG TGGGCGCGGT GTCGGACCGG GTCACGGCGG TCGACGAGTC GACGGTCGAG
ATCACGCTGG CCGAACCCGC CAGCGAGACG ATCCTGCTGG CGTACCTCCA GCCAAAGCGG
CTCACCGCCC ACGACGGCTC CTACGGGGAG TTCGTCACCG CACTCGACGA GGCCGCCGGC
GAGGACGAGC GCGCGTCGGC GCTCAGCGAT CTCACCAACG ACACGACCCC CGAACCGGTC
GGCTGTGGCC CCTTCCAGTT CGAGGACGCC GACAGCCAGC GCACGCTGCT GACCAAGTAC
GAGGACCACC CGGACGCGGA CAACATCAAC GTCCCCGAGG CCGAGTACCT CTACAAGCCA
CAGAACCAGG GGCGCTGGAA CTCGCTGATC AACAACGAGA CCGACGGGTC CGCGACGCTG
TTCATGCCCC AGAACCGGCT CAACCAGCTG CCCGACTCGA TGCAGGTCAG TCTCATCCCG
CGTCACTGGG GGATGGGGCT GATGTTCAAC TTCGAGGAAG AGCCCGTCGA CGACGTTCGG
GTCCGCAAAG CGATCGCCCA CGTCGTCAAC CGCGAGAACG CGGCGCTCAA CTCCGGTGCG
GGCACCCAGT CCAAGCTCGC GGTCACCTAC CCCAGCGGAC TGACCGGCGA GTTCAACGAC
CAGATCGAGG GCGGCTGGCT CGACGGCGTC GCAGACGAGT TCGAGACCTA CGGCCCGGGC
GAGTCCCAGA CCGAAGCGGC CGCGTCGCTG CTTCGCGACG CCGGCTACGA GAAACAGAAC
GGCACCTGGC AGAAGGACGG CGAGCCCCTC GAACTCCCGA TCAAGGGGCC GTCGGGCTTC
TCGGACTGGG TGACCGGCGT CGAGACGATC GTCTCGAACC TCAAGGACTT CGGCATCGAG
GCCGAGTCAG TCATGCTCGA CAACTCCACG TACTGGGGGA GTGACTACTC CAACGGCGAC
TTCGTCGTCG GGCTCCAGGG GTGGGCCTCC TACGACCACT CGTACCCGTA CTTCCACTTC
GACTGGATCT TCAACAGCTG GGACGCCAAG AACGCCTGGA ACCTCCCAAG CGAGTTCGAA
TCGCCGATCC TCCACGACGA TGAGCGCGAC GGCGAGACCG TCACGCCCGT CGACATCGTC
GACGAGCTGT CGACGGCCAA CCAGTAG
 
Protein sequence
MANNHTDHEH QSILSRRRFV SGVSLAGIAG LAGCGGQQAE QTATETSGDG GDETDAGDTD 
TETETEVQAT SEAQRKIQEL AYITNQTLPV LPVMEKLAQS FQSTDDWNVP GTDSDAVQTY
WPTEWLPREG QWTATDDSDD DRLTFAQWAV PQDSQYNPWN GQNYGEARRL MFDRFMKYNL
ATQEYTGYAI QDWEVGEETV SLTVREGLTW HNGDAVTATD VANQVKLDIY NGGSLGNFVA
PEDVGAVSDR VTAVDESTVE ITLAEPASET ILLAYLQPKR LTAHDGSYGE FVTALDEAAG
EDERASALSD LTNDTTPEPV GCGPFQFEDA DSQRTLLTKY EDHPDADNIN VPEAEYLYKP
QNQGRWNSLI NNETDGSATL FMPQNRLNQL PDSMQVSLIP RHWGMGLMFN FEEEPVDDVR
VRKAIAHVVN RENAALNSGA GTQSKLAVTY PSGLTGEFND QIEGGWLDGV ADEFETYGPG
ESQTEAAASL LRDAGYEKQN GTWQKDGEPL ELPIKGPSGF SDWVTGVETI VSNLKDFGIE
AESVMLDNST YWGSDYSNGD FVVGLQGWAS YDHSYPYFHF DWIFNSWDAK NAWNLPSEFE
SPILHDDERD GETVTPVDIV DELSTANQ