Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0050 |
Symbol | |
ID | 8409547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 46154 |
End bp | 48040 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645018388 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003175908 |
Protein GI | 257386135 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.600017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.630489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATA ATCATACCGA TCACGAGCAC CAGTCGATAC TATCTCGCCG CCGTTTCGTC AGCGGCGTCA GCCTGGCCGG CATCGCCGGC CTCGCGGGCT GTGGCGGACA GCAGGCCGAG CAGACGGCGA CCGAGACGTC CGGCGACGGC GGCGACGAGA CCGACGCGGG CGATACTGAC ACCGAGACCG AAACGGAGGT CCAGGCCACT AGCGAGGCCC AGCGCAAGAT TCAGGAGCTG GCCTACATCA CGAACCAGAC GCTCCCGGTG TTGCCGGTCA TGGAGAAACT CGCCCAGTCG TTCCAGTCGA CCGACGACTG GAACGTGCCC GGGACCGACA GCGATGCGGT CCAGACCTAC TGGCCCACAG AGTGGCTCCC GCGGGAGGGA CAGTGGACGG CGACGGACGA CTCCGACGAC GACCGACTGA CCTTCGCACA GTGGGCCGTT CCCCAGGACT CACAGTACAA CCCCTGGAAC GGGCAAAACT ACGGCGAAGC CCGTCGTCTG ATGTTCGATC GGTTCATGAA GTACAACCTC GCGACCCAGG AGTACACCGG CTACGCCATC CAGGACTGGG AGGTCGGCGA GGAGACGGTC TCGCTCACCG TCCGCGAGGG GCTGACCTGG CACAACGGCG ACGCCGTCAC CGCGACGGAC GTGGCCAACC AGGTCAAACT CGACATCTAC AACGGCGGTT CGCTGGGTAA CTTCGTCGCG CCCGAGGACG TGGGCGCGGT GTCGGACCGG GTCACGGCGG TCGACGAGTC GACGGTCGAG ATCACGCTGG CCGAACCCGC CAGCGAGACG ATCCTGCTGG CGTACCTCCA GCCAAAGCGG CTCACCGCCC ACGACGGCTC CTACGGGGAG TTCGTCACCG CACTCGACGA GGCCGCCGGC GAGGACGAGC GCGCGTCGGC GCTCAGCGAT CTCACCAACG ACACGACCCC CGAACCGGTC GGCTGTGGCC CCTTCCAGTT CGAGGACGCC GACAGCCAGC GCACGCTGCT GACCAAGTAC GAGGACCACC CGGACGCGGA CAACATCAAC GTCCCCGAGG CCGAGTACCT CTACAAGCCA CAGAACCAGG GGCGCTGGAA CTCGCTGATC AACAACGAGA CCGACGGGTC CGCGACGCTG TTCATGCCCC AGAACCGGCT CAACCAGCTG CCCGACTCGA TGCAGGTCAG TCTCATCCCG CGTCACTGGG GGATGGGGCT GATGTTCAAC TTCGAGGAAG AGCCCGTCGA CGACGTTCGG GTCCGCAAAG CGATCGCCCA CGTCGTCAAC CGCGAGAACG CGGCGCTCAA CTCCGGTGCG GGCACCCAGT CCAAGCTCGC GGTCACCTAC CCCAGCGGAC TGACCGGCGA GTTCAACGAC CAGATCGAGG GCGGCTGGCT CGACGGCGTC GCAGACGAGT TCGAGACCTA CGGCCCGGGC GAGTCCCAGA CCGAAGCGGC CGCGTCGCTG CTTCGCGACG CCGGCTACGA GAAACAGAAC GGCACCTGGC AGAAGGACGG CGAGCCCCTC GAACTCCCGA TCAAGGGGCC GTCGGGCTTC TCGGACTGGG TGACCGGCGT CGAGACGATC GTCTCGAACC TCAAGGACTT CGGCATCGAG GCCGAGTCAG TCATGCTCGA CAACTCCACG TACTGGGGGA GTGACTACTC CAACGGCGAC TTCGTCGTCG GGCTCCAGGG GTGGGCCTCC TACGACCACT CGTACCCGTA CTTCCACTTC GACTGGATCT TCAACAGCTG GGACGCCAAG AACGCCTGGA ACCTCCCAAG CGAGTTCGAA TCGCCGATCC TCCACGACGA TGAGCGCGAC GGCGAGACCG TCACGCCCGT CGACATCGTC GACGAGCTGT CGACGGCCAA CCAGTAG
|
Protein sequence | MANNHTDHEH QSILSRRRFV SGVSLAGIAG LAGCGGQQAE QTATETSGDG GDETDAGDTD TETETEVQAT SEAQRKIQEL AYITNQTLPV LPVMEKLAQS FQSTDDWNVP GTDSDAVQTY WPTEWLPREG QWTATDDSDD DRLTFAQWAV PQDSQYNPWN GQNYGEARRL MFDRFMKYNL ATQEYTGYAI QDWEVGEETV SLTVREGLTW HNGDAVTATD VANQVKLDIY NGGSLGNFVA PEDVGAVSDR VTAVDESTVE ITLAEPASET ILLAYLQPKR LTAHDGSYGE FVTALDEAAG EDERASALSD LTNDTTPEPV GCGPFQFEDA DSQRTLLTKY EDHPDADNIN VPEAEYLYKP QNQGRWNSLI NNETDGSATL FMPQNRLNQL PDSMQVSLIP RHWGMGLMFN FEEEPVDDVR VRKAIAHVVN RENAALNSGA GTQSKLAVTY PSGLTGEFND QIEGGWLDGV ADEFETYGPG ESQTEAAASL LRDAGYEKQN GTWQKDGEPL ELPIKGPSGF SDWVTGVETI VSNLKDFGIE AESVMLDNST YWGSDYSNGD FVVGLQGWAS YDHSYPYFHF DWIFNSWDAK NAWNLPSEFE SPILHDDERD GETVTPVDIV DELSTANQ
|
| |