Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1824 |
Symbol | |
ID | 8411350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1743503 |
End bp | 1744852 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645020154 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003177645 |
Protein GI | 257387872 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACG ATAGTGACCG ACTTTCGCGG CGGCGGTTCG TTGAAGCGAC AGGAGCGGCA ACACTGATCG GCCTCGCTGG CTGTAGCGGG GACGGCGGTG GTGACGGCGG CGACGGCGGC GACGGCGGCA GTGAGAACAC CGACGGCAGT GACGGCAGTG ACGGCGGCGA CAGCAGCGGG GACCCACTCG AAGTGCTTCA CGGCTGGACC GGCGGCGACG GTGCAGCAGC GGCCGAAGCA CTCGTCGAGG CGTTCGAAGA GGAGTACCCC GACATGGAAC ACGAGTTCAA CCCCATCGGT GGGGGCGGGA ACCAGAACCT CGACGCAGTC GTCGCCAATC GGCTGCAGAA CAACAACCCG CCGAGTTCCT TCGCCAACTG GCCCGGCAAG AACCTCCAGC GCTACGAGGG CGTGCTGGGC GAGGCCGACA GCGTGTGGGA CGAAGAGGGC TTCGAGGACG TGATGGTCCA GGAGGCAGTC GATCTCCACC AGTACAACGG TGCGTTCCGA GCCGTCCCGC TGGGTTCCCA CCGACTGAAC TGCCTGTTCT ACAACACCTC GGTCGTCGAG GAGGCGGGCG TCGATCCCGA TTCGCTGACC AGCGTCTCGG CCCTGATCGA CGCACTGGAG ACGGTCGCGA CCGAGACCGA CGCGGTGCCG ATGACCCACG GTATGAGCGG GACCTGGACG ACGACGCAGC TGTGGGCGTC GACCATGCTC GGTAAGGAAG GATACGACGC CTACATGAAC TTCATCGAGG GCAGTCCCGA CGAGGCCGCC GTCCAGTCGG CCTTCGAGTC GGTCGCCGAG ATCCTCGAGA ACTACATCAA CGACGACGCG TCCTCGATCG GTCTGACGGA GTCCAACCAG AACATCATCG AGGGCAACGC CGCCTTCATC CACCAGGGCA ACTGGGCGGC CGGTGCCTTC CGTAACGCCG AGGACTTCGA GTACGACGAG GACTGGGGCT TCAAGACGTA CCCCGGCTCC GAGGGGATGT ACATGCTCCA CTTCGACTCG TTCCTCTACC CGTCGAACAA CCCGACCCCG GAGAAGACGG ACAAGTTCAT GGCCTTCGTC GGGAGCGAGG CCGCACAGGT CGCGTTCAAC CAGTACAAGG GATCGATCCC GACCCGGACG GACGTGGACA TGAGCGCGTT CGGTCCGTAC CTTCAGGAGA CCCAGGAGGA CTTCGCCAAC GCCGAGGAGC GGCCCCCGAA CCTCCAGCAC GGTCTGGGTG TCGACTCCGA GACGATGTCG ACGCTGAACG ACGTGATCTC CAGCGAGTTC TCGGGACCGT ACAACGTCGA GGCGGCGACC AGCGGCTTCA TCGACGCGGT CTCCAACTGA
|
Protein sequence | MTDDSDRLSR RRFVEATGAA TLIGLAGCSG DGGGDGGDGG DGGSENTDGS DGSDGGDSSG DPLEVLHGWT GGDGAAAAEA LVEAFEEEYP DMEHEFNPIG GGGNQNLDAV VANRLQNNNP PSSFANWPGK NLQRYEGVLG EADSVWDEEG FEDVMVQEAV DLHQYNGAFR AVPLGSHRLN CLFYNTSVVE EAGVDPDSLT SVSALIDALE TVATETDAVP MTHGMSGTWT TTQLWASTML GKEGYDAYMN FIEGSPDEAA VQSAFESVAE ILENYINDDA SSIGLTESNQ NIIEGNAAFI HQGNWAAGAF RNAEDFEYDE DWGFKTYPGS EGMYMLHFDS FLYPSNNPTP EKTDKFMAFV GSEAAQVAFN QYKGSIPTRT DVDMSAFGPY LQETQEDFAN AEERPPNLQH GLGVDSETMS TLNDVISSEF SGPYNVEAAT SGFIDAVSN
|
| |