Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2277 |
Symbol | |
ID | 8411818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2199690 |
End bp | 2200883 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020620 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003178096 |
Protein GI | 257388323 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGAAC ACAACGCTGA CGACGCCGAC CGATCGGATC GAGATCGCTC GCGTCCGTCG TCACGACGCG GATTCCTGGC CGCCAGCGGC ACGCTCGCTG CCGGCGCGCT CGCCGGATGC ACGGATATGC TCCCCGGTGG AAGCGACGGC ACGAGCGCCA GCGAGCTATC GCTCGGTGAC TTCCGTGGCT CTGGTCCGCT CGTCGAGCAA CGCTCCGCCC CGGAGGGGAC CAGCATCGAC GACCTCCCGG ACCTCTCGGG CGAACTGACG ATGTATCTCG GCGGCGGCGA GAGCGGTCTC TACCTCGACC TGGTCGATCT CCTCGAACAG ATCTACCCCG ATTTCAGTGT GAGCGCACAG CAGGAGTCGG CGAGCAACCT CGCCAACCGG ATCGCAGAAG AGAACCGCGC AGGCAGCACG CCGGCCGACG TGTTCATGGC CGTCGACGCC GGTTCGCTGG GGTCGGTCGC CGAGGACGGT GCCGCCGTGT CGATGTCCGC CGAAGTGACA GATCCCGTTC GCGACGCCTT CAAGGACAGC GAGGACCGCT GGGTCGGCTT CGCCGGCCGC GCCCGCGCGA TCCCGTACAA CACGAACGAA CTCTCGGCCA GTGACGTGCC GAGTACGGTC GCGGCGTTCC CCGAGACGAG CGCCTTCGGA GACTCCTTCG GCTGGGCACC CAGTTACAGC GCCTTCCAGT CGTTCGTGAC CGCCATGCGT GTCATCGAGG ACGACGAGAC GACTCGTCAG TGGCTCCAGT CCATGCAGGA CCTCGGGGTC ACGACCTACG ACAACGAGTT CGCCGTCTCG AACCGCGTGG CCGACGGCGA GATCTCGGCC GGTTTCGCGA ACCACTACTA CTCGCTGCGG GTCCAGAGCG ACCGATCCTC GGCCCCGATC GACCTCGCGT TCACCGAGGG CGACGCCGGT GCGCTGGTCA ACGTCTCCGG TGCCCAGATC CTCGACGGCA CCGAGAACAA GGCGCTCGCG GAGAACTTCC TCCGACACGT CCTCTCGGCG GAGGCCCAGG AGTTCTTCGC CACCCGGACC TACGCCTACC CGATGATCTC GGGCGTCGAG CCGGTCGGCG ACCTCCCGAC GATCGACGAG CTGAATCCGC CGGACATCGA TCTCGGCGAG CTATCGGATC TGGGCGGGAC GGTCGACATG CTGCGTGAGG TCGGCGTCCT CTGA
|
Protein sequence | MMEHNADDAD RSDRDRSRPS SRRGFLAASG TLAAGALAGC TDMLPGGSDG TSASELSLGD FRGSGPLVEQ RSAPEGTSID DLPDLSGELT MYLGGGESGL YLDLVDLLEQ IYPDFSVSAQ QESASNLANR IAEENRAGST PADVFMAVDA GSLGSVAEDG AAVSMSAEVT DPVRDAFKDS EDRWVGFAGR ARAIPYNTNE LSASDVPSTV AAFPETSAFG DSFGWAPSYS AFQSFVTAMR VIEDDETTRQ WLQSMQDLGV TTYDNEFAVS NRVADGEISA GFANHYYSLR VQSDRSSAPI DLAFTEGDAG ALVNVSGAQI LDGTENKALA ENFLRHVLSA EAQEFFATRT YAYPMISGVE PVGDLPTIDE LNPPDIDLGE LSDLGGTVDM LREVGVL
|
| |