Gene Hmuk_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2277 
Symbol 
ID8411818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2199690 
End bp2200883 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content67% 
IMG OID645020620 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003178096 
Protein GI257388323 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAC ACAACGCTGA CGACGCCGAC CGATCGGATC GAGATCGCTC GCGTCCGTCG 
TCACGACGCG GATTCCTGGC CGCCAGCGGC ACGCTCGCTG CCGGCGCGCT CGCCGGATGC
ACGGATATGC TCCCCGGTGG AAGCGACGGC ACGAGCGCCA GCGAGCTATC GCTCGGTGAC
TTCCGTGGCT CTGGTCCGCT CGTCGAGCAA CGCTCCGCCC CGGAGGGGAC CAGCATCGAC
GACCTCCCGG ACCTCTCGGG CGAACTGACG ATGTATCTCG GCGGCGGCGA GAGCGGTCTC
TACCTCGACC TGGTCGATCT CCTCGAACAG ATCTACCCCG ATTTCAGTGT GAGCGCACAG
CAGGAGTCGG CGAGCAACCT CGCCAACCGG ATCGCAGAAG AGAACCGCGC AGGCAGCACG
CCGGCCGACG TGTTCATGGC CGTCGACGCC GGTTCGCTGG GGTCGGTCGC CGAGGACGGT
GCCGCCGTGT CGATGTCCGC CGAAGTGACA GATCCCGTTC GCGACGCCTT CAAGGACAGC
GAGGACCGCT GGGTCGGCTT CGCCGGCCGC GCCCGCGCGA TCCCGTACAA CACGAACGAA
CTCTCGGCCA GTGACGTGCC GAGTACGGTC GCGGCGTTCC CCGAGACGAG CGCCTTCGGA
GACTCCTTCG GCTGGGCACC CAGTTACAGC GCCTTCCAGT CGTTCGTGAC CGCCATGCGT
GTCATCGAGG ACGACGAGAC GACTCGTCAG TGGCTCCAGT CCATGCAGGA CCTCGGGGTC
ACGACCTACG ACAACGAGTT CGCCGTCTCG AACCGCGTGG CCGACGGCGA GATCTCGGCC
GGTTTCGCGA ACCACTACTA CTCGCTGCGG GTCCAGAGCG ACCGATCCTC GGCCCCGATC
GACCTCGCGT TCACCGAGGG CGACGCCGGT GCGCTGGTCA ACGTCTCCGG TGCCCAGATC
CTCGACGGCA CCGAGAACAA GGCGCTCGCG GAGAACTTCC TCCGACACGT CCTCTCGGCG
GAGGCCCAGG AGTTCTTCGC CACCCGGACC TACGCCTACC CGATGATCTC GGGCGTCGAG
CCGGTCGGCG ACCTCCCGAC GATCGACGAG CTGAATCCGC CGGACATCGA TCTCGGCGAG
CTATCGGATC TGGGCGGGAC GGTCGACATG CTGCGTGAGG TCGGCGTCCT CTGA
 
Protein sequence
MMEHNADDAD RSDRDRSRPS SRRGFLAASG TLAAGALAGC TDMLPGGSDG TSASELSLGD 
FRGSGPLVEQ RSAPEGTSID DLPDLSGELT MYLGGGESGL YLDLVDLLEQ IYPDFSVSAQ
QESASNLANR IAEENRAGST PADVFMAVDA GSLGSVAEDG AAVSMSAEVT DPVRDAFKDS
EDRWVGFAGR ARAIPYNTNE LSASDVPSTV AAFPETSAFG DSFGWAPSYS AFQSFVTAMR
VIEDDETTRQ WLQSMQDLGV TTYDNEFAVS NRVADGEISA GFANHYYSLR VQSDRSSAPI
DLAFTEGDAG ALVNVSGAQI LDGTENKALA ENFLRHVLSA EAQEFFATRT YAYPMISGVE
PVGDLPTIDE LNPPDIDLGE LSDLGGTVDM LREVGVL