Gene Hmuk_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2440 
Symbol 
ID8411984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2341880 
End bp2343148 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID645020783 
Producthypothetical protein 
Protein accessionYP_003178257 
Protein GI257388484 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00184061 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGAGAG ACGATACGAC ACACGAGGCA CCGACGCGCA GAGACTACGT GAAGTACGGC 
GGGGCGGTCA TTGGCGGTGG GCTACTGGCC GGCTGTACGG GAAACAGCGC TGATGGCGAC
AACGCTGATA CCGACAGCCC GAAGATCGAA GAATCAACCG GCGGTGAGAC GGACAGTCCT
GAAGAAACCA AAACTCCCGA AGACACGTCG TACTCGGTGA CGATGGAACC AGTAGGAGAG
GTTACATTTG AGGGAGTTCC AACTACCTGG CTCAGTACAG ACTCGGAATT GACTGACATG
GCGTTCTGTC TCGGTCAAAT CGATGGATGG ATTCCGACAC CACTGCGGAG TTTCAACTAC
TTCGAACATC TCGGTGTAGA CCTTCGCAGT CGTTACCCCA ATTCTGACCC ATACACGTGG
AAGGAAGGAG AAGCGGACTA CGACGGAAAA GAGTACGTAT ACGAAGTCGA GCCCGATCTC
CTGATGTTCT TTCCGCAACG ACGCTTGGTG TATAACAAAG CGTGGGACGA GGGTGATTTG
GAAGAAATCT CGGAGAACGT AGCCCCAATT TTCGGCACGA ACGTTTATCG GAACAATTCA
AGCCTGAACT ACGACCAACC ATCCATCTAC GAGGCCTTTG AAAAGCTCGC ACGGGTTTTC
AAGGAGGAAG CCAGGTACGA GGCGTTCGTC GAGGTTCACG ACCAGATGAT CCGGACCGTG
AACTCGAAAC TACCACCCGA GGAGGAACGA CCGACTATCG CGTATCTCAG CGATGCGAGT
GACCCTAACA GAGGGAATAT TTATCCGGTC GGCTTCGACG GAGAATTAGG ACAGTCGAAG
CACTTCCGAG ACCTTGAAGC CGTCAACGCA TTTGAAATCT CCGAGCAGAC CGACTACGAG
GGGCTTCTAC AAGCCGATCC CGACGTAATC ATTATTCAGG GAGCTCTAGA ATACACCGGA
AATCTTGTCT CCGACGAGAA CGGTGAAGCA GTATTTGATC TTGACCAATT CCGCAAAAAT
TACGTTCTGC CAATGGAGGA TAATTCGGTC GGTCAGAAGG TGACTGCTGT TCAGGAAAAC
AGGATCGAGC CGGGCGGAGT GAGTCGACAG GGACCGTTGA CGAATCTCTA CAATACGGAG
ATACTTGCTC AGCAGGTATA TCCCAAGCAG TTTGGAGAGT TTGATCCAGA AAATCCGTTC
GACAGTGCTG AGGAACACCA GCTATTTGAT CGAGAACGCG TCCGAGACAT TATCAACGGC
GACCTCTGA
 
Protein sequence
MSRDDTTHEA PTRRDYVKYG GAVIGGGLLA GCTGNSADGD NADTDSPKIE ESTGGETDSP 
EETKTPEDTS YSVTMEPVGE VTFEGVPTTW LSTDSELTDM AFCLGQIDGW IPTPLRSFNY
FEHLGVDLRS RYPNSDPYTW KEGEADYDGK EYVYEVEPDL LMFFPQRRLV YNKAWDEGDL
EEISENVAPI FGTNVYRNNS SLNYDQPSIY EAFEKLARVF KEEARYEAFV EVHDQMIRTV
NSKLPPEEER PTIAYLSDAS DPNRGNIYPV GFDGELGQSK HFRDLEAVNA FEISEQTDYE
GLLQADPDVI IIQGALEYTG NLVSDENGEA VFDLDQFRKN YVLPMEDNSV GQKVTAVQEN
RIEPGGVSRQ GPLTNLYNTE ILAQQVYPKQ FGEFDPENPF DSAEEHQLFD RERVRDIING
DL