Gene Hmuk_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1541 
Symbol 
ID8411062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1470526 
End bp1472100 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID645019867 
ProductNa+/solute symporter 
Protein accessionYP_003177363 
Protein GI257387590 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.69558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA CGACGATCCA GCTGGGCATC GTCGGCGGCT ACATGGTACT GGCGGCGGCC 
ATCGGCGTCG TCGCCTACCG CCTGACCGAT CGCACGGCGG AAGACTACTA CCTCGCCAGC
CGGACATTCG GCACGGTCGT ACTCCTGTTC ACGACGTTCG CGACGCTGCT GTCGGCCTTT
ACCTTCTTTG GCGGCCCGAA CCTCACCTTC GCGCAGGGAC CCGAGTGGCT GCTGGTGATG
GGGCTGATGG ACGGGATCAT CTTCGCCGTC CTCTGGTACG TGCTGGGGTA CAAGCAGTGG
CTCGTCGGCC AGCGCCACGG CTACGTCACG CTCGGGGAGA TGCTGGGCGA TCGCTTCGGC
TCGCGTCTCC TCCGCGGGCT CGTGGCGGCG ATCAGTCTCT TCTGGCTCTT TCCCTACGTG
ATGCTCCAGC AGAAGGGTGC CGGCCAGGCG GTCGTCGGAC TGACCGACGG CGCGGTCCCG
TTCTGGGTCG GTGCCGGCGG CATCACGCTC TTTATGATCG TCTACGTCGC CGTCTCGGGG
ATGCGCGGGG TCGCCTGGAC CGACACGCTC CAGGGGATCG TCATGCTCGG GCTGATCTGG
GCGGCCGTCG CCTGGATCCT CTCGGCGGTC GGCGGCCCGG CGGCGGCGAC GGACCGACTG
GCCGAGACCA ACCCCGAGTT CCTCGCACTG GGCGGCGGGC TGTACACGCC GGAGTACGTC
CTCTCGACGG CGATTTCCAT CGCCTTCGGC GTGACGATGT TCCCCCAGAT CAACCAGCGC
TTCTTCGTCG CTCGCTCCCA GAAAGTGCTC AAGCGGACGC TGGCGCTGTG GCCCGTACTG
GTGGTCCTGC TGTTCGTCCC CGCGTTCATG CTCGGCGCGT GGGCCGCCGG CCTGGGCGTC
ACCGTTCCGG AGAACGGCAA CGTGATCCCG GCGGTGCTCA ACGAGTACAC CGCCGGGTGG
TTCACGGCGG CCGTCGTCGC CGCGGCGCTG GCCGCCATGA TGTCCTCCAG CGACTCGATG
CTGCTCTCGG GCGCCTCCTA CCTCACTCGC GACCTCTACC GACCGGTGAC CGACCTCGCC
GAGGAGGAGC CGACGCTGCC GGACCGCGCG TCGCTGGTGA ACCGCGTCCG CCGATCCCTG
CTCGCAGTCG CCGTCTCCGT CGGTCGCACG CTTCACTCCG ACCGCGACCG CGAGACGCTG
CTCGCCCGCG CTGGCGTGGT CGTCTTCGCG ACGGTCTCGT TCGTCGCCAG CCTCTACGCG
CCGGGAACGC TCGTCCAGAT CGGCGACACC GCGTTCGGCG GCTTCGCCCA GCTGGCCCTG
CCCGTCATCG TCGCGCTGTA CTGGCCCCGG ACGACCCGCT GGGGGATGTA CGCCGGCGTC
GGCGGCTCGC AGCTGTTCTA CCTCGCCAGC GTCTTCCTCC CGTTCGTGCC CGGCAGCTAC
CTCGGTGGCT GGTCGGCCAG CGTCGTCTGC ATGGCGCTGG GACTGGTCCT GACCGTCGGC
GTCTCGCTCG TGACGAGCGC GTCCCCCGGC GAGGACGCCG GCCTGTACAG CGTCTCGGGT
GTCGACGGCG ACTGA
 
Protein sequence
MADTTIQLGI VGGYMVLAAA IGVVAYRLTD RTAEDYYLAS RTFGTVVLLF TTFATLLSAF 
TFFGGPNLTF AQGPEWLLVM GLMDGIIFAV LWYVLGYKQW LVGQRHGYVT LGEMLGDRFG
SRLLRGLVAA ISLFWLFPYV MLQQKGAGQA VVGLTDGAVP FWVGAGGITL FMIVYVAVSG
MRGVAWTDTL QGIVMLGLIW AAVAWILSAV GGPAAATDRL AETNPEFLAL GGGLYTPEYV
LSTAISIAFG VTMFPQINQR FFVARSQKVL KRTLALWPVL VVLLFVPAFM LGAWAAGLGV
TVPENGNVIP AVLNEYTAGW FTAAVVAAAL AAMMSSSDSM LLSGASYLTR DLYRPVTDLA
EEEPTLPDRA SLVNRVRRSL LAVAVSVGRT LHSDRDRETL LARAGVVVFA TVSFVASLYA
PGTLVQIGDT AFGGFAQLAL PVIVALYWPR TTRWGMYAGV GGSQLFYLAS VFLPFVPGSY
LGGWSASVVC MALGLVLTVG VSLVTSASPG EDAGLYSVSG VDGD