Gene Hmuk_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1223 
Symbol 
ID8410743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1163556 
End bp1165244 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content66% 
IMG OID645019555 
ProductNa+/solute symporter 
Protein accessionYP_003177052 
Protein GI257387279 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.704891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC TCCTCCTCCA GTCGGGCCTG CTCCCGGAGG GGCTGGAGAT TTCCTTCAAG 
CTCATCCCCG GGATCATGGT CACGGCCATG CTCTTGCTCT TTCTGGCGGT CGGCTACGTG
TTCAAGGTGG CAGACACCGA GGGCATGTGG GTCGCCGGCC GCTCGATCGG CAACGTCGAG
AACGGGATGG CGATCGGCGC GAACTGGATG TCTGCGGCCT CCTACCTCGG GATGGCCGCG
ACGATCGCGC TGTTTGGCTT CTACGGACTG GCCTTCGTCG TCGGCTGGTC GACGGGCTAC
TTCATCCTCC TGATCTTCAT GGCCGCCCAG ATGCGCCGGT TCGGGAAGTA CACCGCGCCG
GACTTCGTCG GCGACCGGTT CAACTCCGAC AGCGCACGCG CGATGGCGGC CGTGACGACG
TTCCTCATCG GGTTCGTCTA CGCGATCGGT CAGGCCCGCG GAATGGGGCT GGTCGGACTG
TACATCTTCG GCGACATCGG CATCCCCGGG CTCTCGGGGT ACCAGTCGAT GGTCGTGTTG
ATGATGGCGA TCACGGTCGG CTACCTCACG CTGTCGGGCA TGCTGGGCGC GACCAAGAAC
ATGACCGTCC AGTTCGTCAT CCTCATCGTC GCGTTCCTGG CCGGCCTCTA CGCGGTCGGG
TACACCCAGG GGTACTCGAC GGTGTTGCCC CAGCTCGAGT ACGGTCGACT GATCGGCGCG
CTCAGCGCCG AGTTCAGCGA GCCGTTCACC ACCGAGAGCT ACTACACGTG GATCGCGACG
GCCTTCACGC TCGTCGTCGG GACCTGTGGC TTGCCCCACG TACTGGTGCG GTTCTACACG
GTCGAGAGCG AGCGGACGGC CCGCTGGTCG ACGGTCTGGG GCCTCTTTTT CATCTGCCTG
CTGTACCTGA GCGCGCCGGC CTTCGCCGCC TTCGGGACCG ACCTCTACGC CAACGAGATC
GGTGCCGTCT ACGGCGATCC CGGGATGTCC AGTGCCGCGG GCGACGTGAT CGTCGTGCTG
GCGACCCAGC TGGCGGGACT GCCAGAGTGG TTCGTCGGCC TCGTCGCTTC CGGCGGGATC
GCCGCCGCGA TCGCGACGAC CGCCGGCCTC TTTATCGCCG GCTCCTCGGC GATCTCTCAC
GACATCTACA AGGGACTGAT CAACCCCGAC GCGACTCAGC GCCAGCAGGT GTTGGTCGGT
CGCCTGAGCA TCGTCGCGCT GGGCGTCATC ACGACGCTCG CGGCGCTGGA TCCCGCAGCA
CCGATTGCCG CGCTGGTGAC CTACGCGTTC TCGCTCGCGG GCTCCGTGCT CTTCCCGATG
TTCTTCCTCG GGCTCTGGTG GGAGAACACG AACCGACAGG GCGCACTGGC CGGGATGTCG
ACCGGCCTCG TCGTCTGGCT CATCCCCATG GTCAACGAGG TGGTCCCGAG CTACGGGCTC
CTCGCGGGTG CAGCCGGCTC TGACGGCGTG CTCTCGGCCA CCCTCGCACA GTGGCTCCCG
GCGATCGGCT CGGCGCTCGT CGCCGCACCG CTGGTGTTCG TCGTCACGAT CGCCGTCTCG
ATGGCCACAG AGGAACCGCC ACTGGAGACC AAGCGGATGG TTCGCCAGTG TCACAGTCCG
GAACCGATGG GACAACAGCA GACGGCCGAA GAGGTCGTCA GCGGTGCGGA AACGCCGGGT
GATGACTGA
 
Protein sequence
MSGLLLQSGL LPEGLEISFK LIPGIMVTAM LLLFLAVGYV FKVADTEGMW VAGRSIGNVE 
NGMAIGANWM SAASYLGMAA TIALFGFYGL AFVVGWSTGY FILLIFMAAQ MRRFGKYTAP
DFVGDRFNSD SARAMAAVTT FLIGFVYAIG QARGMGLVGL YIFGDIGIPG LSGYQSMVVL
MMAITVGYLT LSGMLGATKN MTVQFVILIV AFLAGLYAVG YTQGYSTVLP QLEYGRLIGA
LSAEFSEPFT TESYYTWIAT AFTLVVGTCG LPHVLVRFYT VESERTARWS TVWGLFFICL
LYLSAPAFAA FGTDLYANEI GAVYGDPGMS SAAGDVIVVL ATQLAGLPEW FVGLVASGGI
AAAIATTAGL FIAGSSAISH DIYKGLINPD ATQRQQVLVG RLSIVALGVI TTLAALDPAA
PIAALVTYAF SLAGSVLFPM FFLGLWWENT NRQGALAGMS TGLVVWLIPM VNEVVPSYGL
LAGAAGSDGV LSATLAQWLP AIGSALVAAP LVFVVTIAVS MATEEPPLET KRMVRQCHSP
EPMGQQQTAE EVVSGAETPG DD