Gene Hmuk_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2949 
Symbol 
ID8412502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2832660 
End bp2833865 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID645021296 
Productmajor facilitator transporter 
Protein accessionYP_003178761 
Protein GI257388988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.163593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.476505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACC CGCCAGTCGC CGAGTCGGAG CGACACCGTC CCGAGACCGA CCAGCAGTTC 
GAGATCGTCG ACGGTGTGCG CGTCGACCGC ACGCTGACGG ATCTCTTCGC GGCCGTCGCA
TCGGTGGCGA CCTTCGACGG AGCACTGTAC GTCGTTGTTC GATTTCTCCC GGAGTACCTC
AGCGTGCGTG GCGTCGGTCC GGTCGCGATC GGTGCGTTCG GCACGTTCTG GCTCGCGATC
ACTGCGTTGG ATCGCCGAGA CACCGGTAGG CTTCCAGTGC TCGTGGCTGG TGCGACCGCC
GGAGTGCTGG CCTGGCTGGT CGCTCCGACG GTCACGGACA GTACGCTAGC TCTCTGGGTG
GCCGTCACAG CCGGTGGGCT CGGTCCGGCG GCCTGGTACT GGTCTCGCAC CGCGATCGGA
GTCCCGCTCG CGGACGCCTG GCCACCTGCG TCGAGATCGA CTCGTCGACC CGGCGGACCC
GTCTGGGTGC TCGGCCCACT CGTGCTCTCG ATCATCGTGT TAGTGCTCTC GGCGTCGTTT
TCCGCGGGCT TTCGCATCGT CCTCGCACTG ACTGTCGCGC TGGGTGGGAC GGCGGGCGCG
CTCTGGCTGT CGCTCGACGA CACCGGATCG CAGCCCTCGT CACGGCGTGT CCGCACTGTC
CCCCACCTCG GGGATTCCCT GCTGGAGGTG GCGATGGGCG TGGTCTCCGT GTTCGTCGTC
TTGGTCGTCA CGAGGGTTCT CGACGTCGAG CTGGCCGTTC TCGGCATCCA GTTGGGATCG
GCGGCGACCT TCGGGCTCCT CTTGCTCGTC GAGATCGTCG CCGGAGCACT GGCCCGCACC
GCTGGCCCCC GACTCGTCGG TTCCATCGGG TCGAGACCAC TCCTCGTCTA CGGGAGCCTC
GTCGTCGCGG CGTTCCCCCT CGTCCTCGTG AGTGTGCCAC CGACCCCGCT GGCCGTGGGC
GGTCTCTTCG CGATCTACGG CACTCGCTCC CTCGCGGGCG TCGCCCGTCG TGCCGGTGGT
GCCATTTGCC GACCGGCCAG CGACCGTCGT CGAACCGTCG TCGTCGCCGC TGGACCGCTA
CTTGGTGGCG TCCTCTTCGC CGTCGACCCC GTCCTCGCGT TCGGGTCCGC GACTGCGATC
GGTGCCGTCG GTGTCTGGGA ACTCGCGCGG ACACACGTCA CGGGAGCCGG GTGGCGAGAC
CGATGA
 
Protein sequence
MDDPPVAESE RHRPETDQQF EIVDGVRVDR TLTDLFAAVA SVATFDGALY VVVRFLPEYL 
SVRGVGPVAI GAFGTFWLAI TALDRRDTGR LPVLVAGATA GVLAWLVAPT VTDSTLALWV
AVTAGGLGPA AWYWSRTAIG VPLADAWPPA SRSTRRPGGP VWVLGPLVLS IIVLVLSASF
SAGFRIVLAL TVALGGTAGA LWLSLDDTGS QPSSRRVRTV PHLGDSLLEV AMGVVSVFVV
LVVTRVLDVE LAVLGIQLGS AATFGLLLLV EIVAGALART AGPRLVGSIG SRPLLVYGSL
VVAAFPLVLV SVPPTPLAVG GLFAIYGTRS LAGVARRAGG AICRPASDRR RTVVVAAGPL
LGGVLFAVDP VLAFGSATAI GAVGVWELAR THVTGAGWRD R