Gene Hmuk_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1654 
Symbol 
ID8411177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1576754 
End bp1578004 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID645019981 
Producthypothetical protein 
Protein accessionYP_003177475 
Protein GI257387702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG ACCCGGTGCA CGTCGACGGG ATCGCGCGCC TCGCCGGGCA GATCGAGCAG 
GGGGTCGACG AACAGGACCA CCGCGCGTTC GCCGAGACCG TCTGGCAGGA GTTTCTGGAC
CCGCTCGTGG TCGACGGCCG GGCGGTGCTG GAGCCGATCG ACGAGCAGCG CCGCCGCAAG
GTCGACATGC AGGACGCCGC GCTCCAGGCG ACGCCGTTTC CCACCCAGCA CGGGCTGGAC
TCGGGGACGA TCAACCCAAC GACGTTCAAA AACGGCGTCG TACTGGACGT TGCCCAGGCC
GCGATGAGCG CCGTCCCCTC CGACCTGGAG TTGCACCGCG GCCGGACGAT CGTCATGACC
GCCCACTCGA ACGACGCCAC GGCCAGCCTG ACCGAGGACT GGCGGATGGA CGACCAGGGG
TACGCCCGCC AGCGAGTCCT CCAGACGCCA CGGGTCGACC GGTACGAACA GGACGTGGTC
CACGCGCTCG CGCTGTACCT CGCCGAGAGC GAGCACGCCC TCCAGCAGGC CGACGTGGTC
GAGGACCTGC TCGTCCTCGA CGGTCCGATA TACCCGACCG GGCTCCTCCA GTGGGCCGAC
CACGATCCCG AACTCGCCGA GTTGCTCGTC AAGGAGGACG TGCTCGCCGT CGTCGAGAAC
TACGTCACCC TCGTCGAGAC GTTCCTCGAT CGAGATGTCC CCCTGATCGG CTTCGTCAAG
TCGACCGGCT CGAAGGCGCT GAGCCGCGCC GTCCGCAAGA ACCAGGGGCA GGCTCCGTGG
GCCAACGACG CCGCCTTCTT CGAGAAGCTT CTCGCGCGGC GCGACGACGA CGGCGAACTC
CTGACCGACG CACTGACCTG CACGAACTGG TTCCGGTCGC GGATCGGCAC CGACCGCCCG
CTCTCGACGG ACGGCGACGC GCTGGGGATC GACCGCGAAC TCGATCCGGC CGCCTACGAG
GTGACGTTCT TCGTCGTCTA CGATCCCCGC GAGGACGTGC TGTACCGCGT CGAAGCCCCC
TACGGCGTCA CCCGCGACGC GGAGACGCGG GACGCGATCA CCCGCCAGGT GCTCGCCGAC
GTGGCCGCCG AGCGCGGCCC GCCGCTGTCG GTCGCCAAGG CCGACGAACT CGCCCGGATC
GGGCGCGACG AGAAGGCGGC GCTCCGGGAG AAAATCGAGA GCCAGCTCGA CAGCGACCGC
CAGCGCGAGT ACAACGACGT GCGCTGGGGG GCCGACTTCG AGGAGCTGTA G
 
Protein sequence
MTLDPVHVDG IARLAGQIEQ GVDEQDHRAF AETVWQEFLD PLVVDGRAVL EPIDEQRRRK 
VDMQDAALQA TPFPTQHGLD SGTINPTTFK NGVVLDVAQA AMSAVPSDLE LHRGRTIVMT
AHSNDATASL TEDWRMDDQG YARQRVLQTP RVDRYEQDVV HALALYLAES EHALQQADVV
EDLLVLDGPI YPTGLLQWAD HDPELAELLV KEDVLAVVEN YVTLVETFLD RDVPLIGFVK
STGSKALSRA VRKNQGQAPW ANDAAFFEKL LARRDDDGEL LTDALTCTNW FRSRIGTDRP
LSTDGDALGI DRELDPAAYE VTFFVVYDPR EDVLYRVEAP YGVTRDAETR DAITRQVLAD
VAAERGPPLS VAKADELARI GRDEKAALRE KIESQLDSDR QREYNDVRWG ADFEEL