Gene Hmuk_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2233 
Symbol 
ID8411773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2150175 
End bp2151500 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content66% 
IMG OID645020576 
Producthypothetical protein 
Protein accessionYP_003178053 
Protein GI257388280 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.960863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.309285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGA GAGACGACCT CGAACGGTAC CGTGAAGTCG GCGAGCAACG CCGCCAGGAC 
CTCGCCGAGT TCATCCAGTA CGGCGACCTC GGCCAGAGCC GACAGGACTC GGTGCAGATC
CCGATCAAGA TCGTCGACCT CCCCGAGTTC GAGTACGACC AGCGCGACAT GGGCGGCGTC
GGTCAGGGCC AGGACGGCCA GCCCCAGCCC GGCGATCCGG TGGGCCAGCC CCAGCCAGAC
GACGACGGCG ACGAGGACGG CGAGCCCGGC GAGCCCGGCG AGGACGGCGG CGACCACGAG
TACTACGAGA TGGATCCCGA GGAGTTCGCC CAGGAACTGG ACGAGCAACT CGGACTCGAC
CTCGAACCGA AGGGCAAGCA GGTGATCGAG GAGGTCGAGG GCGACTTCAC CGACATCACC
AGGTCCGGCC CCTCCTCGAC GCTGGACTTC GAGCGGCTGT TCAAGCAGGG CCTCAAGCGA
AAGCTCGCGA TGGACTTCGA CGAGGCGTTC GTCCGCGAGG CGCTGAAAGT CGACGGCTGG
GGGCCGGCGA CCGTCTTCGA GTGGGCCCGC GAGCAACACA TCCCCGTCTC GAAGGCCTGG
ATCGAGGAGG CCTACGCGGA GTTGCCGGCC GACGAGAAGG CCGTCTGGGA CAGCATCGAC
GAGATGACCG ACGCGGTCGA CCGAGAGAGC ACCGCCAACC GGATCCGACG GGAGGGCGTC
GACCAGATTC CGTTCCGACG GGAAGACGAG CGATACCGCT ACCCGGAGAT CGAGGAGGAG
CGCGAGAAGA ACGTCGTCGT CGTCAACATC CGCGACGTGT CCGGGTCGAT GCGCCAGAAG
AAACGAGAGC TCGTCGAGCG GACGTTCACG CCGCTGGACT GGTACCTCCA GGGCAAGTAC
GACCACGCCG AGTTCGTCTA CATCGCCCAC GACGCCGACG CCTGGGAGGT CGATCGCGAC
GAGTTCTTCG GCATCCGCTC TGGCGGCGGG ACCCGCATCT CCAGCGCGTA CGAACTCGCG
CTGGCACGGC TCGAAGAGGC CTACCCCTGG GCCGACTGGA ACCGCTACGT GTTCGCGGCC
GGCGACTCGG AGAACTCCTC GAACGACACC GAAGAACACG TCATCCCACT GATGGAGGAG
ATCCCGGCGA ACCTCCACGC CTACGTCGAG ACCCAGCCGT CGGGCAACGC GATCAACGCG
ACCCACGCCG AGGAAGTGGA GCGCCACTTC CGCGAGACCG ACGACGTGGC CGTGGCCTAC
GTCTCCAGCC CGGAGGACGT GACCGACGCG ATCTACGAGA TCCTGAGCAC GGAGGACGAA
TCATGA
 
Protein sequence
MGLRDDLERY REVGEQRRQD LAEFIQYGDL GQSRQDSVQI PIKIVDLPEF EYDQRDMGGV 
GQGQDGQPQP GDPVGQPQPD DDGDEDGEPG EPGEDGGDHE YYEMDPEEFA QELDEQLGLD
LEPKGKQVIE EVEGDFTDIT RSGPSSTLDF ERLFKQGLKR KLAMDFDEAF VREALKVDGW
GPATVFEWAR EQHIPVSKAW IEEAYAELPA DEKAVWDSID EMTDAVDRES TANRIRREGV
DQIPFRREDE RYRYPEIEEE REKNVVVVNI RDVSGSMRQK KRELVERTFT PLDWYLQGKY
DHAEFVYIAH DADAWEVDRD EFFGIRSGGG TRISSAYELA LARLEEAYPW ADWNRYVFAA
GDSENSSNDT EEHVIPLMEE IPANLHAYVE TQPSGNAINA THAEEVERHF RETDDVAVAY
VSSPEDVTDA IYEILSTEDE S