Gene Hmuk_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2023 
Symbol 
ID8411554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1927166 
End bp1928305 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID645020357 
Productprotein of unknown function DUF354 
Protein accessionYP_003177843 
Protein GI257388070 
COG category[S] Function unknown 
COG ID[COG1817] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0244531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.923484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGGA GACACGCCTC GGCCGTAACA GCCACAGACA CGCCACGGAA CGCTACCGTC 
CCCCCCGATC AAAACGGAGT CGGGACGCCA CCAGCGGAAC AGACAGAGCG GCTTCGGGTC
CTGATCACGA TTCAGCACCC CGCCCACGTC CACTTCTATC GCCCCGTCAT CGAGGAGCTG
ACCGCCGACG GGCACGAGGT ACGCGTCTGT ACGCGCGAGA AGGACGTGGC CACCGAGTTG
CTGACCGCCT TCGACATCGA GCACAGCGTG CTCGCCGGTT CGGGCCAGTC GACGCTGGGA
CGGATCGGCG TGCAGGCGCT GTACGAGCTG CGCTTGCTGG CGACGGCGCG GCGCTTCCAG
CCCGACGTGG TCACCGCGAT CGGCGGGGTC GCAGCGGCCC ACGTGGCGAC CGCCCTCGGA
GCCAAGAGCG TCGTCCTGAC CGACAGCGAG CCGGCGGCGC TGACCAACCG CCTCGCCTTC
CCCTTCGCGG ACGTGGTCTG TACACCGGCG TGGTACGACG GAGAGGTCGA CGCGCGCCAC
GTGCGCTACG AGGGGCTCCA CGAACTTGCC TACCTCCACC CCGAGCGGTT CGAGCCCGAC
CGGGAAGTGG TCAGGGCAGC CGGAGTGGAC CCGTCGGCCG ACTACTCGCT GGTCCGGTTC
GTCTCCTGGG ACGCACAGCA CGACGTGGGC GAGGGGGGCC TCTCGCGGGC CGGCAAGGAG
GCGATCGTCG AGACGCTCGC CGAGGCCGGC GAGGTGTACA TCACGTCCGA GGAGCCACTC
GACGCGCCCC TCGACCAGTA CCGACTCCCG GTCGCGCCGG CCGACCTCCA CCACCTGCTG
GCGTTCGCGC AGTTGTACGT CGGCGACTCC CAGACGATGG CGACGGAGGC GGCGTTGCTC
GGGACGCCGG CCGTGCGCTG TAACTCCTTT GCGGGCGGCG ACGACATGGC GAACTTCCGG
CGACTCGCAG AGTACGGGCT CGTGTTCTCG ACCGACGACG AGGCGGAGGC CCACGACCGA
ATCGAGACGC TGCTCGCGGA CGACGACGGC GACTGGGACC GCCGCCGCGG CGAGTTCGTC
GCAGAGACCA TCGACGTGGC CTCGTTCGTC CGGGACGTGC TCGTGGCGGT GGGAACATGA
 
Protein sequence
MRRRHASAVT ATDTPRNATV PPDQNGVGTP PAEQTERLRV LITIQHPAHV HFYRPVIEEL 
TADGHEVRVC TREKDVATEL LTAFDIEHSV LAGSGQSTLG RIGVQALYEL RLLATARRFQ
PDVVTAIGGV AAAHVATALG AKSVVLTDSE PAALTNRLAF PFADVVCTPA WYDGEVDARH
VRYEGLHELA YLHPERFEPD REVVRAAGVD PSADYSLVRF VSWDAQHDVG EGGLSRAGKE
AIVETLAEAG EVYITSEEPL DAPLDQYRLP VAPADLHHLL AFAQLYVGDS QTMATEAALL
GTPAVRCNSF AGGDDMANFR RLAEYGLVFS TDDEAEAHDR IETLLADDDG DWDRRRGEFV
AETIDVASFV RDVLVAVGT