Gene Hmuk_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1667 
Symbol 
ID8411190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1591570 
End bp1593498 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content66% 
IMG OID645019994 
Producthypothetical protein 
Protein accessionYP_003177488 
Protein GI257387715 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGAC GCACCCTTCT TCCGATAGCC GTCGTTGTAT TGCTCGTCAC GTCGGGACTC 
GCCACGGCAG CGGTCACCGG CAGTCCCGAT ATCAGCGTTC ACGTGGCCGA CGACACCCTC
GCACCCGGCG AGGAGTCGAC GCTGGACGTC GTCCTCGTCA ACAGCGGCGA CCTCGACTCG
GGGTCGGCGC GAAACCCCGC GCTCAACAGC GAGGTGACGA CCGCTCGCGG AACGACCGTC
AGCGTCGACA ACGGAAACGC TCCGCTGACC GTCGAGACGG GTGAGCAGGC CGTCGGGTCG
CTTCCGGAGG GATCGACGCG CGAGCCGCTC CCGTTCAAAG TCTCCGTTCC GGACGACGCC
GAACCCGGAA CCTACGACGT GCAGGTCACC GTCGAGTACG ACTACTACAG CTACGTCTCG
GAGGCGTCGG GTTCGCGGGA CCGCGAGTCC GAGAGCCGGA CCTACGACGT GCAGGTGAAA
ATCGAGGAGT CGGCCCGACT CGACGTCACC GACGTAGACA CGACGACGCG GGTCGACGGG
ACCGGCACCG TCGAACTCAC CGTCGAAAAC AACGGGTCGG CGACGGCCCG AGACGCGACC
CTCTCGCTGG CTTCCCCCAG CGCCGACCTG ACGCTCGGCC AGTCCGCGAG CGCAGAACGA
TACGTCGAAC AGTGGGAACC CGACGAACAG CGGACCTTCA CCTACCGGCT GAGTGCCGCC
CAGACCGCCG AGTCCGAACC GTACCCGTTC ACGCTGACGG CGGACTACGA ACTCGGGAGC
GGCGAGTCAC GCACGAGCCA GCCGGTCACC GTCGAAGTGA CCCCGGAGCC AGAACAGGAG
TTCACCGTCA GCTCGATCGA CAGCACCGTT CCGGTCGGTG ACAGCGGTGA CTACACCGTC
ACGCTTCGCA ACGACGGATC GGAGACGCTC ACGGACGCCT CGGTCCAGCT GACCTCACAG
AACGCAGACA TCACGTTCGG CCAGTCGAAC AGCGCCAGCC AGTACGTCGA GGAGTGGGAA
CCGGGCGCAG AACGGACTCT CACCTTCGAC GCTCGGGCCG GTGACGATGC CGAGCGGCGC
AACTACACGG TCGACGCGAC GGTCGAGTAC GATCGACCGG ACGACTCCGC CACCCATCGA
CAGGACGTGT CCCTCTCGCT GCGGCCGGTC GCCGAGCAGA CCTTCGCCGT CGACACCGTC
GACGCGGACC TGGAAGTCGG TGACGACGGG ACGCTGTCGG CCGAGTTGAC CAACACCGGC
CCCCGCACCG CCGAAGACGT CGTCGTGGTG TGGGCCAGCG AGCAGCGCAA CGTCGATCCG
ATCGAAACGG AGTACTCGAT CGGGAACCTC GACGCGGGCG AGTCCGCGTC GTTCGACTTC
GACGTGGACA TCAGCGACAG CGCGCGCAGC GGCCCCCGAC AGTTCACGCT CCAGACGCGC
TACACCAACG ACGACGACGA ACAGCGCACC GGTGACTCGA TGAACGTCCG TGCCGACGTC
GCGCCAGAGC GCGACGAGTT CGACGTAGCC ATCCGGAGTG CAAACGTCTC TGCCGGCGAC
GGGACGGAGC TCACCGTCGA GATCACGAAC GCCAAGAACC AGACGCTCAG CGACATCAAG
GCCAAGATCT TCGCCGACTC GCCGATCTCG GCCAACGACG ACGAGGCGTT CGTGGACGAA
CTCTCACCGG GCGAGTCCCG GGAGATCACC TTCTCGATCA GCGCGGGCAG CAGTGCGCTC
TCGAAGCCCT ATCCCGTCTC GATGGACTTC CAGTACGACG AAGCCGACGG TGACACGGTC
ACCTCGGACA CGTACAACAT CCCGGTCGAG GTCGAAGAGT CGAGCGGCGG CAGTTCACCG
CTCGTGTTGA TCGGTGTGGT GGTCGTCCTG ATCGTCGCTG CCGTGGGCGG CTACGTTCGG
TTCCGATAG
 
Protein sequence
MNGRTLLPIA VVVLLVTSGL ATAAVTGSPD ISVHVADDTL APGEESTLDV VLVNSGDLDS 
GSARNPALNS EVTTARGTTV SVDNGNAPLT VETGEQAVGS LPEGSTREPL PFKVSVPDDA
EPGTYDVQVT VEYDYYSYVS EASGSRDRES ESRTYDVQVK IEESARLDVT DVDTTTRVDG
TGTVELTVEN NGSATARDAT LSLASPSADL TLGQSASAER YVEQWEPDEQ RTFTYRLSAA
QTAESEPYPF TLTADYELGS GESRTSQPVT VEVTPEPEQE FTVSSIDSTV PVGDSGDYTV
TLRNDGSETL TDASVQLTSQ NADITFGQSN SASQYVEEWE PGAERTLTFD ARAGDDAERR
NYTVDATVEY DRPDDSATHR QDVSLSLRPV AEQTFAVDTV DADLEVGDDG TLSAELTNTG
PRTAEDVVVV WASEQRNVDP IETEYSIGNL DAGESASFDF DVDISDSARS GPRQFTLQTR
YTNDDDEQRT GDSMNVRADV APERDEFDVA IRSANVSAGD GTELTVEITN AKNQTLSDIK
AKIFADSPIS ANDDEAFVDE LSPGESREIT FSISAGSSAL SKPYPVSMDF QYDEADGDTV
TSDTYNIPVE VEESSGGSSP LVLIGVVVVL IVAAVGGYVR FR