Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2023 |
Symbol | |
ID | 8411554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1927166 |
End bp | 1928305 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645020357 |
Product | protein of unknown function DUF354 |
Protein accession | YP_003177843 |
Protein GI | 257388070 |
COG category | [S] Function unknown |
COG ID | [COG1817] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0244531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.923484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGGA GACACGCCTC GGCCGTAACA GCCACAGACA CGCCACGGAA CGCTACCGTC CCCCCCGATC AAAACGGAGT CGGGACGCCA CCAGCGGAAC AGACAGAGCG GCTTCGGGTC CTGATCACGA TTCAGCACCC CGCCCACGTC CACTTCTATC GCCCCGTCAT CGAGGAGCTG ACCGCCGACG GGCACGAGGT ACGCGTCTGT ACGCGCGAGA AGGACGTGGC CACCGAGTTG CTGACCGCCT TCGACATCGA GCACAGCGTG CTCGCCGGTT CGGGCCAGTC GACGCTGGGA CGGATCGGCG TGCAGGCGCT GTACGAGCTG CGCTTGCTGG CGACGGCGCG GCGCTTCCAG CCCGACGTGG TCACCGCGAT CGGCGGGGTC GCAGCGGCCC ACGTGGCGAC CGCCCTCGGA GCCAAGAGCG TCGTCCTGAC CGACAGCGAG CCGGCGGCGC TGACCAACCG CCTCGCCTTC CCCTTCGCGG ACGTGGTCTG TACACCGGCG TGGTACGACG GAGAGGTCGA CGCGCGCCAC GTGCGCTACG AGGGGCTCCA CGAACTTGCC TACCTCCACC CCGAGCGGTT CGAGCCCGAC CGGGAAGTGG TCAGGGCAGC CGGAGTGGAC CCGTCGGCCG ACTACTCGCT GGTCCGGTTC GTCTCCTGGG ACGCACAGCA CGACGTGGGC GAGGGGGGCC TCTCGCGGGC CGGCAAGGAG GCGATCGTCG AGACGCTCGC CGAGGCCGGC GAGGTGTACA TCACGTCCGA GGAGCCACTC GACGCGCCCC TCGACCAGTA CCGACTCCCG GTCGCGCCGG CCGACCTCCA CCACCTGCTG GCGTTCGCGC AGTTGTACGT CGGCGACTCC CAGACGATGG CGACGGAGGC GGCGTTGCTC GGGACGCCGG CCGTGCGCTG TAACTCCTTT GCGGGCGGCG ACGACATGGC GAACTTCCGG CGACTCGCAG AGTACGGGCT CGTGTTCTCG ACCGACGACG AGGCGGAGGC CCACGACCGA ATCGAGACGC TGCTCGCGGA CGACGACGGC GACTGGGACC GCCGCCGCGG CGAGTTCGTC GCAGAGACCA TCGACGTGGC CTCGTTCGTC CGGGACGTGC TCGTGGCGGT GGGAACATGA
|
Protein sequence | MRRRHASAVT ATDTPRNATV PPDQNGVGTP PAEQTERLRV LITIQHPAHV HFYRPVIEEL TADGHEVRVC TREKDVATEL LTAFDIEHSV LAGSGQSTLG RIGVQALYEL RLLATARRFQ PDVVTAIGGV AAAHVATALG AKSVVLTDSE PAALTNRLAF PFADVVCTPA WYDGEVDARH VRYEGLHELA YLHPERFEPD REVVRAAGVD PSADYSLVRF VSWDAQHDVG EGGLSRAGKE AIVETLAEAG EVYITSEEPL DAPLDQYRLP VAPADLHHLL AFAQLYVGDS QTMATEAALL GTPAVRCNSF AGGDDMANFR RLAEYGLVFS TDDEAEAHDR IETLLADDDG DWDRRRGEFV AETIDVASFV RDVLVAVGT
|
| |