Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2934 |
Symbol | |
ID | 8412487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2821701 |
End bp | 2823062 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645021281 |
Product | Protein of unknown function DUF516 |
Protein accession | YP_003178746 |
Protein GI | 257388973 |
COG category | [S] Function unknown |
COG ID | [COG1650] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0227101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCCG TCGTCGTCTC GCGGTCGGAC TCGGCCTCCG AACACGTGGG AGAACGACTG CTCGATCTTG TCGAGTGGAC GGAGACCGTC GACGAGGAGC GGCCGGACGG CGACGGTGGC GGCACCGTCT ACCGTCGGGA CGAGATCGAA CTCCGAACGT TCGACTCGAT CCACCTCGAC CTCGAATCGG TCGCGACGGC CTTCGACGAC CCCGACCTGC TCGTCTTCGC GTCGCGCCAC GCCGGTGAGA CGGGGCCGCT CCTGACCGCA CATCACACGG GAAACTTCGG GCCGGCGGAG TTCGGCGGCG CGGACGGCGT CCTCGCCCGT GCCTGTCCGA ACGCACACCG CAGAGTCGTC GAGGCTCTCG AATCGTTCGC GCCCGAGGGC TACGAGGTTG GGATGGAGGC GACCCACCAC GGCCCCAGCG TCGTCGGCGC TCCCTCGATG TTCGTCGAAG TCGGGAGCGA CGAGCCCCAG TGGGACGACG CCGACGCCGC GCGGGCAGTC GCGCGAGCGA TCCTCGCGCT CGAAGACACC GAGCCCGACG CGTCACGTGA GAACGGAACG CGTCGCCACC TCGTGGGCTT TGGCGGCGGC CACTACGCGC CCAGGTTCGA GCGCGTCCGC CGCGAGACCG ACTGGGCGAT CGGCCACGTC GGTGCCGACT GGGCGCTGGA CGCGATGGGC GATCCCCGGG ACAACCCGGA CGTGATCGAG CGGGCCTTCG AGCAGAGCCG TGCCGACTAC GCACTGCTAG AGGCCGAGCG GCCGGCACTG CGCGAGACGA TCGAAGGACT GGGTTACCGC GTCGTCGACG AGACGTGGGT TCGCGAGACC AGCGGCGTCT CGCTGGGACT GGTCGATCAA CTGGAAGCCG CGATCGGACC GATCGACGAC GGCCTCCGGT GTGGCGAGCC CGCGAGAAAC CACGACGGCG AGTTCGTCGT CTGGGACCCG TCTGGGGAGT TGCTGGCCAG GGCGAGCGGG ATCGACCGGG AACGGACCCG AGCGGTCGTC GCTCGGACGG CGCTGGCGTT CGACACCGAA CAGAACGGGA CCGAAGTCGT CGGCCCCGTG GCGCTGCCAG CGCCCGACGA CCGCGACCCC ATCGTCGAGG GACTCACCGC GGTCCTGGCA CCGTCGTACG ACGAGGTCGT TCGGGACGGC GACTGTCTCC GGGCGCGCGA AACGGCCTTC GATCCGTCGC TGGCCCGCGA GCACGGCGTT CCGGAAGGCC CGAAGTTCGG ACAGCTATCG GCCGGCCGCT CGGTCGAGGT CGACGGCGAG ACCGTCGATC CCGCGGACGT CGTTCGAGAG CGCGTCGACG AGTTCGAACT CGCAGAAGTA CGGTCTCCGT AA
|
Protein sequence | MLAVVVSRSD SASEHVGERL LDLVEWTETV DEERPDGDGG GTVYRRDEIE LRTFDSIHLD LESVATAFDD PDLLVFASRH AGETGPLLTA HHTGNFGPAE FGGADGVLAR ACPNAHRRVV EALESFAPEG YEVGMEATHH GPSVVGAPSM FVEVGSDEPQ WDDADAARAV ARAILALEDT EPDASRENGT RRHLVGFGGG HYAPRFERVR RETDWAIGHV GADWALDAMG DPRDNPDVIE RAFEQSRADY ALLEAERPAL RETIEGLGYR VVDETWVRET SGVSLGLVDQ LEAAIGPIDD GLRCGEPARN HDGEFVVWDP SGELLARASG IDRERTRAVV ARTALAFDTE QNGTEVVGPV ALPAPDDRDP IVEGLTAVLA PSYDEVVRDG DCLRARETAF DPSLAREHGV PEGPKFGQLS AGRSVEVDGE TVDPADVVRE RVDEFELAEV RSP
|
| |