Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0108 |
Symbol | |
ID | 8409605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 105925 |
End bp | 107850 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645018433 |
Product | conserved repeat domain protein |
Protein accession | YP_003175953 |
Protein GI | 257386180 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGCG GGCGTCTCCT CGCGGCCGTC GGGCTCGTGG CCTTCGCGGT CGGCGTCGGA GCGCTCGCCG CGCCGGGGGT GTTCGGGCTC GGCGTCCAGC GCTACGCCGT CGTCGTGATC GGTCTGCTGG CGGTCGCCGC GGCGATTCGC GTCGTGCAGG GACGACGCCA CAGCCCCAGA CGGCGCGGCG AGACGGCGAC GCCGGAGGAG CTGCCCGCGG TCGCCTCGCC GGGCGAGGAG CTGGACGCCG TGCTGGCGGC GTTCGACCCC ACGCGCTACG GGACGGCCGA CCGCCGCCGG AAGCAACTCC GGCGCGTGGC CACCGAAGTG CTGACGCGCT ACCGGGGCGA CAGCGAAGCG ACGGCCAGCG AGGCGATCGA GCGGGGCACC TGGACCGACG ATCCAGTCGC CGCCGAGTTC CTCGCAGACG AGCGCTCGCG GCTCCCGCTC ACCGATCGAC TCCGCACGCG CCTGGGCGGG CAGTCGGCCT ACTGGCAGGG CATCGAGCAC ACCGCTCGCG CGATCGCCGA GACGGCCGGC GTCGACACCG ACGACCGCGA CGGCTCCCCC CTCCCGGGAC TGGACGGGGT CGTCGGATCG ATCGACGGCC GGGACGCGGG GCTCGACCGG GCCAGCGCCG TCGCGGATCC GACACCGACG ACGCGCTCGA CGGGACACTG GCGAGGGATC AGCGCGGCCG CGCTGGCGGC GCTCGGCGTC GGCGTGCTCG CCGAACGGGC GGGCGTCGTC CTCGTCGCCG TCGTCGGGAT CGCCTACGCG GCTGCCGCCC GCCAGCGGAC GCTCTCCGCG CCGACGCTCT CGGTCGAGCG CACCGTCGAG CCGGCGGATC CGGAGCCCGA CGAGCCCGTC ACGGTGACGC TGACCGTCAC CAACGAGGGC GAGGAGGCGG TGTGGGATCT GCGCCTCGTC GACGGCGTTC CGCCCGCCCT CTCCGTGGTC GAGGGGTCCC CGCGGCTCGG GACGGCGCTG TGGCCGGGGG CCAGTGCGAC GGTCACCTAC GAGGTGGCGG CCGAGCGCGG TCGCCACGAG TTCGGTCCCG CGCAGGTCCT CGTCCGAACG CTGTCGGGCA GCGTCGAACG CGAACAGACC GTCGAGCCCG CGACGACGAC GGAAATCACC TGCGTCCCGC CGCTCCGGCC GATCGACGAA CCGGTCCCGC TGCGCCGCCA GCCCACTCGC CACACCGGGC GCGTCGAGAC CGCCACCGGC GGCGAGGGCG TCGAGTTCTT CGCGACGCGC GCGTACCGCT CGGGGGACCC GCTGTCGCGG ATCGACTGGA ACCGTCACGC CAGAACCGGT GAGCTGGCGA CCCTGGAGTT TCGCGAGGAG CGATCCGCGA CGGTAGTGCT GGTGATCGAC CGCGACGAGG CGGCCGCGGT CGGTCCCTCA CAGCGGGGCC AGACCGCCGT CCAGCGCTCC GTCGACGCCG CGAGCCGACT GTTCGCGACG CTGCTCGACG ACGGCAACCG CGTCGGCATC GCCGCGCTCG GGGCTCGCGA CTGCTGGCTC GCGCCGGGTG CGGGCGACGC CCACCGCGTT CGGGGCCGGG AGCTGCTCGC GACCCATCCC GCGCTCGCTC CCGGCGAGAC GCCCGAGAGC GTCTTGCCGT TCGGCTGGCT CGCCTCCCTG CGGAGTCAAC TGCCCGGCGA CGCGCAGGTC GTCCTGTTCT CGCCGCTGTG CTCGCCCGCC GTCGCCACCG TCGCCCGCCA GCTCGACGCC GGGGGTCACC TCGTGACCGT CGTCAGCCCG AACCCGACGA CGACGGCCTC CGGGGCCGGC CAGCTCGCGA CGGTGGCGCG GCGCTTCCGG ATCGCCGACC TCCGGGCCGC CGGGATCCCC GTCGTCGACT GGCCGTGGGC GCAGTCGCTG CCGGTGGCCC TGGCCCGGAC GAGGTGGTCG AAATGA
|
Protein sequence | MSRGRLLAAV GLVAFAVGVG ALAAPGVFGL GVQRYAVVVI GLLAVAAAIR VVQGRRHSPR RRGETATPEE LPAVASPGEE LDAVLAAFDP TRYGTADRRR KQLRRVATEV LTRYRGDSEA TASEAIERGT WTDDPVAAEF LADERSRLPL TDRLRTRLGG QSAYWQGIEH TARAIAETAG VDTDDRDGSP LPGLDGVVGS IDGRDAGLDR ASAVADPTPT TRSTGHWRGI SAAALAALGV GVLAERAGVV LVAVVGIAYA AAARQRTLSA PTLSVERTVE PADPEPDEPV TVTLTVTNEG EEAVWDLRLV DGVPPALSVV EGSPRLGTAL WPGASATVTY EVAAERGRHE FGPAQVLVRT LSGSVEREQT VEPATTTEIT CVPPLRPIDE PVPLRRQPTR HTGRVETATG GEGVEFFATR AYRSGDPLSR IDWNRHARTG ELATLEFREE RSATVVLVID RDEAAAVGPS QRGQTAVQRS VDAASRLFAT LLDDGNRVGI AALGARDCWL APGAGDAHRV RGRELLATHP ALAPGETPES VLPFGWLASL RSQLPGDAQV VLFSPLCSPA VATVARQLDA GGHLVTVVSP NPTTTASGAG QLATVARRFR IADLRAAGIP VVDWPWAQSL PVALARTRWS K
|
| |