Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0034 |
Symbol | |
ID | 8409531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 29842 |
End bp | 31197 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645018372 |
Product | conserved repeat domain protein |
Protein accession | YP_003175892 |
Protein GI | 257386119 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.559435 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA GCTCGATTCC GGCGGCGGCC ACGCGCTCTG CCGACGTGAC CGACGAAGCG GCCGAGTTCG AGATCGAGCC GGGCACGGTC GTCGACCGCC GGACGAGGCG GTGGTACGCC GTCACCGTCT TCGCGCTCCT GGCGCTCGGA GCGGGGGTCC TGACCCGCGA GTCCGGCCTC CTGCTGACCA GTGCCTTCGG GATCGCCTTC GCGGGCTACG GGCAGGTCAC CTCGCCGCCG CCGGTCGAGG TGAGCGTCGA GCGCTCGATC AGCGACGACG CTCCCGACAC CGACGACACC GTCGCGGTGA CCGTGACGGT GCGCAACGAG AGCGACCGGA CGATGCCGGA CCTGCGGCTC GTCGACGGCG TCCCGCCGAA GATGACCGTG GCCGAGGGGT CACCCCGCCT GGCGACCGCG CTCCGACCGG GCGAGGAGAC GACGTTCGCC TACTCGCTGC GGGCTCGGCG GGGTCGCCAC GAGTTCGAGC CGACGACGAT CCTCACTCGG GACGCGTCCG GAGCCACGGA GCGACGCGGC ACGGTCGACG CGCCAGACAC CGTCGACTGC GAGGCGTCGC TGCCGAGCCA GAGCGTCTCC TTTCCGCTGC GGTCACAGAC CACCCGTCAC ACGGGACGGT TCCCGGCGGA CACCGGCGGG CCCGGCGTGG AGTTCTACGC GACCCGCGAG TACCGGCCGG GCGACCCGCT GAACCGGGTC GACTGGAACC GCACCGCCCG GACTGGTGAC CTCACCACCG TCCAGTACCG GGTCGAACGC AGCGTCTCGG TCGTGCTGGT GGTCGACGCC CGACAGGCAG CGTACGCCGC CCCAGCGCCA CAGGCCCGGA CGGCGCTCGA CGCGGCCGTC GACGCGGCGG GTCACGCCTA CGTCTCGCTG ACCGACGCGG GCCACGACGT GGGGCTGACG GCGCTGTCGC CGACGGAGTG TTGGCTCTCG CCGGGCAACG GGGACGAACA CCGCGTTCGA GCACGCGAGT TCCTCTCGAC GGAGCCGGCG CTCTCTCCGT CTGGGCCCGA CGCGGAGACC TCCCTCTACG CCGCGGTCCA GCGGATCAGG CGTCGCGCGC CGACGGACGC CCAGATCGTC GTGTTCTCGC CGCTGACCGA CGATCGCGTG GCCGGCTCTG CGATCCGACT CGACGCCAAC GGCCACCGGA CGACGGTGAT CTCGCCGGAC CCGACGGCCG ACGACTCCGT CGGTCACCGA CTGGCCGGAG TCAGGCGGTC GCTGCGCATC GCCGACCTCC GGCAGCGCAA CATCCCGGTC GTCGACTGGG ACGGGACGGA ACCGTTCCCG CACGCGCTCG CCCGCTGGGA CGGGGGGTCG CGATGA
|
Protein sequence | MSFSSIPAAA TRSADVTDEA AEFEIEPGTV VDRRTRRWYA VTVFALLALG AGVLTRESGL LLTSAFGIAF AGYGQVTSPP PVEVSVERSI SDDAPDTDDT VAVTVTVRNE SDRTMPDLRL VDGVPPKMTV AEGSPRLATA LRPGEETTFA YSLRARRGRH EFEPTTILTR DASGATERRG TVDAPDTVDC EASLPSQSVS FPLRSQTTRH TGRFPADTGG PGVEFYATRE YRPGDPLNRV DWNRTARTGD LTTVQYRVER SVSVVLVVDA RQAAYAAPAP QARTALDAAV DAAGHAYVSL TDAGHDVGLT ALSPTECWLS PGNGDEHRVR AREFLSTEPA LSPSGPDAET SLYAAVQRIR RRAPTDAQIV VFSPLTDDRV AGSAIRLDAN GHRTTVISPD PTADDSVGHR LAGVRRSLRI ADLRQRNIPV VDWDGTEPFP HALARWDGGS R
|
| |