Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3253 |
Symbol | |
ID | 8409331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 48403 |
End bp | 50307 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645018190 |
Product | von Willebrand factor type A |
Protein accession | YP_003175711 |
Protein GI | 257372937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0206278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTGA CAGCACAGAC CGAGCCGCCG GACCTCGCCG CCCTCTCGAC GCCAGAGCGG GCCTCCAGCG AGCGCCGCCG AGAGCTCGAA CGCCTCGCGT CGCTGTGTAC GGACAGGGAG AGGACGATCT CGATCGCCTT CGACGAGGAG CGAGCGTTCG CTCGGCCGGC CGAGGGGGCC GGCCCGGACG CCTACGAGAT CGTGCTGCCG ACGGAGAAAT ACGAGCAGCC CGGCACGGAG CTGCCGCCGG GGCTGTGGGA TCGCTCGATC CAGGTCGCCT TTCTCTTCCA CGAACTGGGC CACGTCTACT ACTCCGACTT CGATCGCTTC GGCGACTGTT TAGAGCGGGT CGACGGGCGC TGGCGCGACC TGTTTCGGAT GGTGTACAAC ACCGCCGAGG ACGGCGTCGT CGAGACTCAG ATCGCCAACG AGTTCTCAGT CACCGACGAC TTCGTCCTCC TCAACGACGT GCTCGTCTCG CGGGCCGACG AGCGACACCG CGCGTACGTC GACCTGTTCG ACCTCGCGAC CGCCGACGGC GAGCCGGTCC AGTCCTACAC CGTCTTCGAA GCCCTCGCAG TCGGGCTGCT CGACCGTGGC TTCGTCGACA GCGGTCGCTT CGCCGCCATC GTCGACCCGG ACGACGACCG GCGTGTCGTC TACGACGGCC AGCGGGAGGC GCTCGTCGAC CTCGTCCCGG CGATGGACGA GTTCGTCGCC GACGTGCTCT CGGAGCCCGA CGGCACCCGG CGGGTCGACC GGGCCCACGA CTTCTTTGAG ACCGCCAGAG ACACCCTCGC TGGGCTCCCG CCGCGACAGA ACGGTCGGCT CCAGACCGCG CCGGTCCGGC CCAGCGACGC CCGGGCCCGC GCCGGCTGGA CCGCAGACGC GGCGGACCGG CTGCCCGACG ACGGGGCGGC CAGCGCGCAC GTCGCCCGAG ATCGCGCTGC TGACGACCGG TCCGCCTCCG GAGCCAGCGG TGCGGGAAGC GTCGCCGATC CGGACGAGGA CCGGCGGCCG CCGGGCCGCG TCGAGGACGA CACGGTCAGA CGGGTCCGGC GACGGAGCGT GCGCGCCCAG TCGAACCGGG CGGGCTCGGG CCGGTCACCC CTCGAACGCG AGGCCCGACA CCTCCTCGAT GTCGTCGACG ACGAGTCGAC GGCTCTCGAA GAGGTGATCG TCGTCGACCC CGCCGAGGAC GGTGGCGACC GGGACCGCTG GGACGACGCG GTCGGGCGTT CGAAGCAACT CCAGCGGGAC CTCTCGACGC AGCTTCGCCG CGAGCGCCGT CCCAGAGATG AGCCTGGTCA TCGCACCGGC CGACTCGACG GGCGGCGACT CGTCGGCGCG AGCCACGGGG CCCAGCGGGT CTTCACCCGC CGGGAGTCCG GGACGGCCAA AGACCACTCC TGTCTGGTCG TGCTGGACCG GTCGGGATCG ATGGACGGCG AACCGATCCG GACGGCCGAG ACTGCGACGG CCCAGCTCGT CCACGCGCTG TTCGCGGTCG GGGTCGACGC GTCCGTGCTG TCGATCTGGG AGGGGTATCC GTGTCTGGAA CTCCCCTTCG GGGGTCGCCC GTCCGAGCAC GTCGACCGGC TGATGACCGA ACGTGCGGAC TGGGGGACGC CGCTCTCGAC GGCCGTCGCC GTCGCCCGTG AGCGCCTCGA CGACGGCCGG GGATCTCACC CGTTCGTCGT CGTCGTGACC GACGGCGCGC CGGACCACCC CGACCGCTAC CAGTCGCAGC TGGCGGCGTG TACCGTTCCC GTGTTCGGCG TCTACATCGG GTCGGAACCG GGCACTCACA CCGAGTACTT CGACCGGATC GTCCACGCCG AGACCGACAC GCTCGCACGA ACGATGCAGC GCCTCGTCAG GGCGCTGTTC TCGACGGAGG CCTGA
|
Protein sequence | MRLTAQTEPP DLAALSTPER ASSERRRELE RLASLCTDRE RTISIAFDEE RAFARPAEGA GPDAYEIVLP TEKYEQPGTE LPPGLWDRSI QVAFLFHELG HVYYSDFDRF GDCLERVDGR WRDLFRMVYN TAEDGVVETQ IANEFSVTDD FVLLNDVLVS RADERHRAYV DLFDLATADG EPVQSYTVFE ALAVGLLDRG FVDSGRFAAI VDPDDDRRVV YDGQREALVD LVPAMDEFVA DVLSEPDGTR RVDRAHDFFE TARDTLAGLP PRQNGRLQTA PVRPSDARAR AGWTADAADR LPDDGAASAH VARDRAADDR SASGASGAGS VADPDEDRRP PGRVEDDTVR RVRRRSVRAQ SNRAGSGRSP LEREARHLLD VVDDESTALE EVIVVDPAED GGDRDRWDDA VGRSKQLQRD LSTQLRRERR PRDEPGHRTG RLDGRRLVGA SHGAQRVFTR RESGTAKDHS CLVVLDRSGS MDGEPIRTAE TATAQLVHAL FAVGVDASVL SIWEGYPCLE LPFGGRPSEH VDRLMTERAD WGTPLSTAVA VARERLDDGR GSHPFVVVVT DGAPDHPDRY QSQLAACTVP VFGVYIGSEP GTHTEYFDRI VHAETDTLAR TMQRLVRALF STEA
|
| |