Gene Hmuk_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3253 
Symbol 
ID8409331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp48403 
End bp50307 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content71% 
IMG OID645018190 
Productvon Willebrand factor type A 
Protein accessionYP_003175711 
Protein GI257372937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0206278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGA CAGCACAGAC CGAGCCGCCG GACCTCGCCG CCCTCTCGAC GCCAGAGCGG 
GCCTCCAGCG AGCGCCGCCG AGAGCTCGAA CGCCTCGCGT CGCTGTGTAC GGACAGGGAG
AGGACGATCT CGATCGCCTT CGACGAGGAG CGAGCGTTCG CTCGGCCGGC CGAGGGGGCC
GGCCCGGACG CCTACGAGAT CGTGCTGCCG ACGGAGAAAT ACGAGCAGCC CGGCACGGAG
CTGCCGCCGG GGCTGTGGGA TCGCTCGATC CAGGTCGCCT TTCTCTTCCA CGAACTGGGC
CACGTCTACT ACTCCGACTT CGATCGCTTC GGCGACTGTT TAGAGCGGGT CGACGGGCGC
TGGCGCGACC TGTTTCGGAT GGTGTACAAC ACCGCCGAGG ACGGCGTCGT CGAGACTCAG
ATCGCCAACG AGTTCTCAGT CACCGACGAC TTCGTCCTCC TCAACGACGT GCTCGTCTCG
CGGGCCGACG AGCGACACCG CGCGTACGTC GACCTGTTCG ACCTCGCGAC CGCCGACGGC
GAGCCGGTCC AGTCCTACAC CGTCTTCGAA GCCCTCGCAG TCGGGCTGCT CGACCGTGGC
TTCGTCGACA GCGGTCGCTT CGCCGCCATC GTCGACCCGG ACGACGACCG GCGTGTCGTC
TACGACGGCC AGCGGGAGGC GCTCGTCGAC CTCGTCCCGG CGATGGACGA GTTCGTCGCC
GACGTGCTCT CGGAGCCCGA CGGCACCCGG CGGGTCGACC GGGCCCACGA CTTCTTTGAG
ACCGCCAGAG ACACCCTCGC TGGGCTCCCG CCGCGACAGA ACGGTCGGCT CCAGACCGCG
CCGGTCCGGC CCAGCGACGC CCGGGCCCGC GCCGGCTGGA CCGCAGACGC GGCGGACCGG
CTGCCCGACG ACGGGGCGGC CAGCGCGCAC GTCGCCCGAG ATCGCGCTGC TGACGACCGG
TCCGCCTCCG GAGCCAGCGG TGCGGGAAGC GTCGCCGATC CGGACGAGGA CCGGCGGCCG
CCGGGCCGCG TCGAGGACGA CACGGTCAGA CGGGTCCGGC GACGGAGCGT GCGCGCCCAG
TCGAACCGGG CGGGCTCGGG CCGGTCACCC CTCGAACGCG AGGCCCGACA CCTCCTCGAT
GTCGTCGACG ACGAGTCGAC GGCTCTCGAA GAGGTGATCG TCGTCGACCC CGCCGAGGAC
GGTGGCGACC GGGACCGCTG GGACGACGCG GTCGGGCGTT CGAAGCAACT CCAGCGGGAC
CTCTCGACGC AGCTTCGCCG CGAGCGCCGT CCCAGAGATG AGCCTGGTCA TCGCACCGGC
CGACTCGACG GGCGGCGACT CGTCGGCGCG AGCCACGGGG CCCAGCGGGT CTTCACCCGC
CGGGAGTCCG GGACGGCCAA AGACCACTCC TGTCTGGTCG TGCTGGACCG GTCGGGATCG
ATGGACGGCG AACCGATCCG GACGGCCGAG ACTGCGACGG CCCAGCTCGT CCACGCGCTG
TTCGCGGTCG GGGTCGACGC GTCCGTGCTG TCGATCTGGG AGGGGTATCC GTGTCTGGAA
CTCCCCTTCG GGGGTCGCCC GTCCGAGCAC GTCGACCGGC TGATGACCGA ACGTGCGGAC
TGGGGGACGC CGCTCTCGAC GGCCGTCGCC GTCGCCCGTG AGCGCCTCGA CGACGGCCGG
GGATCTCACC CGTTCGTCGT CGTCGTGACC GACGGCGCGC CGGACCACCC CGACCGCTAC
CAGTCGCAGC TGGCGGCGTG TACCGTTCCC GTGTTCGGCG TCTACATCGG GTCGGAACCG
GGCACTCACA CCGAGTACTT CGACCGGATC GTCCACGCCG AGACCGACAC GCTCGCACGA
ACGATGCAGC GCCTCGTCAG GGCGCTGTTC TCGACGGAGG CCTGA
 
Protein sequence
MRLTAQTEPP DLAALSTPER ASSERRRELE RLASLCTDRE RTISIAFDEE RAFARPAEGA 
GPDAYEIVLP TEKYEQPGTE LPPGLWDRSI QVAFLFHELG HVYYSDFDRF GDCLERVDGR
WRDLFRMVYN TAEDGVVETQ IANEFSVTDD FVLLNDVLVS RADERHRAYV DLFDLATADG
EPVQSYTVFE ALAVGLLDRG FVDSGRFAAI VDPDDDRRVV YDGQREALVD LVPAMDEFVA
DVLSEPDGTR RVDRAHDFFE TARDTLAGLP PRQNGRLQTA PVRPSDARAR AGWTADAADR
LPDDGAASAH VARDRAADDR SASGASGAGS VADPDEDRRP PGRVEDDTVR RVRRRSVRAQ
SNRAGSGRSP LEREARHLLD VVDDESTALE EVIVVDPAED GGDRDRWDDA VGRSKQLQRD
LSTQLRRERR PRDEPGHRTG RLDGRRLVGA SHGAQRVFTR RESGTAKDHS CLVVLDRSGS
MDGEPIRTAE TATAQLVHAL FAVGVDASVL SIWEGYPCLE LPFGGRPSEH VDRLMTERAD
WGTPLSTAVA VARERLDDGR GSHPFVVVVT DGAPDHPDRY QSQLAACTVP VFGVYIGSEP
GTHTEYFDRI VHAETDTLAR TMQRLVRALF STEA