Gene Hmuk_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1531 
Symbol 
ID8411052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1456658 
End bp1458496 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content74% 
IMG OID645019857 
ProductTrkA-C domain protein 
Protein accessionYP_003177353 
Protein GI257387580 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTCC AGCTCGTCAG CGGCGTGACG GGGGCCCTCG TCGACACGAC CGCCCGAATC 
GCCGGCACGG CGGTGCTCGC CGGCTTCCTC GCGGCGGCGG TCGCGCTGTT CTACCGGTGG
TACGTCCGCG AGCGGGTGCC GGTCGGACTG GCGTTGCTGG TCGGGCTCTC CGGCGTCGCC
GCCGTCGCCG GGGCGACCTC GCTGCTGGCC CAGCAGCTCG TGCCGGGCGG TGCCGACCCG
GACGCCCTCG TGGCGCTGCT CAACGTCGTC ACGTTCCTCG TCGGCGGCCT CGGGGCGTAC
GCAGGCATGC GGGTCGGCGA CACACTCGGG ATCGACCTGT TCGCGGCGAC GGGGGGCAAC
GGGATCGACG GCGAGGTCAG CGAACTCGTC AAGGCCGTCG GTCGCGTCAT CGCCGTCGAG
ATCCCCGAGG AGATCGACGA CATCGTCGGC TACGATCCGG TCGCGGCCGC GACCCGGGAG
ACCCTCGCCG GTCGGACGTT CCTCTTCCCG CGGCGGCTCA CGCAGGCGGA GCTGCGCGAG
CGGTTCGTCT CGCGGCTGAA GACCGACTAC GGCGTCGGCC ACGTCGACGT GGAGTTCGAC
GAGGCCGGCA ACATCGAGTA CCTGGCGCTG GGCTCGCGGG CCGCCGGGAT CGGTCCGACG
CTCCCGCCGG CCACCAACGC GGTCGCGATC CGGGCCGACC CGGCAAACGC CGCCAGCGCG
GGCGATCTCG TCCAGGTCTG GGAGACCGAT CCCCCGCGGC GCGTGCTCAC GGGAGAGCTC
CGCGGCGTCG CCGACGAGAT CGTCACGGTC GCCATCGACG CTGCCGACAC GCCCAAACTC
GACCCCACGA GCCGGTACAA GCTCGTGACG CTGCCGGTCC AGGACCGGCC GGACCGGGAG
TTCGCGTCCC TCCTCCGGGC CGCAGACGAG ACCCTCGCCA CGGTCACCGT CGCGGCGGGC
TCCCCGATCG ACGGGACCCC CGTGGGTTCG CTGTCGGTCA CCGTCGCTGC CGTCACACGC
GACGACGAGC GGCCCGTCCC GCTTCCGTCA CGAGACCGGG TCCTCAGGGC TGGCGACGTA
CTCTACGCCA TCGCGACGCC GGACGCGCTC CGGAAGCTGG AGGCCGCCAC CGCGGCCCCC
GAGGGCGCGG TCGCGGAGCC GGCCGACGGG ACGGACGACG AACCGGCGAC GGCCGCCGAG
GACGCGACAG CCCCGGAACC CGCGGCGAGC GACCGCCCAC CAGCGGACCA CCGGCCCGAC
CAGTCGGAAA GCGAGCGACG AGCGGACGCG TCCGACGGGC AACCGGACGC CGGAGACGAG
GTGGTGGGCG ACGACGCGGA CGACACATCG ACGACAGAAA GCGAGGCCGA TGCGTCGGGC
GACGACGGCG CGGACGATGC CGCCCTCTCT GGCGACGCGT TCCCCGACAG CGACGACCTC
CCGGGCATCG ATCCCGAGGA CGCGACGGCG TCCCCGTCCG ACGGCGGAGA CGACGACACC
GAGGCGATCC CCGACACCGC CGCGTTTCCC GACACCGACG ACCTCCCCGG ATCGGAGCTG
ACAGACGAGG GAGACGCCCG AGCGGATCGG GAGGACGACG GGGACGGAAC CGACGGCGCG
ACGGCGCGGA GCGCGCCGTC GGACGAGGAC GAAGCCGCGG GCGACTTCGC CGACCTGCTC
GACACCGATG TCGGCGCGGG CGACGACCTG GACGACCTCC TCGACGAGGA CACCGGCGAC
ACGGTCGATC TCGACGACGA AACGGCGGAC GGATCGGAAG CGGACGACCG CCTCGACGAG
GACACCGACG AGACGGACGA CCGCGACCGG ACCGCCTGA
 
Protein sequence
MTLQLVSGVT GALVDTTARI AGTAVLAGFL AAAVALFYRW YVRERVPVGL ALLVGLSGVA 
AVAGATSLLA QQLVPGGADP DALVALLNVV TFLVGGLGAY AGMRVGDTLG IDLFAATGGN
GIDGEVSELV KAVGRVIAVE IPEEIDDIVG YDPVAAATRE TLAGRTFLFP RRLTQAELRE
RFVSRLKTDY GVGHVDVEFD EAGNIEYLAL GSRAAGIGPT LPPATNAVAI RADPANAASA
GDLVQVWETD PPRRVLTGEL RGVADEIVTV AIDAADTPKL DPTSRYKLVT LPVQDRPDRE
FASLLRAADE TLATVTVAAG SPIDGTPVGS LSVTVAAVTR DDERPVPLPS RDRVLRAGDV
LYAIATPDAL RKLEAATAAP EGAVAEPADG TDDEPATAAE DATAPEPAAS DRPPADHRPD
QSESERRADA SDGQPDAGDE VVGDDADDTS TTESEADASG DDGADDAALS GDAFPDSDDL
PGIDPEDATA SPSDGGDDDT EAIPDTAAFP DTDDLPGSEL TDEGDARADR EDDGDGTDGA
TARSAPSDED EAAGDFADLL DTDVGAGDDL DDLLDEDTGD TVDLDDETAD GSEADDRLDE
DTDETDDRDR TA