Gene Hmuk_0138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0138 
Symbol 
ID8409635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp138809 
End bp141622 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content68% 
IMG OID645018463 
ProductFG-GAP repeat protein 
Protein accessionYP_003175983 
Protein GI257386210 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.247842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACA CTTCCCAGAT CCGACTGCGG ATCGGACGGG CGACGGTCGT CGTCTTGATC 
GCCGCGGTCG TCGCGGCGAG TCCCATCGTC GCGAGCGTGA CAGCACTCGG TGACGACGCG
AGCATCTCAC AGCGAACGGA CGATATCGTC GCCCAGCAGT CAGCGCCCAG TGGGTTCTCC
GGCGAGACGA ACCTCTCGGA GGCGGAAACG AAGTACGTCG GCACTGCCGA AAACGATACC
GCGGGCTGGT CGGTCGCGAA CGCGGGCGAC GTGAACGGTG ACGGAATCGA CGATCTCGTC
GTCGGCGCAC CGGAGAACGA CACCGGCGGG ACCAACGCGG GGGCGGCCTA CGTCTTCTTC
GGCCCGGCCG ATCCTGGCAC GGTCTCGCTG GCCGACGCCG ACGTGACCCT CGTCGGCGCG
GCGGCCTCCG ACCGCGCTGG CTACGACGTG TCGTACGCGG GTGACGTGAA CGACGACGGC
TACGCAGACG TGATCGTGGG CGCACCGGGC AACGACAGCA CGGCCAGCAA CGCTGGCGCG
GCCTACGTCG TCTACGGCGG CGACACGATG GCCGACCGGA TAAGTCTCGC CGACGCGGAC
GTGACGCTGA TCGGCGACTC GCCGGGCGAC CGTGCCGGGT GGTCGGTCTC GAACGCCAGC
GGACTCGACG GTCCCGACGG GGTCGCCGTC GGCGCACCCT TCGCCAACGA CAGCGCCGGT
GGGGCCTACC TCGTCTCCGG CGAGCAGCTG TTTGGCACCG TCGACCTCGG GGCGGAGTCG
ACCGCTACGC TGACCGGCGA GTCGCCGGGC GACCAGGCCG GCTGGTCGAT CTCGCACGCT
GGAGACGTGA ACGCCGACGG TACGGCCGAC GTGATCGTCG GTGCGAACAA CTACACGGCC
GCCGACGGAC CGGCCGGGAG CGGGGCTGCA TACGTCGTCT ACGGCGCGGT CGGCGGCGAG
CGGGATCTCG GCGACGCCGA CCTGCGACTC CGTGGCGTCG ACGGTGCGGA CCGCGCGGGC
TGGTCGGTCT CGTACGCGGG TGACGTGAAC AACGACAGCA CTGCCGACGT GATCGTCGGC
GCACCGTTTA CCGATCCCAA CGGAACGGTC GCGGCGGGAT CGGCGTACGT CGTCTACGGC
GAGCCAGACA GGTCTGGCGA CGTGTCGCTG GCCGACGCCG ACGTGCGTCT GACCGGCGAA
GGTGACCGCG ACCGGGCGGG CGTTGCGGTC TCGTCGGCTG GCTCCGGTGA CGTGACCTGT
GACGGTGTCG ACGACGTGCT CGTCGGCGCG CCGCAGAACG ACTCGAACGG GAACGCCTCC
GGAGCGGCCT ACGTCGTCGC CGGCAGCGAA TCGTTCTCGG GTAACATCTC GCTGAGTGAC
GCCGACGCGA TCTTCCGCGG CGAAGCGGCC GGCGATCGAG CGGGCCGTGC GGTCGACGAC
GTTGGTGATC TCGACGACGA CAGCTTCGAC GACATCGCGG TTGGTGCGCC ACGGAACGAC
AGTAGCGCAA CTGACGCCGG AGCGGCCTAC GTGCTGAACA GCGACTGCGC AGTGCTCGAA
ACGCCGACTG CGACGCCGAC CGAAACTCCG ACCGATACGC CGACTGACAC CCCGACTGAC
ACTCCGACTG ACACCCCGAC TGACACGCCG ACGGATACCC CCACGGATAC GCCAACCGAC
ACTCCGACCG ATACGCCAAC TGACACGCCG ACCGACACCC CGACTGACAC GCCAACCGAC
ACCCCGACTG ACACTCCGAC CGACACGCCA ACCGACACCC CGACTGACAC TCCGACCGAC
ACGCCAACCG ACACCCCGAC TGACACGCCG ACCGACACTC CGACCGATAC GCCAACTGAC
ACGCCGACCG ACACTCCGAC CGATACGCCA ACTGACACGC CGACCGACAC TCCGACGGAC
ACCCCGACGG ATACCCCCAC GGATACGCCA ACCGACACCC CCACTGACAC GCCGACGGAC
ACGCCAACCG ATACGCCGAC GGACACCCCC ACCGACACGC CGACCGACAC TCCGACTGAC
ACGCCGACCG ACACTCCGAC TGACACGCCA ACCGACACGC CGACTGACAC TCCGACTGAC
ACGCCGACCG ATACCCCGAC CGATACGCCG ACCGATACGC CAACTGACAC GCCAACCGAC
ACCCCGACTG ACACTCCGAC CGACACGCCG ACTGACACGC CAACTGACAC GCCGACCGAC
ACGCCAACCG ATACGCCGAC GGACACCCCT ACCGACACGC CGACCGACAC TCCGACTGAC
ACCCCGACGG ATACCCCCAC GGATACGCCG ACCGACACTC CGACTGACAC TCCGACCGAC
ACTCCGACTG ACACGCCGAC TGACACGCCA ACCGACACGC CACAGAATCT CGCGGCGATC
AGCTTCGTCG CGTTCTGTGT CCCGGGCGAA CAGGGATCGG GCAACGATCC GTGTCCCGAA
GGCGAGCGGC TCCTGGTCAA ATTCGAGGAC CAGGGCGACG GGTCCTTCGC GCCAGAGGGC
GGTGACGCGA TGGGCGTGAC TGTGACCCCA TCCGAGTTCA AGGACAACGA TCCGTCGGAA
GTCGTCGCCG TCCAGTGGAC CTCCGGACAG TCGATCTCGA CGGTCGTCGT CAAGTCCTCG
ACTGACGAGT GTAACTACCC CGGTGGGAGT TCGGGCACCG CAGAATCCTG CGGACCGCCG
TCGGGCCAGA GTTCCCAGTC GGAACCCGGT GGCGGTGGCT CGGGGCCACT GCCGCCGATC
TTCCTCGCCG GACTGGCAGC GACTTCGCTG GTGGCTGTCG GGCGGCGCGA CTGA
 
Protein sequence
MSNTSQIRLR IGRATVVVLI AAVVAASPIV ASVTALGDDA SISQRTDDIV AQQSAPSGFS 
GETNLSEAET KYVGTAENDT AGWSVANAGD VNGDGIDDLV VGAPENDTGG TNAGAAYVFF
GPADPGTVSL ADADVTLVGA AASDRAGYDV SYAGDVNDDG YADVIVGAPG NDSTASNAGA
AYVVYGGDTM ADRISLADAD VTLIGDSPGD RAGWSVSNAS GLDGPDGVAV GAPFANDSAG
GAYLVSGEQL FGTVDLGAES TATLTGESPG DQAGWSISHA GDVNADGTAD VIVGANNYTA
ADGPAGSGAA YVVYGAVGGE RDLGDADLRL RGVDGADRAG WSVSYAGDVN NDSTADVIVG
APFTDPNGTV AAGSAYVVYG EPDRSGDVSL ADADVRLTGE GDRDRAGVAV SSAGSGDVTC
DGVDDVLVGA PQNDSNGNAS GAAYVVAGSE SFSGNISLSD ADAIFRGEAA GDRAGRAVDD
VGDLDDDSFD DIAVGAPRND SSATDAGAAY VLNSDCAVLE TPTATPTETP TDTPTDTPTD
TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD
TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD
TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD
TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD
TPTDTPTDTP TDTPQNLAAI SFVAFCVPGE QGSGNDPCPE GERLLVKFED QGDGSFAPEG
GDAMGVTVTP SEFKDNDPSE VVAVQWTSGQ SISTVVVKSS TDECNYPGGS SGTAESCGPP
SGQSSQSEPG GGGSGPLPPI FLAGLAATSL VAVGRRD