Gene Hmuk_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0054 
Symbol 
ID8409551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp53190 
End bp54239 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID645018392 
Productcell surface glycoprotein 
Protein accessionYP_003175912 
Protein GI257386139 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.521003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTT GGTTCCTACT GTGCAGCGTA GTGCTCGGTG CCCTCCTCGG TGCCGGTGTA 
CTCGGTGCGC AATCGACCGC TGACTACTCG CTCTCTGCAG ACGCCAGTGT CGACATCCCG
ACACAGGAGG TCTCTTACGA CGGCGATACG TTCGAGATCT CACAGACCGG CGTCGCAGAC
CCAGAAACGA CCTTTATCGC CACTGCGAGT GCGCCCGAGG ACGCGGACTA CGTGGTCTAC
GTCGTCGACA AAGGGGAGAG CATTCGGCAG TCTAACGCTG GCTCGGGCAC CGGCGAAGTG
CCGCTCGATC TCGGCACGCT CGATCCGGGG ACGTACGCGG TCACGATCAA ACAGGACAGC
ACCGTCGCGG CCATGCCGCT GGTCGTTCGA GGCTACGATC TCTCACAGGA CGTTCCCGAC
GAAACGACGG CCGGTACGGA GACGGAGGTC TCGATCACGG CCGACGCTAT CCACGAGGAC
GCCGAGTTCG AGCAGATCGT CCTGACGCTC TGGGACGGCT CGGAGGAACA CGACGTTACC
GCGAGCAGAG CGGACGGACA GCGATACACC GCGACGATCC CCGCTGGAAC GCTCGACGAG
GGCACGTATC GGGTCGTCAG CCGTGCCGAA ACGGGTGCCA CGGCGTTCGA CCACAACGAA
CTGGTCGGGA TCAGTGAGAC GACGACCATC TCGGTGACCG AGTCGCCGAC GACGACGAAG
TCCGGTGCTG GTGGTGGCGG TGGTCAGGTC TCGACCGCGA CTGAGACGGC GACGCCGATG
GCGACGCGGA CGAACGCCAC TGCCTCGCCG ACGGTGACGG CGACGCCGTC GTCGAACCGG
ACGGCGACGC CGGTATCGAC GACGGAGCGC TCGACCCGGA CGCCGGACAC GGCGACCACG
CCGTCGGAAA CCGCGACGAC GGAGCCGACT GCATCGGACG CTGTCACTCC GAGTCCGACG
ACCGGAGAGG GCGGGAGCTT CTCGAAGCTC GTCTTCGCAG CGCTGCTTGC CGGTGTCGTC
GGTAGCCTGA CGATGCGACG CCGGTCCTGA
 
Protein sequence
MRRWFLLCSV VLGALLGAGV LGAQSTADYS LSADASVDIP TQEVSYDGDT FEISQTGVAD 
PETTFIATAS APEDADYVVY VVDKGESIRQ SNAGSGTGEV PLDLGTLDPG TYAVTIKQDS
TVAAMPLVVR GYDLSQDVPD ETTAGTETEV SITADAIHED AEFEQIVLTL WDGSEEHDVT
ASRADGQRYT ATIPAGTLDE GTYRVVSRAE TGATAFDHNE LVGISETTTI SVTESPTTTK
SGAGGGGGQV STATETATPM ATRTNATASP TVTATPSSNR TATPVSTTER STRTPDTATT
PSETATTEPT ASDAVTPSPT TGEGGSFSKL VFAALLAGVV GSLTMRRRS