Gene Hmuk_0881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0881 
Symbol 
ID8410396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp848806 
End bp849846 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID645019216 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_003176718 
Protein GI257386945 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000456072 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0650778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGATC ACACCAGACG ACGATTCATC CGCGCGACTG GACTCGTGAG CGTTACCGCA 
CTCGCCGGCT GTGCCGGCGA CGACGGCGGC GGCGGTGCGG AAGACTCACC GACGGAGACG
GGGATGGACG GCGGCGACGA GACGGCCACC GACACCGAAT CCTCTGACGG CGGCGACGGC
CAGGCCGACA CCCGCCTCTC GTGGCACGCC GGTGGGACCG GCGGCACATA CTTCCCGCTC
TCGAACGAGT TCAAGACGGT CGTCGAGGAC AACACCGACT TCACGCTGAA CGTCCAGTCG
ACGGGGGCCA GCGTCGAGAA CGTCGGCAGC CTCGCCAACG GGAACGCCGA CTTCGCGCTG
ATCCAAAACG ACATCGCGTA CTTCGCGAAG AACGGCACCG GCATCGACGC GTTCCAGGAC
AACGCCGTCG AGAACCTGCG CGGCGTCGCG ACGCTGTACC CCGAGACGAT CACGGTCGTG
ACCCTCTCCG ACACCGGCAT CTCGCAGCTG TCTGACCTCT CGGGAGCGAC GATCAACACG
GGCGATCTGG GCAGCGGCAC GCAGGTCAAC GCCAACCAGA TCCTCGAAGC CGTCGGCATC
ACCGACTTCG AGGAGCAAAA CGCCTCGTTC TCGCAGGCGG CCGACCAGCT CCGGAACGGC
GACATCGACG CCGCCTTCGT CGTCGGGGGC TGGCCGGTCG GGGCGATCGA GGATCTGGCG
ACGACCAACG ACGTCGCGAT CGTCCCCATC GACGGCGACA ACCGCACGGC GGTCAAGGAC
GCGGCGTCGT GGTTCGCGGA CGACACCATC CCGGGCGGGA CGTACTCGGG CGTCTCCGAA
GACGTGGAGA CGGTCGCCGT CCAGGCGATG ATCGCGACCA ACGCCGACGT GCCCGAAGCG
ACGGTCGAGA CGGTGACGGC CGCGATCTTC GACAACGTCG ACGCGCTGAC GATCAAGACG
GAGTTCATCG GCGTCGACTC GGCGCAGGAC GGGATGTCGA TCGAGCTCCA TCCCGGCGCT
GCGGCCTACT TCGACGCCTG A
 
Protein sequence
MGDHTRRRFI RATGLVSVTA LAGCAGDDGG GGAEDSPTET GMDGGDETAT DTESSDGGDG 
QADTRLSWHA GGTGGTYFPL SNEFKTVVED NTDFTLNVQS TGASVENVGS LANGNADFAL
IQNDIAYFAK NGTGIDAFQD NAVENLRGVA TLYPETITVV TLSDTGISQL SDLSGATINT
GDLGSGTQVN ANQILEAVGI TDFEEQNASF SQAADQLRNG DIDAAFVVGG WPVGAIEDLA
TTNDVAIVPI DGDNRTAVKD AASWFADDTI PGGTYSGVSE DVETVAVQAM IATNADVPEA
TVETVTAAIF DNVDALTIKT EFIGVDSAQD GMSIELHPGA AAYFDA