Gene Hmuk_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1821 
Symbol 
ID8411347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1740271 
End bp1741419 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content65% 
IMG OID645020151 
ProductABC transporter related 
Protein accessionYP_003177642 
Protein GI257387869 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACGC TCGAACTCGA CCGACTGACG AAGGTGTTCC ATGACGGCGA AGAAGGCGAG 
ATCGTCGCAG TCGACAACGT CGACATCCAG ATGGACGACG GCGAGTTCAT CGTCGTCGTC
GGCCCCTCCG GCTGTGGCAA GTCGACGACC CTGCGGATGG TCGCCGGGCT GGAGACGGTG
ACCAGCGGCA ACATCCGCCT TGACGGACGG GTCGTCAACG ACGAGAAACC CCAGAATCGC
GACATCGCGA TGGTGTTCCA GTCGTACGCG CTGTACCCCC ACATGACCGT GGCGGAGAAC
ATGGCGTTCG GCCTGGAGGA GTCGACGACG CTCCCGGACG ACGAGATCGA AGAGCGAGTC
CACGACGCCG CAGAGACGAT GGGTATCGCG GAGCTGCTGG ATCGCAAACC CTCGGAGCTC
TCTGGTGGCC AGCAACAGCG CGTCGCGCTC GGCCGAGCGA TCGTCCGAGA TCCGGAGGTG
TTCCTGATGG ACGAGCCGCT CAGCAACCTC GACGCCAAGC TGCGCTCCCA GATGCGGACG
GAGCTCCAGC GCTTGCAGGC CGAACTGGAC GTGACGACGA TGTACGTCAC CCACGACCAG
ACCGAGGCCA TGACGATGGG CGACCGCATC GCCATCCTCA ACGACGGGAA GCTCCAGCAG
GTGGCGACGC CGCTTGAGTG TTACCACGAG CCCGCCAACC AGTTCGTCGC CGGCTTCATC
GGCGATCCGT CGATGAACTT CTTCGACATG GAGCGGGACG GCGACACGCT CGTCGGTTCG
CGGTTCGAGT ATCCCCTCTC TCAGTCGACG CTCGACGACG TGGGCGAGAC CCGAAACGTC
ACGCTCGGCG TCCGCCCCGA GGACGTCGAA GTCGGCACCG ACGAATCGGG CAGCCACACC
TACTCGGCGA TCGTCGAGGT AGTGGAGCCG ATGGGCGACG AGAACACGGT GTATCTCCGG
TTCGAGAGCG CACCGGAGGG CGAGACGTTC ATCGCGACGA TCGACGGCCT CCAGCAGGCC
GCCGTCGGTG ATCGTGTTAC CGTCTCGATT CCCGAAGAGA CGATCCACCT CTTCGACGGA
CGGTCGGGCG AAGCGGTCCA CAACCGCCGA CTCGACATGA GCGGCGAGAT CTCCAGCCCG
CCGACCTGA
 
Protein sequence
MGTLELDRLT KVFHDGEEGE IVAVDNVDIQ MDDGEFIVVV GPSGCGKSTT LRMVAGLETV 
TSGNIRLDGR VVNDEKPQNR DIAMVFQSYA LYPHMTVAEN MAFGLEESTT LPDDEIEERV
HDAAETMGIA ELLDRKPSEL SGGQQQRVAL GRAIVRDPEV FLMDEPLSNL DAKLRSQMRT
ELQRLQAELD VTTMYVTHDQ TEAMTMGDRI AILNDGKLQQ VATPLECYHE PANQFVAGFI
GDPSMNFFDM ERDGDTLVGS RFEYPLSQST LDDVGETRNV TLGVRPEDVE VGTDESGSHT
YSAIVEVVEP MGDENTVYLR FESAPEGETF IATIDGLQQA AVGDRVTVSI PEETIHLFDG
RSGEAVHNRR LDMSGEISSP PT