Gene Hmuk_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0195 
Symbol 
ID8409693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp188333 
End bp190570 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content64% 
IMG OID645018520 
Productamino acid permease-associated region 
Protein accessionYP_003176039 
Protein GI257386266 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG AAGAACTCGC CAAAGACCTC GGACCGCTGG CCGCCCTGAC GATCGGCGTC 
GGGACGATGA TCGGCGCGGG CATCTTCGTG CTCCCCGGAG AGGCGATTCT CAAGTCCGGG
TCGCTCGCGC CGGTCGCGTT CGTTTTGGGC GGTGTCATCG CGATGTTTAC GGCGCTGTCG
GCGAGTGAAC TCGGCACCGC GATGCCCCGA TCCGGCGGGG CCTACTACTA CGTCAACCAC
GCCCTCGGTC CGCTGTTCGG CTCGGTCGCC GGCTGGGCGA ACTGGCTCGG GCTCGCCTTC
GCCAGCGCGT TCTACATGGT CGGCTTCGGG CGGTACATCG CTCGCATCTT CGGACTCTCG
GGCAGCGTCG GCGTCGGTCC GGTCTCGATC ACCGTCGTCA AGCTGACCGC GCTGGCCGGT
GGTGCGTTCT TCATCCTGAT CAACTACGTC GGTGCCAAGG AGACCGGCAG GCTACAGAAC
GTCATCGTCG TCTTGCTCAT CGGAATCCTC ACCGTGTTCA CGTTTCTGGG AACGCTCCGG
GCCGAGCCGT CGAATCTCCC GGCCGCGACC GACGTGGTCA CCACACTGGA GACGACGGGT
CTCATCTTCG TCTCGTATCT CGGCTTCGTC CAGATCACCA GCGTGGCCGA GGAGATCAAA
GACCCCGGAA AGAACCTTCC CCGGGCAGTC ATCGGCAGCG TCGTCATCGT GACCGTCATC
TATGCACTGG TGTTGGTGAT CATGAGCGCG GCCGTCCCAC AGGGGTTCAT CGCGGACATC
ATCAGCTCCG ACGCCGAGAA TCCCATCGCC GTCGTCGAGG TCGGCAACTA CATTCAGGGG
GCCCTGATGG GCGGGGCACT GCTGTTCGGT GGCCTGCTCG CGACCGCCTC CAGCGCGAAC
GCGTCGATCC TCGCGTCGTC GCGTATCAAC TTCGCCATGG GCCGTGATCG AATCGTCACG
CCGGCACTCA ACGAGATACA CCCACGGTAC GGAACGCCAT ACAGGGCGAT CAGCATCACC
GGGGGACTCA TTCTGCTGTT CATCGTGATC GGCGACATAA CGCTGCTGTC GGGTGCCGCG
TCCGGACTGC ACCTCATCAT CTACGGACTG CTGAACCTCG CGCTGATCGT GATGCGCTAC
GTGAATCCAG AAGAGTACAC CCCGGAGTTC GTGGTGCCGC TGTACCCCCT CTTACCGATC
CTCGGTGTCG TGTTCTCCTT TGCGTTGCTG GTGTTCGTCG CCGAGGACGC GCTGTTGCTC
TCCTTTGGCA TCGCCGCGGC AGCGGTCCTG TGGTACGGGC TCTACGCCCG TTCACGCACG
GAAAAGCAGG GGATACTCTC GAAGCACATC ATTTCGCGCT CCGACGAGAT GCCCGACGCG
GCAGTCAGTG CAGCCGTCGG GGTCCAACCC GACGGTGGCC AGTACCGCGT GATGGTGCCC
CTGGCCAATC CCGAGAACGA GCAAGACCTC ATCACCCTCG CGAGCGCGAT CGCAAAGCAG
CGCGGGGGCA CCGTGGTTGC CACGCACATC GTTACCGTTC CCAGCCAGAC GGCGCTCGCG
GCCGCTGCCG ACCGGTCCGA CGAGATCGAC AAGACATCGG AGCGTCTGCT CGCAAACGCT
CGGGAGGACG CCGAGACGTT CGGCGTCGAC GTCGAGACCA ACACGATCGT CTCGCACAAG
TCCTACGAGG CTATCTTCGA CGCCGCTCGC TCACAGACCG CGGATCTCGT CGTGATGGGA
TGGGGCCCGG ACGCACACGG TTCGCCGGGG CGGGCCGAGT CAGCCATGGA CGAACTCACC
GAGTCGGTCC CCTGTGACTT CCTGGTCTTC CGTGACCGCG GGTTCGATCC GTCGCGCATT
CTGCTCCCGA CAGCTGGCGG TCCGGACTCC GAGCTGTCGG CGACCGTCGC AAAGTTGCTG
CAGGCGGAGT ACGACTCCGA AGTGACGTTG CTCAACGTCG ACGAAAATCG GGAAGCGGGA
GCGCAGTTCC TCGAAGAGTG GGCAGTCGAA CACGGGTTGA CGGACGCCGA ACGCCTCGTC
AAATCCGGCG ACATCGAGAC GGCCATCCGC AACGCTGCCG ACGACGCGAC GCTCCTCCTC
ATCGGTGCGA CCGAGGAAGG CCTACTGCGT CGGCTCGTCT CCAAGTCACT CGTGCTGGAC
GTTGTCGACG ACGTGGAGTG TTCGGTCCTC CTCGCGGAGA CCCACCGGGA CCGGGGGCTG
CTCGAACGGC TGTTCTAA
 
Protein sequence
MSEEELAKDL GPLAALTIGV GTMIGAGIFV LPGEAILKSG SLAPVAFVLG GVIAMFTALS 
ASELGTAMPR SGGAYYYVNH ALGPLFGSVA GWANWLGLAF ASAFYMVGFG RYIARIFGLS
GSVGVGPVSI TVVKLTALAG GAFFILINYV GAKETGRLQN VIVVLLIGIL TVFTFLGTLR
AEPSNLPAAT DVVTTLETTG LIFVSYLGFV QITSVAEEIK DPGKNLPRAV IGSVVIVTVI
YALVLVIMSA AVPQGFIADI ISSDAENPIA VVEVGNYIQG ALMGGALLFG GLLATASSAN
ASILASSRIN FAMGRDRIVT PALNEIHPRY GTPYRAISIT GGLILLFIVI GDITLLSGAA
SGLHLIIYGL LNLALIVMRY VNPEEYTPEF VVPLYPLLPI LGVVFSFALL VFVAEDALLL
SFGIAAAAVL WYGLYARSRT EKQGILSKHI ISRSDEMPDA AVSAAVGVQP DGGQYRVMVP
LANPENEQDL ITLASAIAKQ RGGTVVATHI VTVPSQTALA AAADRSDEID KTSERLLANA
REDAETFGVD VETNTIVSHK SYEAIFDAAR SQTADLVVMG WGPDAHGSPG RAESAMDELT
ESVPCDFLVF RDRGFDPSRI LLPTAGGPDS ELSATVAKLL QAEYDSEVTL LNVDENREAG
AQFLEEWAVE HGLTDAERLV KSGDIETAIR NAADDATLLL IGATEEGLLR RLVSKSLVLD
VVDDVECSVL LAETHRDRGL LERLF