Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0195 |
Symbol | |
ID | 8409693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 188333 |
End bp | 190570 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645018520 |
Product | amino acid permease-associated region |
Protein accession | YP_003176039 |
Protein GI | 257386266 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGG AAGAACTCGC CAAAGACCTC GGACCGCTGG CCGCCCTGAC GATCGGCGTC GGGACGATGA TCGGCGCGGG CATCTTCGTG CTCCCCGGAG AGGCGATTCT CAAGTCCGGG TCGCTCGCGC CGGTCGCGTT CGTTTTGGGC GGTGTCATCG CGATGTTTAC GGCGCTGTCG GCGAGTGAAC TCGGCACCGC GATGCCCCGA TCCGGCGGGG CCTACTACTA CGTCAACCAC GCCCTCGGTC CGCTGTTCGG CTCGGTCGCC GGCTGGGCGA ACTGGCTCGG GCTCGCCTTC GCCAGCGCGT TCTACATGGT CGGCTTCGGG CGGTACATCG CTCGCATCTT CGGACTCTCG GGCAGCGTCG GCGTCGGTCC GGTCTCGATC ACCGTCGTCA AGCTGACCGC GCTGGCCGGT GGTGCGTTCT TCATCCTGAT CAACTACGTC GGTGCCAAGG AGACCGGCAG GCTACAGAAC GTCATCGTCG TCTTGCTCAT CGGAATCCTC ACCGTGTTCA CGTTTCTGGG AACGCTCCGG GCCGAGCCGT CGAATCTCCC GGCCGCGACC GACGTGGTCA CCACACTGGA GACGACGGGT CTCATCTTCG TCTCGTATCT CGGCTTCGTC CAGATCACCA GCGTGGCCGA GGAGATCAAA GACCCCGGAA AGAACCTTCC CCGGGCAGTC ATCGGCAGCG TCGTCATCGT GACCGTCATC TATGCACTGG TGTTGGTGAT CATGAGCGCG GCCGTCCCAC AGGGGTTCAT CGCGGACATC ATCAGCTCCG ACGCCGAGAA TCCCATCGCC GTCGTCGAGG TCGGCAACTA CATTCAGGGG GCCCTGATGG GCGGGGCACT GCTGTTCGGT GGCCTGCTCG CGACCGCCTC CAGCGCGAAC GCGTCGATCC TCGCGTCGTC GCGTATCAAC TTCGCCATGG GCCGTGATCG AATCGTCACG CCGGCACTCA ACGAGATACA CCCACGGTAC GGAACGCCAT ACAGGGCGAT CAGCATCACC GGGGGACTCA TTCTGCTGTT CATCGTGATC GGCGACATAA CGCTGCTGTC GGGTGCCGCG TCCGGACTGC ACCTCATCAT CTACGGACTG CTGAACCTCG CGCTGATCGT GATGCGCTAC GTGAATCCAG AAGAGTACAC CCCGGAGTTC GTGGTGCCGC TGTACCCCCT CTTACCGATC CTCGGTGTCG TGTTCTCCTT TGCGTTGCTG GTGTTCGTCG CCGAGGACGC GCTGTTGCTC TCCTTTGGCA TCGCCGCGGC AGCGGTCCTG TGGTACGGGC TCTACGCCCG TTCACGCACG GAAAAGCAGG GGATACTCTC GAAGCACATC ATTTCGCGCT CCGACGAGAT GCCCGACGCG GCAGTCAGTG CAGCCGTCGG GGTCCAACCC GACGGTGGCC AGTACCGCGT GATGGTGCCC CTGGCCAATC CCGAGAACGA GCAAGACCTC ATCACCCTCG CGAGCGCGAT CGCAAAGCAG CGCGGGGGCA CCGTGGTTGC CACGCACATC GTTACCGTTC CCAGCCAGAC GGCGCTCGCG GCCGCTGCCG ACCGGTCCGA CGAGATCGAC AAGACATCGG AGCGTCTGCT CGCAAACGCT CGGGAGGACG CCGAGACGTT CGGCGTCGAC GTCGAGACCA ACACGATCGT CTCGCACAAG TCCTACGAGG CTATCTTCGA CGCCGCTCGC TCACAGACCG CGGATCTCGT CGTGATGGGA TGGGGCCCGG ACGCACACGG TTCGCCGGGG CGGGCCGAGT CAGCCATGGA CGAACTCACC GAGTCGGTCC CCTGTGACTT CCTGGTCTTC CGTGACCGCG GGTTCGATCC GTCGCGCATT CTGCTCCCGA CAGCTGGCGG TCCGGACTCC GAGCTGTCGG CGACCGTCGC AAAGTTGCTG CAGGCGGAGT ACGACTCCGA AGTGACGTTG CTCAACGTCG ACGAAAATCG GGAAGCGGGA GCGCAGTTCC TCGAAGAGTG GGCAGTCGAA CACGGGTTGA CGGACGCCGA ACGCCTCGTC AAATCCGGCG ACATCGAGAC GGCCATCCGC AACGCTGCCG ACGACGCGAC GCTCCTCCTC ATCGGTGCGA CCGAGGAAGG CCTACTGCGT CGGCTCGTCT CCAAGTCACT CGTGCTGGAC GTTGTCGACG ACGTGGAGTG TTCGGTCCTC CTCGCGGAGA CCCACCGGGA CCGGGGGCTG CTCGAACGGC TGTTCTAA
|
Protein sequence | MSEEELAKDL GPLAALTIGV GTMIGAGIFV LPGEAILKSG SLAPVAFVLG GVIAMFTALS ASELGTAMPR SGGAYYYVNH ALGPLFGSVA GWANWLGLAF ASAFYMVGFG RYIARIFGLS GSVGVGPVSI TVVKLTALAG GAFFILINYV GAKETGRLQN VIVVLLIGIL TVFTFLGTLR AEPSNLPAAT DVVTTLETTG LIFVSYLGFV QITSVAEEIK DPGKNLPRAV IGSVVIVTVI YALVLVIMSA AVPQGFIADI ISSDAENPIA VVEVGNYIQG ALMGGALLFG GLLATASSAN ASILASSRIN FAMGRDRIVT PALNEIHPRY GTPYRAISIT GGLILLFIVI GDITLLSGAA SGLHLIIYGL LNLALIVMRY VNPEEYTPEF VVPLYPLLPI LGVVFSFALL VFVAEDALLL SFGIAAAAVL WYGLYARSRT EKQGILSKHI ISRSDEMPDA AVSAAVGVQP DGGQYRVMVP LANPENEQDL ITLASAIAKQ RGGTVVATHI VTVPSQTALA AAADRSDEID KTSERLLANA REDAETFGVD VETNTIVSHK SYEAIFDAAR SQTADLVVMG WGPDAHGSPG RAESAMDELT ESVPCDFLVF RDRGFDPSRI LLPTAGGPDS ELSATVAKLL QAEYDSEVTL LNVDENREAG AQFLEEWAVE HGLTDAERLV KSGDIETAIR NAADDATLLL IGATEEGLLR RLVSKSLVLD VVDDVECSVL LAETHRDRGL LERLF
|
| |