Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2259 |
Symbol | |
ID | 8411800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2178828 |
End bp | 2179925 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645020602 |
Product | hypothetical protein |
Protein accession | YP_003178078 |
Protein GI | 257388305 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0091596 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.541812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG TGACCGTCGC ACCGCTGCTG GATCGGCTGG ACGACTACCT CGAACGGGAG GGACTCGAAG CGGTCTGGTT CGCGCGGCCG AACTCCTTCG CGTGGCTCAC CGGCGGCGGG AACAACGTCG TCGATCGCGC GACCGACATC GGGGTCGCCG CGGTGGGCTA CGACGGCGAC GAGCGGACCG TCGTGACGAA CAACATCGAG GGCCAGCGGC TGATCGACGA GGAGGTCGGT GACGTGCCCG TCGAGACCGT CCAGTGGTAC GAGGACGGCG TCGCCGAGGC GATCCGCGCA CACAGCCCGA CCCCCGCGGC CGCCGACTTC GAGGTCGAGG GGTTCGACAC CGTCGAAGCG CGCGATCTGC GACAGCCCCT GACGGCCGAC CAGATCGAGC AGTACCGTTC GCTGGCCGTC GACGCGACGA GGGCGTTCGA GAGCGTCCTG CGCGAGGCCA GTAGCGACGC GACCGAACGG GCGGTCACGG CCGACATCCA CCGCGAACTC GAAGCGGAGG GCATCTGGGC TCCCGTCGTG CTCGTCGGTG GGAGCGAGCG CGCACCGGCG TACCGACACT ACACGTCGAC GGACACGGAA CTGGGTGACT ACGCGCTGGC GAGCATCACG GCGATGCGAG ACGGCCTGTT CGTCAGCACC TCCCGGACGA TCGCGTTCGA CGCGCCGGAA TGGCTGGCCG AACGGACCGA AGACGCCGCC CGCGTCGAGA CCACGGCGCT GGCGGCGACC CGGCGCGTCG GCCGCGAGAG TGGGACTGCC GGCGACGTGT TCGCGGCCGT CCAGGACGCC TACGCCCAGC TTGGCTGGGC AGGCGAGTGG GAGAACCACC ACCAGGGCGG TGCGGCCGGC TTCGCCGGTC GCGAGTGGAT CGCGACGCCC ACCCACGACG CACCGGTCCA CCTGCCGATG GCGTACTCGT GGAACCCGAC GGTCCAGGGA GCAAAGAGCG AGGACACGCA TCTGGTGACC GACGAGAGCG TCGAGCCCAT CTCGGTGACC GGTGAGTGGC CCCGGACGAC GGTCTCGGCC GTCGGGACCG ACCTGACGCT CGACCGCCAC GACGTGCTCC ACTTGTAG
|
Protein sequence | MTDVTVAPLL DRLDDYLERE GLEAVWFARP NSFAWLTGGG NNVVDRATDI GVAAVGYDGD ERTVVTNNIE GQRLIDEEVG DVPVETVQWY EDGVAEAIRA HSPTPAAADF EVEGFDTVEA RDLRQPLTAD QIEQYRSLAV DATRAFESVL REASSDATER AVTADIHREL EAEGIWAPVV LVGGSERAPA YRHYTSTDTE LGDYALASIT AMRDGLFVST SRTIAFDAPE WLAERTEDAA RVETTALAAT RRVGRESGTA GDVFAAVQDA YAQLGWAGEW ENHHQGGAAG FAGREWIATP THDAPVHLPM AYSWNPTVQG AKSEDTHLVT DESVEPISVT GEWPRTTVSA VGTDLTLDRH DVLHL
|
| |