Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0653 |
Symbol | |
ID | 7401788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 669661 |
End bp | 670977 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707719 |
Product | peptidase M28 |
Protein accession | YP_002565325 |
Protein GI | 222479088 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.602384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.117412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT GGATCGGCGA CACCTTCACG AGCGACGCCG GCTGGGATCA CCTCGAATCA CTCGTCGATA TCGATCACCG CATGGCCGGC TCCGACGGCG AGCGCGCCGG GCTCGAACTG ACCCGCGACG CGCTGTCCGA CGCCGGCGCG CGTGACGCGC GGATCGAGGA ATTCGAGATT CAGGGCTGGG AGCGCGGCGA CAGCGAGATC CGACGCGACG AAGAGGTTGT CGCGAGCGGG CAAAACGCCT GCATCGCGCT CCCGCGGAGC CCGAGCGGCG AGGCGACCGG CGAGTTCGTC GATCTGGGGT ACGGCGTCCC CGAGGACTTC GACGACGATC TGACGGGGAA GGTCGTGATG GTCTCGTCGG ACACCCCCGA CTCGGTCGAC CGGTTCATCC ACCGCCGCGA AAAGTACTAC CACGCGGTGG AGGCGGGTGC CGCCGCCTTC GTCTTCGCGA ACCACGTCGA GGGGACGCTG CCGCCGACCG GGAGCGTCGG CACCGCGGAT GCGCCGATCG GCGATATCCC GGCGGTCGGC GTCTCGAAGG AGACCGGCGC GAGCCTCGCG CGCCGACGCG AGGGCGAGGA CCTCACCGTC GCAGTCAACT GCGAGACGCC CGACGCGACG AGCGGGAACG CGGTCGCCGA CCTCGGTCCC GACACCGACG AGTACCTCGT CGTCTCCTGC CACGTCGACG CCCACGACCT CGCGGAGGGG GCGATGGACA ACGGCGCCGG CACCGCGACG ATCGTCGAGG TCGCCAACGC TCTCGCGGCC CGCGAAGAGG AGCTCGACAC GAGAGTGCGG TTCGTCGGCT TCGGTGCCGA AGAGGTCGGG CTGGTCGGCT CTTCCCAATT TGCCGCGGGC GTCGACCCCG ACCACGTCAA GGCCGTCGTC AACGTCGATA GCAACGTGTT CGGTCGTACC CTGAAGCTCG ATCACCACGG CTTCGACCCG CTGGAGGCGG CCGGCGAGCG CGTGAGCGAC CGGTTCGATC ACCCGATCGC GCTCGGCGAG GAGCAGGTCC CCCACAGCGA CCACTGGCCG TTCGTCGAGC GCGGGATCCC CGGCTATATG GTCTCCGGTG AGACGGAGGG GCGCGGCCGG GGCTGGGGAC ACACGGGTGC GGACACGCTC GACAAGCTGG AGTCTCGGAA CCTCCGCGAG CAGGCGATCC TCCTGACGGC GCTCGTCGTC GACCTCGCCG GCGACGACGT GTCGACCGCG CGGAAGCCAA CCGACGAGAT CGCGAGCGCG CTCGAACAAG AGGGGAAGGC GACGGGGATG AAGATAACCG GCGACTGGCC GTTCTAG
|
Protein sequence | MTDWIGDTFT SDAGWDHLES LVDIDHRMAG SDGERAGLEL TRDALSDAGA RDARIEEFEI QGWERGDSEI RRDEEVVASG QNACIALPRS PSGEATGEFV DLGYGVPEDF DDDLTGKVVM VSSDTPDSVD RFIHRREKYY HAVEAGAAAF VFANHVEGTL PPTGSVGTAD APIGDIPAVG VSKETGASLA RRREGEDLTV AVNCETPDAT SGNAVADLGP DTDEYLVVSC HVDAHDLAEG AMDNGAGTAT IVEVANALAA REEELDTRVR FVGFGAEEVG LVGSSQFAAG VDPDHVKAVV NVDSNVFGRT LKLDHHGFDP LEAAGERVSD RFDHPIALGE EQVPHSDHWP FVERGIPGYM VSGETEGRGR GWGHTGADTL DKLESRNLRE QAILLTALVV DLAGDDVSTA RKPTDEIASA LEQEGKATGM KITGDWPF
|
| |