Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1341 |
Symbol | |
ID | 4795485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 1365868 |
End bp | 1366686 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640100023 |
Product | proteasome subunit alpha |
Protein accession | YP_001030775 |
Protein GI | 124486159 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0638] 20S proteasome, alpha and beta subunits |
TIGRFAM ID | [TIGR03633] proteasome endopeptidase complex, archaeal, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.20981 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00320062 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGCCAC AACAATATCA AATGGGCGGG TACGATCGTG CCATCACGAT GTTCTCCCCC GACGGACGTC TGTATCAGGT TGAATATGCA CGTGAAGCGG TAAAACGGGG AACGACCGCG GTTGGTATCA AATGTAAGAC GGGAGTCGTA CTTCTCGTTG ACAAACGGGT CAACTCCCGT CTTCTGGAAC CATCGTCGAT CGAGAAGATC TTTCGGATCG ACGAACACAT CGGTGTCGCC TCCTCCGGCC TTGTCGGCGA TGCCCGGATC CTTGTCGACC GTGCCCGGAT CGAGGCTCAG ATCAACCGGG TAAGCTACGG CGAACCGGTC GATGTGGAGA CGCTGGCAAA GAAACTCTGC GACCACATGC AGAGCTACAC CCTGTTCGGC GGGGCACGCC CTTACGGTAC GGCGCTTCTG ATCGCCGGTG CCGAGTCGTC ACCGACCGGT ACAAAGTATC ATCTCTTCGA GACCGATCCC TCAGGAACCC TTCTCGAGTA CTCCGCCACC GGTATTGGTA TCGGCAGACC TGCCGTCATC AAACTCTTCG AACAGGAGTA CAAAGAAACC TGCTCGGCAG AAGAAGCCGT CCTGCTTGGC CTTAAAGCCC TGCACACCGC GACCGAAGGC AAGTTCGATA TGAACACCGT CGAGATCGGT ATAGCAGGCG AGTATTCCAA GAAACACTCC TCCAAAAAGG AGAGCGAGAC CGCGAACGGA ACAAAAATAT CGACCATCGC ATTTAGAAAA CTGAATGTGG ATGAAGTAAA GGCAGCTGTT GCCAAATTTT CCAAGACGAC TCCTGCGAAA AAGGAGTGA
|
Protein sequence | MQPQQYQMGG YDRAITMFSP DGRLYQVEYA REAVKRGTTA VGIKCKTGVV LLVDKRVNSR LLEPSSIEKI FRIDEHIGVA SSGLVGDARI LVDRARIEAQ INRVSYGEPV DVETLAKKLC DHMQSYTLFG GARPYGTALL IAGAESSPTG TKYHLFETDP SGTLLEYSAT GIGIGRPAVI KLFEQEYKET CSAEEAVLLG LKALHTATEG KFDMNTVEIG IAGEYSKKHS SKKESETANG TKISTIAFRK LNVDEVKAAV AKFSKTTPAK KE
|
| |