Gene Mboo_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1458 
Symbol 
ID5411403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1495866 
End bp1497035 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID640868693 
Productproteasome-activating nucleotidase 
Protein accessionYP_001404619 
Protein GI154151001 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.264354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGTA TGGAAGAGAC TGCGGTGGAC AATGCCCCGC CCTCAAACGA GCTCAAATAC 
CAGATCCAGA TAAACGAGCT TGAGGCGGCG CTTCTGGAAC AGAAGGTCAG AGCCGAAGAT
CTCCAGAAAG AAAATGCGCA GCTGAAACGC GAGAACAACC AGCTCAAGCG CATGCCGCTC
TTTGTAGCGG TGATCATCGA TATTCTGGAG AACAACGAGA TCTATCTCCG CCAGCAGGGA
AACAACCAGG AGTACCTGAC CCATGCCTCG GATGAGCTCA GGCCCCTGTT AAAGCCCGGC
ACCAAGGTGG CAGTCAACAA CGCACTCTCG ATTGTCAAGG TGATCGGGAA CGTTTACGAC
TCCCGGGTAC GGGTGATGGA GCTCGAGGAG TCCCCATCGA TATCCTATAC GCAGATCGGG
GGACTTACTG ACGAGATAAA AGAAGTACGG GAAGCAGTGG AGTACCCGCT CACCAAGCCC
GAGATCTTCC GGCGGATAGG AGTGGAACCA CCCAAGGGGA TCCTGCTCTA CGGTTCACCC
GGGACCGGAA AGACCCTGAT CGCAAAGGCG GTCGCCCACG AAGCAAAGGC AACATTCATC
CGCATGTCGG GAAGCGAACT GGTCCACAAG TTTATCGGCG AAGGAGCCGG GCTTGTCCGT
GAGCTCTTCA CCCTTGCCCG GGAGCGGGCA CCCGCGATTG TCTTTATCGA CGAAATCGAT
GCTGTGGGTT CCATGCGCAC CAATGATGGC ACAAGCGGCA GCGCCGAGGT GCAGCGCACG
CTCATGCAGC TCCTTGCCGA GATGGACGGA TTTGATAACC GGGGCGAGAT CCGGATCATG
GCTGCCACCA ACCGTGTTGA CATGCTTGAC CCGGCACTCC TGCGGCCCGG CCGGTTCGAC
CGGCTCATCG AGATCCCGCT TCCCGATCGT AATGCACGCC TTGAGATCCT GAAGATCCAC
TCAAAAAAGA TGCACCTTAA CGATGTCAGG CTCGATATGA TCGCCTCTAT GACTGACGGG
GCTACCGGAG CTGAACTCCA GGCAGTCTGC CGCGAAGCCG GCATGATGGC AGTCCGCCGT
GATGCTTCCG CAATAGAGAT GAAGGATTTT ACTGATGCAG TCAGGAAAGT CAGGACCGAG
ACTATCACCG ATACCCGGAT GTATACATAA
 
Protein sequence
MSSMEETAVD NAPPSNELKY QIQINELEAA LLEQKVRAED LQKENAQLKR ENNQLKRMPL 
FVAVIIDILE NNEIYLRQQG NNQEYLTHAS DELRPLLKPG TKVAVNNALS IVKVIGNVYD
SRVRVMELEE SPSISYTQIG GLTDEIKEVR EAVEYPLTKP EIFRRIGVEP PKGILLYGSP
GTGKTLIAKA VAHEAKATFI RMSGSELVHK FIGEGAGLVR ELFTLARERA PAIVFIDEID
AVGSMRTNDG TSGSAEVQRT LMQLLAEMDG FDNRGEIRIM AATNRVDMLD PALLRPGRFD
RLIEIPLPDR NARLEILKIH SKKMHLNDVR LDMIASMTDG ATGAELQAVC REAGMMAVRR
DASAIEMKDF TDAVRKVRTE TITDTRMYT