Gene Msed_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0210 
Symbol 
ID5104076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp171221 
End bp173527 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content48% 
IMG OID640506115 
ProductAAA family ATPase, CDC48 subfamily protein 
Protein accessionYP_001190311 
Protein GI146302995 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family
[TIGR01243] AAA family ATPase, CDC48 subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.798334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0460927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGCTG GTTCTAGCCC AGAGCAAAGG TCGCCAAGAC GAGAGTTGTC TCTGAAGGTG 
ATGGAAGCAA GGCAAAAGGA TGTTGGTAGA GGAAAGGTAA GAATAGACGT TGAGATGCTT
GCACAAATCG ATGTTAGTCC AGGTGATGTA GTGGAGATTG AGGGTACTAG AAAGACGGCA
GCAATAGCGT GGCCACTGTC CCCAGATGAC GCCACTAGCG AGAGAGATAT AATCAGAATG
GATGGCATAA CTAGGAAGAA CGCAGGTGTC TCCATTGGAG ACAAGGTGAT AGTAAGGAAG
GCCTCTGTGA AGCAAGCTGC ATCCATCAAA CTTGCCCCTT CCAACTTTTC GATTACCGTA
GATCCAGGTT TCGTGGCATA CGTCAAGAAG AAGCTTAAGG AGTTCCCACT GGTTGAAGGT
GACACTGTTC TTATCCCAGT GCTGGGTCAG GCTATTCCCT TCACGGTAAT ACAGGTCAGA
CCGGCGTCGA TTGTGATGGT TGTCGACGAG ACCAGTATTT CCATCTCTGA CAAGCCCATA
GAGCAGACCA GGTATCCCAG GGTCACGTAC GAGGATATAG GAGGAATGAA AAACGTTATA
CAGAAGATCA GAGAACTTGT GGAGTTACCG TTGAGACATC CAGAACTCTT CAAGAGGTTG
GGAATCGAGC CACCAAAGGG GATCATGTTA TACGGTCCTC CAGGTGTGGG TAAGACCCTG
CTGGCCAAGG CTGTTGCCAA TGAAACTGAA TCCTATTTCA CCTCAATAAA CGGGCCAGAG
ATAATGAGCA AGTTCTACGG CGAGAGCGAG CAGAGGCTCA GGGAGATCTT CGAAGATGCT
AAGAAACATG CCCCAGCCAT AATATTCATT GATGAAGTAG ACGCCATAGC TCCCAAGAGG
GATGAGGTTA TAGGAGAGGT AGAGAGACGT GTTGTCGCGC AGTTGCTGAC ACTCATGGAT
GGTCTAGAGA GTAGGGGAAA TGTAATAGTC ATTGCAGCTA CCAACAGGCC AAATGCAGTA
GATCCTGCAC TGAGAAGGCC TGGGAGATTT GACCGTGAAA TAGAGATACC CCTACCAGAT
AAGCAGGGCA GACTGGAAAT ACTTCAGATC CACACTAGGA ACATGCCACT TTCAAAGGAC
GTGGAACTGG AGAAATTGGC TGATATAAGT CATGGCTATA CTGGGGCCGA CCTTTCTGCC
CTAGTCAGGG AGGCTGCAAT GAACGCCCTG AGAAGATATT TGCCCATGAT AGATATCAGT
CAGGACAAGA TCCCGCCAGA GATCCTAGAG AGAATGGAAG TCAAGATGGA GGACTTCATG
AATGCATTCA AGGAAATTGT GCCCAGCGGC ATGAGAGAGA TCTACATTGA GGTTCCAGAG
GTAAAGTGGG ATGACATAGG CGGTCTCAAC GAGATTAAGG AAGAGCTTAG AGAGGTAGCA
GAGTATCCGC TGAAGTTCCC AGACTACTAT GAAACCGCAG GAGTGGAACC ACCAAAGGGA
ATACTCCTGT TTGGACCTCC AGGCACAGGT AAGACCATGC TGGCAAAGGC TGTAGCAACC
GAGAGCGGAG CGAACTTTAT TGCTGTGAGA GGTCCTGAAG TTCTCTCTAA GTGGGTAGGG
GAGAGTGAGA GGGCAATCAG GGAGATATTT AGAAAGGCGA GAATGTATGC ACCATCGGTG
ATATTCTTCG ACGAAATAGA TGCCATAGCT CCCATGAGGG GAATCTCCTC TGACTCTGGT
GTAACTGAGA GACTAGTTAA CCAACTTCTA GCTGAGATGG ATGGCATAGA GAACCTAGAC
AACGTTGTTA TTGTGGCAGC TACCAATAGG CCAGATATAC TAGATCCAGC CCTACTGAGG
CCTGGAAGGT TCGAGAAACT TATGTACGTA CCCCCACCTG ACAAGAACGC TAGATATGAC
ATACTAAAGG TTCACACCAA GAAGGTTGCT CTCTCGGACG AAGTTAACCT AGAGGAACTA
GCTGAGAGGA CAGAGGGATA TACAGGCGCC GATCTGGCAG CCTTGGTTAG GGAGGCTGCA
ATGAGGGCCA TCAGAGAAGG GATGAGGGAA TGCGTAAATA GGGTCAGTGC AGCTTGTCCA
CCTAATGATA AGGATTGTCG CGACGCGAAA ATGAGAGATT GCATGAAGGG CGCAACAATT
AAGGTGGAGA ACAGGCATTT CAACGAAGCC TTAACTAAGG TTAAGCCATC GCTCAGCCAG
GAGATGATAC AATTCTATCA GACATGGATT GATAAGGCAA GACAGCAACT ACCAAGACAG
ACTGTGAAGC CCAGTACGTT TACGTGA
 
Protein sequence
MSAGSSPEQR SPRRELSLKV MEARQKDVGR GKVRIDVEML AQIDVSPGDV VEIEGTRKTA 
AIAWPLSPDD ATSERDIIRM DGITRKNAGV SIGDKVIVRK ASVKQAASIK LAPSNFSITV
DPGFVAYVKK KLKEFPLVEG DTVLIPVLGQ AIPFTVIQVR PASIVMVVDE TSISISDKPI
EQTRYPRVTY EDIGGMKNVI QKIRELVELP LRHPELFKRL GIEPPKGIML YGPPGVGKTL
LAKAVANETE SYFTSINGPE IMSKFYGESE QRLREIFEDA KKHAPAIIFI DEVDAIAPKR
DEVIGEVERR VVAQLLTLMD GLESRGNVIV IAATNRPNAV DPALRRPGRF DREIEIPLPD
KQGRLEILQI HTRNMPLSKD VELEKLADIS HGYTGADLSA LVREAAMNAL RRYLPMIDIS
QDKIPPEILE RMEVKMEDFM NAFKEIVPSG MREIYIEVPE VKWDDIGGLN EIKEELREVA
EYPLKFPDYY ETAGVEPPKG ILLFGPPGTG KTMLAKAVAT ESGANFIAVR GPEVLSKWVG
ESERAIREIF RKARMYAPSV IFFDEIDAIA PMRGISSDSG VTERLVNQLL AEMDGIENLD
NVVIVAATNR PDILDPALLR PGRFEKLMYV PPPDKNARYD ILKVHTKKVA LSDEVNLEEL
AERTEGYTGA DLAALVREAA MRAIREGMRE CVNRVSAACP PNDKDCRDAK MRDCMKGATI
KVENRHFNEA LTKVKPSLSQ EMIQFYQTWI DKARQQLPRQ TVKPSTFT