Gene Msed_0634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0634 
Symbol 
ID5103794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp579589 
End bp581388 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content48% 
IMG OID640506538 
ProductATPase central domain-containing protein 
Protein accessionYP_001190733 
Protein GI146303417 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.527117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.730431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTACA TCGAGACAAT GCTAATTCTT TTTATAGCAC TTGGTATTAT AGTCTACATA 
CTTATGGGTC TATTTAAAAG ATTTACTACC AATTTCACCA CCAGCGACAG GGCTTCCCAG
GTTCAGGTGA AGTCTAAGAA GAAAGAGGAG GAGGAGAGAA AGGCTTCCTG GGACGACATT
GGAGGTTACG AGGATGTAAA GAAGGAGATC AGGGAGTACA TAGAGTTTCC CCTGAAGAAC
AAGGAGATAG CCAAGACCTA TGGGTTAAGG CCTCCAAAGG GAGTATTACT TTTCGGTCCC
CCCGGTTGTG GAAAGACTCT AATGATGAGG GCTTTGGCTG GCGAGGCTAA ACTGAACTTC
ATTTACGTGA ATGTCAGCGA TATCATGAGC AAATGGTACG GGGAAAGCGA GGCCAGGCTC
AAGGAGTTGT TCGCTAACGC CAGGAAGAAT GCCCCATGTA TCCTGTTCTT CGATGAGATA
GATACAATTG GTGTAAGGAG GGAAACCCAC AGTGGTGATT CAGTTACTCC CAGGTTACTT
TCCCTCATGT TGTCGGAGAT TGATGGCCTG CACAGCGATG ATGGAGTAAT AATAGTTGGT
TCCACTAACG TGCCCCAGAC GTTGGATAAG GCATTACTTA GGGCAGGGAG ATTCGATAAG
TTGATCTTCA TAGGTCCTCC GAATAAGCAG GCCAGGCTGG AGATACTGAA GGTTCACTGT
GCAGGCAAGC CCTTGGCCCC AGACGTGGAT CTATCCAAGA TAGCAGAGAT GACGGAGAGG
TACAGTGGAG CGGATCTGGC AAACATATGC CAGGAAGCGG CGAGGAAGGT GGCTGTGGAG
GCGTTGGAGA GCAAGACCGA GAGGAAGATA ACTATGCAAG ACTTTATGGA GATCATTCAG
AGATACAAAC CAAGTATCAC TTTACAGATG CTTGAGGAGT TCGAAAAGTT CAGGCTAGAT
TACGAGAGAA GATCTAGGAA ATCTGAGGAT GCTAAGGAGG GAGAGGACAA GATAACTCTG
GATGACATAG GTGGCTACAC GAAGGTTAAG CAGGAACTAA AGGAGCTCCT AGAGCTTCAG
CTAAAATACG CTAGACTCAT GGAGCAGATG AAGGTTCCAC CAATTAGGGG GCTCCTACTT
TATGGCCCTC CAGGAGTTGG TAAGACCATG ATGGCTAAGG CTCTCGCTAA GACTCTCGAT
GTCAAGTTGA TTTCAGTGAG CGTGGCGGAG ATAATGTACA AGGGATATGA GGGAGCAGTT
GCCACAATAA AGGAAGTGTT TAACAGAGCG AGGGAGAACA GGCCCGCAAT AATTCTCTTG
GACGAGCTAG ATGCTATTGC ATCCAAGAGA ACCCAGAGGG GTAACGGCGA ATCCTCAAAA
ATTGTGAACC AGCTTTTGAC AGAGATGGAC GGAATAAGAA ACCTAAAGGA GGTAGTGGTT
ATTGGTACCA CAAACAGGAT CAAGGTTATA GACCCCGCGC TACTCAGGCC CGGAAGATTT
GACATAGTGA TAAAGATGGG TCTTCCCAAC CTCGAGGAAA GGCTGGACAT TCTGCAAAAG
TACCTTGGGG TGGAGAATTG TCAGGAGGTG GACTGCAGGA AGATTGCAGA GCTCACCGAG
AATTACACAG GGGCCGATCT GGCAGCGGTT GCTAGGGAGG CGAAGATAAG GGTGCTCAAG
GACATTATAA GGGGACAAAC TGACAGGAAG TTAACCAAGG AGGATATGAT GGAGTCACTC
AAGAAGGTAA GGCCTTCCAC GCTGATAAAA TCCGTTGAGA AAGCGAAGAG TCAGGAGTAA
 
Protein sequence
MSYIETMLIL FIALGIIVYI LMGLFKRFTT NFTTSDRASQ VQVKSKKKEE EERKASWDDI 
GGYEDVKKEI REYIEFPLKN KEIAKTYGLR PPKGVLLFGP PGCGKTLMMR ALAGEAKLNF
IYVNVSDIMS KWYGESEARL KELFANARKN APCILFFDEI DTIGVRRETH SGDSVTPRLL
SLMLSEIDGL HSDDGVIIVG STNVPQTLDK ALLRAGRFDK LIFIGPPNKQ ARLEILKVHC
AGKPLAPDVD LSKIAEMTER YSGADLANIC QEAARKVAVE ALESKTERKI TMQDFMEIIQ
RYKPSITLQM LEEFEKFRLD YERRSRKSED AKEGEDKITL DDIGGYTKVK QELKELLELQ
LKYARLMEQM KVPPIRGLLL YGPPGVGKTM MAKALAKTLD VKLISVSVAE IMYKGYEGAV
ATIKEVFNRA RENRPAIILL DELDAIASKR TQRGNGESSK IVNQLLTEMD GIRNLKEVVV
IGTTNRIKVI DPALLRPGRF DIVIKMGLPN LEERLDILQK YLGVENCQEV DCRKIAELTE
NYTGADLAAV AREAKIRVLK DIIRGQTDRK LTKEDMMESL KKVRPSTLIK SVEKAKSQE