Gene Msed_0246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0246 
Symbol 
ID5104112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp205638 
End bp207926 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content47% 
IMG OID640506152 
Productprotein of unknown function DUF699, ATPase putative 
Protein accessionYP_001190347 
Protein GI146303031 
COG category[R] General function prediction only 
COG ID[COG1444] Predicted P-loop ATPase fused to an acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.588146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.529293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGTG AGGAGTTCTT TTCAGCACTG AGGGAGGCGT TGATAGATTC AAGGAAGCGG 
TTCTATAGGA ACTTGGTATA CATAGAGAAG ACAGATTACC TAGAGGACCT GAGGCGCGTT
CTTTCCCTGT TCAACGAAAC TCAAGGGGGT AGTAGAAGGG CCTATGCCTT TCATCCTTGG
GCTACTGGTT CCAAGGAAAG GCTTTCAGCG TTGAAGGACC TCCTGGGGCA GGTAGATGAC
ATAGATTACT CCAGTTCTGA GTACTACCTA GGAAGGACAT ATGACCTTGT AGTCCTTGAC
CTGGTGGATA ACTTCCAGCC AAACTATGTG GGCAGGCTTA CCGATCTTAC CAGCGGAGGC
GGTCTGGTGG TAATGTACAC TGATAATCTA ACGCAAAATA AAATATTTCG AAATTCCATA
GCGAGGAAGG GAATAGTCCA CGATTACTAT GAGCAGAGGT TCAGGAGGAA ACTCAATGAA
CATGAAGGCA TGTTTAGAAT AGATTCTAGT TATGAGGCTA GGCCCTTCAA GGGAGAGGTC
AAGCCCACAA CCGAGAAAAA GCGTCTCAAG AGCATGTATT TCCCTAAGGA ACTTCACGAT
CTCTGCCTCA CCGATGAACA GGACAAGGTT CTAGAGGAGT TCAGGATCCT GTACAGGGGA
GGTAAAAGGA TCCTCGTGAT AACTGCGCCC AGAGGGAGGG GAAAAAGCGC TGTCACTGGG
CTAGGCATAG CGGCCTTGAT AGCTGATTCC AATAGGGAGA GAACTAGGGT AGTGATAACA
GCTCCGTCCC TCGCCTCAGC TTCGCAGATC ATGGAGTTCG CCAAGAGGGG ACTAGATACG
CTTCAGGTCC CCAATGAGGC AGAGATGTCT GATATAGGAA TAGTGAGGGC CATTAGAGGG
GATAATTTCT CAGTTGTTTA TGTCTCACCC GAGACGGCTG TGGGTGAAGA CGGCACTTTC
CTGGTTGTGG ATGAGGCAGC TGCAATTGGG ATAAATCTTC TAGCCCAGTA TGTTAACAGA
TGGAGAAAGG TAGTCTTTGT CTCCACGGTA TATGGCTATG AGGGATCAGG AAAGGCCTTC
CTGAGATATC TTAAGAACAT TCTAGAGGAG AAGAAAGCCT GGACACGTTG GCTAACAATG
AGTAAACCAC TTCGTTACGC AGAAGGCGAC CCAGTGGAGA AATGGCTCTA TGATGCCCTA
CTACTCAACC CTGAGCCCGC TAAACCTAAG AGCTTGGAAT CGGTTGAATT CGTAACGTTA
GACAAGGAGA CACTCTTTCA CGACGACGTA CAATTGTCGC AGGCCTACGG CATACTGGTC
TCAGCACACT ATCGGAACAA CCCCGACGAC CTAATGATAA TGGGGGATGG GCCACATCAC
ATTCTGAAGG CAATTAGGGC TGAGGATGGG TTCATCTCAG TCTCCCAAAT CTCCGAGGAG
GGGAGCCTTT CGGATTCAAT GATAGATCTT GCCCTCAAGG GAGGTACCTT TGATGGTGAT
CTTATTCCAG ATAGACTATT GAAGCATGTT AGAATTAAGG AGTTTGGGAA GCTTTCAGGC
TGGAGAATAG TGAGGATAGC CACGGTCCCT GAGCTTCAGG ATAAGGGTTT TGGAAGCCAG
CTTCTTCAAA TGATACTTGA AGACGCTAAG TTACAGGGGG TGGACTGGGT TGGCTCCTCC
TTCATGGGAG ACCCCAAGGT TTTAAGGTTC TGGATAAGGA ATGGCTTCAT CCCGGTTCAC
GTCTCCCCCA AGAGGAATGA AAAGTTTGGG GACTTTCCCG TGGTCGTGAT CTATCCAATA
TCTAATGTAT CCAAAAGGAT CGTGGGCATA GCCTCCCACG TATTCAAGGA GAAGTTGCTG
AACACCATAC ATGACGTCTA CTTTAACATG ACACCTGACA TGGCCATGCT ACTTCTTCAG
GGCAGTAAAG CTCACTTGGA CGTTAGTGTA AGTAAGGTTT ATCTAGCAAA GCTTGTAGCA
TTTCTACAGG GTACCAGTCC CTACGAGTCC TCTGCTGACG CCATTCACGT TCTCGTAATG
AAGTACTTCT GGGACGGAAA GAGGGACTGG AAGCTAGATG ATAACCTAGA GAAGGTTCTC
CTGGCTAAGG TTCTTCAGGG AATGCCTTGG TCATATCTAA ATGTTGTTAT TGGAAAAGGG
AGAACGAATT CCACGGAGGC AATACATGAG GCTGTGAGCA TCTTAGCAAA AAGATATTAT
AACTTAGATG AGGAAGGAGA AATTGCAGTT TCGTTACAGG ATTTAGGTGA TGAGTTCACA
ACACGATGA
 
Protein sequence
MDSEEFFSAL REALIDSRKR FYRNLVYIEK TDYLEDLRRV LSLFNETQGG SRRAYAFHPW 
ATGSKERLSA LKDLLGQVDD IDYSSSEYYL GRTYDLVVLD LVDNFQPNYV GRLTDLTSGG
GLVVMYTDNL TQNKIFRNSI ARKGIVHDYY EQRFRRKLNE HEGMFRIDSS YEARPFKGEV
KPTTEKKRLK SMYFPKELHD LCLTDEQDKV LEEFRILYRG GKRILVITAP RGRGKSAVTG
LGIAALIADS NRERTRVVIT APSLASASQI MEFAKRGLDT LQVPNEAEMS DIGIVRAIRG
DNFSVVYVSP ETAVGEDGTF LVVDEAAAIG INLLAQYVNR WRKVVFVSTV YGYEGSGKAF
LRYLKNILEE KKAWTRWLTM SKPLRYAEGD PVEKWLYDAL LLNPEPAKPK SLESVEFVTL
DKETLFHDDV QLSQAYGILV SAHYRNNPDD LMIMGDGPHH ILKAIRAEDG FISVSQISEE
GSLSDSMIDL ALKGGTFDGD LIPDRLLKHV RIKEFGKLSG WRIVRIATVP ELQDKGFGSQ
LLQMILEDAK LQGVDWVGSS FMGDPKVLRF WIRNGFIPVH VSPKRNEKFG DFPVVVIYPI
SNVSKRIVGI ASHVFKEKLL NTIHDVYFNM TPDMAMLLLQ GSKAHLDVSV SKVYLAKLVA
FLQGTSPYES SADAIHVLVM KYFWDGKRDW KLDDNLEKVL LAKVLQGMPW SYLNVVIGKG
RTNSTEAIHE AVSILAKRYY NLDEEGEIAV SLQDLGDEFT TR