Gene Msed_0701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0701 
Symbol 
ID5105307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp638445 
End bp639677 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content48% 
IMG OID640506605 
ProductAAA ATPase 
Protein accessionYP_001190800 
Protein GI146303484 
COG category[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1474] Cdc6-related protein, AAA superfamily ATPase 
TIGRFAM ID[TIGR02928] orc1/cdc6 family replication initiation protein 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00101638 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACAAA GTGAAACTCA ATTTAACTAT GGAAACTTAG GAAACTCTGT GATCAGAGAG 
GCACTAAAGG GGGGTAAGGG TGAGGTAATT AAGAACCCTA AGGTCTTCAT TGACCCCCTG
TCTGTCTTCA CGGATATCCC TTTTAGGGAG GATATAATTA GGGAGACCGC GATAGCGGTC
AGGTACTTCG TGAAGAACGA CGTTAAGTTT TCGAACCTAT TTCTAGGTTT AACTGGGACC
GGGAAAACCT TTGTGGTCAA GTACATCTAT AACGAGATTG AGGAGGTCAA GAAGGAGGAT
CCAGATTACA GTGGGGTAAA ACAGGTTTAC CTGAACTCAA GGGAGGTTGG GGGAACCCCT
CAAGCGGTCC TTTCAGTGAT AGCTGAAAAG CTCACAAATA GCCTCGTTCC AAGACACGGA
GTTAATCTGG GCGAGTATAT TGACAAGATA AAGAGCGTCC TAAGAGGAAA GAAGGCCATC
ATCTACCTCG ACGAGGTTGA TACCCTCGTG AAAAGAAGGG GAGGAGATAT TGTGCTCTAT
CAGCTGTTGA GGGCAGACGC GGACGTGTCA GTGATCATGA TCAGTAACGA TATCAACGTC
AGGGATTACA TGGAGCCCAG GGTTTTATCG TCGCTAGGTC CGTCGGTCAT TTTCAAGCCC
TATGACGCTG TACAGCTAAA GGAGATCCTC GCGAAGTATG CGGAGTATGG CTTGATTGAC
GGAACTTTCA ATGACGAGAT CCTCTCCTAT ATTGCTGCTA TATCGGCGAG GGAACATGGG
GATGCGAGAA AGGCGGTCAA TCTTCTCTTC AGATCGGCCC AATTAGCCTC AGGGATAGGT
TTCATTAAGA AGGAGCATGT GGATAAGGCC ATCGTAGAGT ATGAGCAGGA GAGGTTGTTT
GAGGCCGTCA AGTCGCTCCC CTTCCACTAT AAGCTCGCAT TAAGGGCCAT AGTTACGACG
GAGGATGTGG TCACGGCGCA CAAGGTTTAC TCTAAGTATT GCGACAAGCT CAAGCAGAAG
CCCCTATCCT ACAGGAGGTT CTCGGACATT GTGTCGGAGC TTGATATGTT TGGAATAGTG
AAGATAAAGA TCATGAACAG GGGAAGGGCA GGGGGAATAA GGAAGTACGT GGAGGTTCAT
GACAAGGAAA AGATAATGAA AGCACTTGAC GAGAACCTAG CGGAAGAGAT GGGTTATGAG
TACGACGAAG GGTCCGATGT GGAAACGAGT TAA
 
Protein sequence
MPQSETQFNY GNLGNSVIRE ALKGGKGEVI KNPKVFIDPL SVFTDIPFRE DIIRETAIAV 
RYFVKNDVKF SNLFLGLTGT GKTFVVKYIY NEIEEVKKED PDYSGVKQVY LNSREVGGTP
QAVLSVIAEK LTNSLVPRHG VNLGEYIDKI KSVLRGKKAI IYLDEVDTLV KRRGGDIVLY
QLLRADADVS VIMISNDINV RDYMEPRVLS SLGPSVIFKP YDAVQLKEIL AKYAEYGLID
GTFNDEILSY IAAISAREHG DARKAVNLLF RSAQLASGIG FIKKEHVDKA IVEYEQERLF
EAVKSLPFHY KLALRAIVTT EDVVTAHKVY SKYCDKLKQK PLSYRRFSDI VSELDMFGIV
KIKIMNRGRA GGIRKYVEVH DKEKIMKALD ENLAEEMGYE YDEGSDVETS