Gene Msed_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0001 
Symbol 
ID5105029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp331 
End bp1521 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content49% 
IMG OID640505894 
ProductORC complex protein Cdc6/Orc1 
Protein accessionYP_001190102 
Protein GI146302786 
COG category[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1474] Cdc6-related protein, AAA superfamily ATPase 
TIGRFAM ID[TIGR02928] orc1/cdc6 family replication initiation protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.265686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TTATCGACGA AGTGTTATCC TCAGTTAAGA ACTCAGCCAT CTTCAAGAAC 
AGGGAATATC TCCTCCCCGA CTACATCCCA GAGGAGTTGC CTCACCGTGA AAATGAGATA
AAGAAGCTTG CAAGCATTCT CGTTCAGTTG TACAGGGGGG AGAGACCCAG TAACATCTTC
ATTTACGGTC TCACAGGTAC TGGAAAGACC GCAGTAACCA AGTATGTTCT GAGTAATCTG
CAAAGGAAGC TCAATAACTT CGAGTACGTG TACATAAACG CCAGACAGAC CGACACCCCC
TACCGGATCC TGGCAGATAT AATTGAGATC CTAGGGGATA AGGTTCCCTT CACGGGCCTT
TCCACGGCGG AGCTGTACAG GAGGATGGTC AAGGTTTTGG AGAGGTCAGA AAGGGTTATG
ATTATCGTGC TGGATGAGAT TGATGCACTG GTCAAGAAGC ACGGTGATGA TATACTCTAC
AAGTTAACCA GGGTGAATTA CGACGTTCAT AAGAGTAAGA TCTCCATCGT AGGAATAACC
AATGACGTAA AGTTCATAGA TGGGCTCGAT CCCAGGGTTA GGAGTAGCCT TGGAGAGGAG
GAGTTGGTGT TTCCCCCATA CAACGCTGAA CAACTGGAGG ATATCCTCAA GAAGAGGGCA
GTCCTGGCCT TCAGGGAGGG AGTGGTATCG GAGTCCATCA TCAAGTTATG CGCAGCCATA
GCTGCCAGGG ATCACGGAGA TGCCAGGAGG GCCCTAGATT TGCTTAGGGT TGCCGGGGAG
ATCACGGAAA GGGAGAGGAA AAACCAGGTA GGCGAGGAAG AAGTTGAGAA GGCCAGGGTA
GAGATAGAGA GGGATCGCGT GTATGAGGTA ATCGCGACCT TACCCTTCCA CTCTAAGCTG
GTCCTGTTAT CCATCATTAA GGGTCTAACC AAAAATACCA GGCTTACCAC GGGGGAAATT
TACGACCTTT ACAGGAACAT TGCCACCTCG ATGGGATCCG AATTTGTGAC CCAGAGGAGG
GCAAGTGACA TAATAAACGA ACTGGACATG ATGGGGATAA TCTCAGCTAG GGTGGTGAAC
AGGGGAAGAT ATGGTAAGAC AAAGGAAGTT GTTCTGGCAG TCGACTCCGG AATAGTCCTG
AAAGCCCTCC TGGAGAGTGA CGAAAGGTTT GCTGATTTCT GGAGTGGATG A
 
Protein sequence
MSDIIDEVLS SVKNSAIFKN REYLLPDYIP EELPHRENEI KKLASILVQL YRGERPSNIF 
IYGLTGTGKT AVTKYVLSNL QRKLNNFEYV YINARQTDTP YRILADIIEI LGDKVPFTGL
STAELYRRMV KVLERSERVM IIVLDEIDAL VKKHGDDILY KLTRVNYDVH KSKISIVGIT
NDVKFIDGLD PRVRSSLGEE ELVFPPYNAE QLEDILKKRA VLAFREGVVS ESIIKLCAAI
AARDHGDARR ALDLLRVAGE ITERERKNQV GEEEVEKARV EIERDRVYEV IATLPFHSKL
VLLSIIKGLT KNTRLTTGEI YDLYRNIATS MGSEFVTQRR ASDIINELDM MGIISARVVN
RGRYGKTKEV VLAVDSGIVL KALLESDERF ADFWSG