Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0001 |
Symbol | |
ID | 5105029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 331 |
End bp | 1521 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640505894 |
Product | ORC complex protein Cdc6/Orc1 |
Protein accession | YP_001190102 |
Protein GI | 146302786 |
COG category | [L] Replication, recombination and repair [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1474] Cdc6-related protein, AAA superfamily ATPase |
TIGRFAM ID | [TIGR02928] orc1/cdc6 family replication initiation protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.265686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA TTATCGACGA AGTGTTATCC TCAGTTAAGA ACTCAGCCAT CTTCAAGAAC AGGGAATATC TCCTCCCCGA CTACATCCCA GAGGAGTTGC CTCACCGTGA AAATGAGATA AAGAAGCTTG CAAGCATTCT CGTTCAGTTG TACAGGGGGG AGAGACCCAG TAACATCTTC ATTTACGGTC TCACAGGTAC TGGAAAGACC GCAGTAACCA AGTATGTTCT GAGTAATCTG CAAAGGAAGC TCAATAACTT CGAGTACGTG TACATAAACG CCAGACAGAC CGACACCCCC TACCGGATCC TGGCAGATAT AATTGAGATC CTAGGGGATA AGGTTCCCTT CACGGGCCTT TCCACGGCGG AGCTGTACAG GAGGATGGTC AAGGTTTTGG AGAGGTCAGA AAGGGTTATG ATTATCGTGC TGGATGAGAT TGATGCACTG GTCAAGAAGC ACGGTGATGA TATACTCTAC AAGTTAACCA GGGTGAATTA CGACGTTCAT AAGAGTAAGA TCTCCATCGT AGGAATAACC AATGACGTAA AGTTCATAGA TGGGCTCGAT CCCAGGGTTA GGAGTAGCCT TGGAGAGGAG GAGTTGGTGT TTCCCCCATA CAACGCTGAA CAACTGGAGG ATATCCTCAA GAAGAGGGCA GTCCTGGCCT TCAGGGAGGG AGTGGTATCG GAGTCCATCA TCAAGTTATG CGCAGCCATA GCTGCCAGGG ATCACGGAGA TGCCAGGAGG GCCCTAGATT TGCTTAGGGT TGCCGGGGAG ATCACGGAAA GGGAGAGGAA AAACCAGGTA GGCGAGGAAG AAGTTGAGAA GGCCAGGGTA GAGATAGAGA GGGATCGCGT GTATGAGGTA ATCGCGACCT TACCCTTCCA CTCTAAGCTG GTCCTGTTAT CCATCATTAA GGGTCTAACC AAAAATACCA GGCTTACCAC GGGGGAAATT TACGACCTTT ACAGGAACAT TGCCACCTCG ATGGGATCCG AATTTGTGAC CCAGAGGAGG GCAAGTGACA TAATAAACGA ACTGGACATG ATGGGGATAA TCTCAGCTAG GGTGGTGAAC AGGGGAAGAT ATGGTAAGAC AAAGGAAGTT GTTCTGGCAG TCGACTCCGG AATAGTCCTG AAAGCCCTCC TGGAGAGTGA CGAAAGGTTT GCTGATTTCT GGAGTGGATG A
|
Protein sequence | MSDIIDEVLS SVKNSAIFKN REYLLPDYIP EELPHRENEI KKLASILVQL YRGERPSNIF IYGLTGTGKT AVTKYVLSNL QRKLNNFEYV YINARQTDTP YRILADIIEI LGDKVPFTGL STAELYRRMV KVLERSERVM IIVLDEIDAL VKKHGDDILY KLTRVNYDVH KSKISIVGIT NDVKFIDGLD PRVRSSLGEE ELVFPPYNAE QLEDILKKRA VLAFREGVVS ESIIKLCAAI AARDHGDARR ALDLLRVAGE ITERERKNQV GEEEVEKARV EIERDRVYEV IATLPFHSKL VLLSIIKGLT KNTRLTTGEI YDLYRNIATS MGSEFVTQRR ASDIINELDM MGIISARVVN RGRYGKTKEV VLAVDSGIVL KALLESDERF ADFWSG
|
| |