Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18622 |
Symbol | MCM2 |
ID | 7203986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 708929 |
End bp | 711850 |
Gene Length | 2922 bp |
Protein Length | 808 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186401 |
Protein GI | 219113635 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGACGACC CTGACGAGCG TAGTGTGGAC GACGTCGTCG CCGAGAATGA CGACGACGAC GTAGACTACG TGACGGGACA AGCCGCCGCC GAAGAAGAGG AAGAAGACGG GGAAGACTTG TTGGAGCACG CCGAACGTGA TTACCAAGCT ATTCCAGCTT TGGATACGTA CGGAGCCGAA GGTTTGGACG ATCGCGATTA CGAAAATCTT ACTGTCGATG CCCGTCGAGC GGTGGAACAA CAGTTGGCCC GACGGGATCG GGACAAGGCC GGAAGGAACG ATGGCTTCTA CACTCTACTC GACGATATGG CGGACGAAGA CGAAGAAGCG CGACAAGCCC GTCGCGGAGC CTTTGATCGT CGTCAATTGC ACAAGGACGG ACGGGGCGAC GAAGACGACG ACGACGCCGC GGATACTCGG GGACCCGCGG CGGACGACGC CGACGATATT ACGGAAGGTG ATTTGGACCG TGACGATCAA GTCAATCTAG AAGCCTTTGA CGTGCCGCTG CGGGAGTGGA TTGCGCAAGA ACAGACGCGC CGCGAAATAC AACGCAAGTT TCGGGTCTTT CTGCGACACT ACACGGGACC CTCGGCCGTT CCGGAATCCC GGCGCCGCCG CGGGAACGGA CTCTACGAAC AAAAGATCCG TACCATGTGC GCCTCCAACA AATCCACATT ACAGGTCTCC TACATTCATC TCATGGACGC CGAACCCATT CTGGCGTACT GGTTGGCGGA TGCTCCCAAG GACATGCTCC TCGTACTCAA CGAAGCTGCC ACCCGCCACA CTCTCATGCT CTTCCCCTCC TACAACGCCA TCAAGTCCGA AATTCACGTG CGTATTTCGG AGGTGCCCAT TCTGGATAGT CTCCGGGATT TGCGGCGGTC GCATCTAGAT TGTCTCGTCA AGGTGCACGG TGTCGTTACC CGGCGCTCCT CCGTCTATCC CCAACTGCAA ATGGCCTACT ACACCTGCCT TTCCTGCAAG GCCATTCAGG GGCCCTTCCG TACTGAAGGC GTCGGAGCCA ACTTGGCCAA CGTCCATACC CCTAGCGAAT GCGTGCAGTG CGAAGTTTCC GCCTTTCGTC TGCACCCCAC CATGTCCTCC TACCGCAACA TCCAACGTGT CAATCTACAA GAGACACCCG GATCGGTTCC ACCCGGCCGC GTCCCCCGCA CCAAAGAAGT CCTCGTCGCC GATGACCTTA TTGACGTCGC TCGACCCGGG GAAGAAATCG AAGTCACCGG TGTGTACGAA CACACCTTTG ACTCCTCACT GACGCTCAAA TCCGGTTTTC CCGTCTTTTC AACTTTTCTG CACGCCAATC ACGTTCTCAA ACGCGAAGAC GCCTCCAGCG CCTCCAATTT GAGTGAACAA GACATTCGCG ATATTCTGCA GCTCGCCCGG GATCCCAACA TTGGGGCCCG CATCGTTCAG TCCATCGCCC CGAGTATCTA CGGCCACGAC AATTGCAAAA TGGCCCTCGC CATGAGTTTG TTCGGTGGCG TCGCCAAGAA CATCAACGAC AAACATCGTA TTCGTGGCGA CGTGAACGTG CTCTTGTTGG GCGACCCCGG GACGGCCAAG TCGCAGCTCC TCAAGTACGC CGAACAGACC GCACCCCGGG CCGTTTACTC TACCGGGAAG GGTGCGTCGG CCGTGGGATT GACCGCTAGT GTGCATAAGG ATCCGATTAC GAGGGAATGG ACGCTCGAGG GTGGGGCATT GGTGCTCGCC GACAAGGGCG TCTGCCTCAT TGACGAATTC GACAAAATGA ACGAACAGGA TCGCACGTCA ATCCATGAAG CCATGGAACA ACAGAGTATC TCCATTTCTA AAGCCGGCAT CGTCACCAGT TTGCAGGCGC GGTGTTCCGT CATTGCGGCG GCCAACCCGA TCGGTGGTCG TTACGACAGT AGCAATACTT TAGCGGATAA CGTGGAGTTG ACGGACCCGA TTCTGCAGCG ATTCGACTGC CTTTGTGTAT TGCAGGATGT GGTGGATCCG GTCGCCGATG AACGGCTCGC TCAGTTCGTC ACTAGTAGTC ACATGCGGTC CGTACCCACG CGGGAATACG TGCCGAACGA AAGCGACCTA GCCGACAACA ACGCGGAACG CCCCGGTCTC ATTCGGCAAG ATCTGTTGCG CAAGTATATT CAGTACGCCC GCTTCAACGT ACGGCCCATT CTGCGTGGCA ACGCGCTGGA CCAGGAAAAA GTGTCGTCGC TGTACGTGGC GCTGCGTCGA GAGTCCGCCG CATCGGGTGG CGTGCCCATT GCGGTGCGCC ACGTGGAATC CATTATGCGC ATGTCAGAAG CTCACGCCAA AATGCACCTG CGTGACTACG TTCGGGACGA TGATATGGAC GCCAGTATCC GCATGATGCT GGAGAGCTTT ATCATGGCGC AAAAGTTTAG CGTCCAACGT GCGCTCCGAC GGTCGTTCGC CAAGTTTATT ACGTCCGGAG AAGACCGGGC TTACCTGCTC CTGCACATTT TGCAGGACAT GTTCCGCAAG GAACAAATGT ACCAGGTCAT CCGTTTGCGA CAACGCAATC AGACCGAGGA CGATCTTGAA ACGCTAGACG TGCCGCTGGA CGAGCTGGAA GCCAGGGCGC GGGAGCGACG GATCTACGAC GTTTCCGAGT TCTGCCGAAG CGAAGCCTTT ACCGAAGCGG GCTACGTCTT GGACGAACGT CGTCGGGTTG TTTCCCGTAA TTTGGTTGTA TGATTGGAAA GCTACGAGTA GAATGATTAT GTGTGCTCAC GAGCTGATTT AGTGTTCAAC AATGAAAAAA GTAAATCATC GTCCCTTGAA TGCTTTTGCA GCGATCGGGA CGAATGTCAC ACATCAGTCA ATCCGAAAGG GAACATTGCG TTCTATAGGC TGCACCATAT AGTCTGAAAT ATTACCAAGC GC
|
Protein sequence | MADEDEEARQ ARRGAFDRRQ LHKDGRGDED DDDAADTRGP AADDADDITE GDLDRDDQVN LEAFDVPLRE WIAQEQTRRE IQRKFRVFLR HYTGPSAVPE SRRRRGNGLY EQKIRTMCAS NKSTLQVSYI HLMDAEPILA YWLADAPKDM LLVLNEAATR HTLMLFPSYN AIKSEIHVRI SEVPILDSLR DLRRSHLDCL VKVHGVVTRR SSVYPQLQMA YYTCLSCKAI QGPFRTEGVG ANLANVHTPS ECVQCEVSAF RLHPTMSSYR NIQRVNLQET PGSVPPGRVP RTKEVLVADD LIDVARPGEE IEVTGVYEHT FDSSLTLKSG FPVFSTFLHA NHVLKREDAS SASNLSEQDI RDILQLARDP NIGARIVQSI APSIYGHDNC KMALAMSLFG GVAKNINDKH RIRGDVNVLL LGDPGTAKSQ LLKYAEQTAP RAVYSTGKGA SAVGLTASVH KDPITREWTL EGGALVLADK GVCLIDEFDK MNEQDRTSIH EAMEQQSISI SKAGIVTSLQ ARCSVIAAAN PIGGRYDSSN TLADNVELTD PILQRFDCLC VLQDVVDPVA DERLAQFVTS SHMRSVPTRE YVPNESDLAD NNAERPGLIR QDLLRKYIQY ARFNVRPILR GNALDQEKVS SLYVALRRES AASGGVPIAV RHVESIMRMS EAHAKMHLRD YVRDDDMDAS IRMMLESFIM AQKFSVQRAL RRSFAKFITS GEDRAYLLLH ILQDMFRKEQ MYQVIRLRQR NQTEDDLETL DVPLDELEAR ARERRIYDVS EFCRSEAFTE AGYVLDERRR VVSRNLVV
|
| |