Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51412 |
Symbol | MCM4 |
ID | 7196534 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1459687 |
End bp | 1462139 |
Gene Length | 2453 bp |
Protein Length | 791 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176792 |
Protein GI | 219110080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCGC AACGGGACGA TCCTTCGGTG GCTCCACTGG ACGACGATGG TTTGTTGGGA CAGGAAACGG CACACATTCG GGGAACGGAC GTGCACGTTC CTACGGCGGC AGCGACCTTT ACCAGCTTTC TCCGTAGCTT TCAATCGTTA CAGATCTCCC AACGTGTGCA GGAGCGAGCC GCGAGGGTAG AGGCCGACGA CGACGATTCC GTTGGGGACG AAGTGGACGT CGACGGGGAC GATCCGCCAC CGTTGTACCG TACGCGACTC GAATCGCTCC TCACTCGCGG TGTCACCCAG GGATCGACTA CCACCAGTCT CGATATTGAT GCCATGCATC TGTACTACCA CAACGCAGCC TGTCAGCGAC TCTATCATCA GATTGTGGCC TATCCGATGG AGCTGGTCCC CCTCATGGAC TTGTGCGTCC AGCGCGAACT AGAGCACCTC GCCAACACGC TACCCGACGT TGTGGATCCG GACACTCTGC CGCGGGCGCA GGTTCGACCT TTCAACCTCA AGCTCGTCTC CAACCTGCGG TGTCTCGATC CCGTCGCTAT GGATACGCTC CTTTCCGTCA AGGGCATGAT TGTGCGCTCG TCGCCCATCA TTCCGGATCT CAAAATTGCC CACTTTGGTT GTTGCGTCTG CGGACATGTT GTACAGGTAG CCATTGATCG CGGCAAGATT GCCGAACCTA CCGCACGCTG TCCCCAGTGC AACACCGCCG CATCCTACCA GCTGGTACAC AATCGCTGCG TCTTTGCCGA CAAGCAGCTC GTGCGGTTGC AGGAAACGCC CGACGAGGTG CCGGCCGGAC AAACACCCGC ATCCGTTACC TGCTTCAGTT TCGATGATCT CGTCGATGCC GTGCAACCCG GTGATAAGGT CGAAGTCACC GGTGTGCTCC GCGCCCAACC ACTCCGCGTA CACCCCAAAA TATCCAAACT CAAGACCGTT TACAAAACGT ACCTGGACGT CATTCATTTC CGGACCATTG CTGGTATAGA TAACGACCAG ACCAAGCACG ATTTGGCCGC CACGGCGGGA CACCAGTCCA ACCGTTCTCG GTGGTCCGAC GATCGTGTCC GACAGCTACG AACGCTCGCC CGGGACCCAC TAATATACGA AAAACTGACC GCCTCCCTCG CACCGTCCAT CTGGGAACTC GACAACGTCA AGAAGGGCAT TCTCTGTATG CTCTTTGGGG GCAACCACGT ACGGGTCAAA CAACAACAGC AACAGACCAC GGAAGAACAA AATGCCCCCG ACGCTTGGTT GGACGAGGAA GAGGAGGGTA CCGGCGCTAC CTCCAAACTC AACAAACGTG GTGACGTGAA CATTCTGTTG TGCGGCGATC CCGGAACCTC CAAGTCCCAA TTACTCTCCT ACGTACATAA ATTGACCACC CGGGGGGTCT ACACCTCTGG TAAGGGCTCT TCCGCGGTTG GTTTGACGGC GTCGGTCGTG CGCGATCCGG AAACCCGGGA TTTGGTGCTC GAAAGCGGAG CACTCGTGCT TAGCGATCAG GGTATTTGTT GCATTGACGA ATTTGACAAA ATGACCGACA CGACACGCAG TGTACTACAC GAAGCCATGG AACAACAGAC GGTGTCGATC GCCAAGGCGG GAATCCTGGC CACGTTACAC GCCCGGACCA GTGTACTGGC GTCCGCGAAT CCCACGGAAT CGCGGTACAA TCCCAACCGC TCGGTGGTGG ACAACATTCA GTTACCACCC ACGCTTTTGT CACGGTTTGA TTTGATCTAT TTGATTCTGG ATAGTCCAAA CATGGAACAG GACCGTCGAC TGGCCCAGCA CTTGGTCGGA CTGTACTACG AGACTCCCAA CGTTGTCCAA CCTCCGTTGG ACCAGGCTTT ATTACGTGAC TACATTGCGT ACGCTCGCGA CAACATCCAC CCCGAAATTT CCGACGAGGC CGCCGACGAG TTGGTATCGA GTTATTTGAC CATGAGGAAT CCGCCCGGTG GTGGGGCCGC GGCAGCAGGT ACCCGCACCA TTTCCGCCAC ACCCCGGCAA CTCGAGTCGC TGATTCGGCT TTCCGAAGGC CTTGCCAAAA TGCGCTACAG TAGTATCGTG TCCCGGGCCG ATACGTTGGA AGCGGTACGA CTCATGAAGG TCGCGACACA AGCGGCGGCC ACGGATCCGC GAACGGGACG TATCGACATG GACATGATTA CGACGGGAAA GTCGTCGGCC GATCGACAAC TCGAAGAACA AATTACCTTG ACGCTACAAG AACTGTTCGC GGAACGTCGG GGGACCCGCA TGGCGGTTCG TGATGTCACC AAGCAGCTCG CGGAAATCAC CAACGCAACG ATTCCGCATG ACGAAGTAGT CAAAGCCCTG CGTCAAATGG AAGCGGACGG CGTGATCCAG TACCAAGAGA GGGCGCAAAC TATTTTTGTT CGTACCGGAA TTGTGCGATC GTG
|
Protein sequence | MPPQRDDPSV APLDDDGLLG QETAHIRGTD VHVPTAAATF TSFLRSFQSL QISQRVQERA ARVEADDDDS VGDEVDVDGD DPPPLYRTRL ESLLTRGVTQ GSTTTSLDID AMHLYYHNAA CQRLYHQIVA YPMELVPLMD LCVQRELEHL ANTLPDVVDP DTLPRAQVRP FNLKLVSNLR CLDPVAMDTL LSVKGMIVRS SPIIPDLKIA HFGCCVCGHV VQVAIDRGKI AEPTARCPQC NTAASYQLVH NRCVFADKQL VRLQETPDEV PAGQTPASVT CFSFDDLVDA VQPGDKVEVT GVLRAQPLRV HPKISKLKTV YKTYLDVIHF RTIAGIDNDQ TKHDLAATAG HQSNRSRWSD DRVRQLRTLA RDPLIYEKLT ASLAPSIWEL DNVKKGILCM LFGGNHGTGA TSKLNKRGDV NILLCGDPGT SKSQLLSYVH KLTTRGVYTS GKGSSAVGLT ASVVRDPETR DLVLESGALV LSDQGICCID EFDKMTDTTR SVLHEAMEQQ TVSIAKAGIL ATLHARTSVL ASANPTESRY NPNRSVVDNI QLPPTLLSRF DLIYLILDSP NMEQDRRLAQ HLVGLYYETP NVVQPPLDQA LLRDYIAYAR DNIHPEISDE AADELVSSYL TMRNPPGGGA AAAGTRTISA TPRQLESLIR LSEGLAKMRY SSIVSRADTL EAVRLMKVAT QAAATDPRTG RIDMDMITTG KSSADRQLEE QITLTLQELF AERRGTRMAV RDVTKQLAEI TNATIPHDEV VKALRQMEAD GVIQYQERAQ TIFVRTGIVR S
|
| |