Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_8773 |
Symbol | MARK3 |
ID | 7196841 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1876123 |
End bp | 1878387 |
Gene Length | 2265 bp |
Protein Length | 511 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176859 |
Protein GI | 219110215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.527877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAGG CTCCCGTACA AATAGGCCAG TTTATACTGG GCAAGAATTT AGGAATCGGT GCCTTTGGAA AGGTAAGTTG TACGGAACAC ACGAACGATA TTCCCATGGT GACACTCCAG AAGATCCTTG TCGTCGGACA GCTACATTTT TCAAGATGTC ATGTACTTTG ATTTAATTTA TTGATTTTGA CAATCTAATT GTCGCTACGA CATATTATTT TGGGGATGGT CGAAGAGACC AGTTGCGTTC GTATCGTTTA GGGACCATGG GATTGCCCCA ACGATACTGC CCTCTCACGT GTATACTTCT ATCACCTTTT CTATCAGGTA AAATTGGCAA CACATGCCGT AACCGGACAC AAGGTAGCGG TGAAAATCCT GAACAAGAAC AAAATCAAGC AGCTGGGTAT GGAGGAAAAA GTTCATCGGG AAATCAATAT TCTGCATTTG TGCACGCACC CACACATTAT TCGCCTGTAT GAAGTTATTG ACACCCCAAC TGATATATTT CTTGTGAATG AGTACGTCTC TGGCGGCGAG CTTTTTGACT ACATTGTTTC CAAAGGCCGA TTGTCCGCGG ACGAGGCCCG TAATTTCTTT CATCAAATTA TTTCTGGAGT GGAATATTGT CATTTTCAAA AGATTGTTCA TCGTGATCTC AAGCCGGAGA ACCTTCTCTT GGACGCCAAT CTGAATATCA AGATTGCCGA TTTTGGATTG TCGAATCTCA TGAGAGACGG TGACTTCTTG CGTACGTCAT GTGGATCCCC AAATTACGCC GCACCAGAAG TTATCAGTGG CCATCTGTAC GCCGGACCAG AAGTCGATGT CTGGTCCTGT GGTGTCATTC TTTACGCTCT TTTGTGTGGA TCTCTTCCGT TCGATGATGA ATCGATTCCG AACTTGTTCA AAAAGATCAA AAGTGGAATG TACAGCTTAC CAACACATCT TTCCCAGTTG GCGAAGAACT TGATTCCGCG CATGTTGGAA GTTGATCCAA TGAAAAGAAT TACTATTGCC GAAATTCGTT TGCATCCGTG GTTCCAGCAT AAGCTTCCTC CTTATTTGAG GCACCCTCCA GAGCTGATGG AGAAGCAGGA GCGAATCGTC GATCAAGAAG TTATTGATGA AGTGATGAAG CTACCGTTTC ACAAGGCTTA TGGCAACACG AAAGGTCTTG CGAACGGTAC TCTCAATGTG CCGCAACATC AGTTCCTCAC TAATATTGTC ACAAGGGAAC TAGTTGAGAC AGCAGCGGCC TTGGAAGACA GTCGCGACTC CGACGCGAAA AAATTGCTGA AGGATTTGCG ATGCGCGTAC GAGCTAATCC TCGATCACAA GCACACTCGT CTTCGCGTTA TGGAAGTCGC CCGCGCCATT CAAGAGGCGG CGAGCGCGAC GCCACCAGCG TTCTCCCCTG GAGGATCTCG AGGGACAACT CCCGGTGGTC ACTATGGAAC CGGCGGTAGT CGATACGGTG GTAGTGTCGG CAACAGCTAT GATGGTGGGC GAACATATTC TGCAAGTGTG TCTTCCAATT CCCATTCTCC GACTGCTTCG CAATCATCAC CCGCACAACA ATCAAGACTT GCAGAAGAAG CGACTCGTGC ATTAATGCAA CCTGGTAGTA CGACCAGCAG CCACAGCTCA TCGCATACTC CCCCGGGATC CGTATCGGCA ATGTCTGGTC ATGGAATTGT GCAAATGACT TCGTCAATTC CAGGAAATAC GGGTATGATT GCCCAACATC AACACGGACG GCGAACTCGC CGATGGTATC TTGGTATTCA ATCAAAGAAG GATCCTGCAC ACGTCATGAC GGAAGTCTAT AAAGCGCTCA TGTCGCTTGG TTGTGAATGG TTACAGCTAT CATCCTACCG AATCAAGTGC AAATGGCGTC CAAATACTGG AGGGAGCGGT TCGAGCTCTA CAATTCCTTT GGCTGGGGGT GAATCCCCAC AGGCAGCGTG GATTTCGAAT CCTTCTCGAG GATTGAGCGA CGCATCGATG GATGTGGACA TCGATGGCAA GGAGAGACAT ACCAGCCCGA CGTTTGGACT CAATGCAATG CAAGTAATTG CTGGTGAAGA TGGGCATAGT TTGCGTGTAC CGAATCTCTC AACCTCAGAA TATTCAATCA AAATTGGTCT AACTCTCTAC AAGGTGCAAC AAAATATTTA TCTTCTCGAT TTTCAGAAAA TGACCGGCGA CGCCTTTTCT TTCATGACTC TGTGTGCGAA TATCATAACG GAGTTGAAGA GTCTG
|
Protein sequence | MDQAPVQIGQ FILGKNLGIG AFGKVKLATH AVTGHKVAVK ILNKNKIKQL GMEEKVHREI NILHLCTHPH IIRLYEVIDT PTDIFLVNEY VSGGELFDYI VSKGRLSADE ARNFFHQIIS GVEYCHFQKI VHRDLKPENL LLDANLNIKI ADFGLSNLMR DGDFLRTSCG SPNYAAPEVI SGHLYAGPEV DVWSCGVILY ALLCGSLPFD DESIPNLFKK IKSGMYSLPT HLSQLAKNLI PRMLEVDPMK RITIAEIRLH PWFQHKLPPY LRHPPELMEK QERIVDQEVI DEVMKLPFHK AYGNTKGLAN GTLNDLRCAY ELILDHKHTR LRVMEVARAI QEAASATPPA FSPGGSRGTT PGGHYGTGGS RYGNTGMIAQ HQHGRRTRRW YLGIQSKKDP AHVMTEVYKA LMSLGCEWLQ LSSYRIKCKW RPNTGGSGSS STIPLAGVIA GEDGHSLRVP NLSTSEYSIK IGLTLYKVQQ NIYLLDFQKM TGDAFSFMTL CANIITELKS L
|
| |