Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50515 |
Symbol | |
ID | 7199239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 282089 |
End bp | 285425 |
Gene Length | 3337 bp |
Protein Length | 1010 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185410 |
Protein GI | 219130517 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAAC AGTCTCGGGT ACCGACGAGC TCCGAGAGAC GCAGGAAAAG GGCTTCGTCA AAGAGTCTCC GTTCCAATCG GAGACAGGCT TTCCCGGATC GCGTACGCTC CGAGAGGAGC TGCCGCGATC TCTACAACAG CGACTGCGAC TGCCTTTTCC GCGCTAACGT GAGATCCGTC CGACGTAACG TGCGCTCGCC AAGAGTAATC GCCGGAAAAC AACGACAAAT ACGGAGGACA CTGTTGTTGG TGTATTGTCT CTGGCTCTCT GCCGACTGTC GTGCGTTTGT TGCTGGTCCA CCAGGAGCAA ATCATCCTCA TCCACATCAG CGTTTACATC CGCATCCGTA TTTTCAGCAA ATCCAGTATC TTTGCTCACG TTCAGAGACG ACGAGACGCG ATGCTAAAAG TAACAACCGT CCCTTTACAG ATACGGACTC GTCGGCAAAC CCCGACAATT CGAGAAGACC GCGTCCTCAA TACTCCAACG CTCAACACCG CCCCAAACGG AGGCAAAACA TCAAGTCGCA TCATCGGTAC AACAACCAGG CGACGGTCAT TTTCAATCAA CAGCTGCTCG ATATATGTCG TAAGAAGAAG GAAGCTCCCG CCAGTCGGAT TAAGGTCGCT GTACATGCGC AACAACTGTT GGAGGAACAA ATGAATGCAG GCCTTCACTA CGATGTGGTT AGTTTTAATA TTGTTATGCA AGCCTGGGCC CGACAGCACT CCTGGACGGC CGCGCAAAAC GCCGAAAATC TCTTGTCCAG GCTACTTTGC CATTCTACGC TCCAGGCGGA TTCTTATTCG TACGCCGCCG TGCTACACGC CTACGCCAAG TCCGGTGGTC AAGTTCCTGC CGCCCAACGG GCTCAGGCCT TACTTTCCCA ACTGTTGACC CAGAGCACTG CCGGTGCGGT CACTTTGACC ACGGACATTT GTCACAACGC CGTGATGGAC GCGTGGGCAG TCTCCGGACA TCCCCAAGCC GGCGAAAAGG CCGTTCGTCT GCTACAGCAA CTCGAACAGC ATCCCGTGAT CCAACCCACT CGCATATCCT ACAATGCCTG TATTAAAGCC TACGCCAAGA GCGGTCAAGC ACCACAGGCA CAAGCATTAC TGGAACGGAT GCGGACCCTA TCGGCGCTTC GAGATCCACC GAACCGCTAT ACTCATTTGG CACCCGACAA GGTTTCCTTG ACAACCTGCA TTGATGCCTG GGCTAAAGCT CCACCCAGCG CTACCGCGGC AGCTAGTGCT GAAGCGTTGC TCTCCGATAT GGAAGAATGC TACCAACGCA CTGGCGATAC GTCGATCCGT CCCGACATCG TGTCCTACAC TTCCGTCCTA GCGGCGGCCG CACGCAACGG GATGCCAAGC CACATGGCAC TGGAATTACT CTCACGCATG GAGCGCTGCG CTCGCGAGAG GCCCAACGCG GCCTTTTTAA ACACTTGGAT TCACTTGTTG GCCAAAACAA ACACCGGCGT CGAATCCCTA CAGGCCGCCG AAGACATTTT AGCATATATG AAACGAGCCT ATCGAAACGG GAATGATCTC GTCCAACCTT GTAAGATCAC GTACACGGCG GTCATCGCCG TATTGGCACA AACAGGGACG ATTCCGGCGG CCCAACGTGC GTCCGAATTG CTGGATGAAT TACAAGGCTT ATGGGAGGAG GCGCATTTCG ATAAGGCGTA CTTGCCAACC GCCAAGACTT TTGCAAGCGT ATTGCGCGCA TGGGCAACCA GCAACGCACC AGACTCGTGG ACTAGAGCGA AATCCTTATT GGATCGGATG GATCACTTGT ATGCAGCTAC CAATTCAGAC GAGCTGAAAC CTAGCTCGAT TGTTTTCGCT CAGTTATTTC AGATCTTGTC GCGAAGTCGA GACTCCCAAA CTGCAGCCAG ACAGGCACGG GACCTGTTGC AGCGCATGAG TGAATTGCAA AAATCCGGTA ACCATCAGGA CGTACAACCA GATGCTTCTA CAATGGCGTA TTTTTTGAAC ACATTAACTA AGGCGGGCGT AGACAATGTC GTCGAATTGG CAACCGTAGT GCTCAATGAA GTTGAAGATG GTTACGCGGC AGGGATGGGC CATTTGAAAC CAACGAGTCT ACTTTATTCG GCAGTATTGC AGGCGTACGC GAAGAGCGCT TCGAAGGAGG GTGCCGAGCT AGCCGAGGCG TTGTTGCACC GAACTCGAGA ACTGTACCGG CAGGGAACAT TGTACGCGAA GCCCAACGTG TTGTTCTACA ATGCAGTGAT TGATGCGCAT GCACGATCCA ATGGTGGGAG TGCTGCAGCA GAACGCGCCG AACTTTTGCT TGACGAAATG GAAACGCGGT CCCGAGCAGG AGACTTGTCG CTCCGGCCAA CTACGCGAAG TTTCAATGCT GCTATTTTCG CTTGGAAAAA GAGCAGTGCC GCTGAAGCAC CGCAGCGTGC CGAAGCTTTA CTCAAACGCA TGAATAAGAG ATATGAAGCT GGCGATGAAC GCTGTCGGCC GGATCGTATC ACATTAAATT CGATTATTGG TGTATGGGCC AAAAGCAGAC AAGAAGGAGC GGCAAAACAG GCGGAAGAAT ATTTAGGATT TATGGAGAGT TTGTACGAGG GCGGCGATGA AACGTTTAAA GCGGACTTGT ACAGTTTCAA CTCTTGCATC GATGCGTTTG CCCGACAAGG TGATGTGAAA AGAGCTACTG CGTTGTTTGA TCGTATAAAG AGTAGATATG AAGACGGAGA CACGTCGTTG AAACCGAATA CCATTACCCT TACTTCTCTG AGAAATGCAT GGAGCAACAG TCAAGATAGT GAAGACAAAC AAAGGGAGCT CAATAGGATC GACAAACTAC TACTGGCACA GCAAGGAGAC GGTTGGCGAA GGAGATCACA AGCTGTCAAC AAGGCTTCCG CAGTGGTCGA CTTGGATTCC TCCGCTAGAG CCGTGAGATC CAACGAACTG GGCTTGGAAG CACTGTCGAT GTTCGTATCA CTACGCCGTA AGCACACCGG CAATCCTTCG TAGAAAGACA ATGTCTTTCT GAAATTGAAA TTCAATTCTT CACTGAGAAC GTATAGGGAG TTTCTTCCTC TCCCGCACAA ATTTTGCTCG GTGTTGCGTC GGCCAACTGT CGGGTGGCTA CTAAACTTTC GGAATCAAAA AGAATCGCAT CCTACTCACA TCTTTGATGC CTTGACTGAT GAGTAATCTC ACCAAAACAG CGCACCCTTC CAATACATCT TTTTCAATTA TATCGACGGC GTAGATTTTT TGATTGTCTT ACTAGTGTAA AAGTAAACTT GTTGATTATA AATACTGTCG AAACGAC
|
Protein sequence | MEEQSRVPTS SERRRKRASS KSLRSNRRQA FPDRVRSERS CRDLYNSDCD CLFRANVRSV RRNVRSPRVI AGKQRQIRRT LLLVYCLWLS ADCRAFVAGP PGANHPHPHQ RLHPHPYFQQ IQYLCSRSET TRRDAKSNNR PFTDTDSSAN PDNSRRPRPQ YSNAQHRPKR RQNIKSHHRY NNQATVIFNQ QLLDICRKKK EAPASRIKVA VHAQQLLEEQ MNAGLHYDVV SFNIVMQAWA RQHSWTAAQN AENLLSRLLC HSTLQADSYS YAAVLHAYAK SGGQVPAAQR AQALLSQLLT QSTAGAVTLT TDICHNAVMD AWAVSGHPQA GEKAVRLLQQ LEQHPVIQPT RISYNACIKA YAKSGQAPQA QALLERMRTL SALRDPPNRY THLAPDKVSL TTCIDAWAKA PPSATAAASA EALLSDMEEC YQRTGDTSIR PDIVSYTSVL AAAARNGMPS HMALELLSRM ERCARERPNA AFLNTWIHLL AKTNTGVESL QAAEDILAYM KRAYRNGNDL VQPCKITYTA VIAVLAQTGT IPAAQRASEL LDELQGLWEE AHFDKAYLPT AKTFASVLRA WATSNAPDSW TRAKSLLDRM DHLYAATNSD ELKPSSIVFA QLFQILSRSR DSQTAARQAR DLLQRMSELQ KSGNHQDVQP DASTMAYFLN TLTKAGVDNV VELATVVLNE VEDGYAAGMG HLKPTSLLYS AVLQAYAKSA SKEGAELAEA LLHRTRELYR QGTLYAKPNV LFYNAVIDAH ARSNGGSAAA ERAELLLDEM ETRSRAGDLS LRPTTRSFNA AIFAWKKSSA AEAPQRAEAL LKRMNKRYEA GDERCRPDRI TLNSIIGVWA KSRQEGAAKQ AEEYLGFMES LYEGGDETFK ADLYSFNSCI DAFARQGDVK RATALFDRIK SRYEDGDTSL KPNTITLTSL RNAWSNSQDS EDKQRELNRI DKLLLAQQGD GWRRRSQAVN KASAVVDLDS SARAVRSNEL GLEALSMFVS LRRKHTGNPS
|
| |