Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50255 |
Symbol | |
ID | 7199025 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 114396 |
End bp | 117238 |
Gene Length | 2843 bp |
Protein Length | 834 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185128 |
Protein GI | 219129927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACAAAAGGA AAGCTTTCAG GGCATACAAA TAGATCCCAC CAACTTGCAC TGTCTTGTTA CTGAATATGG AACGCACTGA AGACGCTGGC GATCGGGCCG AGTCTCCTGG ACAGCCTAGC GAAGGTCCTG CGAATGCCCC AACCGAGGGC GTTGACGAGA TCAACGACTT TGAGCAGGAC AATTTGACCG CGCATAGTTT GGAAGAATCG CACTGGACCG GCACCGAAAC TGTAGTTTCC CAGGACGACG TACCGGACGC ACTTTTGCTC AACTTGTTTT GTGCCCGCGC CAAGGCTCCG AAAGAAACGG AAGACAGCCT GATCGAAGCG GAAGAATCTT GGCAGCCGGT ACGCGAATGG CTCGGCTCGC ACGACGCCGA GCAAGTGCGG GCGGCGGCGG AACAACGCGG TGAAAATGGA CTCACAGCAA TTCACTTCGC CTGCCGAAAC GTGCCGCCTT TGGATGTGAT CGACGTCTTC TTGTCCATTG CGGCCGACAC GGTACAGTGG CCCGACGGCT TTGGTTGGTT GCCCATTCAC TACGCGTGCG CCAGTGGATC GGAAGAAGCC GTAATCAAAG CCCTGGCGGA ACATTTCCCG GAAAGTAAGA CGGCAACCGA TGGCAGAGGT CGGACGCCCT TGCATTTTGC CTTGGGAGAT AAGCCGGCGA GTCCGGATGT CATCTTTTTG TTGAGCTCTT CCGGCGCGGC GTCGTTTCCA GACGATATTG GCATGTTGGT GCGTCCTGGA TTAATGTGAA CCGTTTGTGT TGCTCGTTGA ACTCTTCTCA CCGTGACTGT ATCATCATTG CGTTACTTCT TCAGCCGTTA CACTATGCCT GCGCTTTCGG CGCGTCCGAA GAAGTCCTGT ACGTCCTTAC GGATGCCTAT CCCGAGGCCA TCACGTCACG TGATAAGCGC CAGCGAACAC CGTTGCATTT TGCTCTGTCC AACGCCGGTC GCAAAACTGT ACCTGCAGCT GTTCGTCTCC TTTTGTCTCT GGATCGACGA ATTGTCAATT CTGTCGAGGG TGGACCGTTG CCGCTCAGCG TCTTGGCAGC GTACGCAGCC ATAGTCCGGA ACGAAGACGA AAATCGGGAC AAGCGAGAAT CGGTCCAACG CTGTTTGGAA CATCTACTCC ACGCTGAACC GGAACCCACC GCTGATTTCT TTACGGCGCT ACAGTCTTTA CCTGATTGGC TGTCAGAACG GGCTGTGGTC ATGCCTGTGG TGCAGATTTT ATTAAACGAA AAGATTTCGC AACGGTTTCC CACCGCAGTC CTTATGATGG ATTTTTACGT ATGTACGTGC ACAACTGTAA TGCTCACTCG GTACCAGAAT ATCGGGTAGC AATGACGCTT ATGTTGAGAT TCCTTACACA TCTCAATTGT TCGTTCTTTT GTTCCGTTTG ACGATTGCAG TGACTATGGT TATCATTTCC TATTCACATA ACGTGGTGAA TTCCATTCAA CGTCGGTTTG ATGATGACGA CACAAACGAT ACAATTGCAA CCAAATCGTT GATTCCTCTC TACTTGGGCG TCGCATACTT TGCACTGCGT GAAGTCGTCC AAATCATGTC GCTGTTGAGC CTCAAGGTCT TCAAGCTATG GCTTTACGAT CCAAGTAATT ATCTCAACGT GGCGTTCATC GCTTTGGTTT TGAGTTGGAC GGTTGTCATG GATAGCGGAG CTGGCGATCG CGACACTTTT CGAGTCGGCG CTGCCGTGTC GGTCACCATT TTATGGGTCA AGCTCCTCGC ATATCTACGC AATATGCTGA TTGACTTCGC CGTCTTTGTT GGTGGGGTGT TTTACGTTGT ACGACGTCTA GCAGCCTTTC TCTTGGCTTT AGGACTTATT CTGGTTGCGT TCGCGCAAAT GTTCTACACG GTTTTTCAAC AGACAGACTA CTGTCGAAAT CAACCGCAAA ATGATTTGGA CTACGACCTG ATACTGGCAG AAACTCGGTG CGATGCCAGC ACGCTACGTC CGTACTGTGG CTTTTGGACT TCTTTTCTGA GCGTATACAC CATGTTGCTA GGAGAAGTAG ACGAGGACGA CTTCGAATCG TCTGGGGTGG CCATGGCACT ATTCGTCATC TTTATGTTCT TGGTCGTAAT TTTGCTTGCA AACGTGTTAA TTGCTATCGT TACGGACAGT TACAAAGTGA TTCAATATCA GCGTGCTGCC ATAGTGTTTT GGACAAATCG CTTGGATTTC GTCGCAGAGA TGGATGCCAT AGCAAATGGA CCCTGGAAGA GCCGCGTCAA GAAATCCTTG GGCATGTTGG ATCAAGATGA AGATGGAGCG CAACAACGGG ATGTCTTTGG AAAGGATCTA TGGAAACAGA TTATGGATTT GTTTGAAGAC GACTCGTTTG ACGGGATGTC AACGGTGGAC TTTATTGCTT TCGCGCTCCT TCGGGTGGTT GCTTGCGTCT TTATCATTCC TATGTGGCTG CTACTGGGCA TTGTTACTGT TGGGTGGTTT TGGCCTCCAC AAGTTCGGGA AGCGGTCTTC ACAAGTAAGG TGTCAAAGCA CTCCTCTGAC ACTGCAAAAG AGGACGAATT ACGTCGAACA CAAGTCAAGC AGCTTCAACA AGACGTGATT GAGATGCGGG ACGACCTCTT ACAAGAACTT GCTTTGGATC GGACCCAAAT TGTCCAGATG AAATCGCAAG TAGCTGAGCG AAAACTGGAG ATAGCCAACG AAATGAAGCA AGTGAAGAGG CTCGTCACTA TGTTGTTTGA GAGGCAAATT GCCTTCTCAG CATAAAATAA ATTGCCTTTT TGTTTTAAGT CGTAACAATG AACAAGTAAA AATAAATTTT CTATACCATT TGC
|
Protein sequence | MERTEDAGDR AESPGQPSEG PANAPTEGVD EINDFEQDNL TAHSLEESHW TGTETVVSQD DVPDALLLNL FCARAKAPKE TEDSLIEAEE SWQPVREWLG SHDAEQVRAA AEQRGENGLT AIHFACRNVP PLDVIDVFLS IAADTVQWPD GFGWLPIHYA CASGSEEAVI KALAEHFPES KTATDGRGRT PLHFALGDKP ASPDVIFLLS SSGAASFPDD IGMLPLHYAC AFGASEEVLY VLTDAYPEAI TSRDKRQRTP LHFALSNAGR KTVPAAVRLL LSLDRRIVNS VEGGPLPLSV LAAYAAIVRN EDENRDKRES VQRCLEHLLH AEPEPTADFF TALQSLPDWL SERAVVMPVV QILLNEKISQ RFPTAVLMMD FYVLTMVIIS YSHNVVNSIQ RRFDDDDTND TIATKSLIPL YLGVAYFALR EVVQIMSLLS LKVFKLWLYD PSNYLNVAFI ALVLSWTVVM DSGAGDRDTF RVGAAVSVTI LWVKLLAYLR NMLIDFAVFV GGVFYVVRRL AAFLLALGLI LVAFAQMFYT VFQQTDYCRN QPQNDLDYDL ILAETRCDAS TLRPYCGFWT SFLSVYTMLL GEVDEDDFES SGVAMALFVI FMFLVVILLA NVLIAIVTDS YKVIQYQRAA IVFWTNRLDF VAEMDAIANG PWKSRVKKSL GMLDQDEDGA QQRDVFGKDL WKQIMDLFED DSFDGMSTVD FIAFALLRVV ACVFIIPMWL LLGIVTVGWF WPPQVREAVF TSKVSKHSSD TAKEDELRRT QVKQLQQDVI EMRDDLLQEL ALDRTQIVQM KSQVAERKLE IANEMKQVKR LVTMLFERQI AFSA
|
| |