Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49566 |
Symbol | |
ID | 7198189 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 64902 |
End bp | 66452 |
Gene Length | 1551 bp |
Protein Length | 424 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184296 |
Protein GI | 219128179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.088371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTATTGTCA TTCCTCCTAC TCACGCGCAA ACTCCTTCGC ACATTCCTCT TACATTAATC TGCCCATTCA AATTAGCAGA GACTGAGATT GGCGTCATGC ATTATAGCAG CGATGGACAT CCAAAGACGA ACTTTATTGA CGATGTTGAG CAGGGAGTGC AAACGGATTC GGTCGTCAGC ATTGCAAAAT CAGATTGCTG TTCTTCAATT CGAAGCAAGA AAAAATATTG TATGGACGGT AGTATTGATG CTGCACCAAA TACGGTTTCG TCCCTGTTCC GTACGTCAAC TGCTCGCATA ACTCTGGCAT CCACAGGTGA ATTAAGCTGT ATGGGAGATA GCGAAATGTT CCAAACAATC AGCTCCGGTC AAGTCGCCAA ACCGCTGCAT AATAGATTCG GTCGCGCTTT TGTGTCACAT GAGTACGAGG ACAACTACCG CGAAGCCATC AATCATCGAA ACGATGAGTC TTCAACCCAC GGTACGCCGA AGAAAATTTA TTTTCGTGGT GGAACAGCCA TGCATTTTCC TGAACGTCTC TTTGAGATGT TGCAGCAGGT CGAAGAGCTC GGAATCTCCC ATATTGTCTC TTGGCAGCCT CATGGACGCT CTTTCCTTGT ACATCGTCCT CGAGAATTTG TATCGCAAGT TATGCCAAAG TATGTATTGC CGACATGGCA ACATTGCCGA CGTAAACTTA CCTTTACAAC CAAACGTCTC ACCCCTGAAT TGTTCACACC AATTTAATGA ATTCAGATTT TATCGGCAGA CTAGATTCAC GTCCTTTCAG CGCCAACTCA ATCTATACGG TTTTACTCGT TTGAGCACAG GGCGAGACTG CGGTAGTTAC TACAACGCAA ACTTCCTCAG GGGTTGTCCT CTACTTTGCC GTCGTATTGT CCGTCGACGC ATCAAGGGCA ATGGTGTCAA GCCAGTCCCT TCGCCAACCA CAGAACCTGA CTTTTACAAC ATGGAATGGT GCGAGGACTC CGGTCCACGG CCAACCTTTC ACGAGAAGCC ATCTTTCGGA ATCTGTGGTG GTACTGCTCC TCAAACCTCG TGCTTCCAAC AAATTTTAAA TTCAAGCGCT GCTTCTTACG ATCCTTGGAA CATAACAAGC CCATATCATG AGCAGCCAGG GTACACCACG CAGGTAGCCT CGCCTGAAAT CGCAATGAGC CACCTTCAGA TTCCTGAAAG TCTTCTTTAT TCGCAGCAGA TGGCTCAATG CTGTCGACGC AGCAATCTCC CTACAGCCTC TAGCTCTAGC ATCTACCCTT GGACTTTAGG CAGGTCTACC ACCACCGAAA ATGGTTCGAA CGAGGATATG GTAGAAGGAT TACGCCAATA CCTTCCGGAC CATTTTGTGG AAAATGACCA AGCGTTGATA CTCCTTAGTA GCATTTGTGA TACAGAGGAA GATTCATTGT ATGCCCCAGT TGATGGTGAT GTTTTCCGCT TTCTCTAGTG GCTTTTCCAG TAAAACGTCG GATGATGAAA CCTTGATTTT ATCTTACTGT TAAAAAGAGC TAATGTAACG AGTTGCACTG G
|
Protein sequence | MHYSSDGHPK TNFIDDVEQG VQTDSVVSIA KSDCCSSIRS KKKYCMDGSI DAAPNTVSSL FRTSTARITL ASTGELSCMG DSEMFQTISS GQVAKPLHNR FGRAFVSHEY EDNYREAINH RNDESSTHGT PKKIYFRGGT AMHFPERLFE MLQQVEELGI SHIVSWQPHG RSFLVHRPRE FVSQVMPKFY RQTRFTSFQR QLNLYGFTRL STGRDCGSYY NANFLRGCPL LCRRIVRRRI KGNGVKPVPS PTTEPDFYNM EWCEDSGPRP TFHEKPSFGI CGGTAPQTSC FQQILNSSAA SYDPWNITSP YHEQPGYTTQ VASPEIAMSH LQIPESLLYS QQMAQCCRRS NLPTASSSSI YPWTLGRSTT TENGSNEDMV EGLRQYLPDH FVENDQALIL LSSICDTEED SLYAPVDGDV FRFL
|
| |