Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49469 |
Symbol | |
ID | 7195821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 328262 |
End bp | 329853 |
Gene Length | 1592 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184231 |
Protein GI | 219128040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.141082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAACAACG CCAACAATCG ACCATGGGAT TTTCGACTGC TGAGCGCAAT CGCCGAAAGA GAGAGCGCAA AAAGCGCGAG CGCGAAGACC AGCGTAAAGA GGAAGAAGCC GAAAACCAAA AAAGCGCGGA AGTGGCGAAA GCAGATGAGG ACGATGAAGA CGAGGTGGAG GTCGAATACG TTGCCGAGCC CATCTTGCCA GCCGATGAGG CGTTTGACGC TTTGCGAAAG TTTCAGGAGA GAGCTGCGGC TGTGGTTGTT TCGGATGACG ATCGTGGTGA ATATTCCGAA GCGAAAAGTG TAGATATCGA GGGCGGACAC GAGGAGTCCA ATGAAGATGA GGAAGATGGT GGTGCGGTCA GCAAGCGAAA ACTTCGCGAA TTGTTGCGTC CATCAGTTGC CGAACTCAAG AGGCGTGTAC TTCGCCCTGA CTTGGTAGAA GCACATGACG TGACCGCCGC CGATCCGGAT TTTCTCATCG AGCTCAAGGC AGCTGCTGGC ACAGTACCTG TCCCGCGACA TTGGGGTCGG AAGCGTAAAT ACCTGCAGGG CAAGCGGGGT TTTGAAAAGC CGCCATTTCA ACTCCCTGAT TTCATTATCA AGACCGGTAT TACCGAAATT CGTGATACAG TCATGGAGGC TGAATCAGAC ATGTCTGCGA AACAGAAAAA TCGTTCGCGC GTTGCTCCCA AAATGGGTGC CATTGATGTC GATTACAAAA CTCTACACGA TGCCTTTTTC AAGCATCAAA CTAAACCGGC AAATCTCACA AAGTTTGGAG ATACCTACTA CGAAGGAAAA GAGCTCGAGG TACAGGCAAA GGTGCAGCCT GGAGGGCCGT TGAGCCAGAA ACTCCGGGAT GCGTTGGGCA TGGCTTCTGA ATCCTCGCCG CCACCGTGGC TGCTGAATAT GCAACGTTAC GGACCACCAC CGAGCTATCC GTCTCTCAAG ATTCCTGGTT TAACTGCCCC GTTGCCAACA CAAGAGTGTC AGTATGGCTA CCACCCAGGT GGCTGGGGAA AGCCACCGAT TGATGCCTAC GGTCGCCCAC TGTACGGCGG CAATCCTTTT GACGCCCCCG GTACCGGATC TCGCAAGGAT ACTACCAACA GTGCCCTTGT GACAAGCGAC GGCAAAACGA TTGCCAAAGC ACAATGGGGA GCGCTACCAA CAGGCTTTGT CGACGATGCC GAAGCGTCGG AAGAAGAGTC GAGCGACGAA GACATGGAAG AATCAAGTGA GGAGGAAGAA GAATCAGAGG TTACGGTTGT CGATGGTACA GATTCAGTAT TGTCTCCACC TCCGAGCTTG ACATCCTCAG GACCCATGGA TCTACGCAAA CAGCATGGGA ATGAAACACC TATGGATCCT TCGGCGCCGA AGCAACTCTA TCAGATTATT GACCAAACCA AGGCCGTTAC TTCTCAGGGA ACTGTCTTTG CTTCTGAAAT GTCCTATCTG GTTCCAGGAC TACAATCAGC TATACCTGAG GGTGCCGAAA GCGTTTTATC CAAGGCCTTA CCAGCTGGTG AATCGTCCAA GCGGAAAGGC AAAGATGAAG ACGATGATAT TGGCAAGAAC TTTAAATTCT AG
|
Protein sequence | MGFSTAERNR RKRERKKRER EDQRKEEEAE NQKSAEVAKA DEDDEDEVEV EYVAEPILPA DEAFDALRKF QERAAAVVVS DDDRGEYSEA KSVDIEGGHE ESNEDEEDGG AVSKRKLREL LRPSVAELKR RVLRPDLVEA HDVTAADPDF LIELKAAAGT VPVPRHWGRK RKYLQGKRGF EKPPFQLPDF IIKTGITEIR DTVMEAESDM SAKQKNRSRV APKMGAIDVD YKTLHDAFFK HQTKPANLTK FGDTYYEGKE LEVQAKVQPG GPLSQKLRDA LGMASESSPP PWLLNMQRYG PPPSYPSLKI PGLTAPLPTQ ECQYGYHPGG WGKPPIDAYG RPLYGGNPFD APGTGSRKDT TNSALVTSDG KTIAKAQWGA LPTGFVDDAE ASEEESSDED MEESSEEEEE SEVTVVDGTD SVLSPPPSLT SSGPMDLRKQ HGNETPMDPS APKQLYQIID QTKAVTSQGT VFASEMSYLV PGLQSAIPEG AESVLSKALP AGESSKRKGK DEDDDIGKNF KF
|
| |