Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20318 |
Symbol | |
ID | 7201016 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 722475 |
End bp | 724093 |
Gene Length | 1619 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180104 |
Protein GI | 219118672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGGAAACAA CTGTAAATGC ACTTGACTGA ACGATTTTCA AGGATTGGAT TTGTGATCTC GTAGTTGGTG TCCTGCTCTC AAGAACACAA AGTTTGTTAT ATCGTTTGCT GCGTTCCCTT CAAAGTCGTT CGGTCGCCAG AATTCTTTAG TCAAGATGAT GACATCGTTT GCGAAACTTT TCAGTATCTT TCTGGCTTTA AATTCAATTT CGACGTATCG GGTTGCTCGG GGCTTCGCTC CGATGAGGAT TGAAGCACTC TCTTATCATC AGCGAACACC GAGCTCAACA AGAACAGCTT CTATCACTGA GGACTTGTCT CACGAAGAAA TCTCCCGCTA TTCCCGCCAT TTAGTTTTGT CTGATGTCGG TGTACGCGGC CAGAAAGCAC TCAAAAATGC TTCTGTGCTA GTGATTGGAG CCGGAGGTCT CGGTGCCCCG TGTCTCATGT ACTTGGCCGC AGCTGGCATC GGCCATATTG GGATTGTCGA CGGCGATGTC GTGGATGAAT CTAACTTACA GCGACAAATC ATTCACGGGA CGAGCACGGT GGGTCTTTCC AAATGCGAGT CAGCGGCACG GCGTATCCAG GATATCAATC CACACGTGAA TGTAAGACAG TACGAGGAAG AGTTTACCTC CGAAACGGCA CTCCGCATTC TCGGAGATGG GTTCTCCGAT AAACGACGGT ACGATGTTGT GATAGATGGA AGCGATAATT TTCCAACCAA ATATCTCATC AAGTAAGTGT AAGAAAGTGA GCAGATCGAT CTTGTTCTTT TGGATTTTAA TTAATTCCAG CACGACTCAC ACTTTCCTAT CAGCGACGCG TGCACCATTA CGAATACTCC TTGGGTCTAT TCTGCTATTT TGGCCTTTGA AGGACAAATG TCCGTCTTTA ATTTGGACAA CGGACCCGAC TACCGTGACT TGTTGCCTAC TCCACCACCA CCAGGAGACG TTCCCTCCTG TGCCGAGGGA GGAGTGTTGG GTGTTTTACC GGGTACCATG GGATGTCTGC AGGCGACTGA GGTAATAAAA ATTGTTCTTG GTCGAACGGA GGGGTGCATG TCGGGACGCG TACTGATTTT TGATGCCATG CGTATGAAGT TTAGCGAAGT GGGGTTGAAG CGAGAGACCG ATCGAGAGAA TATAACTGCT TTAATTGATT ACAAAGGATT TTGTGGTGGT CCACAGGCAA AATCGGAACA AGCAAAAGGG CTCATGAATG CCAACGGTCG TACTATGGAC GAAGCAGAAT CGGGTATCGA GGCGTCTTCA ACCGGTGACA GTCCCTCATT CCACAACATA GACCCTCAAA AAGCCTTGGC GAAGTTAAGT AGTGGTTGGT CGCCATGGGT GGTGGACGTG CGATTGCAAA CAGAAAACGA CATAGTGGCC TTACCATTTA CCGATCGAGT CGTACCGCAC AGAACAATAC GAGTCAAAGA CATCCCAGAG GATGGTGACG TGTTGGTGTA TTGCAAAGCG GGTATTCGCG GCAAGAAAGC TTGCTCAAGC CTTGTTGAAC AAGGCGTGGA CCCGGGCCGA TTGTTCAATC TAGACGGAGG CATCATGAAA TGGCAGAAGG AATTGGATCC ATCTATGCCA CGCTATTGA
|
Protein sequence | MMTSFAKLFS IFLALNSIST YRVARGFAPM RIEALSYHQR TPSSTRTASI TEDLSHEEIS RYSRHLVLSD VGVRGQKALK NASVLVIGAG GLGAPCLMYL AAAGIGHIGI VDGDVVDESN LQRQIIHGTS TVGLSKCESA ARRIQDINPH VNVRQYEEEF TSETALRILG DGFSDKRRYD VVIDGSDNFP TKYLINDACT ITNTPWVYSA ILAFEGQMSV FNLDNGPDYR DLLPTPPPPG DVPSCAEGGV LGVLPGTMGC LQATEVIKIV LGRTEGCMSG RVLIFDAMRM KFSEVGLKRE TDRENITALI DYKGFCGGPQ AKSEQAKGLM NANGRTMDEA ESGIEASSTG DSPSFHNIDP QKALAKLSSG WSPWVVDVRL QTENDIVALP FTDRVVPHRT IRVKDIPEDG DVLVYCKAGI RGKKACSSLV EQGVDPGRLF NLDGGIMKWQ KELDPSMPRY
|
| |