Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49750 |
Symbol | |
ID | 7198338 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 156430 |
End bp | 157969 |
Gene Length | 1540 bp |
Protein Length | 419 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184497 |
Protein GI | 219128601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTCGAACG TTAGATTGTC GTAAACGCCC TAGTAAGTGG TGCAATACTA CTGCTAAAAC AACACGCTTC CGCTGCGCAG ACACAGCAAG CAATACCCAT GGTCAGCACT AGATCAGTCA GGGTACAACC CCCCTCAGAG GGAGGTCCCG TCGGAGCCGA CGATCCGTCC GTAGGAGATC CACCTCCTTC AGACGACGAT GGCGAAGACG ACGACGACGC AGGAACGGAA CCCAGCAACG GCTCCAGTCT TCCAGCTCAA GTGGTTTGCG TCCATCAAAC TCCACGCTTT TCCATCTCAC CCAGTCTCTC GCAGGGACGT CGGGCGATAT TGGACTACAG CATTCCGAAG ATCGCGAAGC TTTATGTTTC GGCAACGAAG CCGCTCTCGA AAACTGAATT CGACATCAAG GCAGGTGCTC TCACACCTTT ACTTTCGAGT CTGGCACTCA GGGCTCGAGA GCACGGATGG CATGGACACG GAAGTAACGG AATTCTCAAA ATTCCAAATA ACATCACTAT TCCGAACGGC GCCAGCAAGT CGCTCATCAA GGAATACGGT CAAATCTCCC TTCCCCACAT CCGCAACTAT GTCGGTACAT TTGCGAACAC CAAAACTCGT GAAGTACAAG ATGACGAGGC ACTCTACCAG TGCTTGAAAG TATCGCTCAC GCACGAAGCA ATGGCGAAAA TCAATCTCTA CGAAACTGAA TGGACGGTTG CAGGTGAGCC ATCTGGAGTC GCAATGCTCA AGGTAATCAT TCGACACGCA TACGTTGATA CAAATGCGAC GACAATGCAT GTATGACGAA ACTTAGCAAG CTTGACTCAT ACATGGAGTC TCTCGCGGAA CACAATGTTA CTCTTTTCAA TGAGTATGTT TATGAGCAGC TCCACGCTCT GACAGCTCGC GGCGAACAAA CTCTTGATTT ACTTCCGAAT CTTTTCAAAG GATACGAAGC GGCCAAGGAT ACGCAATTTT TGGAATACAT CCGCAAGAAG AAAGCCGAAT TTGAGGAGGG AACAGTCTTT TTGGAGCCGG AAATTTTAAT GTCGCAAGCA TCTATCAAAT ATCGAACTTT GGTGGAGAAA GGAGAGTGGG ATGCCCCATC CGAAAGCGAA GCGAAGATTC TAGCTCTAAC TACCCAAGTC AAGGAACTTC AGGCAAAGAA GTCAGAGAAA CCTAAGTCTA AAGCTAAGGG TGACTCAAAA AGGAAGAAGA AGAAAGGGAA GAAATCCGAT AAACCAAAAA CGGACAAGTA TGCTTCACTA AAGAAGCCAA GCGCGTCAGA ACCTCACACA AAGACTTTTG ACGGAGACAA GATTAAGTTC TGTACTAACC ATCAAGCTTG GGGCACGCAT TTGGCAAGCG AATGCAAGGG ATACGGACTC GAAAAGGATT CCAATGGAAA ACCAATTCCA AAAGGATCTG AACCTAATGC CACAGACAAG AAAGGCCCGC CCTCAAAGTC ACACGCCGCA ATCATGCGGA TGAGCAAGGC CCTAACAACC GAAATCGAGA AGGCCGAGAC CGAAGAATGA
|
Protein sequence | MVSTRSVRVQ PPSEGGPVGA DDPSVGDPPP SDDDGEDDDD AGTEPSNGSS LPAQVVCVHQ TPRFSISPSL SQGRRAILDY SIPKIAKLYV SATKPLSKTE FDIKAGALTP LLSSLALRAR EHGWHGHGSN GILKIPNNIT IPNGASKSLI KEYGQISLPH IRNYVGTFAN TKTREVQDDE ALYQCLKVSL THEAMAKINL YETEWTVAAR GEQTLDLLPN LFKGYEAAKD TQFLEYIRKK KAEFEEGTVF LEPEILMSQA SIKYRTLVEK GEWDAPSESE AKILALTTQV KELQAKKSEK PKSKAKGDSK RKKKKGKKSD KPKTDKYASL KKPSASEPHT KTFDGDKIKF CTNHQAWGTH LASECKGYGL EKDSNGKPIP KGSEPNATDK KGPPSKSHAA IMRMSKALTT EIEKAETEE
|
| |