Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48028 |
Symbol | |
ID | 7203021 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 759746 |
End bp | 761641 |
Gene Length | 1896 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182450 |
Protein GI | 219124310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAATGGCATC ATCGCCAAAA ACCGAGGTGC AAAGGTCCAA AGACTTTCCC TCTCCCAAAT GCATTGCGAC AATATCAATC ATCCATCCAA TGAGTGCGCG GACTACCACA ACTTTGATCT TTTCAACTGT AACGCGACTG CACCTTTAGG ATTCTGTTAC CAACATAACT CTGTTCAACA GCTTCTCAAT TTTACCGTTG ACTGCTGCGT TCGCCAGAGC GTTGGAGCGT TACTCCATCT GTGCACCTCT CCTGGAAAAA AGGCCACTGG AATTGGATTG ACGGCGATTC TCGGTATTTT AAGAATATGA GAGAATCTGT GTACAGAAAG CATTGGGGAA GTCGAAGTAA CGGCAGTGAA ACCGATCATT CCGGGCATCT TCGAAAATGC TCATACACAC CGCCACCTGT ACCGACGGTA GTAATCACTC AACAGCGACA ACATTCACCT ATCATAGATC GACACGAGCC AGCAAGCTCC CCACAAATAT CCTTTAGAAG AATCCGACGC CCTTCTCCCC TGATACAATC TCGAGACGAG GCGCCACCCA GGCGAGGATT GAGTTCCGAC AGGGAGTATG AACAAGGCGA CGAAGAGTTG TCTGTGGTAC CTAGTGGCAA CTTGACACCG CTTTCTTCTT ACTCGCCCCT AAGAACATTC GCTAATTGGA ACCAGACAAG TGCCGACAGC GCTAGCTATC ATTTGGCTTC TCCGGATGGT CAGATTGCCA TCGATGTTCG ATTACCTTCG CCGAAACATC GTACGAAACT GAAACGGTGG TACGAGAAAC AAGATGCAGC CTCGGCCACC ACCAGCAATC TCAAAGGGCG CCCCCTGCAA AGCCATTCGC AGCGCATTCA TCAACAAGGT ACCATTCATT TGGAGAAAAC TGATCATTTG CTATATGACG AAAAACTAGA ACTCCCTGTA GACGAAATGT TGTTCTCGAA TGAAGACACG TTCGACCAGA CGGCTGACCA AACCTACGAT GAGACTTACG CTACGAATCC AAAGACGATG TCAACGTTCG CGGATGACCA CACGCTGGAC TCAGCATGTG CTGGCCGTAG CGTTGGTCAC GGTGACGACT TGAGGACGAA ACAACACGTT CCGTTTGTCA GCATCATCGT TATGGCTGTT CAACTTTTGA TACTAATTAC ACAACTCGCC ATGTGCGGAG TTGCGAGTCT TGATGTCAAT CCAATGATTG GACCATATCC GGACGCATTT TCAGAATGGG GCGGCAAGAA TGCATACTTG ATGACTGAAG AGAATCAGTG GTGGAGGCTA CTGACATCGT CGTTTTTGCA CGTTGGCGTC CTTCACTTGC TAGCAAACGC TTTGTGCGTG ATCTGGTCTG TTGCTGTCTT TGAACAAGAG TGGGGATCTT GTAGATGGTT GCTCGTTTTT CTCGTCAGTT CCGTTGGATG TACGGCCTGT GCTTCACTTG GCGACGCCGA CACGATTGGC GTTGGAAGTT CCGGGACCTT AATGGGTCTC TACGCTGCAA AACTTGCACA AGTGATGAGC TGCACTTGCT TTGAAGTACA TAAATCTTTG GATGGGAACA TTCATTATGA CCGAATGTGC GGCGTTTTGG TTGGGATTGC CATCCTTTCC ATGTTGAGTG CATGTACGTA CATTGATTGG TCAGGCCATG TTGGCGGACT GGTGACAGGG TTTTTAGTGG GGATTTTGAT ATTCAGCACC TCTATCAGAC ACTGCTGCAC TCGACTCCTT TGGGCTTTAC TAGGTCTTCT CGGGGTGTCC GGATTCCTAG GCTTTGCACT GTACTCTGTG GCAGTGTATA TTGAACCAGA CGAACAGATC GCGGACACCT GCGAATACTT TCGAAACCTT TTCCCCGAAG ACTACACTTG TGAATGTGCT TGGTGA
|
Protein sequence | MRESVYRKHW GSRSNGSETD HSGHLRKCSY TPPPVPTVVI TQQRQHSPII DRHEPASSPQ ISFRRIRRPS PLIQSRDEAP PRRGLSSDRE YEQGDEELSV VPSGNLTPLS SYSPLRTFAN WNQTSADSAS YHLASPDGQI AIDVRLPSPK HRTKLKRWYE KQDAASATTS NLKGRPLQSH SQRIHQQGTI HLEKTDHLLY DEKLELPVDE MLFSNEDTFD QTADQTYDET YATNPKTMST FADDHTLDSA CAGRSVGHGD DLRTKQHVPF VSIIVMAVQL LILITQLAMC GVASLDVNPM IGPYPDAFSE WGGKNAYLMT EENQWWRLLT SSFLHVGVLH LLANALCVIW SVAVFEQEWG SCRWLLVFLV SSVGCTACAS LGDADTIGVG SSGTLMGLYA AKLAQVMSCT CFEVHKSLDG NIHYDRMCGV LVGIAILSML SACHVGGLVT GFLVGILIFS TSIRHCCTRL LWALLGLLGV SGFLGFALYS VAVYIEPDEQ IADTCEYFRN LFPEDYTCEC AW
|
| |