Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39996 |
Symbol | |
ID | 7195482 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 556129 |
End bp | 557336 |
Gene Length | 1208 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183895 |
Protein GI | 219127340 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTGCT CGGTCGAGGT ACGAATTATC CGCCATTTCA TCTTACGTAC TTTTCGATAA CTGGTCTGAA CATGCATGCA GTTCGGACTC TACGCTGTTG ATTTCGGCTT CTTGAATGAA ACAAAACGTC AGACGAATTC ACACGTGCTC CCTTTGTGCT GTTCGACAGG TGGGAGAAAT AACCAGCATC CCGGGAACGA CAAACTTCGT CTTATGGCAA AGCACTGCAC TCGCGTGTAC CACGTCGCGC AAAAGAAGCA GAAGGCGGCG ATCGCGAAAC TTCTCGTAAA ACAAATACAC AACCTTTGTC CGTCGGGACG GTAAGCATTC GCCAAGCAAT TGTGACCATT CCATGGAAAA AGAAAAATAC GTGGCGCACT TCGTTTCTCA CTAATCAATT TTTGCCTGCT ATTAGCTTTC TAAAGCAGGA ATATAGGGAA TGGCGAATAG TTTCGGAGAG CGTAGCTCGA GAAAAGGCAT GCCAGTGCCT TCGCGACGCT ATTGCGGTTA TGAGAAACGA AGTGGTAAGT TCACCACTAA GCAAGTATTG CCAAAGCCAC AGCAAAATTC GTGCTCCTCT TGAGGACAAG ACTTATCTAG TAAAAGAGGG GCATTTTCAC ACAAATGGAG GGGCCGCGCT TTCTATGACT ACGAAGCCAC TAAATTGTGT CGCTGACAGT AAGTACCCGG ATTGCCTAGC TCCTACCCCC GTTTGTTTGG AGCAGGGAAA TGGACCTAGC ATGCCATTTC GTGCCGTTCC AGCCCCCCAG CTGTATCAGG CGACAAAGAC GAGTTTCATG GACACTCAAC TATCAAATTA CAAGAATCAT CAGGAAACAG TGAAATGGTC TCGTTCTATG GAACTTCGAA CAATTGCGGG CTCTCCAATA ACAGGCAAAG TAAACAGCCG AAACCGTGCA ACAAAGGAAT TGCGCGCATC GCTCTCGAAC CACATTCCGC CACCCTCCAA CCCGTCCAAT ACATTTTGCT CCAACAAATT CCAGAATATA GAGCCGATCC CTCTCAGTCC ACGCTCCAAC TGCTGTGCCA GTCACTTTCC CAAGTCCTCG GATACTCCAA ATTACGAAAG GTATACCGGA GAAGCTGGCA CCTGCGCTTA CACAGTATTG GACACGACTG TTTTTGATGT AGAACGGCTT CCGGCCTATC TAGATCCCCA CAAGTCGTCA TATTTCTACC GAAAATAA
|
Protein sequence | MSCSVEFGLY AVDFGFLNET KRQTNSHVLP LCCSTGGRNN QHPGNDKLRL MAKHCTRVYH VAQKKQKAAI AKLLVKQIHN LCPSGRFLKQ EYREWRIVSE SVAREKACQC LRDAIAVMRN EVVSSPLSKY CQSHSKIRAP LEDKTYLVKE GHFHTNGGAA LSMTTKPLNC VADTPTPVCL EQGNGPSMPF RAVPAPQLYQ ATKTSFMDTQ LSNYKNHQET VKWSRSMELR TIAGSPITGK VNSRNRATKE LRASLSNHIP PPSNPSNTFC SNKFQNIEPI PLSPRSNCCA SHFPKSSDTP NYERYTGEAG TCAYTVLDTT VFDVERLPAY LDPHKSSYFY RK
|
| |