Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40647 |
Symbol | |
ID | 7198665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 60881 |
End bp | 61981 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184637 |
Protein GI | 219128895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCTT CGTCAATCTA CGACCCTCAC TGGGCTTTGT ACCCCAAAGC TTATGTAGCC ACTAGATCTA TAGGACCTGT TGAGATTAAT GGCGACTTGG AAAAACCTGT TTGGAATTCT GTCCCTTGGA GTGACTTATT TGAGGATATT CAGGGAAATG ATGCACCCGC CAACGCAATT GGCCCAGCGA AAACGGCCTT CAAGGCCATC TACGATGATG AGCATCTCTA CGTTGGCGCT CTGTTGCATC CCTCCCCTGT CTTTCGAACG GAAGCACACT TCACCACGCG AAATTCACCA ATTTATCAAA CAGATTCCGA TTTCGAAGTT TTTTTTGACT TGAACGGAAG CAACCACGGT TATAAAGAAT TTGAAGTAAA TGCACTCAAT ACAGTCTGGA ATCTCATGCT CGACAAACCC TACGATGATG GTGGACACGA GCACTCGGGA CGGATAGCCC ACCGTGGCGA TTCTGCGTTC TACGATGTAA ACAGCCAAAC TACCGCGGTA CAAGTGCTGG AAGGCCGCTT AAACGATGCC TCCAACCAAG GGGCACTGTG GTCGGTAGAA ATGGCCTTTG CCTTTGAGGA CTTGGCGGCT CACCTCCCCA GTCCGGTACC GCGACCCTCA CCTGGTGAAT TTTGGAGAAT CAATATTTCT CGAGTGGAGT TGAAAGGAGA AGTCAATTGG ACGTGGCAAC CACAAAAAGT TTGGGATCCA CTTTTGAGAA AGCATCATGG TAAAGTTGCC ATGCACATGC CGGACTCGTG GGGTTACCTT ATATTCGGAG ACACCGTTCT GGAGTTAGAC AATTCTTCCT GGCGAGACCC TTTATGGCCC ACGAAATTAG CGGCGATGAA TATTTACTAC GCGCAATATT TCTATAAAAG CCTAAATGGT TTCTATACAG ACTGTATGGA AGAGCTCGCT GGTTATCTGG ACTTGGAAAT CACTTCTCCT TTCAAAGTTG ATCTAGTGGC CGACACCGAT CGCTTTCTCG CAACAGTGTC AGCTTCTGAC GAAGACCAAG CCATCTCTAT ATTAGACGAT CGCCTCCTAC AAGTAATCCC ATCCGCGAGA ATGTCGTCTA CAGCGGTGTG A
|
Protein sequence | MTPSSIYDPH WALYPKAYVA TRSIGPVEIN GDLEKPVWNS VPWSDLFEDI QGNDAPANAI GPAKTAFKAI YDDEHLYVGA LLHPSPVFRT EAHFTTRNSP IYQTDSDFEV FFDLNGSNHG YKEFEVNALN TVWNLMLDKP YDDGGHEHSG RIAHRGDSAF YDVNSQTTAV QVLEGRLNDA SNQGALWSVE MAFAFEDLAA HLPSPVPRPS PGEFWRINIS RVELKGEVNW TWQPQKVWDP LLRKHHGKVA MHMPDSWGYL IFGDTVLELD NSSWRDPLWP TKLAAMNIYY AQYFYKSLNG FYTDCMEELA GYLDLEITSP FKVDLVADTD RFLATVSASD EDQAISILDD RLLQVIPSAR MSSTAV
|
| |