Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49708 |
Symbol | |
ID | 7198394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 16575 |
End bp | 18119 |
Gene Length | 1545 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184474 |
Protein GI | 219128553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGAATATC CGAGGAACGA ACCAAGAGTC ATGGATGTCT TTGGCAGCAA ATTGGTTGAA GCGCTCTCCG TCTATCATGC TTCCTGGCTG CTTGGTTTAA GCGTCACAGT AGCGGTAGCG ATCGCAATCA AAATGTCCTC CAATCAAAAA AGTCGCTCGC CTCTGCACCG AAAGTTTTCT TTCACATCCG TCGGAATGGC CATCGGAATC TTCCCCGAAT CGGTCAAAGC TCCCACAACA ATCATCAACG CGGCAATCTA CTTTTCAACA TGTCCCGCGG AGAAGGATCT CATTGAACTG GCGGTAAAAC CTATGCTTGC TTTCACGCGA CTGTCAACGA TTCCTGTCCC GGAAACGGCC AACTGCCGAC CTTCCACGCG GTCTTTTGCG CCATCGGAAC TCATTCGGAA GGTTGAAATA TCAGGTAAAT GCATCAAGTC GACAAATGAT GTCATATTTA AGCACCTGCA AGAGTCGCTC TCGACAGAGC GAGACGATTT GCCGTGGTGG GAGTTTCTAG TGGTCGAAAA CGTTGGCGAG GGCGAGTCTG CCGTCGTTCT ACGGATGCAC CACGCCCTAG CGGATGGTAT TTCGCTAGTA CACGTTTTTG AAAAGTTCAT AACCTACGAA GATGGTTCGC CGGTTTTGTC CATTATTCTG TCCAACATGG CGCAGAAGAG CAAAGTCGAG AAAACGCACA AAACAAATCC CTTCCGCCTT GCTTGGATGC TTGTCCGAGA TGCTACCAAG GTCCTCACGT TGGGTCTTTC GCGTTCGGAC GATCCCACTA TCTTTACCGA ACCGAATCAG ACGTATGTGC ATTCGCAGCA TCGAGAATGT GTGGTTTTCC CAACGTTTTC ATTGGCCTTC GTTAAGCGGC TGAAAACAGC AGCCAACGTG ACCGTTAACG ATATTCTCAT GACCGCGGTC AGCCAAGCGG TACACGAGTA CTGCCGAGCT GAATCCTGCT CGGTCTTGAT GGGAAAAGGA GCATCGCTTC AGTCACGTGC ATTATTGCCG ATAGCGTTGC CGCGATCCGC GTCAGACTTG GAACATCCTT CCACGGCTTT GCGCAACAAG TGGTGTCTTG TTTCGGCAAA TATGAGCATT GGCTGTGTCG ACCTAGTGGA TCGTCTTAAT TCGATCCACC AGACTACTGT TCACTTAAAA GGAAGCCCAA TTGCCATGGT CCAACTCAGT CTGCAAAACA AATTGGCAAG TCGATTGCCT AAAATAGTCG CTCGACAAAC CATGCTGGAC ATTTTTCGAA GGCATTCGCT TGTCTTTTCC AACGTTCCCG GCCCAGATCG TCCGTGTCAA TTGGCCGGGC AAACAGCCAC TGGAGTACAA ATGTTCTATA GCAACCTGAT TCCTCAAGTT GGATTGCTGT CGTACGCCGG GAACATTTAC GGTAATATAG TCCTAGACAC TGGTGCCGTG CCCAACGCTG AATCTTTGGC TGGCCATTAC GCAAAGGCGC TTGTCGACAT GGCGACCCTG CTCAACGTCG ACAAAATTCC AACGAATTTA CAGTCGTACT TCTAA
|
Protein sequence | MDVFGSKLVE ALSVYHASWL LGLSVTVAVA IAIKMSSNQK SRSPLHRKFS FTSVGMAIGI FPESVKAPTT IINAAIYFST CPAEKDLIEL AVKPMLAFTR LSTIPVPETA NCRPSTRSFA PSELIRKVEI SGKCIKSTND VIFKHLQESL STERDDLPWW EFLVVENVGE GESAVVLRMH HALADGISLV HVFEKFITYE DGSPVLSIIL SNMAQKSKVE KTHKTNPFRL AWMLVRDATK VLTLGLSRSD DPTIFTEPNQ TYVHSQHREC VVFPTFSLAF VKRLKTAANV TVNDILMTAV SQAVHEYCRA ESCSVLMGKG ASLQSRALLP IALPRSASDL EHPSTALRNK WCLVSANMSI GCVDLVDRLN SIHQTTVHLK GSPIAMVQLS LQNKLASRLP KIVARQTMLD IFRRHSLVFS NVPGPDRPCQ LAGQTATGVQ MFYSNLIPQV GLLSYAGNIY GNIVLDTGAV PNAESLAGHY AKALVDMATL LNVDKIPTNL QSYF
|
| |