Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49199 |
Symbol | |
ID | 7195511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 199862 |
End bp | 201237 |
Gene Length | 1376 bp |
Protein Length | 418 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183946 |
Protein GI | 219127446 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00518745 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGAACTCG GCGATGATCA TCAAACTCGT CCTAGTCATC GGCCATGTCG TCTTCTGGGT GAGCGCAGTG GCCTCCCCAT CCAGCATTTT GAATTCCGTA AAGACCGCCG AATCTTCTTA TCAGCAGGAG AGGGAGCTTC AAAGTTTGGA CGAATCCTTG TACAACATGA CTTTCGTAGG TGTTAGTAAC GCGAACAACA CCATTACGGT GACGTTTGAC AATGGTATAG GCAACGGCGA CAGCGGAACC AATCCGGATG GTCGCTTCGA CATTTACACA GCCCCAGATG ATGTCGACGT CACGGCAGAC AAGACCCTAT CATGCTATAC CGAAGGAGGT GAGCTGTTTG ATCATGCAGC AACTGGTTCG GGCATTTCCG GACAAATAGT ATCGATTGGA GCCAACACGT TTCCAACTCC GTCAACCTTT GAGTTTACCT TCGATGAGAA TGTCACGGAA TCAAGTCCCT TTTACAATTA CGATTCCGGT ACGACAACGA AGCAGATTGA ATTCATCTTT TGTGTTAAGT TCACTCTTAC TCGCGAGATT GAAAATAAGG GTTCCGACCC AATCACTACC AGCACGGTAG ATATTAACTT TCGCGAAGTA GCGATTGTGG TCACGGTTAC CCTTGATGGC AATCTCAATG CTAACAGCGT GGATGCATTT AATGTCGCTG CTGCCCCCAT CAATTTCGAC CTTGACAACG AAATCGTATA TACGGCAAGT GTTGGTCTCT GTGAAACATA CAACCTCGAC GTCAAAACAC CTCAACAGGG CGATGTCGTT CCTATTTGCA TTATGTCGGA CGACTTCCCG TTGGCTCGAA TCATTTCAGT ACAAGACCTG ACCTTCACTT CGGATTCTCT GACGCAACAA ATCCGCGTGG ACGGATTAGA CGCTCCTGGG GCTACTGGTT TGTACGGCCG AGCGAGTGCC GACACCCATT GCGTTACCAA TGAGTGCATT CAGTACGACG TGATTGTCTA CGCTATTTTC GCGACCAGCG CGGAAATCAA TTTGAAAATT GATATTACTG GCAGCGTGGT CCTCGCCGTG GGTAACTATA TGACCCGGAA GTTGCGAACT CGGCTGGAGC CTACTCGTGA GCTGGCCGAA TTGTTTCAAG GGCAGTCTTT TCGGTCGACA ATTGAGCTGC CACCCTTGCC TTCGAGCGAG TCAGCTGCGT CTACCGCCTC CGCAAATGTC GTCTGTATGA AGTTTATCCC GTTTGCTATT GCCATGCTGG CACCTTTCCT AGTTTTCTGA AAAGGTCGGA GCGAACATTG TAAATATGTC TACAAATGGC GAATGCCGCC TTGCGTTCCA ACGTTAAATC ATATATAGAA AAATTAAGAA AGCATTTTTT TAAAAA
|
Protein sequence | MIIKLVLVIG HVVFWVSAVA SPSSILNSVK TAESSYQQER ELQSLDESLY NMTFVGVSNA NNTITVTFDN GIGNGDSGTN PDGRFDIYTA PDDVDVTADK TLSCYTEGGE LFDHAATGSG ISGQIVSIGA NTFPTPSTFE FTFDENVTES SPFYNYDSGT TTKQIEFIFC VKFTLTREIE NKGSDPITTS TVDINFREVA IVVTVTLDGN LNANSVDAFN VAAAPINFDL DNEIVYTASV GLCETYNLDV KTPQQGDVVP ICIMSDDFPL ARIISVQDLT FTSDSLTQQI RVDGLDAPGA TGLYGRASAD THCVTNECIQ YDVIVYAIFA TSAEINLKID ITGSVVLAVG NYMTRKLRTR LEPTRELAEL FQGQSFRSTI ELPPLPSSES AASTASANVV CMKFIPFAIA MLAPFLVF
|
| |