Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39510 |
Symbol | |
ID | 7195345 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 22185 |
End bp | 24093 |
Gene Length | 1909 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183655 |
Protein GI | 219126837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000156281 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCT CGGCTCATTT CAAACTGAGC GACTTTCCTC ACAAAGTCCT TGACCCGATT GCCACCCTCA CCGTCCCACC GACCTACGCG ACCATCAAGC GTGCCCAACG CCAGCTCATG ACTAACGCCG CCGCCATTCC CACACTCAAT GGTGGTGGCG CCCACGGCCA CATGGCCTTG ACCCTGACCG CCCTTGCCTA CGCCGACATC AGCGACGTCC CGTTCGTCAT TTCCGTCGCC CCTCTGGCCA ATCCGCCTCC CGGTGCCACG CAACCGCAAA TCACCGAAAA CAACCGCATT CATCAACGCG ACGCCGACAT CGACAACCTT TATGTCGCCG TCAACAACGC GCTTCGCCAG CAACTTCTCG ACGCGGTTCC CCGCATTTAT GTCCGCGCCC TCGCCCATCC CATGTTAGAG TTTAGCAACG TCACGTGCCT TGACTTGCTC TTGCACCTCT GGACCAAATA CGGTACCATC AAGCCCCCCG AGCTCCAGAA AAATTTCCAG TCCATGTACA CCCCTTGGAA CACAACCGAG CCGATTGAAT CAGTTTTTCT CCAGCTTGAC GAGGCCATCG CTTTCTCCGT TGACGGTAAC GACCCCATCT CGGAAGCTGC TGCTGTTTGC GCAGGCTACG AAGTCATTGC GCACTTGGGC CTGCTCCCAC TGGACTGCAA AGAATGGCGC AAATTGCCTA CTGCTGCTCA CACCCTTGCC CATTTCCAGC AGCACTTTTC CCTGGCCGAC GAAGACCGGC GCCTCACGGC AACCACCGGT TCCCTTGGAT ATGCCAACGT GCTTGCTGCT GCCCCCTCTC TCGCTCCTGC CACAACCTCC GACGCTCTCA ACCTTCCTTC TCCGCGCTCT CTGTGTCCCA GACTTCTGTC TCTTCGCCGG ACATGACCTA TTGCTGGACC CATGGTACCA GCAAAAACCG GCGCCATACA AGCGCCACAT GCAAGAACAA GGCCCCTGGC CATGGTGCCA GGCCCTCGAC TCCGGCCATC TTGCGACTTT TCCAGAACTT TCCTCCCGCC AGGTCCGCAA GTATCCACCT AGTTCCCCCG CCATGGTCAA GGGCCACCTT GACCAACAAC GCGCGAACCT TCGCTCCACC AAGCTTCCCC CTGTCGGTTC CCCCATCCCG ACGGAACCAC CTGCCGCCGC TGTGCCTGAC CTTGACCCTC CCGACCCCCC CCCCCCCCCC GTCGCATGCA CACACCATGT CTTTGTTGCC CACCAAAGGG TTACCGGTCA GATCTACACG GACCAACCGG GCCGTTTCCT CACTCCCTCC AGTGCCGGCC ACAACAACAT GCTTGTTCTT TACGATTACA ATAGCAACGC CATCCATGTC GAACTCATGA GGAACAAGTC CGGACCCGAG ATTCTGGCCG CCTACAAACG TGCTCACGCT CTGTTTACCC AGCGCGGCCT GCGTCCACAA CTTCAGCGCC TCGACAACGA AGCCTCTACC GCCCTCCAAT CCTTCATGAC CTCGGAACAC GTCGACTTTC AGCTGGCACC CCCCCCCCCC CATCTGCACC GTCGTAATGC CGCCGAACGA GCCATACGCA CCTTCAAGAA TCACTTTATT GCTGGCCTCT GTACCACGAA CCCGGATTTC CCCCTGCATC TTTGGGACCG CCTCCTCCCC CAGGCCCTCA TTACCCTCAA TCTTCTTTGT CGCTCCCGCA TCAATCCCAA GTTGTCCGCC CACGCACAAC TTCACGGTGC CTTTGACTAC AAACGCACCC CGCTTGCTCC TCCCGGCACT CGTGTCTTAG TTCATGTCAA GCCAGCTGCT TGCAAAACCT GGGCCCCCCA TGCTGTTGAA GGTTGGTCGG CATCAATAAT TTTACAAGCA ACAAGGGCTC GATGGCTCTG GCCGTCAGGG GCTACATGA
|
Protein sequence | MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKRAQRQLM TNAAAIPTLN GGGAHGHMAL TLTALAYADI SDVPFVISVA PLANPPPGAT QPQITENNRI HQRDADIDNL YVAVNNALRQ QLLDAVPRIY VRALAHPMLE FSNVTCLDLL LHLWTKYGTI KPPELQKNFQ SMYTPWNTTE PIESVFLQLD EAIAFSVDGN DPISEAAAVC AGYEVIAHLG LLPLDCKEWR KLPTAAHTLA HFQQHFSLAD EDRRLTATTG SLGYANQKPA PYKRHMQEQG PWPWCQALDS GHLATFPELS SRQVRKYPPS SPAMVKGHLD QQRANLRSTK LPPVGSPIPT EPPAAAVPDL DPPDPPPPPV ACTHHVFVAH QRVTGQIYTD QPGRFLTPSS AGHNNMLVLY DYNSNAIHVE LMRNKSGPEI LAAYKRAHAL FTQRGLRPQL QRLDNEASTA LQSFMTSEHV DFQLAPPPPH LHRRNAAERA IRTFKNHFIA GLCTTNPDFP LHLWDRLLPQ ALITLNLLCR SRINPKLSAH AQLHGAFDYK RTPLAPPGTR VLVHVKPAAC KTWAPHAVEG WSASIILQAT RARWLWPSGA T
|
| |