Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47417 |
Symbol | |
ID | 7202552 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 576027 |
End bp | 579128 |
Gene Length | 3102 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181586 |
Protein GI | 219122509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.143888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAATG GGGAAAAGAA GTCGACGACC TGGCAAACCC TCGCTTGGTG CACTCGCAAT TCTTTTCATA TGAGATTGAA GAGAGACAAG AAATTGATCT GTCGAATGCG GCAAAACAAC GTCCCAACGG TTTTAAATGT CGCCGAAAAG CCATCCGTGG CCCGGGCTTT GGCTTCTGTC TTTGCCCGAT TACCCAACGC CGTTGATCGT GGAATGCGTC GAGAGGGTAA TCAAGTGTTC AGCCACGAAA ATGTCTGTTT TCCTAGCGTG TTTTCTCAAG GAAATGGTGT ATGCGTTCAA GGACCTAGTA CGTCGCAACG AGCTGCTTTG TAGTACGTCA AGCCGCTTTC ATTTTGCCTC TCACAAACTG GTTACGCGAT ATGCTAAGTA GTAGTCCCAC ACAGTATGAT AACAACATCC GTAAGGGGTC ATTTAGCGTC CCAAGACTTT CCTCCGGCGT ACGGATGGTC CAAGTGCGAC CCAATTGCCT TGTTTGAAGC TCCGATTGAA ACGTCGTATC GCGACGACAT GCAGCCCCTT GAACGCATGT TGAAGTCATT GTCACGTCAA GCCCAGGTCC TAATTCTCTG GTTAGATTGC GATCGGGAAG GAGAAGCCAT TGGCGATGAA GTACGTACCG TGTGCTTGGG GTCAAATTCA AGGCTGCAGG TGTACCGGGC ACGGTTTTCG ACCGTGTTGC CAGTTGAAAT TGAGAGAGCC CTACAAACAC TAGGTCGAGT GAACGAGTAC ATGGTGGCAG CGGTCCAAGC CCGATCTACG CTAGATCTAC GAGTAGGTGC CGCCTTTACA CGATTTCAAA CCCTACGATT GCAACGCAAA TTTGACGGCT TTGCTGAGCA GGGTGTCATC TCCTACGGCC CATGCCAGTT TCCGACTTTG GGATTTGTTG TGGAGCGATG GGCACGGATT GAGACGTTCA TCCCAGAGGA CTTTTGGCAT TTAGAGCTAG CGATTTCTGT GGATGACACT TACCACCAGC AATCGCAAGA AGAAGACCAA AGTGCTGCTC GTGGACGTAC CCAACAAAAT CGGACAATTC ACTTTTCGTG GAAACGCGGA TACTTGTACG ATAAACTTCT GACCACCGTG CTATTCGAAG AGTGTTTGGA AGCTGGAGAA GCGGTCGTTA CCGCAATGAA TGGGCGCACT AGGAATAAGT GGCGCCCCGT CCCACTAGCA ACGGTCGAAC TACAAAAGCG GGCTTCAATG TACTTGCGAA TTGGATCCGA AACCTTGATG TCGGCCGCAG AAGAATTATA TCAACAAGGT TATATATCCT ACCCCCGTAC CGAAACAGAA CGTTTCAGAC CAGAGTTTGA GCACCGCCCA TTAATACAGC AATTTTCCTC ACTTCAAGGG GAGTTCGGGG CCTATGCTTC CAAGTTGCTG AATGAAAATG GGTTTCAAAT TCCTCGTGCG GGGAAAAGCG ATGATCAGGC CCATCCTCCC ATCACGCCTG CCAAAGCCGT TGATCCAAAT ACTATTCAGG ACCAGATACA GCGAAAAATA TATTCTTTGA TTGTAAAGCA CTACTTGGCC TGCTGCTCTC GTGATGCTGT TGGGAGAGGA ACAACGTTGA CAGTCCGAAT GGGCACCGAA GAGTTCAATG CAACTGGTTT GATGGTTATC GAGAAGAATT GGTTAGAAAT ATATTCCCCC TGGGAACGTT GGGGCTCAGG GCAGGGAGAA TTACCTCCGC TCCAAGTCGG TAGTCGCATA AGGCCGACAT CGTTCCTGAT GAAGGAGGGT CGCTCAGGCC CTCCACAACC CATTTCAGAG GTGGAGCTTA TTTCGCTCAT GGATCGCAAT GGTATTGGAA CTGATGCCAC GATTGCGCAA CACATATCCA CCATTCTCGA TCGCGAGTAT GCTAGAAAGG ACGGCAGGCA AAAATTTCTA CCAACACCTC TGGGAATCGC ACTCGTGGAA GGCTACAATT CCATGGGATA TCAACTGAAC AAGCCCGACT TGCGCCGTGA GATGGAGGCC GAATGCAACG AAGTCGCTTC TGGACGTAAA ACTAAGGAAG AAATTATGGT GCCCATTCTT GCGAAAATGA AAAGCTGCTA CGAAACGGCA AGAGCTGAAG CTCGCAAGCT GGACGAAGCT GTTGCACGAC ATTTTCCTCG ACTCGGTGCC GGTGAGAGCA CATCTCAAGT TGTGGAAGAG AGTTTCAGCG AATGCGGAGT CTGTCGCAAC AGCATGGCGT TGAAGCAAGA ACGAGAAAAT AACAACCGTA CAACAGCTCG CAACACTGTG CGGCGCAAAC TGTTGTACTG CAGCACATGC CGGGCAGGCT GGACTTTACC ACGGGGTGTA GTCCGACCAA AAACAGAACA AGAGGACAAT GGTCCTCCTG TCAAATGCCC CATATGTCAA TTTCAGGTGA TTCGGATATT GCGAGGGGAG GGCTATGAAG GCAACGGTTA TCACGTTTGC CCCAAGTGCT TTTCGGATCC ACCTTCCGAT CACGGTGGTG CCAGCAACGC TGGCGACTTC CGCTGCTTTG CTTGTCAACA TCCAACCTGT GCTCTCGCCA GCGGAACACC GGGAGGTGAC GTTGAAGTCT TTCGATGCCC CTTTTGCCAT CCATCGGCAC AACCAACTTC GACCTCTGAT TCCGGGAAAG TATGCGTACG CAAAACATCA CGCGGATACG TACTTTCTTG CAACAAGTAT GTACGAGGTC AGGACCGATG CTCGTATACA ATCTGGCTCC CCAAGGAATG CCACAAAGTC TCTGTGCTCT CGGGCGATGA AAACCAAAAC GAGATCTGTG GTCGATGTTC CTCGCCGCGT GCTGTCATTC GCAAGGTCCA TTTCGTCTGG AAACCCGGTA GCGTTCCGCC GCACTTGGGG CGTGAATGCA CCGTGTGCGT GCTATGCGAT GCCGATTTTC GTCGTGAACT CAATATTTCG TTGCCACAGA TGAACCAAGT ACAAAGTCGA CCCCGCACGA CAGCCGGTCG GGCAGGGCAT CGCGGTGGAG GTGGAACAGA GACAGGGCAG GGAGGTGCAG GAAACACTTG TTTCCACTGC GGCCAGCCCG GTCATTTTGC CAACAGCTGT CCAAATAGAT AG
|
Protein sequence | MDNGEKKSTT WQTLAWCTRN SFHMRLKRDK KLICRMRQNN VPTVLNVAEK PSVARALASV FARLPNAVDR GMRREGNQVF SHENVCFPSV FSQGNGVCVQ GPSTSQRAAL YMITTSVRGH LASQDFPPAY GWSKCDPIAL FEAPIETSYR DDMQPLERML KSLSRQAQVL ILWLDCDREG EAIGDEVRTV CLGSNSRLQV YRARFSTVLP VEIERALQTL GRVNEYMVAA VQARSTLDLR VGAAFTRFQT LRLQRKFDGF AEQGVISYGP CQFPTLGFVV ERWARIETFI PEDFWHLELA ISVDDTYHQQ SQEEDQSAAR GRTQQNRTIH FSWKRGYLYD KLLTTVLFEE CLEAGEAVVT AMNGRTRNKW RPVPLATVEL QKRASMYLRI GSETLMSAAE ELYQQGYISY PRTETERFRP EFEHRPLIQQ FSSLQGEFGA YASKLLNENG FQIPRAGKSD DQAHPPITPA KAVDPNTIQD QIQRKIYSLI VKHYLACCSR DAVGRGTTLT VRMGTEEFNA TGLMVIEKNW LEIYSPWERW GSGQGELPPL QVGSRIRPTS FLMKEGRSGP PQPISEVELI SLMDRNGIGT DATIAQHIST ILDREYARKD GRQKFLPTPL GIALVEGYNS MGYQLNKPDL RREMEAECNE VASGRKTKEE IMVPILAKMK SCYETARAEA RKLDEAVARH FPRLGAGEST SQVVEESFSE CGVCRNSMAL KQERENNNRT TARNTVRRKL LYCSTCRAGW TLPRGVVRPK TEQEDNGPPV KCPICQFQVI RILRGEGYEG NGYHVCPKCF SDPPSDHGGA SNAGDFRCFA CQHPTCALAS GTPGGDVEVF RCPFCHPSAQ PTSTSDSGKV CVRKTSRGYV LSCNKYVRGQ DRCSYTIWLP KECHKVSVLS GDENQNEICG RCSSPRAVIR KVHFVWKPGS VPPHLGRECT VCVLCDADFR RELNISLPQM NQVQSRPRTT AGRAGHRGGG GTETGQGGAG NTCFHCGQPG HFANSCPNR
|
| |