Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42555 |
Symbol | |
ID | 7196095 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 405918 |
End bp | 409852 |
Gene Length | 3935 bp |
Protein Length | 1159 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176579 |
Protein GI | 219109650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.240204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGGGG TACCAAACGG TGAAACGACG AAACCGACCG ACGGGCCCGC CGTTCCAGAA GGGGCTTCTT CGCTGCTCGG CTTATTTGCA CCACCTCCTG GCTCGGTTGA AAAGAGTGCA CGCCCTGCGG AAAACGCTCT TTTGGGAAAT TCCAGAGCGA CGAATGATAC TTCGGCAGCT GTATCTTTGG AACGACAGCT GGAATTCCCT CGAGAGTCAA CCATACACAG GTGCAATAGT GCAGAGCTTC TGGATGCAGG GAACAACAGC ACAAGCATAC CCTTTCTACC AGATTCTACA ATTGATGAGC CTTTAGATTC ACCGAAGCAA GGGCTCAATA CTAGAGTCAT AGAATCCGAG TACGACGACA ACTTTGGCTG TCTACTGGAT GGTCCGATTC TGAATGAGAA TACCCCGCTT CTTGTTGAAA AGATGCAGCA CAGCACTTCA ATCGGAGGTC TTTTTGATCC TCTACCGGAA TCTAAGACAC CAATTCCCCC TCACAATGAG GTTACACCCA AGGCGGGACA CAGGCAGCGC AAAACTCTGT TGTCGACTAC CCGGAACGCT CGGACGGACT CGGCATTGCC TCCCATTATC GAATCGGTAC GACCTTCCGT TGACACTCAC AACGACGATT CACCGGAAAT GCACAAGCAA CAAATAAGAT CAGACTGCTG GCAAGGCTTC CTATCTAAAT TTTGGCAAGC TTACCATGAA TGTCTACAAC CCACAACCTG GGTTGGGGCT TTCATGTTCC TACTCTACCA AATCGTGTTT TGTTTGACTA TGGGTTCGGC TATAACCCGA CCGCACAGTA CTGTTTCTCT GCTAGGACTC TTGACCAAAA TGTCCGCTTT AGGCATCATC CTGGGCGCAC CAGTCTACTG GTACGGCAGT GGAACGGAAA TTCCTGCCCT CTACCCGACG GTAGATTTGT TTTCGGCACC CTTTCTCGCC GAGATTGCCG TGGTGGTCGA CAACACCTTA TTCGAAGACA AAAATGTCAC CTACCAGGAA AATGACGCCT TGTTTTTGGG CACCTTTACT TTTCTGGCTT CCGTGGCATT GTTCCTTTCG GGAACGCTTC TGGTACTCGC CAGTGTCTTT AAATTAGCGA ATCTTGGTGC CTTTTTGCCC TTTCCCGTCT TATGCGGATT CTTTGCCGCG GTTGGTGTCC TGACATGGAC ACTCGCCTTT AAAGTCGATA CAAACGGCCT AACAGTACAT GAGGTGGTCT TCTCGGGAGA TGCAGCTCTT GTACTTCACA GTTTACGTCA TCATTTGCCA AGTGTCTTCA TTGCAGCAAT TATGAAGTAT CTGGGACCAA AGAATCCTTT TTACGTTGCC GGGGTAGTGC TTGCGACAAT TTGCATGTTT TATATCTTTA TGCTTAGTTT CGGAGTATCC ATGGAACAAA TGATTGAATG TGAATGGTTC TGGGCACGCT CCGACCTTGT CTATGAATCG CTGGATGTCA AGGTATGATG TCGACACCGA AAACAAGCAC GGCATCAATG TGTGACTGTA GTTCTGACGT CTTGTTTTCT CTGCTATTGC CTAGGTTGGC TTTGCCAAAT GGGCTCCGCC TGCGCCTATG GGATGGATCA GTTCCTTTAT TTCAGGAAAT GTGCATTGGG GAGCTGTCCA AAAAGGGCTC AACCCAACTG TCGCTTTGGC TTTTCTTTAC ATGATTCGGT GTTCATTGCA TGGCGCAGCT TTAAAAAAGA ATGTGCCAAA CTTGGAGAGG ATCGTCAAAG GGAGAGCACG GCCCAAGCTA ATACGGGATC GGTCCGTCCA AGCCTCTGGA CCCCGCCGTC GTAGATTTTC TGAAGTGGTT GACATCGAAA ATCTAGCTTC TGTGATGTCG GAATTGGACG CAGATGGCCC CTCAACAATT CATCCAAAGC CTACCCACAT GTCGTTGAAG GATATTCTGA TTCAGTACGG ATATAGCCAA TATGTCTGTG GTCTCATGGG AAGTTTCGCA ATTACACCTT CAGTGGCAGC ATCGCCGACT ATGTATATGG TCAGTTTGTT AAAATTGTGC TGTAGGCCTG TCACTTTTTC GCTTCTCTCA TGCATTGTCG TTGCGAATTT TTTCAGTTGG GTGCTGAAGG TGTTGCACCA CAATTGGGTT CGGTTCTGCT CTTGTCCTTA TTTTATTTGA CTGACTTTCA AGCAGTTTCC TACATTCCGA AACCTGCTTT CTCGTCATTG CTTGTCTTGG CTTTTATCGA TATGACTTCA ACTTGGTTCG TCAAGTCTTA TTTCAAGACT AGGGAGAAAA TGGAATGGCT GGTCGTTCCT TTGATCGTGC TGCTCGCCTT CGTTGTCGGT TTACTTGGTT CAGTCTTTTT AGGGATCGCC ATGTCAACGG TACGTTGTCG TGTCCCTTTG ACAAAAAGTA TTCCGAAGAA TTTCTCATCG TTTATCTTCA TCCGGTCAGT TTCTTTTCGT AGCTGCTTTT TTTCGCAGTG GAGTTGTCAA GTATGTCGCC AATGGTATTG CAATTCGTTC AACAATTGAA AGGCCTTTAA AGACAGCAAA TTGGCTGGAC AGAAATGGTG AGCTGATACA AATTCTTGTT CTGCAGAACT ATTTGTTCTT TGGAAACGCT TCGTCAATAC TGAACTACAT ATGTTCAATG TTCGAAGATC CCGATCCTGC CCTCGACGAA GTTTTTGTGG TTCCAATCCC GAAAATTATT GTCCTGGACC TGACCCTTGT AACTGGTATC GATACATCAG CAGTCGATGT GTTTTCGGAC ATTTTCAGTA TGGTTGGGAA GCACAACTGT AAGCTCTTCC TTTCTGGTGT CTCCAACAAC CTGCGGCAAG TCATGGCAAT GGCTGGTGTG AAGCCAGAGA GTAGTGTTGA TAGAAAGAAG CGACAGTTAA GGTTTTTCTC GAATCTGGAC ACAGCAATTG GAAAGGCGGA AGACATGCTG CTTGATGACG CTGGAATTGA AGAGCAAAGT GATTTTGGCT ACACGGGTGC AAAGGGCTTC GCTCTCGCTC TGTGGCATAT TGATGACCAG GTATGTTGTT GTATTGTCTC TTCTTTGTAA AAGTCGACCG AGAAACTCAC AATACATCCA CGTTTAGCAC GACACTAAGT ATGCAAAAGA TCTGATGGCT CTAAAGGATT ACACAATTCA GATTGAGGTC GAACCCGGCG AAATGCTATA CGAAGATAAG CACTTGGACA GAGGACTTTT CTTCATCGAA CACGGAATAA TGGTTCGTTC ACATAGCTGC CAAGGAGATT GTGCCCTATT TCTGCTTGCA CTCACCAGCG CCTTTCGTTC ATAGAGAATA GAGCGCAACG CTAACTTTAC TCTGTCTCGG GTTGGCAGTA CAGATTCGCT ATCAAAGCTG GGTCAGACCT CGGGTACAAT TTCTTGTCTG AACGCAAGGT CAGCCTCAAT AGGGAGGGAA GTCGCTCGCC TTAAGATGTC GGGGGTGTCC GCACGAAACC ACATGTTTCG GGTAGCGCGG ATAGGCCCTG GCTGGGTCCT CGGATCTATC GAAGCGCTAA GTGGTGCAAT TCATCCTGGC AGTATGATTG CAGGTGAGTG TTATGCTTTG AACATATGTC TCGTGAGTCG TCGATTGTGA ACTTATGTGT CAAACGAATT GCTCAGTCAC TCAGTGCCGG CTTCACTACA TTTCTTATAA GAAGATCGAA GATATTGAAC GGAGCGACCC GTTGCTCGTG TTAACGTTAC ACAAATTGCT TTCATACTTG ATGGCAAGGC GGCAATCGGT CACAATCCAT CAACTAGCGA CTTTGCATTC AATTATGAGC TCTCCTGCCC AGAAGAAGCC TATCGGAAGA GCTGGAAGCA GCGGCTTCCA TATGTCGTAG CATAGAAATA GAATTATATA AAAGAATTGC TTTGCCAGTA AATCAGCGAG CAAGACCATT GTTTC
|
Protein sequence | MVGVPNGETT KPTDGPAVPE GASSLLGLFA PPPGSVEKSA RPAENALLGN SRATNDTSAA VSLERQLEFP RESTIHRCNS AELLDAGNNS TSIPFLPDST IDEPLDSPKQ GLNTRVIESE YDDNFGCLLD GPILNENTPL LVEKMQHSTS IGGLFDPLPE SKTPIPPHNE VTPKAGHRQR KTLLSTTRNA RTDSALPPII ESVRPSVDTH NDDSPEMHKQ QIRSDCWQGF LSKFWQAYHE CLQPTTWVGA FMFLLYQIVF CLTMGSAITR PHSTVSLLGL LTKMSALGII LGAPVYWYGS GTEIPALYPT VDLFSAPFLA EIAVVVDNTL FEDKNVTYQE NDALFLGTFT FLASVALFLS GTLLVLASVF KLANLGAFLP FPVLCGFFAA VGVLTWTLAF KVDTNGLTVH EVVFSGDAAL VLHSLRHHLP SVFIAAIMKY LGPKNPFYVA GVVLATICMF YIFMLSFGVS MEQMIECEWF WARSDLVYES LDVKVGFAKW APPAPMGWIS SFISGNVHWG AVQKGLNPTV ALAFLYMIRC SLHGAALKKN VPNLERIVKG RARPKLIRDR SVQASGPRRR RFSEVVDIEN LASVMSELDA DGPSTIHPKP THMSLKDILI QYGYSQYVCG LMGSFAITPS VAASPTMYMA CHFFASLMHC RCEFFQLGAE GVAPQLGSVL LLSLFYLTDF QAVSYIPKPA FSSLLVLAFI DMTSTWFVKS YFKTREKMEW LVVPLIVLLA FVVGLLGSVF LGIAMSTFLF VAAFFRSGVV KYVANGIAIR STIERPLKTA NWLDRNGELI QILVLQNYLF FGNASSILNY ICSMFEDPDP ALDEVFVVPI PKIIVLDLTL VTGIDTSAVD VFSDIFSMVG KHNCKLFLSG VSNNLRQVMA MAGVKPESSV DRKKRQLRFF SNLDTAIGKA EDMLLDDAGI EEQSDFGYTG AKGFALALWH IDDQHDTKYA KDLMALKDYT IQIEVEPGEM LYEDKHLDRG LFFIEHGIMR IERNANFTLS RVGSTDSLSK LGQTSGTISC LNARSASIGR EVARLKMSGV SARNHMFRVA RIGPGWVLGS IEALSGAIHP GSMIAVTQCR LHYISYKKIE DIERSDPLLV LTLHKLLSYL MARRQSVTIH QLATLHSIMS SPAQKKPIGR AGSSGFHMS
|
| |