Gene PHATRDRAFT_42555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42555 
Symbol 
ID7196095 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp405918 
End bp409852 
Gene Length3935 bp 
Protein Length1159 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176579 
Protein GI219109650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGGG TACCAAACGG TGAAACGACG AAACCGACCG ACGGGCCCGC CGTTCCAGAA 
GGGGCTTCTT CGCTGCTCGG CTTATTTGCA CCACCTCCTG GCTCGGTTGA AAAGAGTGCA
CGCCCTGCGG AAAACGCTCT TTTGGGAAAT TCCAGAGCGA CGAATGATAC TTCGGCAGCT
GTATCTTTGG AACGACAGCT GGAATTCCCT CGAGAGTCAA CCATACACAG GTGCAATAGT
GCAGAGCTTC TGGATGCAGG GAACAACAGC ACAAGCATAC CCTTTCTACC AGATTCTACA
ATTGATGAGC CTTTAGATTC ACCGAAGCAA GGGCTCAATA CTAGAGTCAT AGAATCCGAG
TACGACGACA ACTTTGGCTG TCTACTGGAT GGTCCGATTC TGAATGAGAA TACCCCGCTT
CTTGTTGAAA AGATGCAGCA CAGCACTTCA ATCGGAGGTC TTTTTGATCC TCTACCGGAA
TCTAAGACAC CAATTCCCCC TCACAATGAG GTTACACCCA AGGCGGGACA CAGGCAGCGC
AAAACTCTGT TGTCGACTAC CCGGAACGCT CGGACGGACT CGGCATTGCC TCCCATTATC
GAATCGGTAC GACCTTCCGT TGACACTCAC AACGACGATT CACCGGAAAT GCACAAGCAA
CAAATAAGAT CAGACTGCTG GCAAGGCTTC CTATCTAAAT TTTGGCAAGC TTACCATGAA
TGTCTACAAC CCACAACCTG GGTTGGGGCT TTCATGTTCC TACTCTACCA AATCGTGTTT
TGTTTGACTA TGGGTTCGGC TATAACCCGA CCGCACAGTA CTGTTTCTCT GCTAGGACTC
TTGACCAAAA TGTCCGCTTT AGGCATCATC CTGGGCGCAC CAGTCTACTG GTACGGCAGT
GGAACGGAAA TTCCTGCCCT CTACCCGACG GTAGATTTGT TTTCGGCACC CTTTCTCGCC
GAGATTGCCG TGGTGGTCGA CAACACCTTA TTCGAAGACA AAAATGTCAC CTACCAGGAA
AATGACGCCT TGTTTTTGGG CACCTTTACT TTTCTGGCTT CCGTGGCATT GTTCCTTTCG
GGAACGCTTC TGGTACTCGC CAGTGTCTTT AAATTAGCGA ATCTTGGTGC CTTTTTGCCC
TTTCCCGTCT TATGCGGATT CTTTGCCGCG GTTGGTGTCC TGACATGGAC ACTCGCCTTT
AAAGTCGATA CAAACGGCCT AACAGTACAT GAGGTGGTCT TCTCGGGAGA TGCAGCTCTT
GTACTTCACA GTTTACGTCA TCATTTGCCA AGTGTCTTCA TTGCAGCAAT TATGAAGTAT
CTGGGACCAA AGAATCCTTT TTACGTTGCC GGGGTAGTGC TTGCGACAAT TTGCATGTTT
TATATCTTTA TGCTTAGTTT CGGAGTATCC ATGGAACAAA TGATTGAATG TGAATGGTTC
TGGGCACGCT CCGACCTTGT CTATGAATCG CTGGATGTCA AGGTATGATG TCGACACCGA
AAACAAGCAC GGCATCAATG TGTGACTGTA GTTCTGACGT CTTGTTTTCT CTGCTATTGC
CTAGGTTGGC TTTGCCAAAT GGGCTCCGCC TGCGCCTATG GGATGGATCA GTTCCTTTAT
TTCAGGAAAT GTGCATTGGG GAGCTGTCCA AAAAGGGCTC AACCCAACTG TCGCTTTGGC
TTTTCTTTAC ATGATTCGGT GTTCATTGCA TGGCGCAGCT TTAAAAAAGA ATGTGCCAAA
CTTGGAGAGG ATCGTCAAAG GGAGAGCACG GCCCAAGCTA ATACGGGATC GGTCCGTCCA
AGCCTCTGGA CCCCGCCGTC GTAGATTTTC TGAAGTGGTT GACATCGAAA ATCTAGCTTC
TGTGATGTCG GAATTGGACG CAGATGGCCC CTCAACAATT CATCCAAAGC CTACCCACAT
GTCGTTGAAG GATATTCTGA TTCAGTACGG ATATAGCCAA TATGTCTGTG GTCTCATGGG
AAGTTTCGCA ATTACACCTT CAGTGGCAGC ATCGCCGACT ATGTATATGG TCAGTTTGTT
AAAATTGTGC TGTAGGCCTG TCACTTTTTC GCTTCTCTCA TGCATTGTCG TTGCGAATTT
TTTCAGTTGG GTGCTGAAGG TGTTGCACCA CAATTGGGTT CGGTTCTGCT CTTGTCCTTA
TTTTATTTGA CTGACTTTCA AGCAGTTTCC TACATTCCGA AACCTGCTTT CTCGTCATTG
CTTGTCTTGG CTTTTATCGA TATGACTTCA ACTTGGTTCG TCAAGTCTTA TTTCAAGACT
AGGGAGAAAA TGGAATGGCT GGTCGTTCCT TTGATCGTGC TGCTCGCCTT CGTTGTCGGT
TTACTTGGTT CAGTCTTTTT AGGGATCGCC ATGTCAACGG TACGTTGTCG TGTCCCTTTG
ACAAAAAGTA TTCCGAAGAA TTTCTCATCG TTTATCTTCA TCCGGTCAGT TTCTTTTCGT
AGCTGCTTTT TTTCGCAGTG GAGTTGTCAA GTATGTCGCC AATGGTATTG CAATTCGTTC
AACAATTGAA AGGCCTTTAA AGACAGCAAA TTGGCTGGAC AGAAATGGTG AGCTGATACA
AATTCTTGTT CTGCAGAACT ATTTGTTCTT TGGAAACGCT TCGTCAATAC TGAACTACAT
ATGTTCAATG TTCGAAGATC CCGATCCTGC CCTCGACGAA GTTTTTGTGG TTCCAATCCC
GAAAATTATT GTCCTGGACC TGACCCTTGT AACTGGTATC GATACATCAG CAGTCGATGT
GTTTTCGGAC ATTTTCAGTA TGGTTGGGAA GCACAACTGT AAGCTCTTCC TTTCTGGTGT
CTCCAACAAC CTGCGGCAAG TCATGGCAAT GGCTGGTGTG AAGCCAGAGA GTAGTGTTGA
TAGAAAGAAG CGACAGTTAA GGTTTTTCTC GAATCTGGAC ACAGCAATTG GAAAGGCGGA
AGACATGCTG CTTGATGACG CTGGAATTGA AGAGCAAAGT GATTTTGGCT ACACGGGTGC
AAAGGGCTTC GCTCTCGCTC TGTGGCATAT TGATGACCAG GTATGTTGTT GTATTGTCTC
TTCTTTGTAA AAGTCGACCG AGAAACTCAC AATACATCCA CGTTTAGCAC GACACTAAGT
ATGCAAAAGA TCTGATGGCT CTAAAGGATT ACACAATTCA GATTGAGGTC GAACCCGGCG
AAATGCTATA CGAAGATAAG CACTTGGACA GAGGACTTTT CTTCATCGAA CACGGAATAA
TGGTTCGTTC ACATAGCTGC CAAGGAGATT GTGCCCTATT TCTGCTTGCA CTCACCAGCG
CCTTTCGTTC ATAGAGAATA GAGCGCAACG CTAACTTTAC TCTGTCTCGG GTTGGCAGTA
CAGATTCGCT ATCAAAGCTG GGTCAGACCT CGGGTACAAT TTCTTGTCTG AACGCAAGGT
CAGCCTCAAT AGGGAGGGAA GTCGCTCGCC TTAAGATGTC GGGGGTGTCC GCACGAAACC
ACATGTTTCG GGTAGCGCGG ATAGGCCCTG GCTGGGTCCT CGGATCTATC GAAGCGCTAA
GTGGTGCAAT TCATCCTGGC AGTATGATTG CAGGTGAGTG TTATGCTTTG AACATATGTC
TCGTGAGTCG TCGATTGTGA ACTTATGTGT CAAACGAATT GCTCAGTCAC TCAGTGCCGG
CTTCACTACA TTTCTTATAA GAAGATCGAA GATATTGAAC GGAGCGACCC GTTGCTCGTG
TTAACGTTAC ACAAATTGCT TTCATACTTG ATGGCAAGGC GGCAATCGGT CACAATCCAT
CAACTAGCGA CTTTGCATTC AATTATGAGC TCTCCTGCCC AGAAGAAGCC TATCGGAAGA
GCTGGAAGCA GCGGCTTCCA TATGTCGTAG CATAGAAATA GAATTATATA AAAGAATTGC
TTTGCCAGTA AATCAGCGAG CAAGACCATT GTTTC
 
Protein sequence
MVGVPNGETT KPTDGPAVPE GASSLLGLFA PPPGSVEKSA RPAENALLGN SRATNDTSAA 
VSLERQLEFP RESTIHRCNS AELLDAGNNS TSIPFLPDST IDEPLDSPKQ GLNTRVIESE
YDDNFGCLLD GPILNENTPL LVEKMQHSTS IGGLFDPLPE SKTPIPPHNE VTPKAGHRQR
KTLLSTTRNA RTDSALPPII ESVRPSVDTH NDDSPEMHKQ QIRSDCWQGF LSKFWQAYHE
CLQPTTWVGA FMFLLYQIVF CLTMGSAITR PHSTVSLLGL LTKMSALGII LGAPVYWYGS
GTEIPALYPT VDLFSAPFLA EIAVVVDNTL FEDKNVTYQE NDALFLGTFT FLASVALFLS
GTLLVLASVF KLANLGAFLP FPVLCGFFAA VGVLTWTLAF KVDTNGLTVH EVVFSGDAAL
VLHSLRHHLP SVFIAAIMKY LGPKNPFYVA GVVLATICMF YIFMLSFGVS MEQMIECEWF
WARSDLVYES LDVKVGFAKW APPAPMGWIS SFISGNVHWG AVQKGLNPTV ALAFLYMIRC
SLHGAALKKN VPNLERIVKG RARPKLIRDR SVQASGPRRR RFSEVVDIEN LASVMSELDA
DGPSTIHPKP THMSLKDILI QYGYSQYVCG LMGSFAITPS VAASPTMYMA CHFFASLMHC
RCEFFQLGAE GVAPQLGSVL LLSLFYLTDF QAVSYIPKPA FSSLLVLAFI DMTSTWFVKS
YFKTREKMEW LVVPLIVLLA FVVGLLGSVF LGIAMSTFLF VAAFFRSGVV KYVANGIAIR
STIERPLKTA NWLDRNGELI QILVLQNYLF FGNASSILNY ICSMFEDPDP ALDEVFVVPI
PKIIVLDLTL VTGIDTSAVD VFSDIFSMVG KHNCKLFLSG VSNNLRQVMA MAGVKPESSV
DRKKRQLRFF SNLDTAIGKA EDMLLDDAGI EEQSDFGYTG AKGFALALWH IDDQHDTKYA
KDLMALKDYT IQIEVEPGEM LYEDKHLDRG LFFIEHGIMR IERNANFTLS RVGSTDSLSK
LGQTSGTISC LNARSASIGR EVARLKMSGV SARNHMFRVA RIGPGWVLGS IEALSGAIHP
GSMIAVTQCR LHYISYKKIE DIERSDPLLV LTLHKLLSYL MARRQSVTIH QLATLHSIMS
SPAQKKPIGR AGSSGFHMS