Gene PHATRDRAFT_43724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43724 
Symbol 
ID7197013 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1291176 
End bp1294201 
Gene Length3026 bp 
Protein Length890 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177797 
Protein GI219112091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCGGGGCA TTCAACATAA CGTGCGCAGC CTCAAGCCAG TCTAAAAGTT GCCAGGCTTA 
TATCAAAGGT GCAACACGAT GAAGAATGGT AGCCAAAATC CTCAGAAAGA TAAGGAGGAA
TTAGCCGAAG GCGAGCATCG ATCGAGGGAT TTCTCCCCGC CGAGCAGCGA CACGGAAGCT
GACGCTCGTC CTTTGTCTGA AGATGGCACC AAAAGCTCCT CCAATCTGCC AGCGGATTCT
CCTGGAACTT TATCCACTGC CCCTGAAGCG TCTGCGGCAT CTTTGAGATT AAAGCAACTC
GAGCAAGATG TCGTCTCGAA GACATATAAC AGGGGTAACG CCAAACCCTC CCAGCCTGGC
GCCGTCGCTG AAGGCGGTGC GTCCAATCTT TCGCGACTGG AACAAGAAAT CGCCGCCAAG
ACACGAGGCA CCGATCGATC GTCTAGTGGA GGGTCTGTCG GTCTGATTCA ACTGGAACAA
GACGTTGCTG CCAAGGCGCG TGGCAGGAAC ACTACCGCTG CCACTAGACC GGGTGCAGCA
CCTTCGGGCG CAGCTGGCCT CTTGGAACTG GAGCGTGACG TCATTGCCAA GTCGGTGCGT
AATTCGAATA CTTCGCAGCC TGGAGCTGTG AGTGAATTGG ATGTAGTGAA TGGTGTGGGC
GTGACATCCG CCTCGACAGC TTTGAATCAG CTAGAGCGGG ACGTCATAGC CAAGTCGAAG
TCGTATGATG CACGATCGAG CGCCAGCGTC GGCCTGCACC AGTTTGAGAA CGATATTTTG
GCAAAGAACC AGGTACGTCT ACCAAAGGAG GCTAGTGCGA GTGCCGCCCA GTCGTTAGTT
CAAATGGAGA ACAATCTAGC GGCTAAGCTG AGTGCTTCGG GAGCGGCTGT CTCGCCTAGT
AATTTCACTT ACTCGGGGAA AACGGACACA CGCTCCAGGG CACTGATGGC TCCAAACGCG
CTGACGGCGA TGCAGCAGCT GGAGCACCAA ATTGCGCAAA AATCAGAAGC GACTGGCGGT
ACCACCCCTC CCCCCGTTGG AGTCCTTAGC TCCGCTGTCC GAGGACCAAG TGTTCCATCT
ACAGGAAGCG GTCGTCCACT TCGAGCTGAG TTTACTCCTG ATTTTTCAGC CTACGATATC
CCCCCTCCAT TTTCTTATGA ATCACATCCC GTAGGAGAAC ACGCACCACC GGATATCCAC
GGCTTTCATG GCACACAGTA TAGCCAAGAT ATTCCTGGTG CAGAATCAGG CGGTATTGAA
GCATTCGTTG CTGACAACGT CGTCGAAGCA GTGGGCGTTG CGGTCATCAT GAGCGAAGAA
GAGGAGGAGG CGGTTGGCCG CAAGCGTCGG AAAAGGTACC TTTGTTTCGG TGCACTCTTC
TTGTTGTTAC TGGTTGTTGC GATTGTGGTA ACAGTGGTTA TCATTACGGG TCGGTCTAGT
ACTACTGTCC TAGATTTGCC CCCCACTGCA GCACCGTCGA GCGCTCCGTC TTCGGCACCG
TCATTTGCCC CCACAACGGA TGGTGTACAG GCTTTGATCT CGTGTCTGAC GCCTGCAACA
AGCATCGAAA CTTTTCAAGA CCGTGCATCG GCGCAATATC GAGCTGTCGA GTGGTTGACA
AACACGGATC CGTTTGTGCA GATCAATGGG CTTCAATGTG ACAGCCCCAA ATTTTTGCAG
AGATACGCAT TGGCGACCTT TTACTTCGCC CTTTCAGGAG AAGAATGGGA AATCTGTGGC
CTTCAAAATC CCGAATGTAC CAGCGACCCA GCTGACTTTG GATGGCTCTC TACTCAAGAC
GAATGCAATT GGTACAAGGT TAGATGCAAC ACTCTAGACA TGGTGGAATC CATCAATTTC
GGTATGTTAT CCTGCCGAGT CGCGATTTGT TCCCAATAAA ACGGTCTACC TAACATCGCA
ATATTCCCAT TTTCAAATAG CGGACAATAC CGCGGTCGAA CGGGCACTGA CGATCCTGAA
GGGTGCCATT CCGAAGGAAC TGCAGTACTT GACGGACATG GTTCAATTTG TTGTCGCCGA
CATGCAGATT GAAGATTCCA TTCAGGATAG TTTTTCTACT TGGTTGAAAC TAGAGCGTCT
GATCTTGAGC AGAAACAATT TTACGTCGAG CATCCCAGAC GATATTGCTA TCACTAATCC
GCTCTTGTCT GATTTCCAAG TAAGTGACAA CCAGCTTTCG GGTCGTCTTC CAGATGGATT
GGTCAGCTTA TCATTACAGG ATTTGAGACT GGATGGCAAT AGATTCACAG GAAGTCTACC
ATCATCTTTT GGGGAGAATT CCGAAAGGTT GAGTAAGTAG CGCAGCAGGT ATTGCCTCGA
CGAAAGCTTA CAGAAAATTT TGCAACATCT CATGTCGTCT CATCTATCGC AGACAACCTC
GCAGTACAAA GGAATCAATT GGGTGGTCCT CTCCCCAGTT TGCTCTGGAC GCTTCCAAAC
CTTAGAACCC TTGATCTCAG TGAAAATGCT TTCAGTGGTG AGGTACCGAC CACGATTGGA
TTGATGCAGA ACTTACGCGT ACTTCGTTTG CATAGCACGC AGCTCGGAGG AGAATTGCCA
GCTGAGTTCT TTGGTATTCC CAATTTTTCG ACACTAAACA TTGCCAATTG TCGCTTCCGC
GGAGCTCTTT CGGAGAACTT TATCAATTTT AACCAAACAC TACAGGAAGT GATCGTAGCG
TTCAATAACT TTACCGGTCC TATACCCGTT GAAGCATTCG AAGCCGCTCA ATTTCTTGGT
ACGTACGATC TGTATCACAT TCACGCTCTA GATATTGAAC GGATGCGTTT CTAATACAAC
TTTCCAATTA TAGAGGAGCT TAACCTACAG GGAAACCAGC TTTCAGGCGT TATATCCGAA
GCACTGTGCA ATACAAGGGG AACGGCTTTT GGCCAACTAG CCTTTCTAAT TGTTGACTGC
AACATTGATT GCAATTGTTG TGATCCGGTG TCGGATTGTG GCTGATGGTG CGACCTACCG
TCGCTTTGAC ACATGTTTAT GATGAT
 
Protein sequence
MKNGSQNPQK DKEELAEGEH RSRDFSPPSS DTEADARPLS EDGTKSSSNL PADSPGTLST 
APEASAASLR LKQLEQDVVS KTYNRGNAKP SQPGAVAEGG ASNLSRLEQE IAAKTRGTDR
SSSGGSVGLI QLEQDVAAKA RGRNTTAATR PGAAPSGAAG LLELERDVIA KSVRNSNTSQ
PGAVSELDVV NGVGVTSAST ALNQLERDVI AKSKSYDARS SASVGLHQFE NDILAKNQVR
LPKEASASAA QSLVQMENNL AAKLSASGAA VSPSNFTYSG KTDTRSRALM APNALTAMQQ
LEHQIAQKSE ATGGTTPPPV GVLSSAVRGP SVPSTGSGRP LRAEFTPDFS AYDIPPPFSY
ESHPVGEHAP PDIHGFHGTQ YSQDIPGAES GGIEAFVADN VVEAVGVAVI MSEEEEEAVG
RKRRKRYLCF GALFLLLLVV AIVVTVVIIT GRSSTTVLDL PPTAAPSSAP SSAPSFAPTT
DGVQALISCL TPATSIETFQ DRASAQYRAV EWLTNTDPFV QINGLQCDSP KFLQRYALAT
FYFALSGEEW EICGLQNPEC TSDPADFGWL STQDECNWYK VRCNTLDMVE SINFADNTAV
ERALTILKGA IPKELQYLTD MVQFVVADMQ IEDSIQDSFS TWLKLERLIL SRNNFTSSIP
DDIAITNPLL SDFQVSDNQL SGRLPDGLVS LSLQDLRLDG NRFTGSLPSS FGENSERLNN
LAVQRNQLGG PLPSLLWTLP NLRTLDLSEN AFSGEVPTTI GLMQNLRVLR LHSTQLGGEL
PAEFFGIPNF STLNIANCRF RGALSENFIN FNQTLQEVIV AFNNFTGPIP VEAFEAAQFL
EELNLQGNQL SGVISEALCN TRGTAFGQLA FLIVDCNIDC NCCDPVSDCG