Gene PHATRDRAFT_49227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49227 
Symbol 
ID7195693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp283081 
End bp288120 
Gene Length5040 bp 
Protein Length1675 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183848 
Protein GI219127242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000254751 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGA AAGTAAGGAT TATAGATCCG GAGGAATTCC CCAGCGATAC AGACGAGAAC 
TTGGAGATCG TACCGCTGAC TCCCCGAGAC TCCGCGGAAA CGACCGAGTC CTTTCATACG
CACGCCACAT CTTCGTACTT GCACTTCTTT CGATCTGATC AGCTACCTTC CGCTCGGTCG
CATCGCTCCG AAACACCACT TTGTCCTACG TCTGAACAGC GGCATAGCGT GCTCCTGGAC
GAAGAAGAAG ACGGAGAATA CGAGGATTTG GACGACATAA GTCCCAGTCC CAGGAAATAT
GTCCCCACTC AGTACATGCT CAGTGAACCC CCTCGCCACA GAACCCCAGC TTCCTGTCGC
TCGCTTTCTT CTGTATCTTC CGCATGCTCA GTAGAATCCT CCCGCCTTTC TATTTTATCT
TCCGCCTCTC ATCTTTTCGG AGAAAGTATT GTCAATTTTG ACCAACGCGA TATTGAGACA
GGCAGCGTCG CTTCGAGCTA CGTTTTGTCT CCAGGGCGCA CGGTGATTTC ACCCACATCC
CGTGGTGCCG ACAGAAATAT AATTTGGTGG AGAGAAATGT CCGACAACCT GATTACGATC
GCTCGTGGCC TTCGTGAGGA TCTTCCCCAC CTCATCGAAA GCGTCTGGCC CCGACATCAT
CAAACACGCC TTCCGCTCTC ACCGCGTTCT CGAAATCTGG CCGACATTAA CTTTTTAACT
TCCCAACATA AATCCAAGGT GCAGGCGTTG GTCTTTCCGG AGGAAGAATA TTTTGACTTT
TGCCTGGTTT TAAAGCCACA GGAAACGTAC GCCTTCTGGG CATCACTGTT GGACTTTCGG
GTCGAGATCT TGGGTCAGGA AAGGGTCGAT CAAATGAACG AAGCATTGGA ATCTTCTTCG
ACTAATAATA CTCGGTCCAA CACTCCTTCT AGTGATCGTC CATCAACGCC TCGATCTTAT
TCTGACGATG GATCTGAAGC CAGGGATGTT GTCGACTCGG TGATTGCCAC ACCGCCGACT
ACGGGAATGC ATCGTCGACG AGCCAATGTG ACGGGTAAGT CACAAACTAC CCCCGGACAA
AGTACCAGTG GTTCTCCTAT GCCTACACCT TACGAATCGG CTACCATGAC ACGATCCAAG
ATGTCTATGG TCTCCCCTGG TCTCTATAGT GTCGCTGATT CCAGTCGTGT CAGTCAAGGC
ATGACAAGGC TTTCCATGTT TGAACGTGCC ATTCACAACG GTTCGTCTAC CGCTTTTACC
CCTGAGTCAT GTCTGCGGCG CTCCAGCACA GTCGCCCTGG ATAATGCGTT GGTGGAGAGC
GCTACGCCCA ATACCGTACA TCGCCGTCGC TGGGGAAATC ATACAGCGTC GCAGACACCG
AACATGATGT CGCCTCCAAT CCGTAGCTTG ACGCGAGGAA GTAGCACAGT ACGCCGATCG
ACCACAATAC GTTCAACTGC CTTTCCGTCC AGTGGCAACG GTACAGAAAT CACACTTGGG
GTTGCTGAGA AAACGAGTCC ACGTACTGTC AACGAGATTC GCATAGAAGA CATTCCCAAC
CAAGTGATTC CTCGAGGCAT TGCGGCTCAT ACCAATGGGA TGCTTCACTT TCTTAGTGCT
CTCAAACGAG GCATCGTAGT CCGCCGTCAC CGACCCGGGA AGGAAGCCGT CTTCAGCAAG
ATTGTATCTA GCGATGGAGG AGATACCATT CAGTACATAT TTGTCGAGAA GGAAGACGGC
ATGAACGCTT TCAAGGAACA ACGGGTCCGT TACAACAACG TCTCAGCAGA CGAAGTGGAA
AATACTCAGC CCTGGAGCTA TGAGCAGCAC AGTTTAGAGT CGGACACTAC CAGACCAAAT
CACGATTTTT CAGTGCCTGA TTACGTTGCC GCTAAACAAT ACCGTGAAAA GATGAGGCGC
GAAGAAGGTC TGAGGAAGAA CGTCAAAACT CTGGCAACCA AAGTTGTGCG AAGCGGGGCG
GCCAAGGCTG CGGATATAAT TGCGGTTCAT CCTGGCCAGC ACGAAGATCC TCGGTCGTCT
GAGAGGAACC TTGGTAGCAC AAGTTTACGT CGGTCGAACT GCTCGTTCTC TGCTCCACAC
ACGTTTTCAC TGGTCCTTCG AACTTCCCAG TCTTTCGGAC GTAATCGCGA AATGTCGCTT
GACGAATGGG AGCAGAAATG GTATAGTGGT GAAGGAAATG AGTCGTTGTT CCGGTATGTT
GATATTGAGG CCGCAACCAA AGGTGAATAT TGGTTACTCT TTCGTGGCTT CTTGCTTCTT
CATCGTGATG CCGCCGTTGG GCGCTTCGCC GAGCAACGTG CAGCAGGTAT AGGTTCCCAC
TACAGTCGAC TCGAGGTCGA ACAACGTGAA CAGGCTGATT TGGAAGCGCA TAATCGATTG
CATCGAGACG AATTCCACGA GCCGGTGACG GTAGGGTGTC TCGAGAAGCT GATTGTGAAG
TGGCGACAGC TAGATACGAC ATATATGGAG GGGTTTACTA TGGCAGGAGC CTTGCCACCG
CCTTCCGACT ATTTTCTGGG ATTCAAATCG GCCGGTACCT CGATCTGGAG TCGACTTCGG
CAGGCTGGTT TGGAGACTCA ACGGGTGTAT TCGCTCGACC CGCGACGAGT CCTGATCAAA
GTGCGATGTC CTTCAGATCG TCTCATGGAC GTGGCCGAGG TCCTCAAATT GAAACTTCGA
TCCAGTGAGG GAGGGTTTGC GCCGTTTAGG GAAGATATGA TGGACATGTT TAAATCAACT
GATGACTTGA CGGAAACGCC ACACATAGAC AACGTCCATT CATTTCATTT CCGGTCCTCG
ATTCGGCAAT CAATAATTGA CTTTATTATT TCTTCGCGCA TTCGAGATTC GGGTGCTGAG
TTGGGCCAGA CAACAGATGT TGGTAAGATG ATCCAATCAC GGGTACCATT GCACATGCGA
GCCAAAGTCA ATAGTATATA TCAAACGTGG ACGCACTTTT GGAAGGAAGA AAATTGGACT
GGGCGCGATG GATGCAGTCT ATCTCATGAA AGCTTTTCAG ACACTTCGAA AGGTGTAGAG
CACGATCGCT TTTCCTTTGT CTCTAAATCG ACCTGTGATA CGGAAAGTGG CGACTCTAGC
GAGGCAGCGG TTCCGCATCT TTTCGTCCGT ATCTTCAAAG GCTGTTTCTA CCAGCCGCTC
GATTCGATTG AGCAGTATTT TGGCGAAAAG GTTGCATTTT ATTTCGCTTG GCTACAGCAT
ACAGCCGGTC ATCTTGTCTG GCTGTCGATA TTCGGGTTCA TCATGTTCCT TCTGCAAGTC
GGAAGTGGTA GCTGGGATCA CCCATTGCGA CCGTTCTACT CTGTTATGGT CATGATATGG
ACTTTCACAG TGTTGATCAA TTGGAAGAAG CGAGCCAACT ACCTGGCATA CCGATGGGGT
ACTCTAGATT ACAAGGAACA AGAGACAACG CGCCCGGAAT TCAAAGGTGA CTATATGAGA
GACGAAGTGA CAGGCGAGTG GGTAGTCACG TATCCGAAAT GGAAACGCTG GGTCAAATAC
TCTATTTCTT TTCCTTTGAC TCTTCTCTTT ACTGCCGGCT CGTTAGTCTT GATCCTTTGG
GTGCATGCCA ATCGCGATCT CACGTTGGCC CGCTATCTTG ATCAAAAGGC GAATCCTGGC
TCCGAGAAAT TCCAGTTCAA TTTCGCAATT AGTGCTATTG GAAAGGAGGC CGCGATTACT
GATGTTCAGC TAAGCAGAGA GCATATTTTG GATCCTACCT TCTGGTTTAT AACGATTGGA
ATGCCAGCAT TGCTTGGATT GTGTCAGCCT CTGCTTAATC TTCTTCTGAT GAAACTATCG
CTGATGTTGA ATGACTTTGA AAACTATCGC ACAGAATCCG AGTACAGAAC TTATCTGATT
ATCAAGGTCA TCTCGTTTCG CTTTGTCTGC TACTTTGCCC ATTTGTACTA CTATGCATTT
GTTTCAGTTG GCTCAACTCA AGCGATTGAA AATGGAATTC TTCGTGTGGG AACGGGAGTC
TTTGTCTACA CTACAGTTGC TCATTGGTGG CAAATCTTTC TACAAATATA TTTCCCGATA
TTAATTCGCA AGCTTCGCAT GTACTACCGC GATAAGCGCC TTTGCGAAGA ACTCCGTGAT
CTTGAACTCG ACGAAGAGGA GGTTAGGGAA ATGGCTTCTC GTGGACTACG TGTCAACTTG
AAAGAACGAC AGGTTCGCCT GGTAAATAAA CGGTTATTGG TAGAACAGGC GCAAGACGAC
ATTTGGTTGG AGGTCATGCT GCCCGAGCAC AACAGTTTTC CCGAGTACAT CCAAGCTGTT
GTCCTTTTTA CGTACGTCTC TTGTTTCAGT GCCGTGCTAC CTATCACACC TTTGATTGTA
CTCTTTAACT ACCTGGTGAG TATGCGGCTT GATGCTTTCA AAGTATGCAA AGGACGACGT
AGGCCGTTGG CAGAGAAGAC TGGGGGAATA GGCATTTGGG AACACGTGCT TCATATTGTT
GCGGTTATTT CTGTCTTAAC AAACTGCTGG ATGATGGGCT TTACAAACGC GTTGTTCGTC
AAAATTGGGG AGAGTATTGG AGAAGTGGGA CTGTTTGCGA TCATTGTCGT TTGGGAACAC
GTCATGCTTC TTATCAAATA CGTCATGGAA ACCTCGATAT CTCCTCTTCC CAAAATAGTC
AAGGACGCGA TCAAGCGCGA ACAGTTCGAG CTGGACCAAC AGCGTAACAC GTCCATGCGC
CTACGACAAG GTCGCCGCTC TCAACACGAT CGAGAAAGTG TCGGAGAAGA TCGTACACAA
GGTGTTTGGC GCAATGTCCC TTCTATAGGA CGGGCTTCTG CTTTACATCC CATTCACTCT
GAAGATCAGG AAAGTGTGCG CTCGGTTTCA AGAGCATTAA GTCGGGCCCC TACACTTGAT
TTGGGTGAAT CCTCGATTGA ACAGTCAATG ATCGATTCCG TGCGTACACC AAAAGTAGGG
AAAAGCGACG TTGAGCAAGG TTTGGAGAAG ACTTTGTTCA GCGCCTAGAA ATCGTATATT
 
Protein sequence
MKPKVRIIDP EEFPSDTDEN LEIVPLTPRD SAETTESFHT HATSSYLHFF RSDQLPSARS 
HRSETPLCPT SEQRHSVLLD EEEDGEYEDL DDISPSPRKY VPTQYMLSEP PRHRTPASCR
SLSSVSSACS VESSRLSILS SASHLFGESI VNFDQRDIET GSVASSYVLS PGRTVISPTS
RGADRNIIWW REMSDNLITI ARGLREDLPH LIESVWPRHH QTRLPLSPRS RNLADINFLT
SQHKSKVQAL VFPEEEYFDF CLVLKPQETY AFWASLLDFR VEILGQERVD QMNEALESSS
TNNTRSNTPS SDRPSTPRSY SDDGSEARDV VDSVIATPPT TGMHRRRANV TGKSQTTPGQ
STSGSPMPTP YESATMTRSK MSMVSPGLYS VADSSRVSQG MTRLSMFERA IHNGSSTAFT
PESCLRRSST VALDNALVES ATPNTVHRRR WGNHTASQTP NMMSPPIRSL TRGSSTVRRS
TTIRSTAFPS SGNGTEITLG VAEKTSPRTV NEIRIEDIPN QVIPRGIAAH TNGMLHFLSA
LKRGIVVRRH RPGKEAVFSK IVSSDGGDTI QYIFVEKEDG MNAFKEQRVR YNNVSADEVE
NTQPWSYEQH SLESDTTRPN HDFSVPDYVA AKQYREKMRR EEGLRKNVKT LATKVVRSGA
AKAADIIAVH PGQHEDPRSS ERNLGSTSLR RSNCSFSAPH TFSLVLRTSQ SFGRNREMSL
DEWEQKWYSG EGNESLFRYV DIEAATKGEY WLLFRGFLLL HRDAAVGRFA EQRAAGIGSH
YSRLEVEQRE QADLEAHNRL HRDEFHEPVT VGCLEKLIVK WRQLDTTYME GFTMAGALPP
PSDYFLGFKS AGTSIWSRLR QAGLETQRVY SLDPRRVLIK VRCPSDRLMD VAEVLKLKLR
SSEGGFAPFR EDMMDMFKST DDLTETPHID NVHSFHFRSS IRQSIIDFII SSRIRDSGAE
LGQTTDVGKM IQSRVPLHMR AKVNSIYQTW THFWKEENWT GRDGCSLSHE SFSDTSKGVE
HDRFSFVSKS TCDTESGDSS EAAVPHLFVR IFKGCFYQPL DSIEQYFGEK VAFYFAWLQH
TAGHLVWLSI FGFIMFLLQV GSGSWDHPLR PFYSVMVMIW TFTVLINWKK RANYLAYRWG
TLDYKEQETT RPEFKGDYMR DEVTGEWVVT YPKWKRWVKY SISFPLTLLF TAGSLVLILW
VHANRDLTLA RYLDQKANPG SEKFQFNFAI SAIGKEAAIT DVQLSREHIL DPTFWFITIG
MPALLGLCQP LLNLLLMKLS LMLNDFENYR TESEYRTYLI IKVISFRFVC YFAHLYYYAF
VSVGSTQAIE NGILRVGTGV FVYTTVAHWW QIFLQIYFPI LIRKLRMYYR DKRLCEELRD
LELDEEEVRE MASRGLRVNL KERQVRLVNK RLLVEQAQDD IWLEVMLPEH NSFPEYIQAV
VLFTYVSCFS AVLPITPLIV LFNYLVSMRL DAFKVCKGRR RPLAEKTGGI GIWEHVLHIV
AVISVLTNCW MMGFTNALFV KIGESIGEVG LFAIIVVWEH VMLLIKYVME TSISPLPKIV
KDAIKREQFE LDQQRNTSMR LRQGRRSQHD RESVGEDRTQ GVWRNVPSIG RASALHPIHS
EDQESVRSVS RALSRAPTLD LGESSIEQSM IDSVRTPKVG KSDVEQGLEK TLFSA