Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49227 |
Symbol | |
ID | 7195693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 283081 |
End bp | 288120 |
Gene Length | 5040 bp |
Protein Length | 1675 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183848 |
Protein GI | 219127242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000254751 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGA AAGTAAGGAT TATAGATCCG GAGGAATTCC CCAGCGATAC AGACGAGAAC TTGGAGATCG TACCGCTGAC TCCCCGAGAC TCCGCGGAAA CGACCGAGTC CTTTCATACG CACGCCACAT CTTCGTACTT GCACTTCTTT CGATCTGATC AGCTACCTTC CGCTCGGTCG CATCGCTCCG AAACACCACT TTGTCCTACG TCTGAACAGC GGCATAGCGT GCTCCTGGAC GAAGAAGAAG ACGGAGAATA CGAGGATTTG GACGACATAA GTCCCAGTCC CAGGAAATAT GTCCCCACTC AGTACATGCT CAGTGAACCC CCTCGCCACA GAACCCCAGC TTCCTGTCGC TCGCTTTCTT CTGTATCTTC CGCATGCTCA GTAGAATCCT CCCGCCTTTC TATTTTATCT TCCGCCTCTC ATCTTTTCGG AGAAAGTATT GTCAATTTTG ACCAACGCGA TATTGAGACA GGCAGCGTCG CTTCGAGCTA CGTTTTGTCT CCAGGGCGCA CGGTGATTTC ACCCACATCC CGTGGTGCCG ACAGAAATAT AATTTGGTGG AGAGAAATGT CCGACAACCT GATTACGATC GCTCGTGGCC TTCGTGAGGA TCTTCCCCAC CTCATCGAAA GCGTCTGGCC CCGACATCAT CAAACACGCC TTCCGCTCTC ACCGCGTTCT CGAAATCTGG CCGACATTAA CTTTTTAACT TCCCAACATA AATCCAAGGT GCAGGCGTTG GTCTTTCCGG AGGAAGAATA TTTTGACTTT TGCCTGGTTT TAAAGCCACA GGAAACGTAC GCCTTCTGGG CATCACTGTT GGACTTTCGG GTCGAGATCT TGGGTCAGGA AAGGGTCGAT CAAATGAACG AAGCATTGGA ATCTTCTTCG ACTAATAATA CTCGGTCCAA CACTCCTTCT AGTGATCGTC CATCAACGCC TCGATCTTAT TCTGACGATG GATCTGAAGC CAGGGATGTT GTCGACTCGG TGATTGCCAC ACCGCCGACT ACGGGAATGC ATCGTCGACG AGCCAATGTG ACGGGTAAGT CACAAACTAC CCCCGGACAA AGTACCAGTG GTTCTCCTAT GCCTACACCT TACGAATCGG CTACCATGAC ACGATCCAAG ATGTCTATGG TCTCCCCTGG TCTCTATAGT GTCGCTGATT CCAGTCGTGT CAGTCAAGGC ATGACAAGGC TTTCCATGTT TGAACGTGCC ATTCACAACG GTTCGTCTAC CGCTTTTACC CCTGAGTCAT GTCTGCGGCG CTCCAGCACA GTCGCCCTGG ATAATGCGTT GGTGGAGAGC GCTACGCCCA ATACCGTACA TCGCCGTCGC TGGGGAAATC ATACAGCGTC GCAGACACCG AACATGATGT CGCCTCCAAT CCGTAGCTTG ACGCGAGGAA GTAGCACAGT ACGCCGATCG ACCACAATAC GTTCAACTGC CTTTCCGTCC AGTGGCAACG GTACAGAAAT CACACTTGGG GTTGCTGAGA AAACGAGTCC ACGTACTGTC AACGAGATTC GCATAGAAGA CATTCCCAAC CAAGTGATTC CTCGAGGCAT TGCGGCTCAT ACCAATGGGA TGCTTCACTT TCTTAGTGCT CTCAAACGAG GCATCGTAGT CCGCCGTCAC CGACCCGGGA AGGAAGCCGT CTTCAGCAAG ATTGTATCTA GCGATGGAGG AGATACCATT CAGTACATAT TTGTCGAGAA GGAAGACGGC ATGAACGCTT TCAAGGAACA ACGGGTCCGT TACAACAACG TCTCAGCAGA CGAAGTGGAA AATACTCAGC CCTGGAGCTA TGAGCAGCAC AGTTTAGAGT CGGACACTAC CAGACCAAAT CACGATTTTT CAGTGCCTGA TTACGTTGCC GCTAAACAAT ACCGTGAAAA GATGAGGCGC GAAGAAGGTC TGAGGAAGAA CGTCAAAACT CTGGCAACCA AAGTTGTGCG AAGCGGGGCG GCCAAGGCTG CGGATATAAT TGCGGTTCAT CCTGGCCAGC ACGAAGATCC TCGGTCGTCT GAGAGGAACC TTGGTAGCAC AAGTTTACGT CGGTCGAACT GCTCGTTCTC TGCTCCACAC ACGTTTTCAC TGGTCCTTCG AACTTCCCAG TCTTTCGGAC GTAATCGCGA AATGTCGCTT GACGAATGGG AGCAGAAATG GTATAGTGGT GAAGGAAATG AGTCGTTGTT CCGGTATGTT GATATTGAGG CCGCAACCAA AGGTGAATAT TGGTTACTCT TTCGTGGCTT CTTGCTTCTT CATCGTGATG CCGCCGTTGG GCGCTTCGCC GAGCAACGTG CAGCAGGTAT AGGTTCCCAC TACAGTCGAC TCGAGGTCGA ACAACGTGAA CAGGCTGATT TGGAAGCGCA TAATCGATTG CATCGAGACG AATTCCACGA GCCGGTGACG GTAGGGTGTC TCGAGAAGCT GATTGTGAAG TGGCGACAGC TAGATACGAC ATATATGGAG GGGTTTACTA TGGCAGGAGC CTTGCCACCG CCTTCCGACT ATTTTCTGGG ATTCAAATCG GCCGGTACCT CGATCTGGAG TCGACTTCGG CAGGCTGGTT TGGAGACTCA ACGGGTGTAT TCGCTCGACC CGCGACGAGT CCTGATCAAA GTGCGATGTC CTTCAGATCG TCTCATGGAC GTGGCCGAGG TCCTCAAATT GAAACTTCGA TCCAGTGAGG GAGGGTTTGC GCCGTTTAGG GAAGATATGA TGGACATGTT TAAATCAACT GATGACTTGA CGGAAACGCC ACACATAGAC AACGTCCATT CATTTCATTT CCGGTCCTCG ATTCGGCAAT CAATAATTGA CTTTATTATT TCTTCGCGCA TTCGAGATTC GGGTGCTGAG TTGGGCCAGA CAACAGATGT TGGTAAGATG ATCCAATCAC GGGTACCATT GCACATGCGA GCCAAAGTCA ATAGTATATA TCAAACGTGG ACGCACTTTT GGAAGGAAGA AAATTGGACT GGGCGCGATG GATGCAGTCT ATCTCATGAA AGCTTTTCAG ACACTTCGAA AGGTGTAGAG CACGATCGCT TTTCCTTTGT CTCTAAATCG ACCTGTGATA CGGAAAGTGG CGACTCTAGC GAGGCAGCGG TTCCGCATCT TTTCGTCCGT ATCTTCAAAG GCTGTTTCTA CCAGCCGCTC GATTCGATTG AGCAGTATTT TGGCGAAAAG GTTGCATTTT ATTTCGCTTG GCTACAGCAT ACAGCCGGTC ATCTTGTCTG GCTGTCGATA TTCGGGTTCA TCATGTTCCT TCTGCAAGTC GGAAGTGGTA GCTGGGATCA CCCATTGCGA CCGTTCTACT CTGTTATGGT CATGATATGG ACTTTCACAG TGTTGATCAA TTGGAAGAAG CGAGCCAACT ACCTGGCATA CCGATGGGGT ACTCTAGATT ACAAGGAACA AGAGACAACG CGCCCGGAAT TCAAAGGTGA CTATATGAGA GACGAAGTGA CAGGCGAGTG GGTAGTCACG TATCCGAAAT GGAAACGCTG GGTCAAATAC TCTATTTCTT TTCCTTTGAC TCTTCTCTTT ACTGCCGGCT CGTTAGTCTT GATCCTTTGG GTGCATGCCA ATCGCGATCT CACGTTGGCC CGCTATCTTG ATCAAAAGGC GAATCCTGGC TCCGAGAAAT TCCAGTTCAA TTTCGCAATT AGTGCTATTG GAAAGGAGGC CGCGATTACT GATGTTCAGC TAAGCAGAGA GCATATTTTG GATCCTACCT TCTGGTTTAT AACGATTGGA ATGCCAGCAT TGCTTGGATT GTGTCAGCCT CTGCTTAATC TTCTTCTGAT GAAACTATCG CTGATGTTGA ATGACTTTGA AAACTATCGC ACAGAATCCG AGTACAGAAC TTATCTGATT ATCAAGGTCA TCTCGTTTCG CTTTGTCTGC TACTTTGCCC ATTTGTACTA CTATGCATTT GTTTCAGTTG GCTCAACTCA AGCGATTGAA AATGGAATTC TTCGTGTGGG AACGGGAGTC TTTGTCTACA CTACAGTTGC TCATTGGTGG CAAATCTTTC TACAAATATA TTTCCCGATA TTAATTCGCA AGCTTCGCAT GTACTACCGC GATAAGCGCC TTTGCGAAGA ACTCCGTGAT CTTGAACTCG ACGAAGAGGA GGTTAGGGAA ATGGCTTCTC GTGGACTACG TGTCAACTTG AAAGAACGAC AGGTTCGCCT GGTAAATAAA CGGTTATTGG TAGAACAGGC GCAAGACGAC ATTTGGTTGG AGGTCATGCT GCCCGAGCAC AACAGTTTTC CCGAGTACAT CCAAGCTGTT GTCCTTTTTA CGTACGTCTC TTGTTTCAGT GCCGTGCTAC CTATCACACC TTTGATTGTA CTCTTTAACT ACCTGGTGAG TATGCGGCTT GATGCTTTCA AAGTATGCAA AGGACGACGT AGGCCGTTGG CAGAGAAGAC TGGGGGAATA GGCATTTGGG AACACGTGCT TCATATTGTT GCGGTTATTT CTGTCTTAAC AAACTGCTGG ATGATGGGCT TTACAAACGC GTTGTTCGTC AAAATTGGGG AGAGTATTGG AGAAGTGGGA CTGTTTGCGA TCATTGTCGT TTGGGAACAC GTCATGCTTC TTATCAAATA CGTCATGGAA ACCTCGATAT CTCCTCTTCC CAAAATAGTC AAGGACGCGA TCAAGCGCGA ACAGTTCGAG CTGGACCAAC AGCGTAACAC GTCCATGCGC CTACGACAAG GTCGCCGCTC TCAACACGAT CGAGAAAGTG TCGGAGAAGA TCGTACACAA GGTGTTTGGC GCAATGTCCC TTCTATAGGA CGGGCTTCTG CTTTACATCC CATTCACTCT GAAGATCAGG AAAGTGTGCG CTCGGTTTCA AGAGCATTAA GTCGGGCCCC TACACTTGAT TTGGGTGAAT CCTCGATTGA ACAGTCAATG ATCGATTCCG TGCGTACACC AAAAGTAGGG AAAAGCGACG TTGAGCAAGG TTTGGAGAAG ACTTTGTTCA GCGCCTAGAA ATCGTATATT
|
Protein sequence | MKPKVRIIDP EEFPSDTDEN LEIVPLTPRD SAETTESFHT HATSSYLHFF RSDQLPSARS HRSETPLCPT SEQRHSVLLD EEEDGEYEDL DDISPSPRKY VPTQYMLSEP PRHRTPASCR SLSSVSSACS VESSRLSILS SASHLFGESI VNFDQRDIET GSVASSYVLS PGRTVISPTS RGADRNIIWW REMSDNLITI ARGLREDLPH LIESVWPRHH QTRLPLSPRS RNLADINFLT SQHKSKVQAL VFPEEEYFDF CLVLKPQETY AFWASLLDFR VEILGQERVD QMNEALESSS TNNTRSNTPS SDRPSTPRSY SDDGSEARDV VDSVIATPPT TGMHRRRANV TGKSQTTPGQ STSGSPMPTP YESATMTRSK MSMVSPGLYS VADSSRVSQG MTRLSMFERA IHNGSSTAFT PESCLRRSST VALDNALVES ATPNTVHRRR WGNHTASQTP NMMSPPIRSL TRGSSTVRRS TTIRSTAFPS SGNGTEITLG VAEKTSPRTV NEIRIEDIPN QVIPRGIAAH TNGMLHFLSA LKRGIVVRRH RPGKEAVFSK IVSSDGGDTI QYIFVEKEDG MNAFKEQRVR YNNVSADEVE NTQPWSYEQH SLESDTTRPN HDFSVPDYVA AKQYREKMRR EEGLRKNVKT LATKVVRSGA AKAADIIAVH PGQHEDPRSS ERNLGSTSLR RSNCSFSAPH TFSLVLRTSQ SFGRNREMSL DEWEQKWYSG EGNESLFRYV DIEAATKGEY WLLFRGFLLL HRDAAVGRFA EQRAAGIGSH YSRLEVEQRE QADLEAHNRL HRDEFHEPVT VGCLEKLIVK WRQLDTTYME GFTMAGALPP PSDYFLGFKS AGTSIWSRLR QAGLETQRVY SLDPRRVLIK VRCPSDRLMD VAEVLKLKLR SSEGGFAPFR EDMMDMFKST DDLTETPHID NVHSFHFRSS IRQSIIDFII SSRIRDSGAE LGQTTDVGKM IQSRVPLHMR AKVNSIYQTW THFWKEENWT GRDGCSLSHE SFSDTSKGVE HDRFSFVSKS TCDTESGDSS EAAVPHLFVR IFKGCFYQPL DSIEQYFGEK VAFYFAWLQH TAGHLVWLSI FGFIMFLLQV GSGSWDHPLR PFYSVMVMIW TFTVLINWKK RANYLAYRWG TLDYKEQETT RPEFKGDYMR DEVTGEWVVT YPKWKRWVKY SISFPLTLLF TAGSLVLILW VHANRDLTLA RYLDQKANPG SEKFQFNFAI SAIGKEAAIT DVQLSREHIL DPTFWFITIG MPALLGLCQP LLNLLLMKLS LMLNDFENYR TESEYRTYLI IKVISFRFVC YFAHLYYYAF VSVGSTQAIE NGILRVGTGV FVYTTVAHWW QIFLQIYFPI LIRKLRMYYR DKRLCEELRD LELDEEEVRE MASRGLRVNL KERQVRLVNK RLLVEQAQDD IWLEVMLPEH NSFPEYIQAV VLFTYVSCFS AVLPITPLIV LFNYLVSMRL DAFKVCKGRR RPLAEKTGGI GIWEHVLHIV AVISVLTNCW MMGFTNALFV KIGESIGEVG LFAIIVVWEH VMLLIKYVME TSISPLPKIV KDAIKREQFE LDQQRNTSMR LRQGRRSQHD RESVGEDRTQ GVWRNVPSIG RASALHPIHS EDQESVRSVS RALSRAPTLD LGESSIEQSM IDSVRTPKVG KSDVEQGLEK TLFSA
|
| |