Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38066 |
Symbol | |
ID | 7202748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 792236 |
End bp | 796795 |
Gene Length | 4560 bp |
Protein Length | 1519 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181977 |
Protein GI | 219123325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00336128 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAGA AACCATCGTT TTCGGTTGAC ATTTGTGCCA CTTGCATCAC GTCCAAAGCG CCTCCTTTGC CGTTGCGAGA GGAACTCCCC TCGTTGGGAT TCGAAATCGT TTACAACGCC GGTGGTCTGG GAGCGAAGGC GAGCAAACAC GAAACCAAGA TAGCGTTGGA TCAGTATCGG AAACAATGCC AAACCAAAGA ACTCAAAAAA CGACGATTCT TTCCCAAGAG TTCGAGACGA ATCAGAGAAA CCCTGTTGGA GCGGAAAAAG ATTGAGCACC AGCTCAAGAT ACCCCATGCT CTCTCCTATG CAGCATACGA GCATAGATTG GCCCGTCTCG AGGAGAAAAG TTGGACGTCA ACTTCGACTT CCTTCACGCC CACTACGCAT GCGAGCTCCC ACAAATCGAT ATCGTTATAC AAGTACAAAG ACGGCAAGAA GCCTTTCACA AAACGATGCA AGGGAAGATG GGGTATTTCG CTGCGTAGAC GTATAGGCTC GAATCGTAAG AAAGACACGA AAAGTCTTCC TTCCGAAACA TCGGAATTTT TGCAGACTTC AAGCATCAGG GATACTATCG GTAAGGCCGC CTTCGGAGAC TTTCTTGTGC CTGTGAATTC ATCATCGCAC GAAAAGGAAA ATGGCGTAGA TTGCGGCATG TATACCAAGC GCAATCCACT GAAGTGGAAA AGGGCGTTTG ACTCAGAATC TTTCGACAGC CACAGCAATG ACTCGTCTAT CTCGAGCAAC GGAGACTGCT CCACCAACGA CATCAAGTTG CAATTTGTCG CTTCTGACGA AATGGAAAAA AAGTTACCTG TTTTCAACAC CTTCAATGAG ACGGAACGAG AGGGCGCCGC CTTTGCCAAT CAAGGGATGA ACAAGGATGT TTCGCTGATC ATGGCTGCTG GTGACTCAGC GAAATCATCA GCCAAGCTTT ATGACGATCT TGAGGTGGAG GAGGCGATTG CCGCGGCGTT TAAAGAATTT GCAAATCCAA AGCTAAACTC AGCATCTATT TCGCCTATCG CGGCTCAATC GTCAAATAAC CCGTTCAATA TCCATGAAGC ACCTGCTCCG TGGTCGGGGT CGACTAAATA TGCCATTGAA AATCAATCTG CGGAAGCAAA CGGAATCAGC TCGAAAGTTG AACCACAAAG ACCGATGGAG AGAGAGCCAA TTCGTGATCG GGACAAATCG CACATCAAGC AACAAAATTG GTTTTGGTTT GCCAAACATC TGTTTGAAGA CGTTGCAGGA GATGTCGTCA AAGCTTCTAT TACTTCAACG CCAGGTTTCA AAATAGAAGC AAAAAGTTTA GAGGTCGCGG ATGAGCCAGT GGTGGTCGAA GAAAGGCGCG CCGAAGTCAA GAAGAGTATT GCGCATCTAC CTGCTGTGAC TTTGTCGATC AAAACCAAAG CAGGAAGCGA AGGGCTAGGA GACTCCGGTG GCCCCAATCA GTGCAACACT CGTCTGCCTT TTCTTGAGAT GCTTACAGGA CTTGGTAGTT GTATGTCCGA CATTTTGGGG GCCGTTCAAG AAACTCCGGA ACTCGCTCAT GTCAAATGCC AAGACGACGG TAGATTCCCA TTGCATACTC TCTGCAATAG AGACATTATT GATCGTTCAT CGGTTGCTGA ATCACCACTG GCAGGTGTTT TGCTGGCCGA CATCTTGGAG TACAAAGAGG TCCTTAGGAA CCTGTTGGAA GCCTACCCTG CGGCAGCGAC TGTTATGGAC AAGACCGGCG ACCTACCAGT GCATTTGCTT GCCAGGACGC TCATGAAATG GGAAGCCTAC TGGTATACAA CTGTTTACGC TCAAGCTGCG AAAGTCTTCA ATCCCGATGG AAAAGATGCT AACGCAATCT CGTCGTTGTA TCATACGATG TCGACATGTG TTGAACTCGT TCTAGAACCC CTGGTTACAA AAGAGGATTT GTGTCGCCAA CCAGGTAGTG TTGGTAGAAT GTTTCCATTG CATATCGCAT CTATATTTAC ATGTTCAGTC GAATCACTCC AATCACTCCT CGAAGCATAC CCGAATGCAG CAAAAAAAAG GTGTGATCTC AACACCCTCA ATACCTTTAC CCCCGACGAT TCTTTTCCTT TAATTTTGCA TGATTCGCTT TCAACAGACT TTCCAAAATG GGAGGTTGAA ATCTTCAAGC AAAATGAATC TTATGTGGCA GGCCTTGGTT GCCACAAGTA CGGTGATGAT ACCGAGAGGT ACCTCCGCCG ATCGGACCTG CTCTTCGCCT ACAACCCAAC CATTGAGCCA TATAGGCTTG ATGGCGAACG CATAAGACGT CTCGAAAGAA AAATAATTTT TGAAGCAATT CAAGTCGTCG AAGGGAAAGA AGACTGCCTT AGCAAAGCCA TGCAGCGAAT TTGGGTATGG CTCTGTACAT TCAAAGAACA TGGAAGCAAG CGACCCACTT ACAGTCAAAG CGTGAAGAGA ATCGCAAGAA TGTTAACGCT TCGTGCTGTT CAATATCTAG CCTCAATCTC AACTCCACAT GGAAAATTAG TCTTGGACGA GGCTACTCCT GAATGTTCAA GAGTCATTCA ACTCCGGCTT GCAGAAAGCC AACCCATTGA CCTATGGGGA AAAAGGGTTC CACAATCTCG TCGTTTGACG AAGCAAAATG TGAGGCAATT TGTAAAAAAG ATCTTCAATG TTGAGCAGCA GCGCTTCCCT ACGAGCTTCA TTGTACTACC GTATCGTCTG CAGGTGAATA AAGACGGTTT TAACAGCATT GGAACACCAG AAGCGATCGA GGTTGCGGAC TTATTCACCA GTTATATGCT ACGTCTGACA GATCCAAGAG CTCTTCTTCA CTATCTTAAC GTAAAATCAC AGAAACACTA TGGTGAACCA CTTATCCAGG GAAGCAAGAC GGATGAGGCC CGACAAAATT TTCTCGATTC AATCAAGAGA GTAGAAGACA ATATGCTTTC TCTATACAAA CCAGGATCGT CGTTTTTATA TCTTCTAGAT GAAGGAACTG GCTTTCCAGT AGTTCCAGGG AGCACTGACG AGTACCCAAT CATCCTCGCT GAACCTATGA GCATCATTCC CAAAATATTC CCGCTGATGA TGCCTGGATT GGTCATGATG CGAGGCGAAC AGTATCTAAT CACATTGTCG AAGGTCCTCC TCGACAGCGA CGTAACGATG GTACCACGAA ACTGGTTGAC GACTACAGAT AATATCGAAA GAAGGCTATT ATCCGCTGAG GTTGCGGATT TGACTGACAG TGAAGCGTCG ACATTAGGTG ATCGTATATC TCAGCTTTCG TCTAGCAACA AGACTCGATG GGAGGGGATT ATGTCACCCA AGAATGGAAA TACTGAATGG GCTGCGGAAA TGTCTATTCT CAACACTCTG ATGGAAGTAA ACGACCCATC GATGAAGTTT GCTGGTCTCA AAGTTCAACG TGACGATGAT TCGTCTTTGT TTTGGTCTGT GGACTCAAAC AATGATTGCT GTAATTCGAC TCTTGCGCTA GGAGCATTGC ACATTGATGA CCTCTCCAAT AAACAAAAAG AATTGGCCAA AAAGCTGGAA CGCATGAGAG TCGACAGGAA AGATTCGGGT CCAGATTTTC GATACCCTAG AAAGAACTTC GTTGAGCCAC ATAACGAGAT GCTTAACAAA CCATTTTGGC AAAGGCTAGC TAACGCTTGC GACTACGTCA AATATCCTGA CAAAGCAGTA GACTCAGAAA GAAAAAAGCT TCCCTTGAAA ATTGCTATTG ACTGTAAATC AAAATTGGAT TCTGAAATTT CCTCAGATGA TATGAAAGGG GAGATCCGAA AGAAGTACAG CTTTCTTTTA TCCGACTTGG CAGTTAAGGA TCGCGATAAT AAGGGACAAG AAGCCTTCTG CTGTCGACAA CGAGAACGAG GTTCGAGTAG TGAAAGAATT TGGGAGGATG TGACTTCGGA ATTAGACCGT GCTGGAATGT TTTACGAGGA GAGTGCAATC TTGCACCTGA AAGTCGGCAT TGCTGAAGAA GCAAAACGAA GCGGGCTTCT CGCCAAGCGA ATCGCTTCGC TCCAGGAAGG CGGGGCTCAT GTATGTTTTA AGGCGGAAGC ACTAGACACG GAGCTGCCAT ATCACGAAAT TCAAAACGAA GTGACTGTCC TTCCAGGACT TTCTGATTCC AGGAAATTGT TGATTAGGCT GTTCGACTTG GAGGACCGTT TGTTGTGTGA CGAGATTGAT ATTCAGCATC TCGCAATTGA GTCCCTTTCT ATGTTTTACC AGATTGAAGA GGTCGACCCC GACAATCTGT TTGAAGAAAC GAAAGAAGAA TTCAATAGCC GCGAGAGCGT TTTACATCGT GCACATCCTC GTGTACACTC TGCTTCTTTT AGAGCAGATG CGAGAGGATT TTGTGGTGAG AAAACGCCTA GTCTAGCCAC GTCGTTGTCT AATGACCCTG GAAGTTTCGT CAACCCTCTT GACCTGACGA ATCTGAATGA TAATGTAATG GAAGACATCG CAGTGCCGTG GGTTGTGTAT CACGAATCGA CTGGCCAAAT TGAGTTCTGA
|
Protein sequence | MTKKPSFSVD ICATCITSKA PPLPLREELP SLGFEIVYNA GGLGAKASKH ETKIALDQYR KQCQTKELKK RRFFPKSSRR IRETLLERKK IEHQLKIPHA LSYAAYEHRL ARLEEKSWTS TSTSFTPTTH ASSHKSISLY KYKDGKKPFT KRCKGRWGIS LRRRIGSNRK KDTKSLPSET SEFLQTSSIR DTIGKAAFGD FLVPVNSSSH EKENGVDCGM YTKRNPLKWK RAFDSESFDS HSNDSSISSN GDCSTNDIKL QFVASDEMEK KLPVFNTFNE TEREGAAFAN QGMNKDVSLI MAAGDSAKSS AKLYDDLEVE EAIAAAFKEF ANPKLNSASI SPIAAQSSNN PFNIHEAPAP WSGSTKYAIE NQSAEANGIS SKVEPQRPME REPIRDRDKS HIKQQNWFWF AKHLFEDVAG DVVKASITST PGFKIEAKSL EVADEPVVVE ERRAEVKKSI AHLPAVTLSI KTKAGSEGLG DSGGPNQCNT RLPFLEMLTG LGSCMSDILG AVQETPELAH VKCQDDGRFP LHTLCNRDII DRSSVAESPL AGVLLADILE YKEVLRNLLE AYPAAATVMD KTGDLPVHLL ARTLMKWEAY WYTTVYAQAA KVFNPDGKDA NAISSLYHTM STCVELVLEP LVTKEDLCRQ PGSVGRMFPL HIASIFTCSV ESLQSLLEAY PNAAKKRCDL NTLNTFTPDD SFPLILHDSL STDFPKWEVE IFKQNESYVA GLGCHKYGDD TERYLRRSDL LFAYNPTIEP YRLDGERIRR LERKIIFEAI QVVEGKEDCL SKAMQRIWVW LCTFKEHGSK RPTYSQSVKR IARMLTLRAV QYLASISTPH GKLVLDEATP ECSRVIQLRL AESQPIDLWG KRVPQSRRLT KQNVRQFVKK IFNVEQQRFP TSFIVLPYRL QVNKDGFNSI GTPEAIEVAD LFTSYMLRLT DPRALLHYLN VKSQKHYGEP LIQGSKTDEA RQNFLDSIKR VEDNMLSLYK PGSSFLYLLD EGTGFPVVPG STDEYPIILA EPMSIIPKIF PLMMPGLVMM RGEQYLITLS KVLLDSDVTM VPRNWLTTTD NIERRLLSAE VADLTDSEAS TLGDRISQLS SSNKTRWEGI MSPKNGNTEW AAEMSILNTL MEVNDPSMKF AGLKVQRDDD SSLFWSVDSN NDCCNSTLAL GALHIDDLSN KQKELAKKLE RMRVDRKDSG PDFRYPRKNF VEPHNEMLNK PFWQRLANAC DYVKYPDKAV DSERKKLPLK IAIDCKSKLD SEISSDDMKG EIRKKYSFLL SDLAVKDRDN KGQEAFCCRQ RERGSSSERI WEDVTSELDR AGMFYEESAI LHLKVGIAEE AKRSGLLAKR IASLQEGGAH VCFKAEALDT ELPYHEIQNE VTVLPGLSDS RKLLIRLFDL EDRLLCDEID IQHLAIESLS MFYQIEEVDP DNLFEETKEE FNSRESVLHR AHPRVHSASF RADARGFCGE KTPSLATSLS NDPGSFVNPL DLTNLNDNVM EDIAVPWVVY HESTGQIEF
|
| |