Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43897 |
Symbol | |
ID | 7204306 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 358432 |
End bp | 365030 |
Gene Length | 6599 bp |
Protein Length | 2125 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186048 |
Protein GI | 219112929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATAGAGAAGA GAAACATGAC TGTTAGGTTG TGTCTGCAGC TCTATTTGTG GCTACTTGGC TGCACTGCAA GGTCACTTTC ACAGAAGCTT CGTGGAAATC CATGGATGGG AGTGGTGGAA CCCTCCTTTC GAACAAATTC AGGTCGGCGT TTGGGATCCT TCGAGTGGGA GATCGATTTG CTTGATGGCT TTCCATTTCT CAACAACATC ACGGACACCA AACAAATCGA ATTTTCGTAT TGGTTCACCG GAAATGCTTC TCTACCTGGA AGATCTTTTG ACATTGTCCT CCTAGAAAAA GACTGCAGAA CCGCCAGTGT CGTATCGCCT CAAGGCAGAG GGCTATTTCT TGATAGCGAA GAAATTTATG ATCAAGGTAT AGACGTCATT GTCGGTGTTG ATCTTACATC AATTGCATCG TCTTCGTTTT ATAATAGAAC AAGTCAGCTA TCGGCAGCAA TTGAGTTCTG TCTTCGGGTT GATTATCAGA GTGCGGGCGA GTCGGTCAAC TTTCACGAGA CAACAGTATC GGTTTCTGTC GACTTAACCG CCGGTTTTCG ACTACAAAGT ATTTCTACTG AGCGGAACGA GGAAGCGGTG GTATCCGAAA GTGTCGCCTG CGAAGTCAAC GCGTATTTTT GTGACGACTC CTATACCGAC ACGGGTACGC CGAGGCTTGC GCAGGGTGAT GTTTTCCAGG CCTGTATTGC ATCGACTGGG CAATTTGTGA TCAAAGACAT TGTGTCGGCT CAGCTGGATC AAGGTCAAGA CAATCAGACA ACTGGTGTTC GAGACGACCT TGCTATCGAC TACGAAGTTG TCGATCCCCT TACAGTAATA TCCTGCAGCT TGGGACGTTG CAATATAAAA ACGCAAATGA GGACAAAATA CTTTTCAGAA AATAATCCAA TTTCGGTTCG ACTATCGGTA AGTCAAAGCA TCTGATTTCC GATGACCTCA AACGGAGTTC TCATAATTGT TTACCCAAAT TCTTCTTTCG TAGGGGTTGG CTCTGTGTGG CTTCGGATCA ACCCCAGCAG CATATTGTGC GTCATCTGTT AAGCCTGTAA TTGGAGAAAA TTCTGAGTTC TTAATTTACC CTGTTGCTCT CTCCTTGCAT GACTATGATA ATTCGGACGA GTATGAATCC GTCGTGATAA GCTGGTCGAC ACAGGGTACC AGTGCTGAAC CCTTTGTTGA GTTTGCAGTG GCACCGCCAG GTGTTTCAAT CACATTAGCC AACCGAGAGC TCACTTTGCG AGGAAATGCA CAAGACATCG AAGATGCTGT CGCAAATTTG AAGGTTCGCC CGGGACAAGA CAACGGAGAA GATATCGACA TCCGGGTGAA AGCAACAGCA ATGGACAGAA ATGGTCTAAT GGCCACCGCG GAATGCGGCT TTCGTATTCC TGTGGATCCT TTCGTTTTTG GGATTCTGAG GCTTGACTCG CCCTTTCCAT CGGATATCAG TATCCGTGCT TTCGAAGATA CAGCAATCGA GTTGAGCGGA ATATCGATTG AGTTCTCTGG GTATGACACA GACGGCTCCG AGGTGGCTTA CATAGAAATT GAAGCTTCTA GCATTCCAGA AGGGTCCGTG CTTCTTTCTA ATGGAATCGT GCAAACAACA ATTGTTGATG GATTTTTGCG GCTTCCAATG GAAGCAGGGT CGTCACTCAA AATAAAGCCA CCAAGGAATT ACAGCGGGGA AATTACGCTG TCTGTGCGTG GAGCTATTAC TGATGTAAGT CTGGCATGAG ATATCAATGA CGTCAATTAC TTGCCACTGA CTAGGCATGC ATACGCATTT CGCAGTACAC TGTTTCAAAT TCTGTGCTAG TAGCCACAAT TCCACAGAAA TTGTCGATTC AAGTGGAGCC TGTTGCTGAT GCAATAATGT GGACTACTGT TTATGCAATT GAGGATAATG GTGCGGTTCC GTTTGGCGCG ATTCTTGCTA ACAGACAGTC GGGGATCCGA CTTAACGACA TAGCGCAAGG GAAGGGAAAC AATCCTGAAC CAGAAACAAT TCTTCAGATA GTGATATCTG TTCCGCCCGA CTCAAACGAA GTCACCTATC TTGTGTCGGG GACGTTTGTA CCTCAGTCGG ATGGTATTGT CACAGGTGAA GGCACAGGAG TCGTATCTTT TGACCTACAA AGGCGGACCT TCACTATTCG CAGTTCGCTC CTGACGAACC GAGTCAATTT AGCAGCGGCT GATCAGGAAA CTCGAGAACA GGCCGATAAA GACATCCGCG CCACCTTAGC AGCCTTTTAT GTGGAGATTG GACCGACACA CACTGATATC GACGGAGACT TGTCGGTAAC TGTAACGACA CTTGATGTTC AAATGGGATT CTTCGATACA ACTGATGTAA CGTTTAAGCC TATCAAGTTG CAGGCTGTTG CTGACCAACC GTCGATTACA GTTCAGAGTG CGGCACAAAT TGTAAGCGAG GACGGTGACT TGGTGCCCCT TTATTTTACT GTGGGACGAA GCGCAGACGA AGATGGTTCC GAAAATCTTA AAGTGCTTAT CTCTGTGCCG TTGGATGAGC TTGGACCGAT CGGCTCTATC ACTGGTTCCT TGCCTGTAGG AGTTCTTGTC ATCAATGAGG GCCCTGGTCT GTATCTTATT TCTTCGTCAG CGACAACTCC TATCGGACAG GAGACTTCTC TAAATCAACT TTTCGATGGA CAGATCAAAT TTCATCCCAG AAACAATTTC TCCGGTCAAT ACCTCGGGTC AAACGGGATA CGTGTTGACG TCTTCATGGA AGAGAGAGCT ACAAATGATC AGCTTGCCCC TGTAGAATAT GGCTTGGATG GAGATTCGAA ATTAGCCAAA GCTACCGGAT TTATTGATAT CTTCGTCTTG CCAGACGCAG ATAAGGCAAC AGTCCTTGTG AAAGGGAATG CCGCTGGCAT GGAAGACAAA CCAATACCAG TCCCATTGAG TGTGACTCTG GGGGATCTTG ACGGATCCGA AAGCTACTCA ATGGAAATTG TTGACAATCT CCCTGCTGGT GCCAAACTTC TCGGTTCAAC CAGTCGTGAA CTTCTTTCAG AAGATGGAGT TTTTTTCTTG TTCCCAATAG ATGTAGAATC TCTCATACTG TTACCACCAC TTCACTGGTC AAGTGCAAAG CAAGGTGACA TTGTTCTCCT TACAAATACA ACAACGGTTG ATATAAGCCC TCAATCATCC TCTAGTGCGT CGGTATACCT TTCCATTCCA ATCAAGATTG TGGGCGTAGC TGACAAGCCA AATACTCGAC CAGTTGTTGT ACATTGTTTG GAAGATGGAA CTTACAACCT CGGATCATCA ATCGGGGATC TAAGTGGGGT CTTGGTTGAT GTAAGCATTC TAATCCGTAT AATCGTTGTC AAGAAAGGAT CCCCTAACCT GCTTTGCTCG TTCTCAATAC AGACCGATGG ATCCGAAGTA CTTTCTCTTG TTTTATCCGG TCTACCACTT GGAGTCGTGC CTCAAAGCGA GGAGAGTAAT ATACAATATT TAGGGAAGGG ACGATGGCAA GTGCCTGAAA CAGCGATTCC TTCTCTCACG CTCCCTGCGT TGCCGAACTT CTCCGGAGAC AATCCCTACA CGGGAATTAC TATCAATGCA GTAAGCCAGG AGTTGGACAG GGATCAAGCG GTCTCTGATT CTTGGCCAGT GACAATAAAG GTACGCCCTG TTGTTGATGG GTTTGCTTCT TGGTCGATGT CTTTCACCAC GACGGAGGGG ATGAACGAGG CAAACAATGG CGGTATAGAT TTCAAAAGTG TCAAAAACTT TGTTCTTGGG GATGAGGATA ATTCCGAAGC TGTAATTTCG TTTACTTTTG ACTTTTCCAA CTTAATGGTC AATGCCGGGA TCAGAACTCG CCTGAATCAG CTGGAGGGCC AAAGTGCTGG ACTCGAGGAG TTAGTGTCAA AAAGAATGGA CGGGGAATTT ACCTTCGACC AAGAAAGTGG CACGGTATTG GTACTCTCCG AAAATATCGA AAATATCAAG CTCCACTCAG AACTCTTTCT TGATTCAAAC CAAAATTTTG AAATCCCTGT CAATGCGTTA GTTCGAGATT CTGCTGTATC AGACGGAAGT AGCGTTAGGG AAGACAAGAT CAAGTCAGGC GTGTTATTTG TCATTATAAG TGGAACCGCT GATATCCCTA CGGTGTTTGC AGAGCCGACC ATCTCCGGAA GTGTTGGTGA GTCCGGAATT CCAGTCCGGC TAGGTGGGGA AACGACAGAT CGAGACGTCT CACTTGGCAG AACTCAGTCA GAAGATGTTT ACTATCTGGT CAGCACCAGC AGCAGCTCAA TTTCCAACCT GTTCACATTT TCTGGCGCTG GATTAAACAA TGGAGACGGT CACTGGACTT TAGAGAAAAG CGACTTGGCG GGATTACAGG TTTACTTTAG TCCTTCGATT GGCATGAGAA ATCAAAGCGA AGCTAGTTTT GATTTAACAG CAGTCGCTGT CGAGGATGAT GGAGACTGGG CAGTCAACAC TACGACCTTC AAAGTTATTC TTCAACCCAA CGAAGCGGGA GATACGATGA CGATCCCCCC TTTGCCACCT CTTCTCGCCA TCGGCCTTAA TGCAGGGTTA GAAGATACAA AAATATCTCT GGGTAGCATC AGTGCCGTCG TCAATCCTGA CGATCCTACC AATACAACTA TCAGTGTTGC AATCTGGAGC ATCCCTCCCA ATGCTAAAGT GTTCGGTGCT CGGCTTAACC CCACCACTGG AAAATGGATA TGCTCTGCTG CCAGTATCAA TTCAGGTGCT GTTCAGATAC TTCCGGGCCC AGACTATTCT GGTTCATTGA TGTTGACTAT TCAAGCCATC GCAACCAACC ATTACGCCTT GTCAGCTCAT AGTCCTGAAC AAAAGCTTGA ATTCATTGTA GACCCTGTGG CTGATGGTGC AGCTATTTCC ATCTCTCCTA GCTCAGGCTT GGAAGATGAA ATGGTCTCAC TACGCGTTTC ATTGACGGAG CTGGACGTTG ATGGAAGTGA GCGAATTGGT CATTTTGCAT ACATAAAGAT GACGAACGGG GCCACTCTTA TAGGCAACTA CACTGTCGTC ACAGAGTATG ACAATGACGC CTTCATTGGG ACATTTTCCA ACGCTTACTC ACTGCGCGTA CCATCAGAGG AGATTTCAGA ATTGTCACTG AAGCCGGCCG CGAACTGGCA TGGGACAATC ACTCTAGAAA TTTCAGTCCC GGTTGTAGAG CGTTTCGACG ATGAAGACGG TGATCACGTC AAGGTCGCCC GAAAGTAAGC AAAGTGGCCG AACCAATTCC AATACGCGCA ACTGCTACTC AATTTCTCAC GCAATTGACC AATTTTGCTT TTAGGACTCA TTCAATTGAG ATCAGTGCGT TAGCCGACAT GCCCGACATT TCCGTTCCGT CCGTCACCAC TATAGGTGAC GAAGACACAG AAATTGCTAT TGCATTTCTC GCGGCCAGTC TAACCGACAC AATAATCGCG AATGGTCGTG AAATTCTCTC TGTTGTGATC AGTAATGTTC CAGAAAGTTC AACGTTCAAC TCAGGATACA ACAACGGGGA TGGGAGTTGG GCAATTCCGA CCGCAGCCCT AGACGGACTT TCTCTCTTAC CCCCGGAACA TTTCGCCGGA ACTGTCAAAC TGACACTTAC AGCGTATGCA TTGGAACTGG AGAATGGCAG CGAAAACAGT CAGAGCGGCG ACTTTGATAT CAACGTCCGT CCTGTGGCCG ACCCTTTCCT CATGGTCGCG ACAAGTATCA GTTTGCAAGC CAGTTCGGGA ACTGTTGATA TAGCGTTGAA TTTGCGTCTG CAAGATACGC GCGGCGACAG CCCAGGAGAA ATTCCACCCG AGATTGTCCA ATTGGAGTTT CAAAATCTCC CTCGCGGTAT ATATCTTCGG GCCAGTCAGG GTGGTGGTGT TGTCACCACA CCTGCCGGTA CATTTGTGTT CGCCGGCTCG CAAGCCCAAG CCAACGCGTT GCGTCTTTTG GCCCGTAACG CTACAGAAGG CACCTACCAG ATTGCCATCA GTGGTGTCAC CGTTGATGGT GACAGTATCC TGTCACCAGC GGTGCGGGAC TCTTTTAGGC TGACCATCTC CCCGTCAAAC CAAACGTTGC CCGGCGTGGA ACTGATAGGC ACAAACGGGG CGGACAACCT GACTGGATCG ATTGGACATG ATATTTTGAT GGGCTTGGAA GGTGCTGACC TCATGCGTGG AGGCGACGGT ATGGATTGGC TGGCCGGTGG ACCTGGCGCT GATGTGTTGA CGGGAGGCAA CGGCGCTGAT GTCTTTTCGT GGAGCTCACT CGATTTGGAC GGGTCAGTGG ATCGCATAAC TGATTTTGCT GTTGGGCAAG ACCAGCTCGA CCTTGTCAGC GCTCTGAACG GATACAATCA GCAATTATCG GATATTTCGA GCTTTCTCAG TGTCACTACT AGCAACAATG ATGCAGTGAT TTGGATCGAC ATTCAGGGAG GCGGGAATTT TTCCCGATTT GTGTTGCTGG AGAACTTGGC TGGAGTGAGT CTCAATGAAC TGGTCTCGAA TGGAAGCATC TTGGCTTAA
|
Protein sequence | MTVRLCLQLY LWLLGCTARS LSQKLRGNPW MGVVEPSFRT NSGRRLGSFE WEIDLLDGFP FLNNITDTKQ IEFSYWFTGN ASLPGRSFDI VLLEKDCRTA SVVSPQGRGL FLDSEEIYDQ GIDVIVGVDL TSIASSSFYN RTSQLSAAIE FCLRVDYQSA GESVNFHETT VSVSVDLTAG FRLQSISTER NEEAVVSESV ACEVNAYFCD DSYTDTGTPR LAQGDVFQAC IASTGQFVIK DIVSAQLDQG QDNQTTGVRD DLAIDYEVVD PLTVISCSLG RCNIKTQMRT KYFSENNPIS VRLSGLALCG FGSTPAAYCA SSVKPVIGEN SEFLIYPVAL SLHDYDNSDE YESVVISWST QGTSAEPFVE FAVAPPGVSI TLANRELTLR GNAQDIEDAV ANLKVRPGQD NGEDIDIRVK ATAMDRNGLM ATAECGFRIP VDPFVFGILR LDSPFPSDIS IRAFEDTAIE LSGISIEFSG YDTDGSEVAY IEIEASSIPE GSVLLSNGIV QTTIVDGFLR LPMEAGSSLK IKPPRNYSGE ITLSVRGAIT DACIRISQYT VSNSVLVATI PQKLSIQVEP VADAIMWTTV YAIEDNGAVP FGAILANRQS GIRLNDIAQG KGNNPEPETI LQIVISVPPD SNEVTYLVSG TFVPQSDGIV TGEGTGVVSF DLQRRTFTIR SSLLTNRVNL AAADQETREQ ADKDIRATLA AFYVEIGPTH TDIDGDLSVT VTTLDVQMGF FDTTDVTFKP IKLQAVADQP SITVQSAAQI VSEDGDLVPL YFTVGRSADE DGSENLKVLI SVPLDELGPI GSITGSLPVG VLVINEGPGL YLISSSATTP IGQETSLNQL FDGQIKFHPR NNFSGQYLGS NGIRVDVFME ERATNDQLAP VEYGLDGDSK LAKATGFIDI FVLPDADKAT VLVKGNAAGM EDKPIPVPLS VTLGDLDGSE SYSMEIVDNL PAGAKLLGST SRELLSEDGV FFLFPIDVES LILLPPLHWS SAKQGDIVLL TNTTTVDISP QSSSSASVYL SIPIKIVGVA DKPNTRPVVV HCLEDGTYNL GSSIGDLSGV LVDVSILIRI IVVKKGSPNL LCSFSIQTDG SEVLSLVLSG LPLGVVPQSE ESNIQYLGKG RWQVPETAIP SLTLPALPNF SGDNPYTGIT INAVSQELDR DQAVSDSWPV TIKVRPVVDG FASWSMSFTT TEGMNEANNG GIDFKSVKNF VLGDEDNSEA VISFTFDFSN LMVNAGIRTR LNQLEGQSAG LEELVSKRMD GEFTFDQESG TVLVLSENIE NIKLHSELFL DSNQNFEIPV NALVRDSAVS DGSSVREDKI KSGVLFVIIS GTADIPTVFA EPTISGSVGE SGIPVRLGGE TTDRDVSLGR TQSEDVYYLV STSSSSISNL FTFSGAGLNN GDGHWTLEKS DLAGLQVYFS PSIGMRNQSE ASFDLTAVAV EDDGDWAVNT TTFKVILQPN EAGDTMTIPP LPPLLAIGLN AGLEDTKISL GSISAVVNPD DPTNTTISVA IWSIPPNAKV FGARLNPTTG KWICSAASIN SGAVQILPGP DYSGSLMLTI QAIATNHYAL SAHSPEQKLE FIVDPVADGA AISISPSSGL EDEMVSLRVS LTELDVDGSE RIGHFAYIKM TNGATLIGNY TVVTEYDNDA FIGTFSNAYS LRVPSEEISE LSLKPAANWH GTITLEISVP VVERFDDEDG DHVKVARKTH SIEISALADM PDISVPSVTT IGDEDTEIAI AFLAASLTDT IIANGREILS VVISNVPESS TFNSGYNNGD GSWAIPTAAL DGLSLLPPEH FAGTVKLTLT AYALELENGS ENSQSGDFDI NVRPVADPFL MVATSISLQA SSGTVDIALN LRLQDTRGDS PGEIPPEIVQ LEFQNLPRGI YLRASQGGGV VTTPAGTFVF AGSQAQANAL RLLARNATEG TYQIAISGVT VDGDSILSPA VRDSFRLTIS PSNQTLPGVE LIGTNGADNL TGSIGHDILM GLEGADLMRG GDGMDWLAGG PGADVLTGGN GADVFSWSSL DLDGSVDRIT DFAVGQDQLD LVSALNGYNQ QLSDISSFLS VTTSNNDAVI WIDIQGGGNF SRFVLLENLA GVSLNELVSN GSILA
|
| |