Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47896 |
Symbol | |
ID | 7203162 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 377288 |
End bp | 381995 |
Gene Length | 4708 bp |
Protein Length | 1543 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182211 |
Protein GI | 219123812 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGCGT CGGCCCCTAA AGTGAAGGCC TACTTGGCTA GTCTGAAAGG AAGCGTTATA GATTCCGAAA AACCAGACTT TATACTCTGT TATACAGCGC TTCTCAAACT TGAACTGCTT CATAGGGGAG TATTGCTCGA CTCAACAAAT GATCTATCCA AAACGCTGGA CTCCGTGACG GTCACGGCCG GGAAAGCACT GCTAGACTCA TCTATCGCAT CCATGCCGAA AACACTCTTA TCGGGTATTT CGGCATTCGC TGGCAGAATA GCTGAAAATG TGAATACTGC TGTGATGGAT TCAGGGACAT ATTTAAATGA GTTATCTGAG CTGGACGGGG AACGTAGTCA ATCCGCACGC CAACGTTGTC GCGATGTGGG CGATAAACGA CTTTACGCAA TCTTACTCAT CGCTAGCGAG AGAAAGCTAC TAACGAATGT CCAACTCTCT GAGCGATTAT TGTTTCTCGC AGAAGGTTAC TACTATCATG CCGCAAGTTT GCAACCAGAT GCTCCTATTA TGTGGAAAAG GGCCTGGGAT ACCATCGTTG AAAGACAAAA GATTAATGGC AAGAATGATG AAAACTCCTG GGAGGCGACA TTTTCACTTC AAAATCTACG CCGTATCGCC TCATACAAAG GGAACGGTAA GCGCGCGGCA GAATTGGGTT TAGCTCTCGT GCAGAGTCTC CTCAATTCTG TTGACTGTAC CGAAAGCAAC GTATCTCTCA AGGGCCTTCT CGAATCATTT GTGCAGCCTT ATCGGGAACT GGAACCAATG AAACGCAAGT ATGCAAACAA GATGGCTGCT GATACTGTAG AGTCCATATC TGAGGAAAGT CTATCAGAAC TGGACCTTTT TCTTCTAAAA CAACTGTCTC TAAGGGTTTG CTTGTCTGGC GAAGATTGGG CAATTGAACA AATAGCAGAG GACGAAATCA AGCAACGAGA GAATCGTAAA CCCCTTGAAG CAGTAATCAA ACCCTTATCT TTGGAAGCTG TACGTGGCCT TGTAAGAAAG GATTATTTGG CGAACCAAGA TTCTTTCTCT GAGAAGATGG ATAAGTTTGC AAATGCTCTA CGCGACGCTG TCTTACTGAA TTTTGCAGAT TCTAAAGCAC GCATTAATGG CTGTATTGAC CTATTGAAGT ATTTGTCGCG GATGAAGGAC AAGTACCAGG CTAGAGCTTA TACATTGGCA TTAGGCTCGA CAGAACGATG TGTTACATTG GAAGATATCG CGAACTTCGT TAAGCCTATC TTGTACGCCT TGCGAGAACG AGCCGCTTGG AGTGGAATCG ATGAGAGTGC GGAAGAAAGG ATACATAGGT TAGAGAAATG CTTCAATGAT GTGTCCGTAG AAGAATTGGG ATATATCGAG GCATGCGCTT TGATAGTCCC ATTTTTAGAA TGGAAATTTT CTTGCTCAGA TTCTGTTGTG ATGCTATTTG ACATGGAATC TCATTGCTTT GTTCAAGAAC TTCTTTGGAT GCTTCGTAGA AAATGGAAGG CTGTCGCCGA ACATTCCCAG ACGACTGCTA CAGTGAAGAA TCAAGTAGAT AAAACGGATC GACGCATTCA AAAGATCGAC TTGGCACTTT TGTCGACGAT GGCTCTCCTC GCTCTGGGGT TGCCTTCAGG TTCCACCTCT GATGTTCGAC GTGTGACGGA TGCGGCAATC TCTTTTTCAA TCAAAGATGT CAGCCAATAT TCAAGTGAGA GTGGTGGTCC GTTTCTCTCT TTCCTTGTCG CCTGGAATGG TCTATCTCGA TCGCCCTGGC AATTTTGCGG AGTTGCCGAA GCTCGCCGAA TACTAGTCGG TGCAAGAGCG TGCTTACTGA AGTCGGCAAG AGCTTGCGGT CAAACATTCT TGTCGGTTGG TTCACTGCTC CTTGACATCG CCAAGGCAGA TGCGGAAATG CTGTCTATTG GAGGAGGGTT GACTCATGAA GTGCCACTCC TTTACAGAGC CGTAATGACG GCAGCAAAGG ACACTGGGAG AATGGCGGAC AACTTAGAAC TTATACTGCA ATCACATATT TTCTCGGGTT GGGGGAGGTT ACTTATCAGA GAACCAGCCC TGCAGTTTGA GGCAGTCAAT GCGCAAGAGA ATGTAGTAGA CGTATTGAAA GGGTACTTGA TTGCTTTGCA AAGAATGGAC TGTGAAGGCT CTTTTTACTT GTGGCGCTCT CCAAGCGCAG TGAGGAACGC GGCCTGCTAT CAAATTGCGG CTGCGAGGCA GAGAATTGCT GATCTGCTGC TTCACAAGGG TGAGCTAGAG GAGGCGCAGT TTTTTCTTCA AGACGCGGTA AACGATGCGC CTGAAGATCA AAATGCATGC TTTTCTTTGG GTGTTTTTCA ACTTCGACTG ATGTGTTGTG AACAATCGCG ATTGCCTGTC GATGAAAAAG CGGCACAGAT GCACCTTCTA CGAGCTGCCA AACTAGATTC GAGTCGTCCC GATCCCTTCG CCTTGCTAGG ATATTGGTAT GAAGAGTCGA ACGATTACAA ACGGGCTGCT GGATGCTATT CGAAGGCTTT GCTTCTTGAC CCTTCGCATC CAGTAGCTGG TCGTGGTTTA TTAAGACTGA AAGCTGGAAA CCTACTGGGT GTGCTTGAGA AGGCAATTGA TAGGGGCTCC ACGCTCAGCG GATGGGCATG GTTGGCGCTT GCGACCCACA AAGCCAACGT ATTGGGCGAC GACGAGCTTG CCGTAGTGTC TCTCGTGAAT GCTCTGCGTT GCCGCGACAT TCTGAACCCT GAATCTGAGC CTCTTGCTTT CTGCTATTAT GACCCATTAG GCCCACGTGA CTCTAGCTGC AGCGATCACG CTACGGCTTT GTGCGCTTTA GGTTCTAGCT ACGAGCGCCT TGGTCGCTAT ACTGCCGCGC TTCGGTCATT TCATTCTGCT ATTGACGAAT CGTCTCTACA CGTCACCACA GCATCTCTCA TTTCGTGTGC GCAAGGTAAG CTGGGAACCT ATAGTATAGG CCGAATTTCT CCAAATTTCA TGGCTTAAAA AGTGTCTTCC TATGTTTGTA GTCGAGATTA AACTTGGGTT ATTTGAAGAC GCCGCAGAAC GACTGACAAC AGTCGTATCT ACGGAGAACA AGGACGAGCG ACTTGTTGCT GCGCAAAACC TTGGTATAGC TCTGCATGCA CTTGCCCAAC GGGATCTCCA TGATGGAAAG GCTGGTGCAG CTCTTTCGCA TATTGTCCGA GGCATAGAGT TCTTGCAGTC ACAATTGGAA TCTCATGTCT GTCTACAAAA ACTGATCGGG GATCTTTACA CCTATGCGGC TGTGTTGCCT CCCGACCTAT TTGAATCGAC CTTCAGCAAC AGCTCATCGA TTAGCCCCAA TGGTCAGTTG AGCCCGCATT ATCAATTTAT TGCCGCTGGA GAAGAATTTT ACAATATTGC AATATCAAAC GCCACGGACC TTTTCAAAGA AGGCGATGAA TTGAAGCATT TGCAGGCCAG CCTGATAAGT GACCTTGGTT GTAATATTCT TCTCCAGGCT CAATCATGCT TTTCAGCCCA TGTACACGGC GTTACATCTC CTTCAAGAAA GGAGGTCTTA CTCACCTCTG CAGCGACTAA GTTTAAAAGT GCTCTGGAAA TTGACCCTCT ACATTCTCCA GCTTGGTGTG GACTTGGATG TGCGTTAGCC GTAAGCGATC CCCTTCTTGC ACAGCACGCC TTTTCACGTG CAATCGAACT GGACAAGGTC AGTCCCGATG CGTACGCGAA CCTCGGTTTC TTGTACACAT CTAATGCCAG ATTAGCCGCG AGCGCCGGGG TCTCTGATGC GTTGACCGAA GTAGCAGACA CACCCATGAT GTGGATCAAC CGAGCTATAG TTCTTGAGTA CCAGGCTCGG CAATCTCACC AGCAAGAGGA AGAACAACAA TACCGTGGCT ATATTCGGGA AGCTTCAAAT GCTTACCTTG CGTCAATGCA AGTAGTGAAG CGGCCTGCTG CTGTCCTTGG TATGTCTCTG TTGTCCCGTA TCGACTGGGG GAATCCCGAG GCACCATACG CCCAGCGCGA AAGCTCCTAT TACCTCGATG AGTATTTGGC CTCCGTTGGT CCAGCCGACT TGCCTGCTAA AATACTACAA AGAGTGGCCT TTTTGGAGAA GTCAGCTTCG AGCCATCGAA GCACTTCTCC AGCTGAGCTA TCGGCGATGA TCTCCGGTGT TACAGATGCC ACCAGCGATC TCAAGGCAAT TGGTATAGGT AGTGAAGCCG GACAAGGTCT TGACCTGGAT TTGATCAGCG GAATAGGAGC AGACGTGGCG ACTGAACATT CAGAAGAAGG CACAGCCTGT GAATTGTCGT TGCCTATACC AATTGGTCGT CGCCTGATGC TGACACCACG GGATGGCTCC GTGTGGTTGG AACTTTCCAA AGAGTTGGTG TTTGCTTTGA GTAACGACTC ACCAGACTGC TCCTTTGATG CTGCCCGATC TGCTGCCGTG CAGTCACTCA ACATTCTGAT GAGAAGCGTG TTCCACACTT CCTATGTCAT GAGCAAACCA GCAAAGTACA AGGCTGCCGA CGTTTCGGAC GCTCTGTCCC TTGCCCACGT ACTGAAGAAT AAGGCTCAGA ACGAGTGGTA TTCATCAAGC GGTCACGACT TTGATCTTCA GCGAGCACTC ATTTTATGCC CCACCAACAA AGTTGCCAGA GCAGCGCTGA CCAATTAG
|
Protein sequence | MLASAPKVKA YLASLKGSVI DSEKPDFILC YTALLKLELL HRGVLLDSTN DLSKTLDSVT VTAGKALLDS SIASMPKTLL SGISAFAGRI AENVNTAVMD SGTYLNELSE LDGERSQSAR QRCRDVGDKR LYAILLIASE RKLLTNVQLS ERLLFLAEGY YYHAASLQPD APIMWKRAWD TIVERQKING KNDENSWEAT FSLQNLRRIA SYKGNGKRAA ELGLALVQSL LNSVDCTESN VSLKGLLESF VQPYRELEPM KRKYANKMAA DTVESISEES LSELDLFLLK QLSLRVCLSG EDWAIEQIAE DEIKQRENRK PLEAVIKPLS LEAVRGLVRK DYLANQDSFS EKMDKFANAL RDAVLLNFAD SKARINGCID LLKYLSRMKD KYQARAYTLA LGSTERCVTL EDIANFVKPI LYALRERAAW SGIDESAEER IHRLEKCFND VSVEELGYIE ACALIVPFLE WKFSCSDSVV MLFDMESHCF VQELLWMLRR KWKAVAEHSQ TTATVKNQVD KTDRRIQKID LALLSTMALL ALGLPSGSTS DVRRVTDAAI SFSIKDVSQY SSESGGPFLS FLVAWNGLSR SPWQFCGVAE ARRILVGARA CLLKSARACG QTFLSVGSLL LDIAKADAEM LSIGGGLTHE VPLLYRAVMT AAKDTGRMAD NLELILQSHI FSGWGRLLIR EPALQFEAVN AQENVVDVLK GYLIALQRMD CEGSFYLWRS PSAVRNAACY QIAAARQRIA DLLLHKGELE EAQFFLQDAV NDAPEDQNAC FSLGVFQLRL MCCEQSRLPV DEKAAQMHLL RAAKLDSSRP DPFALLGYWY EESNDYKRAA GCYSKALLLD PSHPVAGRGL LRLKAGNLLG VLEKAIDRGS TLSGWAWLAL ATHKANVLGD DELAVVSLVN ALRCRDILNP ESEPLAFCYY DPLGPRDSSC SDHATALCAL GSSYERLGRY TAALRSFHSA IDESSLHVTT ASLISCAQVE IKLGLFEDAA ERLTTVVSTE NKDERLVAAQ NLGIALHALA QRDLHDGKAG AALSHIVRGI EFLQSQLESH VCLQKLIGDL YTYAAVLPPD LFESTFSNSS SISPNGQLSP HYQFIAAGEE FYNIAISNAT DLFKEGDELK HLQASLISDL GCNILLQAQS CFSAHVHGVT SPSRKEVLLT SAATKFKSAL EIDPLHSPAW CGLGCALAVS DPLLAQHAFS RAIELDKVSP DAYANLGFLY TSNARLAASA GVSDALTEVA DTPMMWINRA IVLEYQARQS HQQEEEQQYR GYIREASNAY LASMQVVKRP AAVLGMSLLS RIDWGNPEAP YAQRESSYYL DEYLASVGPA DLPAKILQRV AFLEKSASSH RSTSPAELSA MISGVTDATS DLKAIGIGSE AGQGLDLDLI SGIGADVATE HSEEGTACEL SLPIPIGRRL MLTPRDGSVW LELSKELVFA LSNDSPDCSF DAARSAAVQS LNILMRSVFH TSYVMSKPAK YKAADVSDAL SLAHVLKNKA QNEWYSSSGH DFDLQRALIL CPTNKVARAA LTN
|
| |