Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46867 |
Symbol | |
ID | 7204717 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 660550 |
End bp | 663828 |
Gene Length | 3279 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185760 |
Protein GI | 219121057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.793152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGTTCCAAA CGAACAAAAC GATTCTCTTA TGGGCCCTTG GCCATCGTGG CGCTGGATAC AGGAACACAA AATTGTTCTA GAATCTCAAA TTGTTCTAGA ATAATAAATT GTTCCAGCCG AAAGAGACGG CAGTACTTGC TCATCCTACG AGAATGATGA TGGTATCATG GTTGTCTCAC AATTTCAATC TTCGCGCTTC TTACAACTAC AAGCGCAACC GACATTCCAT TGCACAAATA CGTCATGGCA TTGGACGAGC GCAGCGACCC TGGTCTACAT GATGACGGGA ACTTGGAAGA AAACGTCGGA GGTAAGAGAA GATGTGGAAG CATGAAGATT GCTAGCGTTC CTTTCAACGA TTTCGTTAAT TCCTCCGTTT GCCTTGTTTT CTCTAGAATT TAATGAAGCC GAGCCTACCT CCTACTCTAC CAAGCCAAGG CCCACCGCGA GCAACCCGAT CGAAGAAGCC ACGGGGTTTT CTGCCAAATG GACACGTACC ATTAACACAA TTCATATCCC TGTCATTCGC GCCTTATTGT GGACGAGCAA CAAGTCGGCA ACCAACCCGC GTCGGACGGT GTCTCTCGTG ACTTTCGTGT CGGTTGCCCT CATTGTGATT GGTATATTCA CAAATTTCTC TGTGGATGTC GACGAAGATG TCTTGTGGAC TCCCAAAGGA GCACGTCCAG TGCAACACTC GGATTGGATT GACGACCGCT CCGGCTTTCC TACAACGCCC CGTACCTTCA TCATGTTCTT TCACGCCGAT ACGGCGGACG TTTTGGGACA AGCTCAAGTC TCGCGCGTCT TCCAGGCCTT GGATGCGGTC CGGACTCTAC CCGAATACGA CAGCATCTGT GCGAAAAGTA GCAGTACTTC GCAGCTTAAC GAAATAGGTG AAGTGACTTG CCCGATTTCC GGTATTACCG CCTTTTGGAA CGACACAGCT TCTATTTTTG AATCACAAGT ATCCAGCGAT GCCGATGTCA TCGAGCAGCT GTCGGCTACT GTTTACCCAG ATGGAACGCC CGTCTCCGCG GACGATATTT TTGGCAAACG CAATCGGGAT GCGAACACGG GCTTGCTTAC TAAGGCGCAA GCCTATACCG TTCTGATTGA TTTTCCGGAC ATAGATGAAG CCGAAGATTT TGAAGAACCC GCCTTAGATG CTGTCTTGGC CTTGCAAGAG CAATGGGAGG CGCAGTCGGA TACTAACTTT CGTTTGGAAG TTCAGGCAGT TCGCTCATTT TCAGACGAGT ACGTGCTCCT TTCTTATTCG TTTATTTTAG CGAACGGATT CTTCTAAACA ATGCCCTTTC ATTTCGTAAA CGACAGGTTT ACCCGTGCGA TCGTTGCGGA CATTCCCTTG GTTCCAATTG TCTTCGTGAT TATGTCAATC TTTACCTGTG CCGTATTTTT CAAGCGAGAC AAGGTCCGAT CCCGAAGTCT GTTGGGGTTC AGCGCAGTTA TTTCTGTTTT GCTCAGCATC ATGAGTGGCT ACGGTCTCAT GTTCGTCTCC GGAGTTCCTT TCACTAGCAT GACCCAAATT CTCCCTTTCA TTATATTTGG AATTGGTTTG GATGATGCCT TCATCATCTC AGGATCCTAC GAGCGTACCG ATCCTGCCAA AAGCGCCGTG GAGCGCATCC ATGACACTGT TGAAGACGTC GGCGCTAGCA TTACTCTGAC AACAGTCTCT TCAACTTTGG CGTTCGGGTT GGGCGCCACG TCCGATGTCC CTGCCGTCTT CTGGCTCTGT TACTACGCCT TTCCCACCAT CATTCTGGTG TTCCTTTACC AGATAACATT TTTCGTAGCT TGTATTGTTC TCGATGAGAA GCGTGTTCAG GACAACCGCC GGGATTGCTG TGTCTGCTTG GTGGTGGACG CTAGCGATGA GAGCGAACCA CAAGCTCTGT CTAATGGTCG CGGACCTACC CCTTCTGTCA TTGATTATTA CATGGGCCTG TACGCGAAAC AAATTCTTCG CCCTGTGGTT CAGATTCCGG TCGTGATTTG CTTTTGTGCT TTGCTTGGTG TATGCGCTTA TAGCGCTACT CTGTTGACGC AGGAATTCAA ATTTACAGAT GTTCTTCCAG ACGGTTCCTA CGTTGCAGAC TTTCAAACAG CATTCGATGA AAATACTGTG CGCTCGGCTG TTGCGCCTTA TGCGTACTTT CGGTTCGTTG ATCAAAGTGA TGGAGACATT CAGAGGCAGA TGGAAGCCTA CGTCGACGAA CTTGTTACCA TAGAAGCGAT CGAAGAGGAC CCCGAGTTCT TTTGGCTGCG GGACTTTAAA GAATTTGTGA ACGTTAGCGG ATCCCAAAAT TTAGAGTTCA ACAGCCAGAT TGCGGCATTT TTGTCCAACC CGGTTTATGG CGATCTATAC AATGACCACG TCGTGCGGGA CGATGCTGGT ACGATTGTTA CATCGCGAGT TCGGTTGCTC ATGGATAATG TCGATGTTGA AAACGTGAAT GAGCAGGTTG ATGCCCTTGA AGATCAAAGC AGTGTATCTG GTGGGCAAAA TATTAATCAA GGACGAGGCG AATGGGCTTT TTTTACGTAT GACGGTATCT ACAACATTTG GGAGTTTTAT GCCGCATCGG TCAATGAAGT CATCTTTACT ACTGTCCTCG GGGTGGCATC TGTTACAGGG ATTACATTGA TCTTTGTTCC ACACTGGTCT GCCGCTTTCT TTGTGCTCCC CTTGATCTGC ATTCTCTATG TTGATCTCTT GGGTGCAATG CAATGGGCAG GAGTTCACAT TAACGCCGTC AGTTACATCA ATCTTGTGAT GTCGATCGGT TTGATGGTAG ATTTCCTCCT CCATGTGCTT TTGCGCTACT ATGAGTCTCC GGGCAACCGC AAGGAAAAAA CTTTACACAC TCTGGAAACC ATGGGTGCGT CAGTCTTGGT CGGTGGTATC TCGACATTTC TCGGGACGCT TCCCTTGGCA TTTAGTTCGA GCACTATCTT TTACACCGTG TTTGTCGCGT TTATTGGCCT CGTCACACTG GGATGTGGTC ATGGACTTAT TCTACTTCCG ATTATTCTTT CTAACTTCGG GCCCGAAGAC CAAATTGAGC CTTCAAAAGT GAGCAAGACT CTGGAGCATT GTGAGACCGA AATCAGTCAA AGTCAGAGGA TTCCGGAAAG CTAGAATTTA GAAGCAATGA GTGTGATGTA CACATACATA TAGGAATTCA GTTTTACATG ACTGAGAGTG CTCGAGACTC CTCTGTAGAG CAAAATAAAA ATAATTTACT GTAAAATAG
|
Protein sequence | MALDERSDPG LHDDGNLEEN VGEFNEAEPT SYSTKPRPTA SNPIEEATGF SAKWTRTINT IHIPVIRALL WTSNKSATNP RRTVSLVTFV SVALIVIGIF TNFSVDVDED VLWTPKGARP VQHSDWIDDR SGFPTTPRTF IMFFHADTAD VLGQAQVSRV FQALDAVRTL PEYDSICAKS SSTSQLNEIG EVTCPISGIT AFWNDTASIF ESQVSSDADV IEQLSATVYP DGTPVSADDI FGKRNRDANT GLLTKAQAYT VLIDFPDIDE AEDFEEPALD AVLALQEQWE AQSDTNFRLE VQAVRSFSDE FTRAIVADIP LVPIVFVIMS IFTCAVFFKR DKVRSRSLLG FSAVISVLLS IMSGYGLMFV SGVPFTSMTQ ILPFIIFGIG LDDAFIISGS YERTDPAKSA VERIHDTVED VGASITLTTV SSTLAFGLGA TSDVPAVFWL CYYAFPTIIL VFLYQITFFV ACIVLDEKRV QDNRRDCCVC LVVDASDESE PQALSNGRGP TPSVIDYYMG LYAKQILRPV VQIPVVICFC ALLGVCAYSA TLLTQEFKFT DVLPDGSYVA DFQTAFDENT VRSAVAPYAY FRFVDQSDGD IQRQMEAYVD ELVTIEAIEE DPEFFWLRDF KEFVNVSGSQ NLEFNSQIAA FLSNPVYGDL YNDHVVRDDA GTIVTSRVRL LMDNVDVENV NEQVDALEDQ SSVSGGQNIN QGRGEWAFFT YDGIYNIWEF YAASVNEVIF TTVLGVASVT GITLIFVPHW SAAFFVLPLI CILYVDLLGA MQWAGVHINA VSYINLVMSI GLMVDFLLHV LLRYYESPGN RKEKTLHTLE TMGASVLVGG ISTFLGTLPL AFSSSTIFYT VFVAFIGLVT LGCGHGLILL PIILSNFGPE DQIEPSKVSK TLEHCETEIS QSQRIPES
|
| |