Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48886 |
Symbol | |
ID | 7194962 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 581463 |
End bp | 584387 |
Gene Length | 2925 bp |
Protein Length | 813 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183513 |
Protein GI | 219126540 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.515683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCTCCATCA GTTTCGGATT CCCGCTGACA ATGTCTTGAC TGTGAAACGA TCTCTTTTCA TTTCCATAAT CACATATCAT CCGAACAAAG TGTAAGCAAA GCGATGCCGG ATACAATTAG ACCGTCTCTC AATGGGAGAA TTTCATCATT CAAACAGTTG CTCGTGATGA TTTTACTAAC AGTGAATGTG GGCCAAGCCG AAGAAGTCTC CTTCGCCAAC ACCACGGAAG TCGAAATGGA GCTCGAACCG GCTGAAGCCG TCCTCTTTCC ATGGTTTGTC CAGGCATTAG GAGTTCTCAC ATTCTTCCTT CTCAGTCGAT ACGTCAAGTG GCTTCCCTAC ACTGCCGTGT TGTTTCTTCT TGGAACATTC ATGGGGTTGG CGACGGCCAA ATTTCAAAAT GACAATCGAC TGTCCCAGTC CATTCTAGAG TTTTGGATAC CCATTGACAG CGAACTGCTT TTACTGGTTT TCTTGCCGGG TTTGATTTTC AAGGACGCGT CGTCCTTAAA TGTGCACTTG TTTCAAGTTT CGATCGTACA GTGTTTTGTG TTTGCCTTCC CAATGGTACT GGGAGGAGCC GTCTTGACTG CTTTGGTGGC GTACTATATA TTTCCCTATG GCTGGTCTTT TGCCTTGGCC ATGACTTTTG GAAGCATTCT GAGTGCAACG GTACGTATCG AGAATCGCCG ACCTGCTTGG CACAGAATAA AATCGCCATT CAATCCGTGG TGAAGAACAC GTCCAAAAAG GAAGCTTATT TTTTCTGTAT TTGTCGCAGG ATCCAGTCGC AGTGGCTGCT TTACTTGACT CCGTAGGCGC TCCACCACGC CTCAAACAGC ATATTAGCGG CGAATCGCTG CTAAACGACG GAAGTGCTCT CGTATTTTTT GCACTTTTTG CTGAAGTTTT CTATACCGAA CTCGGCGTTG AAGGTCTGGG AACCGACTAC AACTGGGGTT CCGGAACGGC TAAGTTTCTG CGTATGAGTG GCGGTGCGTG CGCAGCTGGA CTATTTTTTG GATTCGGCTT GATTTTGCTC CTATCAATTT TGGATAGACG ACTCAATCGA GAAGAAAATA TTGTACAAAC TGCCGCTACC ATTACCGTGG CATACTTGTG CTACTATACA GCCGATGTGG TGTGGAGTAC CAGCGGTGTC TTGGCCACAG TCGTGTGCGG TATTACGTAT CGGGCTTTTG GAGATGCCTT GATCAATGAC AATCAGCTAA TTTGTGATTT CTGGGGCTTG GTCGAGCACT TGTTGAATAC TGTCTTATTC GCGCTAGGTG GATTAGTATG GGGTAGTGTG ATCGCTAACG CAGAAGAACG TGAAGGAGAA TTTACTGGAA GAGATTGGGG TGAGTAGTAC TGATCTGCCA TTGATGCCTG ACTTGTTGAT ACAACAGGCT AACAGCAGAC TTTGTTTTCC AAGGCTATTT GATTATATTG TACATTTTGT TGATTATAAT TCGATTCGCT CTCTTTATCG GCGCGTATCC GCTCATTTCC AGGATTGGGC TCAAATCAAG CAAGCCTGAA ATGATCTTTC AGGCCTTTGG AGGCCTTCGC GGTGCTGTGG GAATCAGTTT GGCAATCGTT TTAGACAATA CAGTGCGCGA GGCGGCTGAA GAGGGAGACT TCAAGTATGT TGGTCAGACC AACAAAGTGT TTGGATTTGT TGGTGGAATT GCTTTTATGA CGCTCTGTAT TAATGCCACT GTAGCTGGTC CTCTGTTACG GCGATTAGGC CTGGCTGACA CAACAGCCAT TCGAAAAAAG ATAATTGAAA GCTACAAGCT CCACTTGCGC TACGAAACAA TTGAAGAGCT AATCCGTTTG CTAGCTCAGC CTCGCTTTGC CAAGATCAAT TTTGCCCTGA TTCGGGACCA TGTTGATTGG TTGAAAGATC TTAGACAGGA CGAAGTGTTA AAGGCCTACA AGGATTACCG GAATACTCAT AATCACGAGA AAAACTATCG TGATCCAAAC TTGTCTAAGG TTTCACCGTA TTTGGAAGAC AACGAAAAAG ATCTGGAAAA ACAAATGGCA GAACATCAGA AAGAAAACAT CGGCATATCA AGTGACAAAA ACTCTACCAA AGTGAGGATA GCCAAGGTCA CATCCAGTAT GTCGTTGATA GAACTTCGCA CGGTATTTCT AGAAATTTTA CGTAGCGCAT ACGCTAGACA AGTGGAACTC GGTGAGCTCT ACAACCGTCA GTTTCTCGCC TTCTCTTTGG AACAATCAAT TGATTTTGCC CTCGACTCTG TCTCAAATGG ATCCGAGTTG AATGATTGGG AGTATGTGAG CGTCGTAAAA GCACCTTGGT CAACATCGGT TTACACTTCG AAAGGAATGA AGTATTTTCG AAAATGTTTC GGAGCTTTTG TCCTGCAAGA TGTGAAGTAC GAAATGATGC GACTGAACGT GGAGCGATGT CTGGCATTTC TACATGCACA CGATACTGCG CAGAGGCTAT TGAGTCAGCA GTTCTTGGAT GAGCAATTCT CGGAAGAAGA GTCAAAGGTT ATTGCCGAGT CAAGACGTCA GTGTGTAGAG GCTGTCAAGC TATTAAAGTC GTACCACATG AGAGATGTTG AGATGATTGT GTCGCACAAC TTGTGCACGG TTCTCTTGTA CAATTCATCT CGTTGCGTGG AAAAGCTGCA TAGAAAAGGT CTTCTCAAGG GCACAGAAGC CGAGACTATA CTGGAGAAGA TTCAAGAATC GCTTCAGCGT GTTTACGCTT GCAGAGAGAG GGACCATCCA GGGGAGCTCC CTGTAGACAG CGATCTAATG TCGGAGAAAG ATGTTGACGA AATGGCACCG GCTTCTGGTT AAGGATCGCT GGTCCAGATT TGCTGCTGCT AGTGTGGTCA TATAATCTTG TTACTCGTGG ATCATCAAGC GTACCGCCTC CAATG
|
Protein sequence | MELEPAEAVL FPWFVQALGV LTFFLLSRYV KWLPYTAVLF LLGTFMGLAT AKFQNDNRLS QSILEFWIPI DSELLLLVFL PGLIFKDASS LNVHLFQVSI VQCFVFAFPM VLGGAVLTAL VAYYIFPYGW SFALAMTFGS ILSATDPVAV AALLDSVGAP PRLKQHISGE SLLNDGSALV FFALFAEVFY TELGVEGLGT DYNWGSGTAK FLRMSGGACA AGLFFGFGLI LLLSILDRRL NREENIVQTA ATITVAYLCY YTADVVWSTS GVLATVVCGI TYRAFGDALI NDNQLICDFW GLVEHLLNTV LFALGGLVWG SVIANAEERE GEFTGRDWAD FVFQGYLIIL YILLIIIRFA LFIGAYPLIS RIGLKSSKPE MIFQAFGGLR GAVGISLAIV LDNTVREAAE EGDFKYVGQT NKVFGFVGGI AFMTLCINAT VAGPLLRRLG LADTTAIRKK IIESYKLHLR YETIEELIRL LAQPRFAKIN FALIRDHVDW LKDLRQDEVL KAYKDYRNTH NHEKNYRDPN LSKVSPYLED NEKDLEKQMA EHQKENIGIS SDKNSTKVRI AKVTSSMSLI ELRTVFLEIL RSAYARQVEL GELYNRQFLA FSLEQSIDFA LDSVSNGSEL NDWEYVSVVK APWSTSVYTS KGMKYFRKCF GAFVLQDVKY EMMRLNVERC LAFLHAHDTA QRLLSQQFLD EQFSEEESKV IAESRRQCVE AVKLLKSYHM RDVEMIVSHN LCTVLLYNSS RCVEKLHRKG LLKGTEAETI LEKIQESLQR VYACRERDHP GELPVDSDLM SEKDVDEMAP ASG
|
| |