Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47110 |
Symbol | |
ID | 7202024 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 470153 |
End bp | 473052 |
Gene Length | 2900 bp |
Protein Length | 907 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181212 |
Protein GI | 219121727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0847586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAAGCAAAA GTGACAACTG GAGGGATCCG TTTCGTTCGA CGCAGCCGTG CTGTTGCACA TTAGTATTGA TATTCTCTTG ACTATCGCTC GTTGCGTGAA TAAACATGCG AACACGGAAC CAGAGACAAG ACGGTGCAGA TGATATTATG AGCAAAGTCA ATGCCACTGA CAGTCGTCAT CGCCGACAGG ATTCTAATAT CAAAGACAAC GAGCAAACAC CAATCGTCCT GCCAAACAGA TCCATCGACG GCTTTACTGA CGACTCATTC CGTGTAACGC AGACACGAAA GAGAATTTTG CGGCAAGGTG TTGCTCCACA AGTAGCGGAA GGCAGTGGAG ACTGTGAGCA GAAGGAAAAA AGCAAGACTC GTGGTAAAAT TAGCAGACTT GGTGCAGCTG GCTTTTACCT AAAAGCGTGT CATCAATCGG GATATGTATT ACAAGCAATA ACGTGGGCCA AAGCTCGTCC GTTTGTTGTC GGAGCCATGC TATTTTCCAT TCCTTTTATT TTGCGGCCGT CTCGCTTTGG GCGGTTCTTA AGCTTTATTG GCCTGATGCG TTGGTACCCC GCGTATTTTG GAAACCATCC AATAGCGGGA GGTCCGTTAG GCCGTAAAAT CGATTACAAC TATCTTACCA AAGTATACGA AAGGACTAAA CGAGAAAATC AAGAGAAGCT GGGAAAATTG GGGATTACAG AGCTGCCCTA TATTGAAACA TTGAATGCAA AGGAGCAAAT CAACGTTTTG AATTTGAACA CGCGAGCGGC GATACGAGAA CTTTTAGTTC GACGAGAAAG ACGTGCTGTG ATGTGGAATG AATTAGAATT GCAATCACAA GGCTTAAAAT TGGCTACCGA CTGCAAACAG TGTGACGGTG TGGCGGGTTG CTTAAATCCT TGCGGAATAC CTCGGATTCT TTTGCTTCCG CAGCTCCACA ATTGGGAACC GCCGCGGCCA GGTCTAATAT CCGGCGCCAA CTTGATCCCC TTCACATTTG ACGAAACCGA AGTGCGCGAT TTCTTGTCTG TCGCTCACCC GCAGCTTCGG AATCATACCA AGCAAGAGAC CAACGTTTCC ACGACGGTGT GGGCGCTAGC AGCCTTGACT TCGTATGGCG GCGTCTTTCT TGGGGATCAG CGTCGCTCTA AGATTGATAC AGCGAATAAA GTCCTGTTTG GGGGTTCGAA TCGTGGGATC CTTGATATAG CCTTACGAGA TACGCCGGTG GCCATCGTTT CCCTAGAGAG AACTGGAACC GATTTTCGTC TCAAGATGCT GATGACTACA CCAAACCATC CTTTCTTAGA ATGTGCTCTT GGCGAGTTGC AAGCAATGAG CCTTTTAAAC GTTCCACAAC TGTGGAGAAT GCTCATGGTC TCTAAAGAAC CTTCGTTCTA CGTCGATGAA GGTTGGAAAC GGGCGTTAAT AAATTGCCCT GCAAAGAAAG AGACTGGATG CTGTCACTCC AATTCTTTGG TTTCCGTCAC AGACGTGGCC GCGGATTATG AAAGCCAAAT GCTGAAAGTC GGCAAAAGTG TGGTATATGT GCAAATAGTG AAAGAAGAGA TATTGGAGGT TAGGTCAGCA AAAACAGCCC AAGTTGAAAT ACAACATACT TCGCGGGCAT CAACTGAACC CTCTACAAAG GTTCGGTTGG AAACGCTTTT GCGCCTTCAT AAAGCCGACC CAGGATGGTT GTGTACTAGG TGCATCAAGA CTCCATGGTA CGGCTCCATG GAAAAATGTT CGTTCTTTTG TCCAACAAGG TACGAAGAAC TCGTTTGTCG GTCTCCGGAT AAGCCTCCTA GACGCGAAGT CCCAGTAGAT GTGAATGTTT TGGTTCCTTC CCAGCCATCA GCCCAACGTA TTCCACGCAT TATACATCAG ACCTGGTTTG AAGACCTCAC GATAGACCGA TACCCGCAAC TTGTAAGGTT GCAAAACTCT TGGAAACAAA GCGGTTGGGA GTATCGATTT TACGACGATG CCGCGGCAAA GGAGTACGTG ATCGAAAACT TCCCCTATCA CTTTGCTGAA GCTTTTGACA GTTTAATTCC TGGAGCTTAC AAGGCCGACT TTTTTCGGTA TCTTGTGCTG ATGAAGACGG GGGGTGTATA TGCTGATGTG GACGTTATGC TCGACACAAA TCTTGATTCA TTTATCACTC CATCCATGTC CTTTTTTGCG CCAAGAGATA TCGTGGGTGA ATACGCAGGA CAGCCATTTT GCCTTTGGAA TGGACTTATC GGTAAGTCGA GTGTGCCAAA TTATTGCGAG TGCAATGCTC CAAGCTGTTG CTTACATATT TAATATTCAC AGGAGCCGCT CCAGGGCATC CGTTTCTGAT ACGAGCTGTT GAACGTCTTG TAAATCTTAT CTTGGATCGC TCAGACTTGT ATGACATGGA ACGCGATATT TGTCGAAGAT CGGAAAAACC GATAGAGCTT TGGAAAGTGA GAGCAGAGCC TTTGCTGCTT TTTTCGGGGC CATGCGCCCT CGGCGTTGCC GCAAACGAAG CTTTGAATCG ATCGTCACTT GAGCCATTTG ATATAGGATG GATCAGCATG GAAAATCTTG GTTTCGGTGG GAAAGATGAT CATGGTGATG CTCTGATTTT GGTGGGCGAC AAATTTGACA TGGGCGCCTT TCGAATTTCG GACCCGGAAA GAAATTTTGT TGTTGCTTCA ACAGACCTGG ATGGAGTGGA AAAAAAAGCA AGGATCATGG CGAATCCAAC GATAGTCGAA CGGAACTTAA ATGACACACG ACAGAAGCAA CTACCGCACT ACAGTAAAAC AGCAAAGGGA GTCTATGTCT GGGGAAGTGC TCGTGTCTAC AAAGACAATG AAGTCGCCAA CGAAAAAATT AAGTTTTTCC CTCAATACCA TGACGAGTAG
|
Protein sequence | MRTRNQRQDG ADDIMSKVNA TDSRHRRQDS NIKDNEQTPI VLPNRSIDGF TDDSFRVTQT RKRILRQGVA PQVAEGSGDC EQKEKSKTRG KISRLGAAGF YLKACHQSGY VLQAITWAKA RPFVVGAMLF SIPFILRPSR FGRFLSFIGL MRWYPAYFGN HPIAGGPLGR KIDYNYLTKV YERTKRENQE KLGKLGITEL PYIETLNAKE QINVLNLNTR AAIRELLVRR ERRAVMWNEL ELQSQGLKLA TDCKQCDGVA GCLNPCGIPR ILLLPQLHNW EPPRPGLISG ANLIPFTFDE TEVRDFLSVA HPQLRNHTKQ ETNVSTTVWA LAALTSYGGV FLGDQRRSKI DTANKVLFGG SNRGILDIAL RDTPVAIVSL ERTGTDFRLK MLMTTPNHPF LECALGELQA MSLLNVPQLW RMLMVSKEPS FYVDEGWKRA LINCPAKKET GCCHSNSLVS VTDVAADYES QMLKVGKSVV YVQIVKEEIL EVRSAKTAQV EIQHTSRAST EPSTKVRLET LLRLHKADPG WLCTRCIKTP WYGSMEKCSF FCPTRYEELV CRSPDKPPRR EVPVDVNVLV PSQPSAQRIP RIIHQTWFED LTIDRYPQLV RLQNSWKQSG WEYRFYDDAA AKEYVIENFP YHFAEAFDSL IPGAYKADFF RYLVLMKTGG VYADVDVMLD TNLDSFITPS MSFFAPRDIV GEYAGQPFCL WNGLIGAAPG HPFLIRAVER LVNLILDRSD LYDMERDICR RSEKPIELWK VRAEPLLLFS GPCALGVAAN EALNRSSLEP FDIGWISMEN LGFGGKDDHG DALILVGDKF DMGAFRISDP ERNFVVASTD LDGVEKKARI MANPTIVERN LNDTRQKQLP HYSKTAKGVY VWGSARVYKD NEVANEKIKF FPQYHDE
|
| |