Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49558 |
Symbol | |
ID | 7198185 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 35105 |
End bp | 39356 |
Gene Length | 4252 bp |
Protein Length | 1106 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184384 |
Protein GI | 219128363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.022156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACGTGCGA TACGAAAACA AGAACTACTA AGGGATCTGT TTGTCTCAGG TACAATGGTA AGCTTTCAGG ATTGCAGACA CTACGAAGCT GCATATCATG CTAGTAGCAG ATTGACCGAT TTCGGGCTTG AGCGTAGAGA GACTGACCAT TCGAGCCGCA CAGCTTTTGG CTCACAAGGA AACACGTATC CCCTGGAAAA CTTTTCAGAT CCATCCGCAC ATTTTGGCCA ATGGTCGGTA CAGCACAGCA GTGCAGGGAA TCTCCACCAC GGCAGTTTGA GCGCCAATCC ATATGCGATA ATGCCGCATG TTCCTTTCAA CGAGACTCAA CAAATTAACC CCAATATTTT GAATCCCGTC CGCCTGCAAC AAAATCATAG CCAGTTTGCG TCAGCTGACA CTGCTGAAAG AGAGGGATTT TCAAACTCTA GCAATCAGCA AAAGGCTCCG TTGGGTCGTC ATAGAACACA GGTGGTGCAT GACTATCACG ATCACTACCG CGACCCAGGA AACTTCGAAA GACCAGCAGC AGCGGGCAAG TACCCTTCCA AAGAGGTTGG AAGCGGCAGG CCCTCCTTTC GTGGTGGCAC TTCAATTCAC TTCCCGGAGC GGCTGTTTGA AATGCTGGAC CGAGAAGATG AGCTAGAGTT GGCTCATATA GTCTCATGGC AGCCACACGG ACGTTCCTTT TTGGTCCACC GAACGGCCGA TTTCGTAGCC ACTGTATTGC CGAGGTGCGT CTGTGGAGCA ATCCTTGATC CGTAGATTCT CCTTTGACTA TATGCTTATC AAAAGCTGGT CTCGTTTTGT CTGTCCTACT ATAGGTTTTT CCGTCAGAGC AAGTTTACCT CGTTTCAACG TCAGTTGAAC CTGTATGGCT TCACCAGATT ATCCGTGGGT AGAGACAACG GAAGCTACTA TCACGAGCTA TTCCTGCGAG GCTGTCCCAA GCTCTGTCGC CGAATCATCC GGCGGCGCGT CAAAGGCAAT GGCGTGAAAC CAACTCCATC TCCAAGAACA GAACCAGACT TCTACAACAT GGAATGGTGT GATCAAAATG GACCCAAGAA ACAAGCGGAT TCTGACGAGG ACGAAACCTC GCTTCCAATT GCAGCTGCGC TGGCGCCGTT ACTAGATGAG GGATACGAGA ATTTGCGTCA CCAAGCCCTT CGACAGCAGC AAAATGACCA GTCGCAGCAG CAGGCGTCTT TCTCTCGCCA GCAAGAAGCG GATAGTTTCT TGTACAATAG AAGAGGCCAA AGCGGATTGC GGTTATCGTC CGGAAGAAAG CCTAACTTTC ACCACCAATT TGTGAGCGAA TCTTCCTTTC AGCCAGTGTG TCGTTCGCTC TCTCGCGATC AAAAGAACGA GCATCCACTA ACTAGAGATT GCTTCGTTCC CCATGCCGTC CTGGCTAGCA GAATCTCATT TCGGCCGCCA AAGAGGGCAA AAGTAAGAGG TGTGGGCAAT CCCGAAGTAT TCGGTAGCAT CACGGATTTG TTTTCCAGTG ACGACGACGA ATGCAATGTC AGGGATTCGA ATCTCCACCC CGATAAGCGA TTTTAAGGCA AACTATAGTG AGGGATCGGC CTGTCTATAA CTTGCTACGG AAACGCTTGA GGCAGCAGAC GATCCAACTG ACTGGAAATG TAACTTGCAA ATATTTTCAA AGCTGCCCTT TTGCAACCCT ACTAGTGCTC CATGGTTTTT GCATAGCCAG AAGCAAGACC ACCGTTTTAA CAGTCTTACA TAAATTAGGA GGGAAGCCAC AGATGTACGA AAGGCATCAC AGTCGAAACG GCTATTTGTG ACAATAGGTT GCTCGTCGCT GACCGTGAGT TGACAGCGCA GAATGATTCG GGGCATTTTC CTGTAAGGCC TGTCCTCCCT ACGAGTGTGG TCATCGGGGT CACAAAGGAT TGCTATGACC GTAGGCGTTC ACATGCAAAC CACATTTTTC TTCAAGCACA GGAAAAACCT ACTTCACAAA ATAGGAGAAA GGTATCATGG GGGCCACTGC ATCTCGAAGC GAGCATGTAC ACAAGGCAGC GCGTAAAGTA TCACAGGAGC CCGAAAACGA TCCCGTTAGT CAACTGGTAT GTACTGTACT GAAAGACGTC CCTTTCTCGT TTATGAGTGC AGACAAACCT CATCTCCATT GCTCCTTTAG CCCTTAGATC CGTCAAAGGA CTTTTTCGAC CATGTGCACT CGTATTTAGA GCTACGCCAG CGCTTGGATG AGACAGCAAA AGCAATCGAC CCCGATGGTG CGTTGAAAGC ACTGACAGTC GATGCCCTCT TGGGCCGACG AGACCATGTC GATGGCACTT TTCCAGTAGA ATTTCAGCAG CAAAGCGCTT ATGGAGAGAC CGTTGCCGAC ACCGAAGAAC TTTTGGTAGC AGCTTCGTTG GTCCAAGCCG AGTTTCGAGA CTTCTTAAAG CGTATACAGA AGGATATGCC TTGGCTTGCC GAAGAGGATC TCAGGGGTAA ACAGAGCAAT TCTTGCAACT TTGACAAGAA TACCGACCCC TCAACTCGTA TTTGGTTGGC ACCGATGAAA GATGCCCAAC GTATTGTCGA CAAGGCTTCC CTTAAGTACG CGGACCGATA TCCTGGGCCA CCGGAATCCT GGGTCTACGA CGTCATCCGT GCCAGCGTTG TTTGTATGAA TGCGACAGAA ATCAAGGATG TGGTCGCCTG GCTCTCCTCC AATTCGCATT CCACACCCTT GGTGGCGGCG AAGAATCGCT TTCAATCGCC AACGAGCATT ACTAAATACC GAGATTTTGT CTTCTTGATC AAATTGCCTG TCGCGAATGC TTCGTTTCAT CATGTGTGCG AGCTGCAAGT ACATCACGCT GCCATGGTCC AGTATCAATC CCAGTCGCAG CACTACTATA AGAATTTGCG ATCCTACTTT TCCCACGTGA TTGATGTGGA CGATTGGGAG CAGCGTATGG CAGATTTAGA AGCAATAGGA AGATATTCCA AGTACGATAC CATGGACGAT GGGCTTTGTA CCAAGGTATC GGCGTGCGGA AACGTTTTCC GTATTGAGCG TTTGGCTGAT CTGTTGGAAA CCAGTCTTCC TCCCTGCACG GACGCGGCGG TTCGCTTGTA CCAAAAAGCA TTGGTACTTG TCATAAGCGA CTCTAACATG AAGAGCCTAG CTGTGGCGAC CGTATACGAA AAACTGGGAG GTGCTCTATC CAGACGTGGA AAATACGGAG CCGCCCTGTG TCTGCTGGAG GCTGCTTTCG AACTCCGCCA AACCGATTTG GGCGAGACAC ATCCGGATAC TCTGAAATTG CAGAACGAGG TGGGCGTGTG TCATCATAGA AATGGCAAAT ATCAAATGGC ACTGGCGACT TTCAAGACTG CTTTAGAGAC TCTCGAGCTT CAAGCTGGAA AGCGGCATAC TCACACTGCC CAAGCACACA ACGACATTGG ATGTTTACTT CGCGATATGG GCAAGTACAC AGAAGCTCTG GATCACCACC AGCAGGCACT TCAAATTCGC GAAGCTGTTC TGGGAAAGAA GCATACGGCC ACGGCCTCTT CTTACGATAA CATCGGCGTT GTGATGCAAG AGAACGGCGA CTTTGAATGG GCGCTGCAAT ATCATAGGCG GGCTTTTATT GTTCGCCGTG CTTTGCTCGG AGATCATCCC TATACAGCCG TGTCCTACGA AAACATAGGT TTACTATTGA ATCGTCAAGG CAAGTCGCTA AGGGCATTGA CTTTACTTGG AAGAGCTCTA GAGATCCGCG AAGCCTTCCT TGGCGACCGC CACCCGGACA CGGCACGATC TCTGAATAGC GTCGGTCTCG TTTTGGCCAA GCTTGGTCGT ACTCCCGAAG CACTAAGCTT TTACCAGAAA GCACTCACCA TCCGAGAGGA TCTGTTGGGT AATGATCATC CGGAAACTGG AGTATCCTAT AGTAGTGTGG GAACAGCGTT GCGTCAGCTT GGCAACTATA ATGAAGCGCT CGAATATCAC GAAAAGGCTT TGGTCGTGGC GTCGGCAGCT AACGGCACGC ATCTCCAAAC AGCGGTTTTC CATTGCAATC TCGGTACGGC CGAAGCCCAC CTCGGAAACT ACGACACCGC CTTGAAGCAG TACCATCAGG CTAAAGCGAT ACGCACGCAA GTTCTCGGCC AGAGTCACCC CGATACACTG GCCGCACAAA GGTCTGTAGA CGCAATCCTG CGTGCCAAAG TTCAAGCTTG GGGACGGTAC AACATACAGT AA
|
Protein sequence | MPHVPFNETQ QINPNILNPV RLQQNHSQFA SADTAEREGF SNSSNQQKAP LGRHRTQVVH DYHDHYRDPG NFERPAAAGK YPSKEVGSGR PSFRGGTSIH FPERLFEMLD REDELELAHI VSWQPHGRSF LVHRTADFVA TVLPRFFRQS KFTSFQRQLN LYGFTRLSVG RDNGSYYHEL FLRGCPKLCR RIIRRRVKGN GVKPTPSPRT EPDFYNMEWC DQNGPKKQAD SDEDETSLPI AAALAPLLDE GYENLRHQAL RQQQNDQSQQ QASFSRQQEA DSFLYNRRGQ SGLRLSSGRK PNFHHQFVSE SSFQPVCRSL SRDQKNEHPL TRDCFVPHAV LASRISFRPP KRAKVRGVGN PEVFGSITDL FSSDDDECNE KGIMGATASR SEHVHKAARK VSQEPENDPV SQLPLDPSKD FFDHVHSYLE LRQRLDETAK AIDPDGALKA LTVDALLGRR DHVDGTFPVE FQQQSAYGET VADTEELLVA ASLVQAEFRD FLKRIQKDMP WLAEEDLRGK QSNSCNFDKN TDPSTRIWLA PMKDAQRIVD KASLKYADRY PGPPESWVYD VIRASVVCMN ATEIKDVVAW LSSNSHSTPL VAAKNRFQSP TSITKYRDFV FLIKLPVANA SFHHVCELQV HHAAMVQYQS QSQHYYKNLR SYFSHVIDVD DWEQRMADLE AIGRYSKYDT MDDGLCTKVS ACGNVFRIER LADLLETSLP PCTDAAVRLY QKALVLVISD SNMKSLAVAT VYEKLGGALS RRGKYGAALC LLEAAFELRQ TDLGETHPDT LKLQNEVGVC HHRNGKYQMA LATFKTALET LELQAGKRHT HTAQAHNDIG CLLRDMGKYT EALDHHQQAL QIREAVLGKK HTATASSYDN IGVVMQENGD FEWALQYHRR AFIVRRALLG DHPYTAVSYE NIGLLLNRQG KSLRALTLLG RALEIREAFL GDRHPDTARS LNSVGLVLAK LGRTPEALSF YQKALTIRED LLGNDHPETG VSYSSVGTAL RQLGNYNEAL EYHEKALVVA SAANGTHLQT AVFHCNLGTA EAHLGNYDTA LKQYHQAKAI RTQVLGQSHP DTLAAQRSVD AILRAKVQAW GRYNIQ
|
| |