Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45808 |
Symbol | |
ID | 7200817 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 345729 |
End bp | 348912 |
Gene Length | 3184 bp |
Protein Length | 784 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180218 |
Protein GI | 219118903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.558065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCATAC AGACCCTTAC TTTTGGAAAA TATATCCCTT TTGGCATAGG GGCAGACTTT GTCACGCTGG TAGCTCCTGA ACTCTCCAGT GCCACTCCCA TACCTACCTT TTTCCGGAGT GAAGCTCACA GGTGCCCTCT CCAAGATATC GCGAGTCTGA GCCAAAAGAA ACTCCTCAAG TCGCAACAAG GAGAAGATCA ACACTTGCTT TCCTTTTTCA ACGGTCTCTG TGGAGGTACC TATCTAGAAA TGGGCGCTTT GGACGGTAGG CTGTACAGTA ATTCATTTGC TTTCCACAAA GCTTTGGATT GGAAGGGTCT ACTTGTGGAG CTGACTCCAG AAAGCTATCT ACGCTTGGTG GAGAATCGGC CCAGCGAACT AGCCGTCGTT AACGCGGCGG TATGCGATCA ACCAAAGAAG ATACACTACT ATTCGAAGCA AAGGCAGCCA GCCGTTTCAG GAGTATGGGA ATTCGCTCCT ACCGAGTTCC GGGAATTATG GTGGCCGGGT ATCACGCTCG CTGATACTCA AGAGATCGAT TGTAGACCCC TGAGGGAAAT CATTGCAACA AACGTTGGAG AACCAGCCTT TTTCGACTTT TTTAGTTTCG ACATCGAGGG CTCGGAATTC ATGGCGTTGC AAGGGCTAGA TTTTTCCAAA GTGGGTTTCG GAATTATATT CATTGAGCGC CAGCCAAATA ATCCAATGAA AAACCTGGCA ATTCGTACTA TTATGGAGAG AAATGGATAC ATTTATTTGT ACGAAAAATC GAACAGCGTA TGGTTCGTGA ATGCTCATTT TGATGAAATC TACAAAGATG TCATTGGCAC CTAGAAAACT GCAACAATAA TGGACTAGAA TTGTGCTGAA AGACCCGAAG AATCCTGCAT ACTTTCGTAG AAGATCAAAG TTGTAATACA TAGAATTATA GACTTGTGCA ATAGACTAAC AGTCAGGGAT ATGAACCCCA CATATAATGC TTGTTTCAGG GGTTTTGCTA GTCCCCAGAC ATCGCAGCCA CAGACATATA GACATGGAGA GATGTGCCCC CTCAAAACCA CAACGGGAAA AATAGGGATC CACAGCTTGT AAAATAAAAC TTCAACACTG CATGCAATAA CAATCAGAAA TGCAAATTCA AGTTGTCCCA ACATTGGCCT TGTTGAGAAC GGGAAAGTTG GCAACGAGAA AGTTGGCAAA CCCTTCTTGT TAATAGAACA AATTTTGAGG AAAACCGCTA TTTCTGCCTC TACGACTTGG AGAAATCGCT ATTTCGGTCA GATCTGAAAA TTGGTAAGTA CTTTTTGGTG AGCGCCAAAC GGAATTAAAA AAATAATCTA ATGCTCGGTG TATGCTATGT ATATGGTATG GCGGAGGTAT CGTGCTGGCA GGGATGAGCT GCTTCCAACC CATGACCTAC CCTTTCAATC TATTTCACCC AACGTCGCCG AGCACCTTAT CCTTACCCTA CATATCGACA CAAGCGTTCG TCAATGGATT CTACCGACAC GGACCGGGAA ACCTTCCTCC GCGGACAGCG CCCTACGGGA ACCTCGGGAA GTGTGGACGC CCCGCGTACC CGCGCTCCCG TCCTGTGACC TCTGGTCCGA ATCTCCCCAA TCTACCCAAC CCCAAGCGTG TTTTTTTTCG CGACGCTCGC GTTTACGCCC TCGTAGGGCA CATCCAAACC ACCGACGTGG TTGACGAATT GTGTCAATTG GTCGTTGTTG ACGGTGATGT TGCCGCCGAA GGTGAAATCA TCACTTCTCG CGTCCAAACA GTGCCTCACC AAGTATCAGC CCTGGTCGAT ATGCAACTCC ATCCCGGGCA CTATGCTCTC TCCTGCCGGT ACCGCTCTAA ACAATATGCC GATGCTGGTA GTGGAGGCGG TACCAGTGTC GGTAGAAATA ACGGTATCGG TGACTCGCAC GACGACACCT CCCTCTGGAT CCACTACCCA CCGTCCCACA GCACCATGGT GAGCTCCCAA CCGACCGTAA CTGCAGCCTC TAGAGCCGGC CACGACGCTC TGTTGGATGC TCTGGCACTA CGGGAGCGTA CTCGGATTCT CAATTTGTAC TACAGTCCGT ACGGGGCCAA CCCGACGGAT GCCGATTTGG AAGAAGCGCT CCACCAATTG TGAGCGAGCT GAACTCGACA AGGAGTCGCA GATGGAAGCT CACGAAGAAC GCTGGCGACA ATTCCTGTTG ATTGTGTGGG AAGAAGAACG TATGATGCGC TTGCCGGTAG CGATGGGTCA AGTTAGTTGC AGTACTGACC TTGATATCTT GGTACGCTCA ACGGCAATTT CCGTACTTGC GCAGTATGGC GACGGCCCAC CGCTGGGTAC CACCACGACT CCACTACTCT TGGATCAAAT TGCGTCTAAA CTCGTCACCG CCTTGGAATG GACGAAGTTT CCGAGGCACG CTTGTCCAGT CCAGAACAAG GCGTCCATTC TGTGATTGCT CGAGGTGATT TGGCTTGGCA TGCTACTGAA GTGGAACAAC TACCAGCGCT ACTGCTTGTT TCCTTGCAGG ACTGTTTATG GGACGTCGTG ATGCCCTACG AGTCCACTTT ATTGATCAAA GGCCTAAAGA ATGCCACGGA AACAGATCTT ATTCTGTTTT TGCTACGGCC ACCGGTCAAA TCCACCTTGC CCGGGCTAAT CAACTGGGAC AGCTTCTGCC TCGTCCATAG ATGAACACCA ACGCTTGGCA ACTTCCAGCT TAGCGACCTG CTGTATCAAT TCGATTTGTC GGCTAACGCT AGGCAGATGT TTGGTCTTGT CCCAAATTAT GACAACACGA CCAAATTATA TGTCGGCTGC CTAATTGATT TATTTTCAGG CTGTGGCGAC AATGTTTGTC AGCACTCAGC ATATCGCCCT GTCAACTACA GTCTACCATC ACCTTCTACG GATAGTCCTT CCTTGTTGTC TCCACTGACG AAACGACCAA GCTTCGCTAT TGTCCGTTTT ATCGTGTCAG GGGAACATGG TGCGTATCCA TCACTATCAA ACAGTTTTAG TACGCAAACT TCTCCCGGGT CGCAGCGAGA CCGACAAGTA CTTCAATACC ATCCGCCACA TCATAGTGAT GCAAGGGGGT GTCACTTTCC ATTGCCTGAG GTGACGGGTG GTTAAGAGTT CCTCTTCTTC CTGCAACCAA TCTGACTCTT GCAT
|
Protein sequence | MAIQTLTFGK YIPFGIGADF VTLVAPELSS ATPIPTFFRS EAHRCPLQDI ASLSQKKLLK SQQGEDQHLL SFFNGLCGGT YLEMGALDGR LYSNSFAFHK ALDWKGLLVE LTPESYLRLV ENRPSELAVV NAAVCDQPKK IHYYSKQRQP AVSGVWEFAP TEFRELWWPG ITLADTQEID CRPLREIIAT NVGEPAFFDF FSFDIEGSEF MALQGLDFSK VGFGIIFIER QPNNPMKNLA IRTIMERNGY IYLTNFEENR YFCLYDLEKS LFRSDLKIGI VLAGMSCFQP MTYPFNLFHP TSPSTLSLPY ISTQAFVNGF YRHGPGNLPP RTAPYGNLGK CGRPAYPRSR PVTSGPNLPN LPNPKRVFFR DARVYALVGH IQTTDVVDEL CQLVVVDGDV AAEGEIITSR VQTVPHQVSA LVDMQLHPGH YALSCRYRSK QYADAGSGGG TSVGRNNGIG DSHDDTSLWI HYPPSHSTMV SSQPTVTAAS RAGHDALLDA LALRERTRIL NLYYTELDKE SQMEAHEERW RQFLLIVWEE ERMMRLPVAM GQVSCSTDLD ILVRSTAISV LAQYGDGPPL GTTTTPLLLD QIASKLVTAL EWTKFPRHAC PDCLWDVVMP YESTLLIKGL KNATETDLIL FLLRPPLSDL LYQFDLSANA RQMFGLVPNY DNTTKLYVGC LIDLFSGCGD NVCQHSAYRP VNYSLPSPST DSPSLLSPLT KRPSFAIVRF IVSGEHGAYP SLSNSFSTQT SPGSQRDRQV LQYHPPHHSD ARGCHFPLPE VTGG
|
| |