Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44347 |
Symbol | |
ID | 7197826 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 290217 |
End bp | 292744 |
Gene Length | 2528 bp |
Protein Length | 646 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178201 |
Protein GI | 219114811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00207317 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCA GAGGTAGACG CGAATCTAAG GATTCACCGT GAGTATAGTA CATTCTGACA AATTGATAGC TTCCTTATTG ACATTGCAGT CCTATCGAGG GTAACAGATG GCTAATTCTC TGCTCACATG AGAGGCTGTT CTTCAACAGT AAAAAAGGTT GCCAAACTTC AACATTGAAA ATAGCCTCTA GAGAGTTGGA TGAACATAAG CTCTTCATTT TACGAGAGGT TCATTTGATA TGCACCCCCG AGGGAACCTT ATGGAAGCGG GTATGAACTG ATGCGGCAGG CCATATCTCA ATGAAAACAT CGGCCTTTCT AGGCCTCAAG CAAACGAAAT CTTTCATGCT CAATTAATAG CTAAGACAAA CACGGCCTTT GAAATTCAGA TTGGCCTGAC AGCAAAATAG CCTAGATTAA GTTCTTGATG CGTGCACACT ATGTAGCCAT CCACTATTTG ATAGATTTCG CTGACAATCA ATCTTTTTTA GGCCTCCCCA AGGTCTCACG AAATCAAAGT TAAAAACTAA TCCCAGCTCC TCAACCCCAT TTCTTGCATC ACTCATACTT TTGAAGCACG ATCTCCAAGT TACACTCTCT TATCAAGGTA AGTATGCCCT TTCCGAAGAT AGAAAGAGGT TTCGTGCAAC AAGTGCTCAC TGACCACATT TTGTTTTACA CACGCTATCT CCTATTAGAT GAAATTGACG AAAGCTTTTA ACATACTGGC TGTCTTGGCA GGTGTCAACA CCATTGCCGC TGATTGCACG CATGGTCGTC TCTTTGTCTC CGACATGGAC TCGGCTAATG TTTATACCTA CGAGATTGAT GGAACTAGTA ATCCCGTCTT GCTCAACACC CTGCCGACGG TGACAGGAAT GGGCCCGCAG TTTTTGTACA CGTCGTCTAC AGAAGGAGCT GTTACCGTTG TCTACCGTGG ATTGGAAGAA ACTGCGTATC AGGATGGAGC TATCAGCTTC ATCCGGGTTG GAGTCACTCC AAGCAGCCAC GAAGTCGCAG GTTTCTCTGT CGAGAAGGAA GACCCTTCTC TGGTTGACGA CTTTTTTGTG AGCTGTGCCA GACCTATACA TCACGTGGCT CATGACCAAA AGATTGCCAT ATTCTGTGAC GGATCATTTG AAGATAGCGT CAACTCAACA GTCTGGGTTG TTGACGAACG CTTTTTTGGG CAAGGCAACA AGACATTGGT CTTTTCCAAG ACGCTGGAAG GTTCTCATCA TGGTGTTGCC GTCCCAGTGG ATGAGGACCA TATTCTTGTC TCCGTGCCCA CACCGGAACG CGTGGCAAAT GATCCTAATG CAAGTGCACT TCCTGACGGG TTCCATGTCT ACGACTATGA TATGAATTTG TTGCATGGCC TGAATGAGGA AGAAGATCCT AGTCGTTCTT GCGCGGGTTT CCACGGAAGC GGTGTAATTG ACAACACCTT TGTGTTTGCT TGCGATCAGG ATCATGGCGG GATCCTTGTT GTTGACTATG GTCAAGCAGG TGTTACTTAC ACTTCTCGAG CTCTCTCCTA TCCGGATGGC TTTGATGCCC ATCGGACTGG AACCTTAACA GAGCATCGTG ATAGCAACGC CATTGTCGGT AATTTTGCTG ATAGAGCCAC TGGAGATTCT AAGCTTGTTT CATTCGTACC GAAGCAGCAA TCTGATGAAA TCACTGAAGG ACAACTTCTG CCTCTGGAAT CGGGTCAATG CAGTTTCAGC TTTGAGCAAT CGGGTGGTAA CCTAATCCTT GCCTGGATGC CGACAGGAAA TCTTCAAGTT TATGCTATTG AGCCTGAATG GATGCTACTT GCCGACATCC AGGTAATCGA TGACATGTCT TCTTGCGACG GAACTTCAAT GGCACCGGGT CAAGGACATG CCTACATCAT GCAAGGAACT TCATTGATTG ACGTTGATCT TCATGACTTG ACGTCGCCGG AAATATCTAG CATTGATCTT GGCTTTATGC CAGCATCAGC AGTGGTGGCT GGAGTTCCGG CTGGTTATGC CTGCGAGGCT CCCAGCTTTC CTGAGACCTC TTCCACATCT GCTGTTGATG GATGGATCTC GATTGAGCAA GTCCTTGCTC CTGCCGGTTC TACTGAATCA ACAGAATTTC TTCGGTCTTT CCGCAATGAT ATTGCTAGAA GTCTTGGAGT TGGCCTCAAT CGCGTTTTCG TCGAAGAAAC AGTCAAGGAA TCGGATAGTT CCACCGTTGT CCATGTCAAG ATGAGCGACC CCACGGAACA TGATGCGAAC CCCGCCACAG GAAAGCAACT CCTCGACCAG CTCATTGCCA GCGGTATGAG TTCGGCAACT AGTGTGTCTA GCCAAGCCCC TTCTCAGGCC ACGGGAGGCA ACGGCGGAGG AGATTCTTGG CCAACTGGGG CTACCGTTGG CATTGTGCTT GTTGCAATTG TTGCCATTGC ATCGATTGTA GCGGCCGTCT TGTTCAAGAA GCGCGAGAAA AAGGCCCTCT TTGACTTGGA AAAGGCCAAT AAAGGGCAAT CGGCCTAG
|
Protein sequence | MTTRGRRESK DSPPPQGLTK SKLKTNPSSS TPFLASLILL KHDLQVTLSY QGVNTIAADC THGRLFVSDM DSANVYTYEI DGTSNPVLLN TLPTVTGMGP QFLYTSSTEG AVTVVYRGLE ETAYQDGAIS FIRVGVTPSS HEVAGFSVEK EDPSLVDDFF VSCARPIHHV AHDQKIAIFC DGSFEDSVNS TVWVVDERFF GQGNKTLVFS KTLEGSHHGV AVPVDEDHIL VSVPTPERVA NDPNASALPD GFHVYDYDMN LLHGLNEEED PSRSCAGFHG SGVIDNTFVF ACDQDHGGIL VVDYGQAGVT YTSRALSYPD GFDAHRTGTL TEHRDSNAIV GNFADRATGD SKLVSFVPKQ QSDEITEGQL LPLESGQCSF SFEQSGGNLI LAWMPTGNLQ VYAIEPEWML LADIQVIDDM SSCDGTSMAP GQGHAYIMQG TSLIDVDLHD LTSPEISSID LGFMPASAVV AGVPAGYACE APSFPETSST SAVDGWISIE QVLAPAGSTE STEFLRSFRN DIARSLGVGL NRVFVEETVK ESDSSTVVHV KMSDPTEHDA NPATGKQLLD QLIASGMSSA TSVSSQAPSQ ATGGNGGGDS WPTGATVGIV LVAIVAIASI VAAVLFKKRE KKALFDLEKA NKGQSA
|
| |