Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47245 |
Symbol | |
ID | 7202336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 16140 |
End bp | 17836 |
Gene Length | 1697 bp |
Protein Length | 480 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181470 |
Protein GI | 219122268 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.548849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGTCAAACG CTTTTCCAAG CTGTGCTCGC AGTCTAATCA TGGGATTCGA AGCCTTCTTA TAAACTTGAA GCGGATTTCT GAGCCGCAAA GGTAGAGCAA GTAAATAAGT ACCTCCACCC GACCTTGCTT TAAGTGCACC GCTATTTCCA AGCCTTAAGC TTTACTTCAC ACCTCGATCG AGCTTGCGGT GGGATGCCGT CACGAAGTGT TATTGAAGTC GATGTTTTAA TTCCGGTTCA CAACGCGAAG GATACACTGC GGGCTACGAT CGAGTCCGCC ATGAATCAAG AACCATCACA CGATGACGAC AACGTTGAAA ACCAAATCGA CCTCGATGTT CACATTTGCT GCTATGATGA CGGTTCTACA GATACGAGCT GGTCCATACT CAAAGATCTG GAAGATCAGC ACATGAAAAA TTGTCGGCAA TGCTCCTCCA TCCGTTGCGA CGGCAGCAAG CGCTCGCGAG TGCTGACGAA ACTGTGGTTA GGTAAGGAAG CAACATCTCG AGGAGCAGGT TACGCGAGAA ACCGGGCGGC TCAACTTCGA CCAAATCCAA ATCCCGATGG TTTTCTTTGC TGGTTGGATT CAGACGACTT GATGGCACCG ACACGGATAT ATCGCCAAGT GCAGTATTTA CTCTCTCTAG AGGAGGAAGC TCGCAAACGA GCACTCTTGG GATGCACATT TGAACGCGAC CCACCCGATT CAACATGGCA CTATTCTGCT TGGGCCAACG GTTTGACCGA TGATCGGCTA AGTCTGGAGC GGTTCCGAGA ATTGACCATA ATTCAGCCTT CATGGATGAT GCAGCGATCC CGATTTGCGG AAGTGGGCGG CTATGTTGAA GCCCCTCCCC TAAACGACAG TGATGATTCC GTTGAATGTT CCGTATCTAT CCAGAAATTC GAACTAATAC ATCCAGTATT TGACACCCCC ACGACGCTGC GATTAGCGGA AGATTTGCGG TTTTTTCATG CTCATCTACA CTCGAATGGA ACCCTCAATC TATTGCGTCA TGATTCTCCC CTGGTGATAT ACAGGCATCG TGCCGGTCTT TCTCAAAGCA CGACAACACC ACGGAAACTA TTGCTTCAGT TACGAACGCT AGCGTTTGAA CGAATGGTTC TTGAATCGGG GGAAATATGG AAAGAGAACG GTTTCTGTAT CTGGGGTGCG GGCCGAGATG GAAAGGATTT TGTTAAGGCT CTGTCGGATA CGAACCGTAA AAGAATTCGT TGCATGGTCG ACGTAGATGA CAGGAAGATT GCTATTGGCT CTTACGTTAA TCGAGATATC AGAGTCAACA TTCCTATTAT GCACTTTTCT CTTTTGGCAA AGGACGAAAG CTTGCGTAGC AGCCTCTACG AGCAATGGAC AACGGGACAG AACCATAACC TTCCCGGATT TGGTAAGATT CGGAAGGGAA GAAATTTGAC AGGCCCGCAA GGGCTCCCTT CAGCAAAGAA ACCCAAGTTG TCGAACAAGG GTAGCGTTGA CCCGCAATTC AAATCCCTCC TACGCGAGCT GCCCGTTGTA GTTTGTGTCT CCATGTATCG GACAAATGGT GCATTGGAGC ACAACGTGAA GCAAATTGGA CGCATCGAGG GTGAAGATCT ATGGCATTTT ATCTAATAGT AAGACGGCTG GCTACTTTCG AGGAAATAGT AAACAGGCCT CAGAGGTTTT TGTTGGT
|
Protein sequence | MPSRSVIEVD VLIPVHNAKD TLRATIESAM NQEPSHDDDN VENQIDLDVH ICCYDDGSTD TSWSILKDLE DQHMKNCRQC SSIRCDGSKR SRVLTKLWLG KEATSRGAGY ARNRAAQLRP NPNPDGFLCW LDSDDLMAPT RIYRQVQYLL SLEEEARKRA LLGCTFERDP PDSTWHYSAW ANGLTDDRLS LERFRELTII QPSWMMQRSR FAEVGGYVEA PPLNDSDDSV ECSVSIQKFE LIHPVFDTPT TLRLAEDLRF FHAHLHSNGT LNLLRHDSPL VIYRHRAGLS QSTTTPRKLL LQLRTLAFER MVLESGEIWK ENGFCIWGAG RDGKDFVKAL SDTNRKRIRC MVDVDDRKIA IGSYVNRDIR VNIPIMHFSL LAKDESLRSS LYEQWTTGQN HNLPGFGKIR KGRNLTGPQG LPSAKKPKLS NKGSVDPQFK SLLRELPVVV CVSMYRTNGA LEHNVKQIGR IEGEDLWHFI
|
| |