Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45712 |
Symbol | |
ID | 7200517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 1018404 |
End bp | 1020101 |
Gene Length | 1698 bp |
Protein Length | 480 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179767 |
Protein GI | 219117965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00011488 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGGATTTGA AGCCTTCTTA TAAACTTGAA GCGGATTTCT GAGCCGCAAA GGTAGAGCAA GTAAATAAGC ACCTCCACCC GACCTTGCTT TAAGTGCACC GCTATTTCCA AGCCTTAAGC TTTACTTCAC ACCTCGATCG AGCTTGCGGT GGGATGCCGT CACGAAGTGT TATTGAAGTC GATGTTTTAA TTCCGGTTCA CAACGCGACG GATACACTGC GGGCTACGAT CGAGTCCGCC ATGAATCAAG AACCATCACA CGATGACGAC AACGTTGAAA TCCAAATCGA CCTCGATGTT CACATTTGCT GCTATGATGA CGGTTCTACA GATACGAGCT GGTCCATACT CAAAGATCTG GAAGATCAGC ACATGAAAAA TTCTCGGCAA TGCTCCTCCA TATGTTGCGA CGGCAGCAAG CGCTCGCGAG TGCTGACGAA ACTGTGGTTA GGTAAGGAAG CAACATCTCG AGGAGCAGGT TACGCGAGAA ACCGGGCGGC TCAACTTCGA CCAAATCCAA ATCCCGATGG TTTTCTTTGC TGGTTGGATT CAGACGACTT GATGGCACCG ACACGGATAT ATCGCCAAGT GCAGTATTTA CTCTCTCTAG AGGAGGAAGC TCGCAAACGA GCACTCTTGG GATGCACATT TGAACGCGAC CCACCCGATT CAACATGGCA CTATTCTGCT TGGGCCAACG GTTTGACCGA TGATCGGCTA AGTCTGGAGC GGTTCCGAGA ATTGACCATA ATTCAGCCTT CATGGATGAT GCAGCGATCC CGATTTGCGG AAGTGGGCGG CTATGTTGAA GCCCCTCCCC TAAACGACAG TGATGATTCC GTTGAATGTT CCGTTTCTAT CCAGAAATTC GAACTAATAC ATCCAGTATT TGACACCCCC ACGACGCTGC GATTAGCGGA AGATTTGCGG TTTTTTCATG CTCATCTACA CTCGAATGGA ACCCTCAATC TATTGCGTCA TGATCCTCCC CTGGTGATAT ACAGGCATCG TGCCGGTCTT TCTCAAAGCA CGACAACACC ACGGAAACTA TTGCTTCAGT TACGAACGCT AGCGTTTGAA CGAATGGTTC TTGAATCGGG GGAAATATGG AAAGAGAACG GTTTCTGTAT CTGGGGTGCG GGCCGAGATG GAAAGGATTT TGTTAAGGCT CTGTCGGATA CGAACCGTAA AAGAATTCGT TGCATGGTCG ACGTAGATGA CAGGAAGATT GCTATTGGCT CTTACGTTAA TCGAGATATC AGAGTCAACA TTCCTATTAT GCACTTTTCT CTTTTGGCAA AGGACGAAAG CTTGCGTAGC AGCCTCTATG AGCAATGGAC AACGGGACAG AACCATAACC TTCCCGGATT TGGTAAAATT CGGAAGGGAA GAAATTTGAC AGGCCCGCAA GGGCTCCCTT CAGCAAAGAA ACCCAAGTTG TCGAACAAGG GTAGCGTTGA CCCGCAATTC AAATCCCTCC TACGCGAGCT GCCCGTTGTA GTTTGTGTCT CCATGTATCG GACAAATGGT GCATTGGAGC ACAACGTGAA GCAAATTGGA CGCATCGAGG GTGAAGATCT ATGGCATTTT ATCTAATTGG AAGACGGCTG GCTACTTTCG AGGAAATAGT AAACAGGCCT CAGAGGTTTC TGTTGGTAAA GACTAATGTG TATCGTTTAG AAGACATGGC AGGCATGC
|
Protein sequence | MPSRSVIEVD VLIPVHNATD TLRATIESAM NQEPSHDDDN VEIQIDLDVH ICCYDDGSTD TSWSILKDLE DQHMKNSRQC SSICCDGSKR SRVLTKLWLG KEATSRGAGY ARNRAAQLRP NPNPDGFLCW LDSDDLMAPT RIYRQVQYLL SLEEEARKRA LLGCTFERDP PDSTWHYSAW ANGLTDDRLS LERFRELTII QPSWMMQRSR FAEVGGYVEA PPLNDSDDSV ECSVSIQKFE LIHPVFDTPT TLRLAEDLRF FHAHLHSNGT LNLLRHDPPL VIYRHRAGLS QSTTTPRKLL LQLRTLAFER MVLESGEIWK ENGFCIWGAG RDGKDFVKAL SDTNRKRIRC MVDVDDRKIA IGSYVNRDIR VNIPIMHFSL LAKDESLRSS LYEQWTTGQN HNLPGFGKIR KGRNLTGPQG LPSAKKPKLS NKGSVDPQFK SLLRELPVVV CVSMYRTNGA LEHNVKQIGR IEGEDLWHFI
|
| |