Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29157 |
Symbol | |
ID | 7203195 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 522021 |
End bp | 523634 |
Gene Length | 1614 bp |
Protein Length | 441 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182409 |
Protein GI | 219124224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.122862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACACCGACG CGACCCGATC TTCAACTCCA CCCCACCATG AAATTCGTTC AAGCTGCCAT CTTCGCCCTC GCTGCCTCGG CCAGTACTGC TGCTGCTTTT GCCCCTGCCA GTAAGTTTGC CAATCCGTCG TTACTTGGCG TGCATGGATC CGTTAGGGAA CCGAATCACG TATCGAAATG CCGTTCCGAC GCGAATCTAT TGCGTTCCAA ATCACTATTC CATATCCAAT CTAACTCGAG CGCCCTTTGC TCCTATTCTA TAGAAACCTT TGGTGTCCGT AGCTTTGCCC CGGTTTCTCT GATCGAGCTC GAAGCCAAGG CCTCCATTGA AGACCTCAGC GATGCCGAGC TTAAGGGAAA GAAGGTTCTC GTCCGTTGCG ACGTCAACGT CCCCCTGGAC GGTAAGAAGA TCACCGACGA TACCCGAATT CGTTCCTCCA TCCCTACCAT TGAGTACTTG AAGAACAAGG GTGCCATCGT CAGTGTCTGC TCACATTTGG GACGCCCCAA GGACGGCCCC GAAGATAAGT TCTCCCTCGG ACCTTGCGCC GAGCGCATGG GGGAACTCCT CGGACAAACT GTAACCCTCG CACCCGACTG CATCGGTGAA GAAGTCGCCA AGATTGTCAA TGATGCCAAG GAAGGCGATG TTATCATGCT CGAGAACACC CGCTTCTACA AGGAAGAGAC CAAGAACGAC CCGGAATTCG TCGAAAAGCT TGCCGCTCCC TTTGACCTCT ACGTCAACGA CGCCTTTGGA ACGGCCCACC GTGCCCATGC CTCCACCGAA GGTGTGACCA AGTTTCTCAA ACCTTCCGTT GGAGGATTCT TGCTCGCCAA GGAACTCGAA TACCTTGATG GTGCCATTAG CAACGGAAAG AAACCCATGG CCGCCATTGT CGGTGGATCC AAGGTCTCTT CCAAGATCAC CGTCCTCGAA GCCCTGCTCG ATAAATGCGA CAAGGTCATC ATCGGAGGTG GTATGGTATT CACCTTCCTC AAGGCAAAGG GATTCAATGT TGGTACCTCC CTCGTCGAGG ATGACTTTGT GGATACCGCC AAGGAAGTCT TGGCCAAGGC CGAAAAGTTG GGCAAGGAAA TCCTTCTTCC CATTGATATC GTGATTGCCG ACAAGTTCGA CGCGGAAGCC GAGACCAAGG TCGTAAGTGC CGAAGAAATC CCTGATGGCT GGATGGGTCT GGACAACGGT CCCGAAACCA CCGCACAGCA GAAGGAAGCA CTTTCCGACT GCAAGACTAT CATCATGAAC GGTCCCATGG GAGTCTTTGA GTTCGAAAAG TTCGCCAAGG GAACTTTTGA CCTTGTCAAC ATCCTTGCCG ATCTTTCCAA AGACAAGGGT GCCATTACTA TTATCGGAGG AGGTGATTCC GTTGCCGCCA CTGAACAGAG CGGACGTGCC GGTGACATGT CCCACATCTC CACCGGCGGT GGTGCCAGTT TGGAGCTGTT GGAAGGAAAG GTCCTCCCCG GTGTGGCCGC GCTCAACGAC AAATAAAGTT GTCCTCTGGC GGCAGTGAGC AATTTACGAG ACGCGTGATG CGCTACACGT CACACACAGT GTAAATTCAT ATAGACCGAT ACTTTGTCAT CGCT
|
Protein sequence | MKFVQAAIFA LAASASTAAA FAPAKTFGVR SFAPVSLIEL EAKASIEDLS DAELKGKKVL VRCDVNVPLD GKKITDDTRI RSSIPTIEYL KNKGAIVSVC SHLGRPKDGP EDKFSLGPCA ERMGELLGQT VTLAPDCIGE EVAKIVNDAK EGDVIMLENT RFYKEETKND PEFVEKLAAP FDLYVNDAFG TAHRAHASTE GVTKFLKPSV GGFLLAKELE YLDGAISNGK KPMAAIVGGS KVSSKITVLE ALLDKCDKVI IGGGMVFTFL KAKGFNVGTS LVEDDFVDTA KEVLAKAEKL GKEILLPIDI VIADKFDAEA ETKVVSAEEI PDGWMGLDNG PETTAQQKEA LSDCKTIIMN GPMGVFEFEK FAKGTFDLVN ILADLSKDKG AITIIGGGDS VAATEQSGRA GDMSHISTGG GASLELLEGK VLPGVAALND K
|
| |