Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45723 |
Symbol | |
ID | 7200876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 34553 |
End bp | 36231 |
Gene Length | 1679 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179961 |
Protein GI | 219118373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.136457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGCCTTGAC CAAGGGCGAG TGCTCTCTGT GTTGACTGTG AATACGTGCG AGAATGAGAA CAAAAGGCGT CCGACAGTCG CAACGCACGG AGCTCGATCG CTGCTTTGAT ATCTGCCCAA AATCCCCATC ACCACCACGG AATGGAGAAC GAATAGGGCG AGACGAACGA GCAATAGTAC CGATGCCGGT ACGCAATATC ATGGCGTGTT TGCAATGCGT AGCGGTAGTG ACGGTACTGT TCTTCGGTGA AGAGGGAGGA CGATCGTCCA TGCTGTCCGT AGCGGCATTC TCCACGAGAC CACCCGTCTC CAAGCCGCTC ACCACACGCT CGTCTTCACA ACGACTTCAC CTTCACTGGG AAATCTCACT ACCCCTTGTC GTCTACAACG GCAACGAGAA TTCAATGAAC GACAGCAACC ACGATAATCT TTCCCAGCAC CTTTGTCGAG CATTCAACGA GATTGTGGAA TCGACGGAAT TCTTTGAGGT TGCAAGTACA CTGGATGAAA ACAACGTCGA CACAGTGGAC AATTCTAGCA AAAAGAGCAC GGTGAAGAAG ATACAATCGT CCCACAGTCA CGACGGTGCC AGTGTCGGGT CCTGTAGGAT CCCACTCTTT GGTTCGCAAG TGACGGTACA GATGCAGTTA AACTTGCAGT CGTCTGGCGT GGAAAGTCCA CAACAGCCGC GTTCACCAAA ACCGACTCGT ACGGTAACCC TGTCCATTGA TAATTGCAGT GACAACACGG AAATCTCCTG GATATGTACC AAGTTCTACG AATCTAGAGC AAGCCTGCAG GGTATTTTGG ATCTTTCTGT TTTGCATCCT TTCGTCGTAG CTCCTCCATC ATTCTCCATG GCAACCACCA CCTCCAAGCT AGATCCCGGC GTACCCGCTG CCAGTGCACC CCTTGCCGGG TCTACCCGCA CTGCCGCTAT TCTAAAAAGT TCACCAACCG AACATGACAT CGACTCGCAT CAGAATAGTA TAATGGCACG GTTACAACGA CACGGATACG TCGTTGTGGA CGCTCCCGAA CTTCCCCGCA CAAATCTAGT TCAACACGCA GCACTCACTG ACTTTCTCAC AGAAAAGACA GGACAAGGGG ACAGCATACG AACAGACACG GTACACTATT TGACGCGCAC CGAGGCTGCC GCATGTGGGT TGACCGATCA TTTTGACGTT CTCACAAGTC TCGCGTCTTA TTTGAACGAG AACTACGAAA TAGAAGAATC GCCTTACAAA CCATTGCCAC CTGCCACCGA AGATTCCCCA TTGACCAATC CTGAACGTAT CCAATTGGCG GAATACGAAA ACAGCGGTTT CTATACTGCT CATAGTGACA ATCCGATTGT CCAAGATTCT GGTGGAAGTG GTTTAGTTCG GGAAAACTTC CGTCGCTTCA CCGCTATATT GTACATGAAT GAGGGCTGGA CTAAAGGTGA TGGTGGTGCT GTGCGACTGT ATCTACGTTC TCAGGCTCTA CAATATGTTC AGGAGGTATA CAGTGAGAGT GAAGAACCAC GTTGGCACGT GGTCGATGTC CTGCCAAGCA ACGGACGATT GTTGCTTTTT GATTCGCGAT TGGTGCACTC CGTCGAACCG GTCCTGTCGG AGCAAAAGCG ACGAGCGCTC ACCTTGTGGA TAAAGCGACC GATAGAGGGC GGTGTTTGA
|
Protein sequence | MRTKGVRQSQ RTELDRCFDI CPKSPSPPRN GERIGRDERA IVPMPVRNIM ACLQCVAVVT VLFFGEEGGR SSMLSVAAFS TRPPVSKPLT TRSSSQRLHL HWEISLPLVV YNGNENSMND SNHDNLSQHL CRAFNEIVES TEFFEVASTL DENNVDTVDN SSKKSTVKKI QSSHSHDGAS VGSCRIPLFG SQVTVQMQLN LQSSGVESPQ QPRSPKPTRT VTLSIDNCSD NTEISWICTK FYESRASLQG ILDLSVLHPF VVAPPSFSMA TTTSKLDPGV PAASAPLAGS TRTAAILKSS PTEHDIDSHQ NSIMARLQRH GYVVVDAPEL PRTNLVQHAA LTDFLTEKTG QGDSIRTDTV HYLTRTEAAA CGLTDHFDVL TSLASYLNEN YEIEESPYKP LPPATEDSPL TNPERIQLAE YENSGFYTAH SDNPIVQDSG GSGLVRENFR RFTAILYMNE GWTKGDGGAV RLYLRSQALQ YVQEVYSESE EPRWHVVDVL PSNGRLLLFD SRLVHSVEPV LSEQKRRALT LWIKRPIEGG V
|
| |