Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45245 |
Symbol | |
ID | 7200261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 591093 |
End bp | 592727 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179245 |
Protein GI | 219116901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.947975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAGC AACCACTACC TTCGCCCATG TTTCATTTGA GTCCAGTTAA GCCGAAGGCA GCAATTTTGT CACTCCCTCC AGATTCGCCC GGAGTTAGCG TGATTCAGGA CGGTAACTCC GATGGTAACC GCGTTTTTTC TGTCGTGATT CCAGATGGAG TGAGAGGTGG CGAGTACTTT CGCGTAACCA TGGCAGGTGA GACGATGAAA GTTAAATGTC CGATGATGGT AGATCCTGGC GAGCGCGTTC GAGTTACGTT GCCACCCAAA TCACCCCTAT CTCCCACCGA TATAAGTGTG ATTGCGGCCC GTCATCCTGG GCCTTACTGG ACCCACATTC CTTACCGTGC CAAACCGGGA GAGCGCTTTC AGCTGAGTGT ATTTGGAGCC GTTGTAAATG TGAAAGTACC ACCAGGAGGA CATGAGGGCA TGAGAATCTG GTTTATGCTA GGAGAGAAGA AAGGCGAAGA AGATATGGCC GATGGCCACG CTCCATCTTT GGGCCAAATT GATGTGCCTC CTCTCTGCGA CAAAAGCGAT TCTTCAAACG GTTTGGAAAG AAGAGATACT ACAACGAACA GCGATACAAT GAGAAAGGAA GACACCATTA GGCCTTCTGA AAGCGAACTA TCCCACACTG AAGAAGATGG CATTTCCGAC GACGAAACGG AAGAACCATT CCGTCCTTCC TTGCGCCCAT CTACGCTGCG AGGTATGTCC TTTGACGACG ACGAAAAGAC TCCTTCTACC TACTGTTCCG ATCCCGATGC CCATCCGAAC GGTGAAGAAA AAAACGTGTT CCACGATGCA GTGGAGGAAA TCGGCAATCG TGTCCGCCGT GGTGGTCGCC GGGTCGCACT CCCAGAACCG CCACTCTACG AAGCCGAAGT TCTTTGTTTG CGGTACGTGG GCGGGGAAGG AGAACATCGT GAGTATGGTT GGGAAAAACC GGACTGGACA ACCAAAGACT TACGGCCAAC TGGTAGGAAT GTCAAAGACG GCTCGACGCT GGCCAAACCT GTAACGTTGA TTGCCAATGT TGTCGGTACG GAGCAGCATC TGTCGTGGGA GAGGCCCGCG TGGACGAAAC AGACTCTCAA GGTGACCGAG AAAGGCCAAA GGGCTAAAAA TGGCGAAACT CTCGCGGCTC CGGTGACGAA CATTGCCCAT GTTGCTGCGG AAACGGCTGA CTACCAATTT CAAAAACCAG AATGGACCAG AAACGCCGGA TTGCGAAGCA CAGGAAAGTT CGAAAGGCTG CAAGAAGGGA AGGATATTGT TCGTCCAATT GGAGGTATCA AGCATATCGA CAATGTGAAC AAAGTTGATG TGGAGCTCCT TTCCTCTACA TCCAGTCTGG TAGAGCTCCC ACGTCATGAA TCAGTTACGA CTTCCTCGCA TGTCCCGGAA ATCGTAGTTC CTTTGGACCC AGCTGATGCG GACAAAAAAT TGGAGTTGGA GGACTCTATC TCAGAGGTTT CTCAAGTCGA TAAGGACACC GTCGAACCGG ACGAAAGTGT GGAAGAATAC GACGAAGATA TCTTTTCCGA GGAAGAATAC GAAGAATATG TTGTTGTGAG AGAAGAAGAC GTGGAAGAAG ATGTCGCATT GGCTGAGATT CGAATCGCGA CGTAA
|
Protein sequence | MQEQPLPSPM FHLSPVKPKA AILSLPPDSP GVSVIQDGNS DGNRVFSVVI PDGVRGGEYF RVTMAGETMK VKCPMMVDPG ERVRVTLPPK SPLSPTDISV IAARHPGPYW THIPYRAKPG ERFQLSVFGA VVNVKVPPGG HEGMRIWFML GEKKGEEDMA DGHAPSLGQI DVPPLCDKSD SSNGLERRDT TTNSDTMRKE DTIRPSESEL SHTEEDGISD DETEEPFRPS LRPSTLRGMS FDDDEKTPST YCSDPDAHPN GEEKNVFHDA VEEIGNRVRR GGRRVALPEP PLYEAEVLCL RYVGGEGEHR EYGWEKPDWT TKDLRPTGRN VKDGSTLAKP VTLIANVVGT EQHLSWERPA WTKQTLKVTE KGQRAKNGET LAAPVTNIAH VAAETADYQF QKPEWTRNAG LRSTGKFERL QEGKDIVRPI GGIKHIDNVN KVDVELLSST SSLVELPRHE SVTTSSHVPE IVVPLDPADA DKKLELEDSI SEVSQVDKDT VEPDESVEEY DEDIFSEEEY EEYVVVREED VEEDVALAEI RIAT
|
| |