Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27735 |
Symbol | |
ID | 7201191 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 668146 |
End bp | 670329 |
Gene Length | 2184 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180482 |
Protein GI | 219119443 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.326245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGGCCGAA ATCCTTCTTT CTTAGGCCCA AGTTAAATTT GCTTCTTGCA CCACTATTTT AACAATCATG GGTGAAGAAG AATACTTGAA ACAGTCGTCG CGCTTAGGGA ACTTGGATCG GTAAGAGGAT CGAGGACGGT TTACGTGTTC TGTGTTAATA GTTTGAAAGA TACACCAACA AGTATGCGTC TAACGTCCCA ACCCTTCACT CTCACACATG TACGAAATCA ACGCGGACCA GTCCACAAGT AGCCATCTTG GACTTTGGGT CCCAGTACTC CCACTTGATT GCCCGTCGCG TCCGCGAACT CCACGTTTAC TGTGAACTCT ACAGTTGCCA GGTTGACGCC GCCGAACTCG CCCAACACCA ACTGACGGCC GTCATTCTTT CCGGCGGTCC GAATTCCGTG TACGAAGAAG GCGCACCGCA TGTGTCTCCA GAGACCTGGA AACTTATTCA AGATCGACGT ATTCCCGTCC TCGGAATCTG CTACGGTATG CAGGAATTGG CACACGTATT TGGAGGCCGC GTGGAAGCAG GTCTCAAACA CGAGTACGGC AAGGCCATGG TACACCGGGT AGAAGGCTGC GATTCGCAGT TGTTTGCCGA CATGCCCAGC GAATTTCAAA TGTGGATGTC ACACGGAGAC AAGTTACATG CCGTACCGGA CGGGTTTAAA GCCGTCGGCG CGACAGCCAA CGCCGAATAC GTTGCGATTG AAAACTTGGC CACCCGCATG TGGGGACTCC AATTTCATCC CGAGGTCACG CACTCGCCTC TGGGGAAAAC GTTGCTCCGA AATTTCGTGA TCACCATTGC TGGTGCCACA CCGGATTGGG TCATGACGGA TTACGCACAA GAGTTTATCG AAGAAGTCCG GGCCAAGGTA GGACCGGATG GTCACGTTTT GGGAGCCGTC AGTGGTGGAG TGGATTCGAC TGTGGCTGCC GTACTTATGA CGAGGGCAAT TGGTGATCGC TTTCACGCCG TGCTCGTTGA CAACGGCTGT CTGCGCAAGG ACGAAGCCAG TGCCGTGCTC AAGCGGATGC GGGAAGACTG TGGCGTCAAT CTACGCTGTG TGGATGCGAG TGACAGGTTT TTGGATCTAC TCAAGGGAGT GACGGATCCG GAAAAGAAGC GAAAGATAAT TGGTGGAACA TTCATCGATG TGTTTCAGGA AGAGGCTGCA AAGATTGAAC GAGAAAGCGG CCCTTGTCAG TATTTACTGC AGGGAACATT GTACCCTGAT GTAATTGAGA GTATCTCCTA CAAGGGACCG TCGGCGACGA TCAAGACACA TCACAACGTT GGTGGCCTCC CAGCTTCAAT GAATTTAAAA CTAATCGAAC CGCTACGGGA ACTCTTCAAG GACGAGGTGC GAGAGCTCGG CATGGCCTTG GGCATTGATG AAGCTAGCGT CTGGCGGCAC CCCTTTCCCG GTCCAGGTCT GGCCATTCGC ATAATTGGGG AAGTAACGGC CGAACGCGTC AAAATATTGC AAGAAGCTGA CGCCATTATG ATTGAAGAAT TGTGGCGTTC GGGACATTAT CGTGCGATTG GACAAGCCTT TGTCGTTCTG TTGCCCGTTA AGTCGGTGGG GGTCATGGGC GACGGTCGCA CGTATGAGAA CGTCGCAGCG GTCCGTTGCG TGGAAACCAC TGACTACATG ACGGCCGACT TTTACCATTT GCCATACGAC GTAATGGGCA CAATGAGCTC ACGCATTATC AACGAAGTGC GGGGCATCAA TCGCTTGTGT TACGATATCA GTTCCAAGCC GCCAGCAACA ATTGAATGGG AGTAAAGAGA AGAGGTTGTT CTGTTCTTTG GTTATCGTTC CTAGACTTGA AAATCATGTG TTTTTCACAG CTAGGTAATA CCGATACAGG AATGGAATGG ATATATTTAG CTTTGATCCT GTTTCATAGC TGGAGAGGCA ACTGATACGA CTCCTTACAT TAGTTTTGAC CCTGTTACCT GTGTTCTGAA ATGTTGCTGG GGTTTTCTCC GTCGCCAGCC AGACGGGAGT CCATGCGCTG TTGTTCGTGA CAGCTGACAA CTTATTGACG CAATAGCAAA GTGCGAGGAG GAGCACTGAC CTACAATAGC CTCACCATAT GATTCTTTAT TGCGTACCAG AGAATTCTCA TGAATCCAGT ACTATAATTG ACAAAAATGC TTCG
|
Protein sequence | MGEEEYLKQS SRLGNLDRPQ VAILDFGSQY SHLIARRVRE LHVYCELYSC QVDAAELAQH QLTAVILSGG PNSVYEEGAP HVSPETWKLI QDRRIPVLGI CYGMQELAHV FGGRVEAGLK HEYGKAMVHR VEGCDSQLFA DMPSEFQMWM SHGDKLHAVP DGFKAVGATA NAEYVAIENL ATRMWGLQFH PEVTHSPLGK TLLRNFVITI AGATPDWVMT DYAQEFIEEV RAKVGPDGHV LGAVSGGVDS TVAAVLMTRA IGDRFHAVLV DNGCLRKDEA SAVLKRMRED CGVNLRCVDA SDRFLDLLKG VTDPEKKRKI IGGTFIDVFQ EEAAKIERES GPCQYLLQGT LYPDVIESIS YKGPSATIKT HHNVGGLPAS MNLKLIEPLR ELFKDEVREL GMALGIDEAS VWRHPFPGPG LAIRIIGEVT AERVKILQEA DAIMIEELWR SGHYRAIGQA FVVLLPVKSV GVMGDGRTYE NVAAVRCVET TDYMTADFYH LPYDVMGTMS SRIINEVRGI NRLCYDISSK PPATIEWE
|
| |