Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26077 |
Symbol | |
ID | 7197869 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 457961 |
End bp | 459652 |
Gene Length | 1692 bp |
Protein Length | 410 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178228 |
Protein GI | 219114865 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.076803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGAAACCAG ATGATGGATA CCATATCGAA CGTCTCACAA TCAGTCTTGG AGCCCTTATG GAGTCGTCGG GGTCTCTGAC AGTGAATTCG AAAGACTTTT GACTGTGAGA TCTATTCACC AAGGCGAATC CGATTGATGT TGGCTCCGCT GTCGGTGACA CATTGGCCCC AATTTTTTGT TTTCGTCATG TCTCTGCTGC TGCGACGGAG CGCTGCCGGT TTCACCGGTA CTTTTCGTCG CCGACGGTGG TCTCGGACGA CGTTGACTTC GGACAGTGCA TCACGATTGT CATCGCTGCC ACCACTATCG GCAGAGCCTC TGCACTGGAA CCGCGAAACA TCTGAACGTC GATCCTATGC TACCAACACG CCGATCGTAA AGCAAGCCAA TGTTATTTCG CTGACGGATA TAGAAGATCC CGCCAACAAT CCGTTCAACC GGGCCGCTCC CCTGCCGGAA GGAGTTAAAA TTCTTGCCAT TGGCGCTACC ATGGACGATT TTGACGTTGC TTCTTTAGAG AGAGAAGGCG CCAATGCCAT TTTTGTCAGT CACCCGCAGG CACGGGAACC CTTGGCCCAG CTGTTGACGC AACTGCCGGA GATTGAATGG GTGCACACTC GCTCGGCGGG CATTGATTTC GTCACATCGC CTACACTCGA ATCCTGGAAG GGCAAGCTCA CGAACGCCAA GGGTCAGTTC AGCAGTACGT TGGCAGAATA TACACTCATG GCCTGCTCGT TTTTGTAAGT CGTACAGCAC TGATGTGTCG AGCTTGTAGG GTAAAGTCAA CCACCAGTCT TCGACTCTTC TGAGCCTTAC CAGCTTCTGC TTTTCCTTAG TGCCAAAGAC TTGCCGCGGT TGATGCGCAA CAAGAAAGTC AAGTGTTGGG ACAAATACAA TGTCTTGGAG CTCAGAGGTG CGACCTTGGG TATATTTGGG TACGTATCTG CTGCACTTCT GCTCGTCTTT TTCTTTGTAA AGGTCAATTT CTTACAGGCA GTTTTTGTTC AAGTTCAGCT ACGGTGACAT TGGACGGTAC GTACATGAAA ATGCAGTCAT CTTACGAACG GATTTTCAGT CCAAACTCAA AATGCATTTT ATTTCTTGTA GAGCTTGCGC GAAACTGGCG GCCGTGTACG GTATGAAGAT CATTGCGTTA CGACGACACC CCAAGCCCGA TCCACTTTGT AATGAAGTGT ACGGCAACGA CAAGGATAGT TTGAATCGCT TGTTTGCCGA ATCAGATTAC GTACTGTGCT CAGCACCTTT AACGGCCGAA ACCCGCGGAA TGATTGGTAA AGAACAGTTC GATCATGCCA AAGAAGGGGC CGTTTTTATA AATCTTGGAC GGGGGCCGAT TGTCGATGAA GTTGCCTTGA CCGATGCACT GAGTAACGGT AAACTCAAAG GCGCCGCATT GGATGTATTC ACGGTAGAAC CGCTGCCGAA TAGCTCGCCT TTGTGGGAAA TGGACAACGT GCTGCTCTCA CCGCACAACA TGGACCAAAC CGCAACTTTC ATGCACGAAG CGACCGAGTT CTATGTGGAA GAAAATCTGC CAAGGTTTGT CCGGGGTCAG GAATTGCTCA ATATCGTCGA CGCCAAGGCG GGATACTAGG TATTGCTAGA ACAGCAACAT GAGGATGCTT TTAACCTATC TTACATTAAT GTAAACGAAA AGAAACGGTT TT
|
Protein sequence | MLAPLSVTHW PQFFVFVMSL LLRRSAAGFT GTFRRRRWSR TTLTSDSASR LSSLPPLSAE PLHWNRETSE RRSYATNTPI VKQANVISLT DIEDPANNPF NRAAPLPEGV KILAIGATMD DFDVASLERE GANAIFVSHP QAREPLAQLL TQLPEIEWVH TRSAGIDFVT SPTLESWKGK LTNAKGQFSS TLAEYTLMAC SFFAKDLPRL MRNKKVKCWD KYNVLELRGA TLGIFGYGDI GRACAKLAAV YGMKIIALRR HPKPDPLCNE VYGNDKDSLN RLFAESDYVL CSAPLTAETR GMIGKEQFDH AKEGAVFINL GRGPIVDEVA LTDALSNGKL KGAALDVFTV EPLPNSSPLW EMDNVLLSPH NMDQTATFMH EATEFYVEEN LPRFVRGQEL LNIVDAKAGY
|
| |