Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38445 |
Symbol | |
ID | 7203426 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 115154 |
End bp | 116469 |
Gene Length | 1316 bp |
Protein Length | 390 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182617 |
Protein GI | 219124661 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.261739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCGT CGTCTACCAA CGACGACGGC GACAACGTCG CCGTGGCCTT TGGGTTGGTG ATTGGTGCGG GGGCGGCGAC GGGGCTCGGC GCAGCGGTAG TCTTCGTACC AGCCTTGGTA CGTTTGGCGT CACGGAAAAC TCTAGCAGCC GCCTTGGGTC TTTCGGCCGG AGTCATGACG TACGTCTCCT TCGTGGAGAT TTTGGGAAAA GCCCGTACGG CCTTTGTCGA TGCCGGATAC GAAGAAGATC GGGCCTACAT TTACGCGACA CTCTGCTTCT TTGGTGGCGT CGTTCTCATG GTGGTACGTA CGTAGTTTCG CCAGTGGAGT TGGAACGATT GGCATCCACT TTGTGTGTAT GTGTCGACTA CGTCTCGTTT TTGTGTGTGT GTGTTATCAT GAGTCGGGGG CTGACGAGAC ATTGTTGTGA CGTTCTTTTC TTTCAGGCAC TCAATTATAT GGTGACCTGG TTGTTGGGTG GGCATCATCA TCACCACCAC CACCACGATT TCCCCAAAGA CCACATTACC CACGCGGCAA AGACGCAAGA GATAACCAAC GTGGACGATC CTTCGGCGCC GCACCCGGCG GACGACAACG CGCCCTTGGC CTGTCCCTGT TGCTCGGACG ATCCGGCCGG GGATTGGCAA GCCGTCCAAG ATATGGCCAG CGAAATCGAA GCCGTGGAAA AAGATCACAA AGTTTGGGAC GGAATACACG AACCGGATTC TCGGGCTCGT CCGGATTCTG CGGACTCGTC ACCGCACGCT ACTCCTTTTT CAGCCACGAC GCATCACGCA TCAGATTACG GCGACGACAG TTCACACGAT GCCTTGGCGG AACCCACGGA CGAGTCCAAA AAACTCTTAC GCATGAGTTT GAACACTGCG CTGGCTATTG GAATTCACAA TTTCCCCGAA GGCCTCGCGA CGTTCGTCGC CGCGCTCGGG GATCCCAAAG TGGGAGCCGT CCTGGCCGTC GCTATTGCCA TTCACAATAT TCCCGAAGGT CTCTGTGTCG CCATGCCCGT ATACTACGCT ACGGGCAACC GCTGGAAGGC CTTTGGCTGG GCCATGCTAT CGGGCATGTC CGAACCAGTG GCGGCGCTTT TGGGATGGGC TGTCCTGGCT AGTTCTTTTT CCGATACCCT GTACGGGTTG CTCTTTGGTA TGGTAGCCGG CATGATGGTT GTTATTTCCA CACGCGAACT ATTGCCAACG GCGCATCGTT ACGATCCAGA AGATTGCGTC GTGACGTACG CCTTTATTTC TGGTATGTGC ATTATGGCGC TCTCTCTGGT GCTCTTTTTG TTGTAG
|
Protein sequence | MDPSSTNDDG DNVAVAFGLV IGAGAATGLG AAVVFVPALV RLASRKTLAA ALGLSAGVMT YVSFVEILGK ARTAFVDAGY EEDRAYIYAT LCFFGGVVLM VALNYMVTWL LGGHHHHHHH HDFPKDHITH AAKTQEITNV DDPSAPHPAD DNAPLACPCC SDDPAGDWQA VQDMASEIEA VEKDHKVWDG IHEPDSRARP DSADSSPHAT PFSATTHHAS DYGDDSSHDA LAEPTDESKK LLRMSLNTAL AIGIHNFPEG LATFVAALGD PKVGAVLAVA IAIHNIPEGL CVAMPVYYAT GNRWKAFGWA MLSGMSEPVA ALLGWAVLAS SFSDTLYGLL FGMVAGMMVV ISTRELLPTA HRYDPEDCVV TYAFISGMCI MALSLVLFLL
|
| |