Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43631 |
Symbol | |
ID | 7197347 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1052431 |
End bp | 1054071 |
Gene Length | 1641 bp |
Protein Length | 479 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178059 |
Protein GI | 219112615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.370877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTAATTTCA TGCTTGCAGT TTATGTAAGA CGCAAAGAAA AACGCTTTTG TAGCTTTCCG CTGCTTACAA CGCAAGAATT CAAGCCACTA TGAAGACAAG AGGGCATTGT GGTGTTTTTT CACTCCTCCT TGCTACTGTT GCTTTCAATG AATTTCTAGC CGTCAGCGCA TACTCAGCAT CTTTTGCACG GGCAGCGGGG CTGGACAGCA TCGCCCCGTG GAGGTATTCC CAATACCCTT CCCCTTTCGA GTATCCAACA GTGTGTCGGA CAACTTCTGA GAAACTTTGT GATCCAGACG GGATTTTGAA CGACAGTGAA GTTGAGCGTG TCGATCGTGT ATTGAAGACT AGCCGTGAAT TTGTACTCCC CTGTAAAGCA GAAAGCAACG TCGAAGGAAT AAAAGGGAGA GTGATAAATA TTGAGATAGC AGTGGCTCTC GTAAAGCAGG TGCGTCGCCA GCGCCTTCCT GCACCATTCC TATGTTGCGC TGGCTGATGC TTCAACCTTA TTATAGATGG ACCTCCTCGA ATTCGAAATG AACAGCAAAC AACATGAACG CGCTGCGGAA GTATTTGCAA GATCACTGCA TAATGAGTGG GGAGTAGGAG TTACTAACAG CTGTGGAGGA ACCGGTATTT TATTGTTTCT TTCCGATTTA GATCGTGTGA TCTATGTTTC ACGTGGAACA GCACTTAAAA CAATTTTGAC CGATCGTCGA TTGGATCGGG CTATGAACAA GATGAAACCG CTGTTACAAG AGAAGAAATT CGAAGAGGCC ATTTTGAGCG CTGTGGAAGA GTTTGAATTT CTTATTCAGT ATGGCAAGCC GCACACATGG GAACTGATCA ACGACTATAT TACCAGGTAC GGCGGTCTCT GTTGGGTCGC TGTTTTTTTA GTTTTTGCAG GCAGGAATAT CCATGTACAA ACCAAAAAAC AGAGAGAATA TGCCAAGGTT CGCAGTCATC TGTCAGAAAT GGATCGTGCG CGAGCTGAAG CGCTGCAAGG TCGTTTCTGT GCGACGTCAT GCCCAATCTG TCTCGAACCA TTCCCAGATC ATGCCACCAC CAGCACCCGT ACCCCGGAAC AATTGGGCTC CGATAATCTC CCAATCAAAC TACTGCGCTG CGGACACGTC TTTGACCATA ATTGTTGGCT AGAATGGGCA AGCAAAGGTC AAGGTCAGGT TACCAAATGC CCTATTTGCC AGCAAGATGT AGGCATGGGG GAAGACCTCA CAACAGCCAG AAATACTCAG TCACTCTCAC GGCGGTCAAG TCGGGTTGTC AGTGATGATC TTGATGACAG TATTGGACAT CGAGGCTTGG CTGCGGAAGG AGAGCGATTT CTTAATCTCC ACAATCGTGA ACGCAGTTTT CGTCTCACGC AGCTAGGATA CCAGTTTCCT CAGATCATTG GGCCTCACCA AATTCAGCAG TGGTCACAGA ATGATTACAA CGGAATGTTG GTACAGGATC CCACTTTTAT AAGCAGTGAT CCGGTGTCTG GTGTGGGGAG CTCTGCCCGC GGTGTTGGGA TCAAAAGCAG TTTTAGTGGC GGGTCCAGCG GAGGCGGTCG TTGTGGTCGT TGGTGAGATA CTTGCAAATA GCAACGCTAT TAGTGTTGAA GCCATGACTT T
|
Protein sequence | MKTRGHCGVF SLLLATVAFN EFLAVSAYSA SFARAAGLDS IAPWRYSQYP SPFEYPTVCR TTSEKLCDPD GILNDSEVER VDRVLKTSRE FVLPCKAESN VEGIKGRVIN IEIAVALVKQ MDLLEFEMNS KQHERAAEVF ARSLHNEWGV GVTNSCGGTG ILLFLSDLDR VIYVSRGTAL KTILTDRRLD RAMNKMKPLL QEKKFEEAIL SAVEEFEFLI QYGKPHTWEL INDYITRYGG LCWVAVFLVF AGRNIHVQTK KQREYAKVRS HLSEMDRARA EALQGRFCAT SCPICLEPFP DHATTSTRTP EQLGSDNLPI KLLRCGHVFD HNCWLEWASK GQGQVTKCPI CQQDVGMGED LTTARNTQSL SRRSSRVVSD DLDDSIGHRG LAAEGERFLN LHNRERSFRL TQLGYQFPQI IGPHQIQQWS QNDYNGMLVQ DPTFISSDPV SGVGSSARGV GIKSSFSGGS SGGGRCGRW
|
| |