Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33054 |
Symbol | |
ID | 7197278 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1414613 |
End bp | 1415782 |
Gene Length | 1170 bp |
Protein Length | 363 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177825 |
Protein GI | 219112147 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.608605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACT ACCGACCTAG TTTGACCCCC GAGGAACAGC GTAAGTTAGA GTTGCTACGA CATGCCAGCT ACAACACTGC TCCCCGGGAA AGTACTCGTA TAGAGCGTCT GCCCCCTCCC CAGCAAGCGA TGATATCGCC GTCGGAACTG AGTCGCCTAA GCCTAGCAAA CATCGAAGCT CGGATGGCTT CTCAAGGGGC GCTGGCTGCA CGTGTTCAAG CTGCTCAGCA AGCTCATGCC CAGGCTCAAG TTCAGAATAT GCAGATGATG GGGCGTCCAC AGCTCTCTCC CGCCCTGCTG TCCAAACATC GACTGGCCTT TCACCCGTCG TCGCCCATAA AGCAAGGCCA GAGGCCGTAT TCACCGCAAA AGCAAAGTGT CGGTGGCTAT CGTTCTCCAA TGAGACAACC CACCTACTAT AGATCGCAGA ATGAATTCTC TTCCTTCCCT TCTCCGGCTT CATCATTTCT TTCACCAGTT GAACACGCGC ACGCATATCC CTCTCCTGCT AGTTTCCCCT CGTCAACTAC TAGCTTTCCT TCTTCGGCGG GGAGCTTCCC ATCGTCTACG GGAAGAATCA GAAGCCCGGC AGAAGGACTG AACATGCTTC GAGCAGTAAG CTTGGGAATG CGAGGAGATC GTTCAGAAGG CCTTCCTCCC ATGTTAGCCC CTCCACTGAC CTCTCATATG GCACCGCCCC AATCACACCC ACCGCCGCGA CACAATAGCA ATGCGTCAAT GGATGACATG TCAACTTCTA CGATACACCC TTCAGAGGGA AAGTTGTACA TTGACGAACT GCAACCTTAT GACGTTCTTT GTGGACGAGG TGGAAAGTCG AACCATCACC CCGGAAAGTA AGTCAGTGTT GCAAATCCAT AGGTGGTTGG ACATGCATCT CATTTTGTCT AACTCTCATC GCTTTGTGGC TTCAGCAAAC GGTACCGACA CGTCGTCAGC GAAATGAAAA TGATGTATCG CAAAACAGAA GCAAAAGCGA TCAAAACCGA TCTTAGTCGT GCTATTGTGG AACATGTATG CAACTATGGA GGACGGTTTA TAAAAAGAGA AGAAAACTCG GGTCGATACT ATGTACTCAC CAAATCTGAA GCCAGGAAAA AGACCAGCCA AGCTCTGCGG GAAACCAAGG AATTAAAGTG GACCGCGTAG
|
Protein sequence | MMDYRPSLTP EEQRKLELLR HASYNTAPRE STRIERLPPP QQAMISPSEL SRLSLANIEA RMASQGALAA RVQAAQQAHA QAQVQNMQMM GRPQLSPALL SKHRLAFHPS SPIKQGQRPY SPQKQSVGGY RSPMRQPTYY RSQNEFSSFP SPASSFLSPV EHAHAYPSPA SFPSSTTSFP SSAGSFPSST GRIRSPAEGL NMLRAVSLGM RGDRSEGLPP MLAPPLTSHM APPQSHPPPR HNSNASMDDM STSTIHPSEG KLYIDELQPY DVLCGRGGKS NHHPGNKRYR HVVSEMKMMY RKTEAKAIKT DLSRAIVEHV CNYGGRFIKR EENSGRYYVL TKSEARKKTS QALRETKELK WTA
|
| |