Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47990 |
Symbol | |
ID | 7203217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 635963 |
End bp | 637193 |
Gene Length | 1231 bp |
Protein Length | 391 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182266 |
Protein GI | 219123926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.077823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACCAGATAT GCTCGCCAAC GTAACTTGCC AAATAAAACT AATCGAAAGC ACACAATGTC AGCGGCGATC AGGAATGAAG CTAACAGTAA GCCTCGAGAC CGACGCTTGC GATCCAACCT TCTCATGAAG CTTGTGATTA TGTCCGGAGT CGGAGCACTC GGCGGTTTTT GGTTGTTCTA TTCCGTGTGG GAAACACAGT GCTCGGATTT ACTCAATCAG GCACAGATTC GGCACAGTGC AGTTATCGCC GAGCTTCAAG AACTACATAC TAAACAGACT AGTGCATTAC AGAATTGCGT CGAAGGAGAT GCCACGAAGG AGAAAGCTTT GGCAGAACGA TTGAAAACCC AAAGCACCTT AGTGATTAAG CACCATGATT TATTGCAACA GTACGATGAA GCAAAATCGC GAATAGCTCG TTTAGAAGTG GAACTCGAGG GTGTGACGCA AACAAATCAA TCGTTGAATG AGCAACTCGA AAGGCTACAA AATGAAGTTG CGGAGTCAGC ACGTTCGACA CAGGGTGCCG AGAAGCAAAT CGTATTGCTG CGTACTCAAC TGGAGGCAAC CAGATCCGTG ATAAAGCAAT CGTGTGCTTT CAACGAAACT ATCGACATGA GTTCGCAGCG ACACAAAGAA GAGGTTTTGC AAATATACGC AGCCATTCAA AGACAGAGTT TTGCGCAGCT CTTTCAACGA TTCGGCGAAG GTCCTTACGA AGTTGAATTT CTGCTTTCCA CCGAATCGAC TACGGAAACG GTTGCGCATG AAACTTTCCG AGTAGAGTTG CTGCGATCAA AAGATATGCC TCACACAGTT CTAACATTTT TGAGCCTTGT GGAGCTCCGT CTCTACGACG GAACAACGAT TGCAGGAACA GATGGGACAG TCATCAGTGG TGGGATCCCC AAACAAGCTC AGACACGTGC GCAATCGTAT CTTATGAGAA TGTACGTGGA GCATGGATTT GGTTTTTCTC CTCTCGTAAT TGAAGAAACA TCTCCAACAA TGCCTTGCAT GGCGCATACT TTTGGTTTTA CTGAAAGAGG GCCTGGTTTT ATAATTCCAC TCGAGAGCAT GTCAAAGAAC GAAAGCCCAT CTTGCCCCGG TCGTATTTCG AGCGGCCGTG ATGTATTGGA GCGACTAGCA AGAGACCGAG AGAGTCAGCT TACAATTATT GAAGCTAAAC TTGTCAGTCG AGATATCGGT TCGCACGACA CCGAGCTGTA G
|
Protein sequence | MSAAIRNEAN SKPRDRRLRS NLLMKLVIMS GVGALGGFWL FYSVWETQCS DLLNQAQIRH SAVIAELQEL HTKQTSALQN CVEGDATKEK ALAERLKTQS TLVIKHHDLL QQYDEAKSRI ARLEVELEGV TQTNQSLNEQ LERLQNEVAE SARSTQGAEK QIVLLRTQLE ATRSVIKQSC AFNETIDMSS QRHKEEVLQI YAAIQRQSFA QLFQRFGEGP YEVEFLLSTE STTETVAHET FRVELLRSKD MPHTVLTFLS LVELRLYDGT TIAGTDGTVI SGGIPKQAQT RAQSYLMRMY VEHGFGFSPL VIEETSPTMP CMAHTFGFTE RGPGFIIPLE SMSKNESPSC PGRISSGRDV LERLARDRES QLTIIEAKLV SRDIGSHDTE L
|
| |