Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35048 |
Symbol | |
ID | 7200044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 952486 |
End bp | 953574 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179545 |
Protein GI | 219117501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.541966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGG CTAGCAAGAA CAAGACGCGA ATGCGAGAGG AAAGCAAGGG GGCCTTCCGA GTAGCCAAAG AGGAAGAAGA AGCATCTATT GGTGGCTGTC CTGAGAACAA TAATCCTGTG CACAAGACTG TCGACGAGGA CAAAACGCCA CAGGAAAAAG GGGGATCGAA CAAGTCTTCG ACAGTAACGG GGAAGAGCAT CATTAACAGT GGAGACGCCG AGTCATCGGA TGAAGACGAC TTGCTCGAAG CAGCAGCCGC CTGGGCGGAG GGAGACGACG ATGACAAGCA AATGAAAAAC CTTAAGGTGC ACCAATCGAA CACCAAAAAG CCGCCCCAGA ACAAGAGCAA ACAATCAAAG GACAAGTTGT CAACTTTGAA TGATAATACG GCGACTGCCG CAAGCTCGCT TTTATCGGAG ACTTCGCCAG ACAGAATGTG GTCGCTACAC ATTACGCAGT TGGACTTTGA CACCACCGAG TTCGACCTGC GTCAACATTT TGTCACGCGA GGATGCGCGC TTTCGTCCAT TCGACTCGTT TGCGATCGCG GATTGAACGG CAAGAAACTG TTTCGGGGAG TTGCCTTTGT CGATGTCCTG GACGAGGAGT CGTACAAGAC GGCATTGGCC TTAGATAAAA GTGATATGTT GGGACGCAGA ATCAATGTGC GACCAACCAA GACCAAGTCC GAGTTGGCTG ATATTGTGCA ACGTACCAAA GAGATTGTAA AGGAAAAGAT AAAATTGAAT TTAGAAGAAA TGGATGAGCG AGAGGCTAGC GAAAAGTCGC ATACGTCGCC AAATACGGAT AAGAAGAGAT CGCGAAAGGA CAAGCAAAGA GATGGAAAGG AGCGCAAACC GAAACGTCGC AAAACCGAAA TGCTAAAGTC CACAGACGAC AATACGAAAG GCAATGCAAA GACAGAAGCG GTAGTTCCCA AGGATGCAAA GCAGGCAACA AGAGGGGTCA AGAATCAATC TCCCACGGGT TCTAAAACGA TTGGCAACAT AGATCCGAAC CGAAAATTGT CGAAAAAGGA ACGCAATCGC AAAGCAGCCA TCTTGATACA AATGCGAAGA AGAAGATAG
|
Protein sequence | MAKASKNKTR MREESKGAFR VAKEEEEASI GGCPENNNPV HKTVDEDKTP QEKGGSNKSS TVTGKSIINS GDAESSDEDD LLEAAAAWAE GDDDDKQMKN LKVHQSNTKK PPQNKSKQSK DKLSTLNDNT ATAASSLLSE TSPDRMWSLH ITQLDFDTTE FDLRQHFVTR GCALSSIRLV CDRGLNGKKL FRGVAFVDVL DEESYKTALA LDKSDMLGRR INVRPTKTKS ELADIVQRTK EIVKEKIKLN LEEMDEREAS EKSHTSPNTD KKRSRKDKQR DGKERKPKRR KTEMLKSTDD NTKGNAKTEA VVPKDAKQAT RGVKNQSPTG SKTIGNIDPN RKLSKKERNR KAAILIQMRR RR
|
| |