Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33478 |
Symbol | |
ID | 7203875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1031761 |
End bp | 1033047 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186172 |
Protein GI | 219113177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00955149 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCTGA CAATAACAGA TGGAGATCCA AAAGAGTACA GGACCTTTGT AGATGCAATA CCAACTTTCT ACCCTCTTTG CGAACACAAG CTTTGTCACT GGCATCTACT GTATCGCAGT AATCTCATGA AGGTGCAGAC TGGAAAATGT GGAGTTAAAG CTACTATTCT ATTCCGCGTA GTTGTTCTTT GGATTGAGAG CTGGATGACC AAAATTGAGA CACAAGAGGA ATACGAACTT TCTAAAAGGC TCTTGGCTGA TTGGCTTGCA ACCCCCGAAG CTATTGATGT CAAATTGGGT GGTATGGGGC AAACTATTGT ATCGCAAATT AATGCGTACA TGACACTGTC ACTTTTTCCT CATGAACAGC GCTGGGCTAG ATATCGCTAT TTATACACAC AAGCATTCAA CACATCTGCA AGCTTGTATG CCGAGGCAGA AAATAGTGCT TTAAAACGAT GGGGCGACGG GGTCAGGCCA AGCTTTTCCG TACCAAAAGC AACTCAGGTT ATAAACGAAG GGACACAACC TAGGTCAAAG AAGAGGCATC AAAAAGCTGT TTACAATTTA AATGCTGCCA AGACGAGAAA GCCTGCCTAC TATGCAAACA TTGGGGATTT AGTGGATTAC ATTCAAGATT CTCTTTCCAA AGATTTTGAA GCAGCTGCTT CATTTGTGCT CTTCCGTCCA AATGCAGACC AGTTTTGGGT CAAGCAAGCC ACTTGCAAAA GCAAAAACAC GGACATTCGG AAAATCAACA ACAGTAGCTA TTACAAGTAC ATGATTCCGC AGTTTGAACG CACACGAATT GTGGAGCTTG TTAATATTGA TGGTACATTC TATTTGGTGT GTAGCTGCGG AAAATTTCAG CGACAAGCTT CCCCATGTGC CCATCTTTAC AAGGTTCTTG GTCAATCACC CACGTCAACC GATGTCTCTG TACGCTGGAC AAAGCACTGG GATGTGTATT TGCACCGAAG TGGCCACAGT GACCTGTCAA AGCATTTGGA AGACCTGTAC AAACAGGAGC GACCAGGTCC AGTATTTGTT GATAGTGGTC AGTGGGTGAT CGGAAAAGGT GAAAAAGGGT CAAATTTTTT CGAAACTTCG CTTCCGTACA AGCCCCCTGT CATACGAGAT TTTAATCGAT GGGCAGTGTC TTCGCAAACA ACTGGAGCTG ATTTGAGTGG GACCAAAAAT ACCACAAATA TGTATTTTTC GAGTGGAATG GTGCAAGAAT CAACAAGCCT GTCCAGAGAG CATGCATTCC AGGATTCATT GCATTAA
|
Protein sequence | MHLTITDGDP KEYRTFVDAI PTFYPLCEHK LCHWHLLYRS NLMKVQTGKC GVKATILFRV VVLWIESWMT KIETQEEYEL SKRLLADWLA TPEAIDVKLG GMGQTIVSQI NAYMTLSLFP HEQRWARYRY LYTQAFNTSA SLYAEAENSA LKRWGDGVRP SFSVPKATQV INEGTQPRSK KRHQKAVYNL NAAKTRKPAY YANIGDLVDY IQDSLSKDFE AAASFVLFRP NADQFWVKQA TCKSKNTDIR KINNSSYYKY MIPQFERTRI VELVNIDGTF YLVCSCGKFQ RQASPCAHLY KVLGQSPTST DVSVRWTKHW DVYLHRSGHS DLSKHLEDLY KQERPGPVFV DSGQWVIGKG EKGSNFFETS LPYKPPVIRD FNRWAVSSQT TGADLSGTKN TTNMYFSSGM VQESTSLSRE HAFQDSLH
|
| |