Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50037 |
Symbol | |
ID | 7198733 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 219998 |
End bp | 221295 |
Gene Length | 1298 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184842 |
Protein GI | 219129326 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAACATTTT TCGACGGAGT CTATTGGAAC GACTAGGGCT ATCTGCGAGA AAAGGTCGAA GTTTTTTTTG CAACGTGCTG ATCCACAAGC AACCAGCTTG TATATGCTAT TTGGCGACAG CGCACTACAC GCACGTTCCA TATAGTTCCA GCTTACACTG GAAGCTCTCC ATGATGGCGG ACGACTGCCC CATTTGCTGC GAGCCTTTTT CGCAGGCCGA TCACGCTTAT CCATTGCATT GCCCAACCCC CACTTGTGCC TTCAATTTCT GCTGCAATTG TGTCACTTCC ATCCAAAAGT CTGCTGCAGA TGGCTATCAA GAAGCCTCCG ACGGTTCTCG ACAACTTAAA GTACAAGTCC AGTGCCCCCA GTGTCGGGGC AGGTACGTTT GTGCAACCTA CAGCAGCACG TCCAATAATG CAATTGTTCC TGCCGTTCTC TTAATGCGGC AAGCTTCCGA GTTGGAGGCC GTAGTTTCGA CGAAAGACTC AGACCTATCG GCTACCGAGC TCGCCACCAA ACACCAATTT TGTCAATCGT GGAGCCTGCG CGATTTGAAA GATGCCTTGG AAACGTTAGA AACGTACCAC TACGAAATCG GGAAAAATAT CGGTCGGTCG AGTTTAGCGA CGCTAGACTG GGAGTCCTGG GCCCACGCTT TACCGGAACA GGCCTCCGGC AACAATATGA GTTGTTTACC ATCATGCATG ACCGGAGATG GTGCCAAACA CCCTAGCTCG GTAGAGATAG ACCCTTCATT GTTTTTAGGA CTGGACGAGT TCGTGACGCG AGATGAGCAA GTCTTTGTTC ACAATCTCCT AACATCCGGT GATGTACAAG GTCTGGTTCA GGCAGCACAA ATATTGCAAT CCATCTTGCA ACTTGCTCAA TCCGGTACCG CTACGATACA ATCGGCATCA ACCAAGACAC CAGTGCAGTT ACAGAGCTTG CGCGAACGCT TTCCTCTTCC AGCCCGAATG CCTCGTTCCG TCAATTTGCC CGTCTATGAT CCTATGGCAA AATACAAGTT GCTCAAGTTT GACAACAAGA ATACGCTGGA GATTGCCTCA CTCCACCACG GGGCCGGTAA ACTGGGCTTG CGCAAGCGAG ACGTGGTAAC GCATCTGGAA GGCGAAGCAA TCTTGGATTA CGATGCCTTT GTCAGTATGC TACAAGCCTA CTACGAACAA GATCCGGAAA CCTCTCTAGC CTTGGTTGTC AATGCAGACA AAGAAACGGC ACAAGCTCTG CAAAGACGTT CGCAAACCAT TATTTGCGCA TCTACGCGTA GGCTTTGA
|
Protein sequence | MMADDCPICC EPFSQADHAY PLHCPTPTCA FNFCCNCVTS IQKSAADGYQ EASDGSRQLK VQVQCPQCRG SSTSNNAIVP AVLLMRQASE LEAVVSTKDS DLSATELATK HQFCQSWSLR DLKDALETLE TYHYEIGKNI GRSSLATLDW ESWAHALPEQ ASGNNMSCLP SCMTGDGAKH PSSVEIDPSL FLGLDEFVTR DEQVFVHNLL TSGDVQGLVQ AAQILQSILQ LAQSGTATIQ SASTKTPVQL QSLRERFPLP ARMPRSVNLP VYDPMAKYKL LKFDNKNTLE IASLHHGAGK LGLRKRDVVT HLEGEAILDY DAFVSMLQAY YEQDPETSLA LVVNADKETA QALQRRSQTI ICASTRRL
|
| |