Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48167 |
Symbol | |
ID | 7203503 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 398783 |
End bp | 400462 |
Gene Length | 1680 bp |
Protein Length | 413 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182676 |
Protein GI | 219124785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATTATGAAA TCCTGTTTCC TTCGAGAAGT GACCGGTGAG GTTACTGCAA GGTTCGGTGT CGAATAAAAT TGCTACTGCC TCGGTACCTG TGCCATCGGT AAACTTCGGC AACCTGTGTT GCAGACTTTC ATTTTGTTGG CCACTGTCGC TGCAGTACTT GCGTATCCGT GAGAGTGGAG GACTATCTGC TTTTCAAAAG GCGAGCTGTA GTATCCAAAT CAACATACAA TTCACCATGG CCGCTTCCAT CTTTGATGGA TGCTCCGACA CATGCGGCTG GATAGCCGCT GTTCTAGCGG TGTTAAGCTT TGGAAGCTTT GGTGTCCCGA TCAAGCTCGC CGGCAAGGTT GAAGTGCACC CGCTGGTTAT GCAAAGCTAT AAGACGTCCG TCTGCTTTTT GACATGCTGG TTGGTAATTC TTTTGGGCGA GGAACCACGA TGGACTCCGT ACGGTATTGT TTCGGGGCTC TTTTGGGTAC CCGGCGCCGC TATGGGGATC TTTGGTATTC GTAATGCCGG TTTGGCAGTG GCTGTGGGAA CTTGGAGCTC CATTACGGTC CTGACCAGCT TCTTTTTCGG AATCATTGTA TTTCAAGAGC GTGTCAAGAG CTTTTACCAG ACATGCTTGG CATTTGGTTG CCTCATCATT GGTTTGATTG GTATGAGCCG GTTTTCGGCA CACCAGCAAC AAGTGGATAC GTTGGCTGTT TCTTACCGAT CCGTCAAGAC TGCAGCTTCG CATCCTCTTG GCCTTGGTCA AAAGTTGAAA CGTGCTGGTT CCACAATAGC GGAAAACTCG ATCACAGTTC CGCTCGTTGG AGCCTCGGGC GTTATTCCGA TGGAAATCGA GCCATTTGCA ACGGATGGGG AAGACATTGT TATGGGCACG TACGACGATG CCAAATCGGT GTTAAGCAAG GATCGTCTGG TTCTTTTTGG TGGACGAGTC TCTCTGACGA GAAGGCAAAT GGGTATTCTT GGAGCCGTAA TCAACGGCGC ATGGGGTGGC ATGAACTTGT ACGTTCTGGA GCAAGAATGC GTTCCGTAGT TGTTAAGTAA GATGACCGAG CCCGGCTTAC CAATTTCCGT TCTTCGCTTG TCGCAGGATT CCCTTACACT TCGCTTTGCA GGAAGAGGAC ATGACTGGTG CTGGCTATCT AATTAGCTAC GCAACTGGAT CTCTCATTGT CAATACATGT ATATGGCTGG CTTTCCTTGG CTACTACCTC CACCAAACGA ACGGACACTG GAATGAAGCA GTTGACTGTC TACCAAAATG GCATTTTGAG CATTTACTAA TACCAGGTCT GATGGCCGGT CTACTATACA GCTTTGGTAA TTTTTGTTCT ATCCTGGCCG TCACCTATCT CGGTCAAGGT ACCGGCTTTT CTTTTTGCCA AATGCAACTG TTCGTGAGTG GTCTATGGGG TGTCTTTTTC TTCAAGGAGG TACAGGGAAC GGACACAATA ACCAAGTGGT TCATTTCCGC CTCCGTTGCC GTCTTGGGAA TTGTGTGGCT AGCCCATGAA CACGAAGGAG GATCCGGAAT GCATCGATTT AGATAGTCCT GACATCGCAA CAGCGGGTGT TTGTTCATAA AGGAGACGAT TGGTTCTCGA GCTGACGACA GCTTTGGTAG ATTAAATTTA ACGCAAACTA GGCATAGCGA CTGCGTCTCA
|
Protein sequence | MAASIFDGCS DTCGWIAAVL AVLSFGSFGV PIKLAGKVEV HPLVMQSYKT SVCFLTCWLV ILLGEEPRWT PYGIVSGLFW VPGAAMGIFG IRNAGLAVAV GTWSSITVLT SFFFGIIVFQ ERVKSFYQTC LAFGCLIIGL IGMSRFSAHQ QQVDTLAVSY RSVKTAASHP LGLGQKLKRA GSTIAENSIT VPLVGASGVI PMEIEPFATD GEDIVMGTYD DAKSVLSKDR LVLFGGRVSL TRRQMGILGA VINGAWGGMN LIPLHFALQE EDMTGAGYLI SYATGSLIVN TCIWLAFLGY YLHQTNGHWN EAVDCLPKWH FEHLLIPGLM AGLLYSFGNF CSILAVTYLG QGTGFSFCQM QLFVSGLWGV FFFKEVQGTD TITKWFISAS VAVLGIVWLA HEHEGGSGMH RFR
|
| |