Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39557 |
Symbol | |
ID | 7195234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 121910 |
End bp | 123541 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183554 |
Protein GI | 219126627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCGT CGACCCCGTT CGCGTCCGTC GCACGGTCGC ACCACGCCCG ACGTCGGAGT GATCCGTCTC GTCGTTCCGG ACTGCACTGC TTAAGGCTGG TACTTCCGCT GGCTATCCTC ACACTCGGCA GCTGCTCCTT GTGGATTCTG CTCGCGGACT CCAAGTCAAT CAACAGCAAT CATCCCGATC CTTCCGAAAT CATTGCTCTC GCAGCCACGT TGGAAACCAC GCAACAACGC CACACGCGTG TAGTCACCTC CGGAGTAACG GCTACTCCCA ATCTTCTACC CCCTTCCCTA CCCGGGACCT TGGCCGACTT TCGTCACCAA CCCCGGGTGC CCTTGGAACC TCCCGTTCGA CTCTGGAACC AATCCAAACA TGATTACGAT CTGGTACACG TCATTTTGAC ACGCTTCCAA CAACACCAGG CCGATTTGGA GCATTTGGGA CGCGCTCGTT GGGAGCTCTT TCGGACCTTT TGTGCACCGT CCCTGCGCGC ACAGACCAAC CAACAATTCC TTTGGATTCT ACGGGTCGAT CCCGATCTCT CACCAGCATT GAAACGAGAT TTGTTACGCA CGGTGGATGG CATGGACAAC GTACTCGTTG TGGCATCGAA CGGATCACAG GAAGATGGTC TCCGCAATCC ACACGGCAAT CGCGATATTA CCCAACCCAA CGTGAGCAAC AACAGTAGTA ACAACACTAC TGGTAGTACT TCCGTTTGGT ACGGCAGTGT GGAAACCTTT CGATCCTACC AGGAAGCGAG TCAAACACGC ATGGTATTGG AAACCAATCT GGATGCCGAT GACGGACTGG CAGTGTCCTT TGTGGAAACG CTCCAACGTC AAGCGGACGC CACTTTTCAC ACGACGAGTC ACACCGGGAC CACGGTGGAG AGCGACACCG GTAGCTCGTT CGACCCAAAC ATTGCGTGGC GCATATATTG CGTGAATCAT CACGTCACCT GGCAATTCTG GGCACCGTGG AGGAAGACCA ACGACGATAC CAACAACGAT AGGAGTACCG TGACGGAACG AGGCAGTCTA GAAGCCCAAC ACGACTTGGA CATTTGTGTG ACACCAGGTC TAACTTGGGC TTCTCGACCA CGCACCCCTC AAACCTTTAA GTACATGCGT TGGCATTGGC GTATTCGTGG GACTCTACCC CGGTGTTCCG ACGACCATGA AAACAACGAA AACAACAAAA ACAACCGTAC CACGGCACTT TCGGGATGCT GGTCCTACGT GCGTCCGGTC CAAACGGTAG AAGGGACGTC CAGCGTCTCG CAATCCGTTC TCTCCTCAGT CGACTTTTCA CCCTTGGCCA TCCGGGCACG GACTCCTACC AGCGCCGGTA TGAATGACGT TGCAACGGTT GGCGGGACCG TCTCCACATC AACCGCTCGA GCCAAATTGC AACGCCAACA ACAACAAGAT GACTTGCTTT GGCGAGACAA CGTGGCCGAA ACGATTTTTG GTGGAAACAC TACGACTATC GTTGAGGCTC GAGAAAACAT GCAGACCCAC TTGGTAGACA TTGTGCAAGA CGCCCTGCAA GGTCAATGCA CCAAGGGACA TTCGTGTCAC AACAAGAGCA AACTTGCTCT GACCCGATTG TTGGAACAGT GA
|
Protein sequence | MAASTPFASV ARSHHARRRS DPSRRSGLHC LRLVLPLAIL TLGSCSLWIL LADSKSINSN HPDPSEIIAL AATLETTQQR HTRVVTSGVT ATPNLLPPSL PGTLADFRHQ PRVPLEPPVR LWNQSKHDYD LVHVILTRFQ QHQADLEHLG RARWELFRTF CAPSLRAQTN QQFLWILRVD PDLSPALKRD LLRTVDGMDN VLVVASNGSQ EDGLRNPHGN RDITQPNVSN NSSNNTTGST SVWYGSVETF RSYQEASQTR MVLETNLDAD DGLAVSFVET LQRQADATFH TTSHTGTTVE SDTGSSFDPN IAWRIYCVNH HVTWQFWAPW RKTNDDTNND RSTVTERGSL EAQHDLDICV TPGLTWASRP RTPQTFKYMR WHWRIRGTLP RCSDDHENNE NNKNNRTTAL SGCWSYVRPV QTVEGTSSVS QSVLSSVDFS PLAIRARTPT SAGMNDVATV GGTVSTSTAR AKLQRQQQQD DLLWRDNVAE TIFGGNTTTI VEARENMQTH LVDIVQDALQ GQCTKGHSCH NKSKLALTRL LEQ
|
| |