Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47105 |
Symbol | |
ID | 7202018 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 423920 |
End bp | 427165 |
Gene Length | 3246 bp |
Protein Length | 1038 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181206 |
Protein GI | 219121714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00493639 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA CCAGAAGCAA TACTTTTAGT GATGCTACTG TGGAGTATGT TGTTGGAACT GTCCTGGATG CTAACTCTGA GTCTCCTTAC AGGCTTGTCC TGAAGGAAGC TGGTATTGAA TCTATGATGG ACATTCTTGA GTTGACTTTG GATGACCTAC TGTTTCTGCA GTGGACTTCA GGAGAGGAGA CCCCCAAGAA GTTGACCCTT GTCCAATCCA AGCGTATCAT GCACCTCATT GCCTGGCATA GAACTCAGGA TGACCCAAGC AATGTGGATT GGTTTTCATT GACTCCTACA GTACTCAGAC AGTTTAGAGA AGGTGAAATC TATTCCAAGC CCCAATGTGC AGATGAGATG TCTGAAAATT CCTACACTGT TCCCCATGCC CAGCCAGCAT TGCCTAATGC TGCTTATGAC TTTGACAAAG GTACTGAGAG AAGTATTGCT GACTATCCTG TTCTGAAAGA AGCCAAACAA TGGTCCTCTT GGAATCGCCA GACCAGAGCT CTTGCCCTTT CCCATGGCCT ACTGAATGTA TTTGACCCTG CCTATGTTCC CCTCAACCCT GATGAAGCCT CTCTGTTTGC TACTCAGCAA AGATTTGTCT TCAGTGTGTT TACCATGGCC ATCAAGGAAA CCAAGGGAAT GATTATCATA CGCCAGCACT CTGATGAAAA GAATGCTATA TCTCTGTTTG GCAATGCTCA GCTTGTCTAC ACTGCCTTGA TGGCTGCCTA TGAGGGTGGT GTTGTGGCTA CACTTTCTGC TCAGTATCAT GAAACCCTTC TCTTGAATTA CAACTTGAAC AATTCCTGGA CCAAACCTCT TGTGACCTGG TTTTCTTCTT TTGAGCACAA ACTGCTGGAT TTGGACAATG TGCTTCCTAC TCCCAAGGAT GATGCTTGGA AGCGCAACAG ACTTGAAATG GCTGTGCTTC ATCATACTCA GCTTCAGACC TTTCATTCTA CACTTTCTAC CCAGGCACTT GTGATGGGCA AGCAACATGA CTTCCTCTTT GAACTTGAAG CCTGCAAGAC TCAAGCTGCC AAATTGGATG CGCAGGCTGG CATGAAAGTT AAGACACAGA GGCAGACCAA CAACCATGAG CGTGGTGGGA CCAAAGGTAC TGGCCATGGC AATGCTTCCA ACAGGAAGGG CAATGCTGGC CGTGGTGCAT CCAAGGGTAA AGCCAAGGGA GGCAAGTATT CTAACTATAT TGACCCTGCA AAATGGAATG CTATGACCTC AGAAGAAAAG CAAGCTGTCT ACGATGCATG CTCTACACCC AGTGCAAGCA ACACTCCCAA TGCAAACCCA GTCCCTTCCT CTGTGCTCAT CAACCAAGGT ACTGTGCAAC CAAGTTCTAC TTCTGATGCT GTTCCAACTG GACAACACAT TGTCCCCAGT AGTGGTGCCT CTGCTCTCTC TGGTACCACC AACCAGTCTT TCATCAGGCA ACTGCTTTCT AATGCTACTG CCAGAACTCC CACTGTACCT ACTTCTTCCC ATGATGGGGA GATTGTCATT GATGGGCTCA GGTTCAGACA TGTTAATATG CACAAGGTCA GCTACTGTGT TAGCAACTAT GACTTGGCCC TTCAGACCCA CCAGGGTTCA CTCATTGATG GTGGGGCTAA TGGTGGCATG TCTGGTGCTG ATGTCAGAGT CTTGGAAAAG GGATTTGCCA CTGCTGATGT TACTGGTATT GGCAATCATG CTGTTTCCAA CTTACCTATT TGTCAGGTTG CAGGTGCTAT CATGACTACA AATGGATTGA TCATTGGCAT CTTCAGTCAG TATGCACATT TTGGAAAAGG AAAAACTATT CATTCTAAGC CTCAGATGGA ACAGTTTGGA CTTACCATTG ATGACAGATC CAGACTTAGT GGAGGGCAAC AAAGAATGGT AACCCCTTGT GGACACATCA TCCCTTTGCA CATTTGCAAT GGTCTTTGCT ACATGGATAT GCACCCTCCC AGTGATACTG AAATGGATGC CCACCCCCAT GTCTTCTTTA CTGCTGACAT GCCTTGGGAT CCCTCCATTT TGGACAATGA ATACACAGAG CATGAGTTCT CTGACTGTCT TGCACCTGAA GACTTCACTC CTTTGGATCA TCGTGTGAAC CAATTTGGAA CTACCACTGA TTCTGATTTC TACCTTGACA CTTGTATTCA TGCTGTCCAT AACATGCAAC TTGTGCATTC CCAACATGTC TCAACTCAAG TGCCTGACTT GCAAGCACTA CGCCCTAATT TTGGATGGAT TCCTGTTGAG AGATTGAAGA ACACCTTGGC TAATACAACT CAGTATTACA GGGCTTCCAT CTCTTACCCA TTCAGAAAGC ATTACAAGTC AGTTTCCCTG CTGCTAATGT TCACAGATTG AATGAATGGT TTGCTACTGA CACCTTCTTT AGCAATGTAC CTGCTCATGA TGATGGTTAC ATGCACCATG GTGGTGCTAC TATGTTGCAA GTCTATGCTG GCAAGGACTC TGGATACTTA GCTGGGTATC CCATGAAGAT GGAAGGTCAA ATGCCCCAGA CTCTAGAAGA CTTTATCCGT GATAAAGGTG CTCCTTTGGG CTTGTTCAGT GACAATGCTA AGGCCCAGAC CTCCAAGGCT GTTGAGACCA TTCAGCGCCT CTACCATATT GCAGATGCTC AATCTGAGCC TCACTATCAA CATCAAAACT TTGCTGAGCG CTGTATCCAA AACATCAAAT GTATGATCAA TACTATTATG GATCGCACTG GTACCCCTGC CAAGTACTGG CTCCTTTGCA CTCTGTTTGT CATTGACCTA TCCAATCACC TTGTGAGTGA TACACTTCAA GCAACTCCTT TGACCCGATG CTTTGGTATT CCCACTGATG TTTCTGCTTA CCTCACTTAC CATTGGTGGC AATTGGTTTA TTTTGAGAAC CATGATGGCT CTTTTCCCTC TACTCCTAAG GAAGGCCTTG CTCATTGGGT TGGTCCTACT AATATGAAGG GGGATGTATT GACTTATCAG TTGCTGACTG TGGATACTCA GCAGCTGCTC TTTCGCTCCA ACATTTGTCC TGCTACCACT GACCCCATGG TCCCTAATGC CAGAGTTGAT GCCTCTGCTG CTCCACATCT TCACCTGGAG GCAGGGGAGG AAAAGGACCA GTCAGACAAC ATCAAGTCTA TCTCTGCTTT CCAAAAGATT GATCCTTCTT ATGTAAAACT GCCTCTCTTC TCTCCAGATG AGTTAG
|
Protein sequence | MTTTRSNTFS DATVEYVVGT VLDANSESPY RLVLKEAGIE SMMDILELTL DDLLFLQWTS GEETPKKLTL VQSKRIMHLI AWHRTQDDPS NVDWFSLTPT VLRQFREGEI YSKPQCADEM SENSYTVPHA QPALPNAAYD FDKGTERSIA DYPVLKEAKQ WSSWNRQTRA LALSHGLLNV FDPAYVPLNP DEASLFATQQ RFVFSVFTMA IKETKGMIII RQHSDEKNAI SLFGNAQLVY TALMAAYEGG VVATLSAQYH ETLLLNYNLN NSWTKPLVTW FSSFEHKLLD LDNVLPTPKD DAWKRNRLEM AVLHHTQLQT FHSTLSTQAL VMGKQHDFLF ELEACKTQAA KLDAQAGMKV KTQRQTNNHE RGGTKGTGHG NASNRKGNAG RGASKGKAKG GKYSNYIDPA KWNAMTSEEK QAVYDACSTP SASNTPNANP VPSSVLINQG TVQPSSTSDA VPTGQHIVPS SGASALSGTT NQSFIRQLLS NATARTPTVP TSSHDGEIVI DGLRFRHVNM HKTHQGSLID GGANGGMSGA DVRVLEKGFA TADVTGIGNH AVSNLPICQV AGAIMTTNGL IIGIFSQYAH FGKGKTIHSK PQMEQFGLTI DDRSRLSGGQ QRMVTPCGHI IPLHICNGLC YMDMHPPSDT EMDAHPHVFF TADMPWDPSI LDNEYTEHEF SDCLAPEDFT PLDHRVNQFG TTTDSDFYLD TCIHAVHNMQ LVHSQHVSTQ VPDLQALRPN FGWIPVERLK NTLANTTQYY RASISYPFRK HYNNVPAHDD GYMHHGGATM LQVYAGKDSG YLAGYPMKME GQMPQTLEDF IRDKGAPLGL FSDNAKAQTS KAVETIQRLY HIADAQSEPH YQHQNFAERC IQNIKCMINT IMDRTGTPAK YWLLCTLFVI DLSNHLVSDT LQATPLTRCF GIPTDVSAYL TYHWWQLVYF ENHDGSFPST PKEGLAHWVG PTNMKGDVLT YQLLTVDTQQ LLFRSNICPA TTDPMVPNAR VDASAAPHLH LEAGEEKDQS DNIKSISAFQ KIDPSYMS
|
| |