Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47067 |
Symbol | |
ID | 7202153 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 315757 |
End bp | 319539 |
Gene Length | 3783 bp |
Protein Length | 1237 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181182 |
Protein GI | 219121664 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.375267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACGGT CCTGCGCCGA TTTCGTGCGT ACAGCGAGCT CGTGGAAAGA TCGAATGTAC GTATTCAACT TCCGGGTCAC GTTTTACGGT AGTAGGGGAT GGCCACACTG TCCGTACTTC AACAGACAAA ACCGGAACCT TATCTGGAAA TTCTTCACCG GCCAAGATCC ATCAATTTTT TCCCTTGGGC GTCGCCGTTT TGGAGTATCT CCCTCCTTCT GCTATGCTCC TTGCGCTGCT GTGAGCTTCA CGGAAGCCGA CACCAGTCAT CCTCCACCTA CTTTTACAAG TGCCAACTGT TTGTTGGAAT TTGTCCGTGT CTGCTTGACC AGAATGTCGA ATGGTATGGC AACGGCGACT GCCCTCGGCT TTGGCGGTCG GGCGCCCGCC CTGGTGCTTG ACTTGCTATC AAGGGCCGTC CATGAGGAAG GGATCGTGAC GGTATCGCCA AGGCGCTCGC AAGCTTTACC GCTGGCCTTG ATGGAAGCGC CGTTTCGGAC GTTTCTCAGC AACGACAATA ACAACAACAA GACGGAGAGT GGAGGCGGAA AATCACCTCT TGACTTGCTC GTGTACTGTC TACCAACGAG TGTGACGGAA TATCTCAACG CTGCTCAAGA AGCCACTCTC GTCCGTTCCA AGCTCCTGTA TGGCAGTTCC GATCGCCAGA TCGTGGTGGT CATATACGAT GTCCACAAGA ATGCTTTTGT AGGCGGTGAG ACTGCGCAAA TCCTCGACGC GGTTGGTGGC AACAATTCGT ATCGACAAAA ACTCGCCATG GCGAGTTTGG CGTCGCTTGC CGAACTCGAT CAAAAGGCGG CCGCTCAGCT CGGCCCCGAT CGTCAACAAG ATTCCGTGAT TGTCGTAGGT GCCGGAGGCC GCGAGCATGC GCTCGCTGTG GCTTTGGCGC AATCGCCCCT TGTCGGAAGA GTCCTGTGTT GCCCTGGTAA CGGGGGTACC GCCGTCGAAG GTGGGAAAAT CGCTAACGTT CCCAACGGTC AACAAGACAA CGAAAGTGTC GTAGCATTGG TCAAGGAAAC TAACGCTGCA ATGGTGGTGG TGGGTCCGGA AGCTCCTTTA GTTGACGGTC TGGTGGATGC CTTGGCGAAG GAATGTCCGG GTACCATGGC GTTTGGTCCG ACACAAGCGG CGGCAGAACT GGAAGCTTCC AAAGCGTTTT CCAAAGACTT TCTGCAGGAG CACGGTATTC CCACCGCAAA GTATCGCAAC TTTACGGACG TATCTGAAGC AATCGCCTAC GTGGAAAGTT TGGATGAGTC AGATCGACAG GTCGTGAAGG CGTCGGGTCT CGCAGCCGGA AAAGGGGTCT TACTGCCAAC GAACAAAGCC GACACCATTG CGGCTGTCAA GGAAATAATG TCCGACAAGG CTTTCGGCAA TGCCGGTGAT ATCTGTGTCA TTGAATCCTT CCTGGTAGGA CCCGAAGCTT CCTGCTTGGC ATTTTGCGAC GGAAAAACTG CTCGTCTCAT GCCAGCGGCT CAGGACCACA AACGAGCCCT CGACGACGAT CAAGGTTTGA ACACCGGAGG GATGGGAGCC TACGCACCAG CACCGTGCGT CACTCCCGTA TTACAGCGTA CCATTGAGGA AATGTGTATC AAGACGGTCC AGAAAATGGC AGAGCGTGGT ACACCGTACG TTGGAGTGTT GTACGCGGGT ATGATGCTGA CGCCGAATGG CCCGTACGTT CTTGAATTTA ACTGCCGGTT TGGGGACCCC GAGACTCAGG TTGTCTTACC GTTACTCGAA ACGGATCTGT ACGAGATCCT GACGGCGTGC TGTTCGGGAA ACCTGGATGC GATAGATGTA CGTTTCAAGG AAGGTCAATC GGCGGCAACA GTTGTTTGTG CTGCCTTGGG ATACCCTGAG GTATATCCAA AGGGTATGGA AATAACCGGT TTGGATGCCG CAAATTCTTC AAATGGGGTC AAAGTTTATC ACGCGGGCAC GGATGTAGAC AACGCCGGCG TCACGCGTTG TTCGGGTGGT CGTGTCTTGG CCATAACGGG TACCGGTAGC AGTCTTAAGA ATGCACTTCA GTCAGCTTAT AACGGTGTCA AAAGTATTCA GTTCATCGAT GTACATGGCA AACATCAGTT GCACCGACGA ACAGATATTG GCAAGAAGGC AACGCAAAAG AACCTCCGAA TTGGGGTGCT GGGTTCAACT CGCGGAACAG CTCTGATTCC CGTCGTTGAA GCGTGCCGGA GTGGAGAGCT CGACGCCGAA ATTGTGGCAT TGATCAGTAA CAAGTCCTCA GCCCCTATTC TTGAAAAAGG GAGAGCTCTT GGCGTAACAG TTCTGTCAAA ATTCATTTCT GCGAAGGACT TGAGCCGCGA GCAGTACGAT TCTGAGTGCA CCGCTGCTTT GGTGGCCGCT GGTGTGGACT TTGTTTTGCT TGTAGGATAC ATGCGAATTT TGTCCAAGTC GTTCACAGAC TTTTGGAAAA ACCGATGCAT CAACGTGCAC CCGTCTCTTC TCCCTAAACA CGCCGGCGGT ATGGACCTTG CAGTGCATCA AGCCGTCATT AACGCAAAAG AAACAGAAAG TGGTTGTACG ATTCATCAGG TAACGGAAGC CGTTGATGGC GGCCCTATTG TCATACAGAA GAGAGTATTG GTAGATAGTG GAGACACTGC AGAATCGTTA AAAGTCAAAG TGCAATTGCA AGAGGGACCA GCGTTTGTTG AGGCAATCAA GCAGTTCTCC CAAGGTGCTA CTATAAGTTA TGCCGATGCG GGAGTTAGCA TTGACGCCGG CAATAAGTTT GTGGACTTGA TAAAACCACT TTGTAAGGCC ACTCGTCGTG CAGGATGTGA CGCTGATCTC GGCGGCTTTG GCGGACTGTT TGATCTAGCA GCGGCTGGCT ATGATTCGGC CAATACAGTC ATTATCGGTG CCACAGATGG TGTCGGTACG AAACTGCGCA TTGCACAAGC AACCGGGAAA CACGAGACAA TTGGCGTTGA TCTCGTTGCG ATGTGCGTCA ACGATTTGAT TGTAGCCGGT GGCGAGCCAT TGTTCTTTCT AGACTACTTT GCCACTGGGC ATCTAGACGT CCACGAGGCA GCTGCGGTTG TAAAAGGGAT TGCCGAAGGC TGCCAACAAG CTCAATGTGG ACTCATAGGC GGAGAAACTG CAGAGATGCC ATCCATGTAC GCCCCTGGTG ACTACGATGT TGCAGGCTTT GCCGTTGGCG CTGTTCCTCG TGATAAAATT CTTCCCTGTA GTATTTCCTC CGGAGATGTG TTGTTGGGCC TTGCAAGCAG TGGCATTCAC AGCAATGGCT TCAGCTTGGT ACGGAAGCTC ATCGAAAAAG AGGGGCTAAA CTATTCAAGT CTATGCCCTT GGGAAGAATC TGGTGTTACG ATTGGAGATT CGCTCTTGAC GCCCACTAAA ATTTACGTTA GGTCATGCCT TCCCATGATC AAAAACGGAC TGCTGAAAGG CCTGGCTCAT ATCACGGGAG GCGGTCTTTT GGAGAACCTT CCTCGAAGCC TTCCGTCTGG TGTTTCCGCC GAAATTACTG CGCATCCAAA ACTACCTCCT GTGTTCAAAT GGATGAAAAA AGCTAGTGGT TTGTCGGATA CGGAGATGCT CCGTACCTTT AATTGCGGAA TTGGAATGGT TCTCATCCTT TCTCAGGAGA ATGTTGGCGA GGCAAGAGAT CTGCTTACCG CAAGCGGAGA AACAGATTTT TTCGAGTTGG GTGTTTTGGT GGAAGGAGTG GGAGAAGTCG TTATGAAAAC TACTCTTACT TAG
|
Protein sequence | MSRSCADFVR TASSWKDRIQ NRNLIWKFFT GQDPSIFSLG RRRFGVSPSF CYAPCAAVSF TEADTSHPPP TFTSANCLLE FVRVCLTRMS NGMATATALG FGGRAPALVL DLLSRAVHEE GIVTVSPRRS QALPLALMEA PFRTFLSNDN NNNKTESGGG KSPLDLLVYC LPTSVTEYLN AAQEATLVRS KLLYGSSDRQ IVVVIYDVHK NAFVGGETAQ ILDAVGGNNS YRQKLAMASL ASLAELDQKA AAQLGPDRQQ DSVIVVGAGG REHALAVALA QSPLVGRVLC CPGNGGTAVE GGKIANVPNG QQDNESVVAL VKETNAAMVV VGPEAPLVDG LVDALAKECP GTMAFGPTQA AAELEASKAF SKDFLQEHGI PTAKYRNFTD VSEAIAYVES LDESDRQVVK ASGLAAGKGV LLPTNKADTI AAVKEIMSDK AFGNAGDICV IESFLVGPEA SCLAFCDGKT ARLMPAAQDH KRALDDDQGL NTGGMGAYAP APCVTPVLQR TIEEMCIKTV QKMAERGTPY VGVLYAGMML TPNGPYVLEF NCRFGDPETQ VVLPLLETDL YEILTACCSG NLDAIDVRFK EGQSAATVVC AALGYPEVYP KGMEITGLDA ANSSNGVKVY HAGTDVDNAG VTRCSGGRVL AITGTGSSLK NALQSAYNGV KSIQFIDVHG KHQLHRRTDI GKKATQKNLR IGVLGSTRGT ALIPVVEACR SGELDAEIVA LISNKSSAPI LEKGRALGVT VLSKFISAKD LSREQYDSEC TAALVAAGVD FVLLVGYMRI LSKSFTDFWK NRCINVHPSL LPKHAGGMDL AVHQAVINAK ETESGCTIHQ VTEAVDGGPI VIQKRVLVDS GDTAESLKVK VQLQEGPAFV EAIKQFSQGA TISYADAGVS IDAGNKFVDL IKPLCKATRR AGCDADLGGF GGLFDLAAAG YDSANTVIIG ATDGVGTKLR IAQATGKHET IGVDLVAMCV NDLIVAGGEP LFFLDYFATG HLDVHEAAAV VKGIAEGCQQ AQCGLIGGET AEMPSMYAPG DYDVAGFAVG AVPRDKILPC SISSGDVLLG LASSGIHSNG FSLVRKLIEK EGLNYSSLCP WEESGVTIGD SLLTPTKIYV RSCLPMIKNG LLKGLAHITG GGLLENLPRS LPSGVSAEIT AHPKLPPVFK WMKKASGLSD TEMLRTFNCG IGMVLILSQE NVGEARDLLT ASGETDFFEL GVLVEGVGEV VMKTTLT
|
| |