Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47784 |
Symbol | |
ID | 7203034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 20321 |
End bp | 24363 |
Gene Length | 4043 bp |
Protein Length | 1317 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182145 |
Protein GI | 219123674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.686852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACACGGTTC AACTCGCGTT GCAACGTCTG GATGTGGATC GTGTTTTGAA TTGCTTCTGT TGTTGTTGTG GTCATTTTGT GAGTGTGGCA TGGCGTCTCG GAGCACGGGC GGTTCCCAAT TTCGGGGTCG TCGACGGGGA GACGCGGTGC CCAACCGTAC GAGTGAGGAC GCTCGCAGCG CCAGGGTTTC GCCTGCGCCT TTATGGATCG CGCCGCCGTC ACCAGTCCGT CGCGTCGACA CGCATCCCCA TCCGAGCAGT AGTCATGGAA TCAACCGGCG CGGCGCCATT TCCGACGGTC ATCCTCACGA TCGTCATCAC CGACGGTCCA ACAACCACAG CGACCTGGAT ATGTCGAACA CTCCTCCGCG CGTGGATCCT CGGTCCAACT TTGCTCCACT CCGCATGCGC GCCGACTCGA CGGGATCCAC CCGGGGGGCC TTTTACTCCG GGCACGTAAC GGCCAGCAAT ACCACCACGG GTGCGACGCC AAACCCTGTC GGGGAAGCGC AACAGCCATA CCTACAAGAA CAGCAGCGTT CTACCGGTAT GCGAACTACT GCGCCCCGGC GTCGTAAAAA GAAACCAGCC GGAGTCGCAT CTTACCAAGA GCCAGCACCA AGAGCTCCAG CATTGTCTGA TCAAATTCGT CCCTCCAGCA TGGCTGATCT GGCCGATCTA GCCTTTGCCT CGCTCGAAGG AGAAGATCTA GAGTTGGGCA ACGGATCAGC GGAGTTGCAC ACCCTGGTCT CTCGCCGCGG ACGGAACGTG GACGGGAACC ATCCATTATC GACAATGACC GACGGCAATA GATACAGTTC CGCAATTCCA GAACACTCTG CCCGACCGAA CATCTTGGCT ACCATCAGGG ATGAATCTAG TTCTAGTCTC CGCGAAGTAC CTACAGCGGC CCGTTCGCAA CGATTTTCCG ACAGTATGCG ATATTCTGAA AATTCCATGG GTTCCTTGCT CGGAACAAAT TGGGTAGAGC GTGGCCCAGG AACTGGTGGC TCTCCCCGTT CGTCTCTCAA TACAGCCTCC AGTAGGCAGC CACAGCGACC CCGAGACGAC GAAATTGCTA TGTTTTCCTC TTTGGTACAT GCCGACTACG GCGCTACCGG TGGCGAACCC GTTAATGTCG CTGCTGCCGT CGCGCTTGAA CACGCTGCCG AAGAGCAAAA ACACCTCCTG AACGAAACCT TTTCGAGTGA CGACGATGAT TCGTCCGGAA CTTACAGCGA CGAAGAAACG GAGCACGACG ACGTCCACCG GGGTTTTGCT GATCAGCTTG TCATTTTGTG GACTGCTTGG TTGACCCAAC AAGCTCATTA CGACGAAGAT ACGGGTCAAC CATATTTCGA AGATGCGACC GGGTGGACTC CGGCGGGCTT TGTTCGGCAC TACCTTTACA ACCCCTTGAC ACCCGAATTT ACCTCGCTCC AGCAATTTTG TTGGGCAGTC ATTCTTGGTG TACTCATGGG CTTTTACACG GCCCTGTGGA AGTACGTCAT TGAAACCGGC CTTGACTTTG TGTGGGAAAC AGTACCCACC TGGTTATTGC AGGTGGGTGT CTTTACAGAC ATCGACGGTG CGTTTCCGCT TTATCACTAT ATGTGGATTT GTCCATCTAT CTTTAGCGGA GTCCTATCGT ACGTTTTCGT CGTGTTGCCA ATAAAAATTC CGGATCAAAA CGAATGGATC AACTGCGTGC ACACACGTGG CGTCCAAGAT TACCGTACAT TTGGCACTCT CTTTGTCCTC TCGACGCTCG GTATGCTTTC CGGTCTTAGC CTTGGACCGG AATTGCCTTT AGTACTGACA GCTGGTATGG TCGGTTCCTG GCTGGGCCTA GTGTGCAAGC AAAGTATGCT GCAGGCGAGA GTTATGAATC TGACAGCCGC TTCTGCTGCG GTGGGAGGAT TCTTCGGTTT TCCTATGGCA GGAGCCTTGT TTGTGCTGGA GCTCCCTCAT CGAATGGGGC TTCAGTACTT TGAGGCTTTA TCCCCGGCGA CTATTTCGTC GATTGTGGCC GTTCTGGCTA ATCGATTGAT TACGGGCAAC GATGTCACAG GTTACTACAG CTACCCGTTC CTAACAGCGA CCTTGCCGAG TGAGATTTTC ACTAGTGCTA TTGTGTATGG CTTGTTCGGT GCAGGTGTGG GTATTATATA CGTCAAGTGG GTAGTGTGGG GCAAAACGTT GGTTCACGAT TGGTTCCAGG CACCACGCGA AAATGACATT AGTCCAATAA CTGCTCCTGC GGACCACTCG GGAAATGGAG TAAGAGAAGA AGTCATATCT TTGGTGTCGC AAAAGGTTCA GAAAAGCATA CCGGAAAATA GAAGCATGTT GTCCCGCACG ATAAAATGGT TCCGCTGCGT CATCAAGGAA GAACCGAAAC GAGCAGCTGT TGCCGGTGCT CTTGCTGGAT TTATAGTGGG CGTGATTGGA ATGTTCGTTC CTCATACAAT GTTTTGGGGC GAAGCACAGC TCCAGAATTT GATTGATAAG GGACGCACTC CTCTTCCTAT ATTTGGCCTC GCTGGTGAAC CAACTTCAGC TTTAGTTGCG CTCGGCTACT GCATGATAGA TCCGAATGAT CCAGAAGCCG TCAAGGCTGG GTTTGATGTA GGCTGTTCTG CTGTGATTTC ATTTGCAAAA ATCGTCGTGG TGGGTCTCAG TCTTGGTACG GGCATTATTG GTGGTCAGTT TTGGGGTCCG CTATTCGTCG GCTGCTCAGC GAGTCATCTT TTTACTGATG CGGTCAATAT GTTTGCCGAC AAGTTTGGCT TTGGACAAAG CCTCGCCGCT TATCCCTGTG TCGTTATCCT ATGTACGATG GGTAGCGCTC ATGTTGTTAC ATTTCGTGCC CATATGGCTA TTATGCTAAT TCTGACTCTG ACAATCAGCG CATTCGACCC AGATGGTGGA AGCAGTATTG GTGCTTTCAA AGTAGCCGGT GATTATTCCG CTGTTTTCCC GCTCCTTGTT GTATCTGTGT TCGTCGCTTT GATGGTCTCC CGTGGGACGG TCTTTTACAA GACGCAGCGA TCACGGGGCG ACATTATGGC CGTACCGGAA GTCTTGTGTG AACCAGGTAT GGAAGGTCGC CCTATGATTA TGGACTTTGA CATTGCGGCA GATGGAGCCT CTTTCATAGA TGCTGTAAGT GACACGGATG AAGATTACGA TGATCGTAAC GATACGAAGC TCTCCCCTAC TACCTCCTAC ACCGGGAGTT ATCAAGTGCG CGCAGCAGAT GATGGTATGA CACAAATAGA CATCGAAAAC GAATTTGCTG GGAGAGCGGT ATGGAACAAA GCAAGTTCTT TGCGACCCTC TGTTTCACGA ACTCAAGCGG ATGTTCGCAT AGGGACGAAG GACGCTCCCC GCCAGTTTGT AGAATATGAT TCAGGCGGAA TTATTCCTCG CTCTCTCTCA AACCCTATGA GTGTTGATGG AGAGCTACCA GGTCTGGACG ACCTCCTTCG CCGGACGATG GTACCAAAGC CAACCTATGC TATGTCACCT CATCGGCATC GGCGCACTCA AAGCGCTCCT ATTGCACCTG AGCCTTCCTT TTCCGGAGGC GGTAGTGCGA GCCCCGACGT AAAGCGTGTG GAACGCTCAC GTGGTCGCGA CTCTTTCAGA TTCGATATCC CGATACGGGA ACGAAGCAAC AGTGGGAGTA GCCGTGGGAG CTTAGTTCGT GTCACCAGTT ATGGTGAGCT GCAGCAACAG CAGCCATCTT TGTTGGACCA AGCCCGTATG CGAGCGGCTT CGTCGGCTGC TGACTCGCGT CATCACCGCG TGCCAAGTCT GCCCTCAGGT CGACACTCTC GCAAAAATTC CGAGTCCAGC ATGAGCTACG TGAATACAAA CAATATTTCG GCAGTTGGTA CAGACGACTC TGGTGCGCTG ACACTCGATG ATATCGAAAA GTCATTCCAA AACGTTATGA ATAAGCAACT AATGGGCAAT TTACCGCAGA GCCGCTCGCC CTGGACCAAC AATGGCAACA GCGGAAGGTC GACTGGCTCT TAA
|
Protein sequence | MASRSTGGSQ FRGRRRGDAV PNRTSEDARS ARVSPAPLWI APPSPVRRVD THPHPSSSHG INRRGAISDG HPHDRHHRRS NNHSDLDMSN TPPRVDPRSN FAPLRMRADS TGSTRGAFYS GHVTASNTTT GATPNPVGEA QQPYLQEQQR STGMRTTAPR RRKKKPAGVA SYQEPAPRAP ALSDQIRPSS MADLADLAFA SLEGEDLELG NGSAELHTLV SRRGRNVDGN HPLSTMTDGN RYSSAIPEHS ARPNILATIR DESSSSLREV PTAARSQRFS DSMRYSENSM GSLLGTNWVE RGPGTGGSPR SSLNTASSRQ PQRPRDDEIA MFSSLVHADY GATGGEPVNV AAAVALEHAA EEQKHLLNET FSSDDDDSSG TYSDEETEHD DVHRGFADQL VILWTAWLTQ QAHYDEDTGQ PYFEDATGWT PAGFVRHYLY NPLTPEFTSL QQFCWAVILG VLMGFYTALW KYVIETGLDF VWETVPTWLL QVGVFTDIDG AFPLYHYMWI CPSIFSGVLS YVFVVLPIKI PDQNEWINCV HTRGVQDYRT FGTLFVLSTL GMLSGLSLGP ELPLVLTAGM VGSWLGLVCK QSMLQARVMN LTAASAAVGG FFGFPMAGAL FVLELPHRMG LQYFEALSPA TISSIVAVLA NRLITGNDVT GYYSYPFLTA TLPSEIFTSA IVYGLFGAGV GIIYVKWVVW GKTLVHDWFQ APRENDISPI TAPADHSGNG VREEVISLVS QKVQKSIPEN RSMLSRTIKW FRCVIKEEPK RAAVAGALAG FIVGVIGMFV PHTMFWGEAQ LQNLIDKGRT PLPIFGLAGE PTSALVALGY CMIDPNDPEA VKAGFDVGCS AVISFAKIVV VGLSLGTGII GGQFWGPLFV GCSASHLFTD AVNMFADKFG FGQSLAAYPC VVILCTMGSA HVVTFRAHMA IMLILTLTIS AFDPDGGSSI GAFKVAGDYS AVFPLLVVSV FVALMVSRGT VFYKTQRSRG DIMAVPEVLC EPGMEGRPMI MDFDIAADGA SFIDAVSDTD EDYDDRNDTK LSPTTSYTGS YQVRAADDGM TQIDIENEFA GRAVWNKASS LRPSVSRTQA DVRIGTKDAP RQFVEYDSGG IIPRSLSNPM SVDGELPGLD DLLRRTMVPK PTYAMSPHRH RRTQSAPIAP EPSFSGGGSA SPDVKRVERS RGRDSFRFDI PIRERSNSGS SRGSLVRVTS YGELQQQQPS LLDQARMRAA SSAADSRHHR VPSLPSGRHS RKNSESSMSY VNTNNISAVG TDDSGALTLD DIEKSFQNVM NKQLMGNLPQ SRSPWTNNGN SGRSTGS
|
| |