Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48315 |
Symbol | |
ID | 7203738 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 98743 |
End bp | 101931 |
Gene Length | 3189 bp |
Protein Length | 831 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182771 |
Protein GI | 219124986 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000929997 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGACACGAGA TCCGCTAGAG CTATCTTTAC AAGATCGATT TTTACCAAGT CAAGATGGGG GAACCCAGCT CAAAGACTCT TCCGCGCGCA CCCGAGGCTG AAGTCCAAGA CGACAACATC GAAGACGGCA ACGGTAACCT AAACAAGGAA ACAACGGGGA CTTCGGATAG CGACACCGAT CGCGAGTTTG ATGTCGACAA GTACCCAGTC CGAGAATGGG AGTTTGTCAT CCCGGGCTTT CGGGATCCCA TTTCTATCAA TCCGGTGGTA TCGGCAATTG GTGTGATAGT CCTTTGGGGC TTGGCGATCT GGTGTATGGG TAAGTCAACA TTTGTGTCGT TTGGGAGATT GCGATGAGGG TTCTGGTTCA AGTTGTTCAT CTATGCATAC GTAAATGGAA TCAGAGTGTT TTGTTTACTT TTAATCTTGT TTGCAAGAAG GCCCTCAAAA AATTTGATAC GTCTGTAACG GATTCGCGCC GATTACTGAC TACGGAAGAA TCCGCTTAAG TCGTTCTGTT ACCTGCCGCA ATGTACTGTA AGCATCACTT GCAATCTCAT GAACAATTTC TTTTTGTGCT TAAACAGTTG ATCCTGATGG CTCTCGTGAG ACTCTTGTGG GATGGCGTGG AGATGTTACT CTCTACTTCA GCTGGCTGTT TATGGGATCA AAAGCCATCT TTTTTTTCTA CCTCATCTAC GTCGTCTTCA AGTACGGTCA CGTCAAGTTA GGTCGACAAG ACGAGCCCCC CGAGTTCTCG ACTGGCGCCT ACTTTGCCAT GATTTTCGCC GCCGGTGTAG CCGTCGGTCT ATTTGTCTTT GGAGTAGCCG AACCTCTCTG GCATCAAGAA AGTCACTACT ACGCCAACGC CGGATACCGC TCGCAGGACG AGATTGACAT GTTTGCCCTC AACATGACAG TTGCCAACTG GGGCATTTCC GGTTGGGCGC CGTACTTGAT TGTGGCTGTC GCCATGGGAT TGGCAGGACA TCGCTTCAAC TTGCCCATGA CATTCCGCTC GTGCTTTTAC CCCATTCTGG GTCAGTACAC GTGGGGCTGG ATTGGTGATT TGATTGATGG CTTTGCTATT GTTGTGACTG TTGCCGGTAC GTGCAACAAC CATAGCACCC TGTCGATATG TTTTCTGACC CACGAAGACA TGATTCTCAC TATGCGTTTG GTTATCTAAT ATTATTCACG CTAGGCGTTT GTACCAGTCT CGGACTTGGC GCAATTCAAA TCGTTGTCGG ATTCCAGTAC CTTGGATGGG TCAAAGACGA TATCACCCAG GACGAGGTGT CTCGTGTCCA GAACGCAACC ATTTGGGTCA TCACTGTCAT TGCAACGGCA AGTGTGATAT CTGGATTGAA TGCTGGTATT CGTATCCTGT CTACCATTGC TTTCATGCTG GGCTTGGTGT TGCTCTTCCT CGTTTTCGTA ATGGATGATA CAAAGTACCT TCTTAACCTG CAAGTTCAAG AAGTTGGCTA CTACTTGCAG CATTCTATCT TTCAGCTGAA CTTCTGGACA GACGCTTTTG GGCAGATCCG TGAGGGTGGC GGTCGCGCTG TGGACGGTGC AGCCGCCGCT GCCTGGTGGA TGGATGCATG GATGATCTTC TACCAAGCCT GGTGGTAAGT GTGTTCTTTT CAGTTCAGAC GTTGTGGAAG ATCCTGTTTA CACATAACAA TTTCACGGTC ATCTCACTCA TAGACTCTTC TACATTTTAC AGGGTATCGT GGTCGGCTTT TGTCGGTCTT TTTGTTGCCC GCATTTCCCG AGGACGCACC GTTTCTGAAG TTATCATCTA CAGTCTGGTC GCTCCCGTCG CCTACTGTAT CATTTGGTTT AGTATTTGGG GAGGAGTCGG TCTACGCCAA GCCCGTCAGG GTCGGGAACT GGAAGCGCTC GGAGGCACCC TGTTCAACGA CACCGAACAC TTTCTGGTGC CTGGCAGTAC CAACTGCTAC GATGTCCCGC AAGAGACGTT GTCCCAAGAT GGTACTGTTG TGTTTGAGAA CCATCTTCTG GGAGTGACCC CTGTTTGTCA GTTTGATTCT TCCCAGTCTA ATACGGCTGC CTTCAATGTG CTATACTCTT TCAGCTTCCC GGACTCCTTT GATACCGGCT TTGGACCTAC TTTGTCGGTG TTGTTTATCA TTTCTCTGGC CATTTATTTT GCGACGAGCT CGGATTCCGG ATCTTTGATT GTGGACCATT TGGCGTCGAA TGGTCGCAAG AACCACCACT GGATCCAGCG CCTCTTCTGG GCCGTGACGG AGGGTGCGGT TGCCACGGCT CTGCTTTCTG CCGGAGGTGA ACAAGCCCTT CAGGCTGTGC AGGCCGCGTC GATTGTGTGC GGGCTACCTT TCTGCTTCAT GCTCTGCTAC CTTTTGCAAT CCATCGAACT ATTTTGTCGT GAGGCCTTGA TTGTTGGCGA CGGGCAAGAC TACCGCATCC CTGCCCAGTC TACGTTTTCG GTCCCAATTT ATGGCGGGAT CTTCAACAAC ATGGAGTTTT TGACCAGCGC TGGGTCCGTG AACCCCAAGC GCATTGAGCT TGGTATGGAC AAGGCAACCA CATTCCACGT CGTCGAATTT ATTAAGGGCC TCTTTGTCCC CTTTGTTTCC TTACACAAGG TCTTGAGCGA TGCGTACCCG CGTAACTCTC TGTCGAATAC GGCGGTAACT GCCGTTTACA CTGTTTGCTA CTATATGTGG ATTGGTATAT TCGCGTCTCT CGGATCGAAA GAAGGCTTGA TCGGCTGGGG ATGGCTACTG TTCTTTGCGT GTGCGTGCAT CTTGGGCTCA GTGCGCGGTG GATTCCGTGC TCGCTACAAT GTGCGTAGCA ATATTCTTGG CGACTACATG GCGTCTCTGT TTTTCTGGCC CCAAGTGTTC ACGCAAATGC GTCAGCACTG CGTGGAACTC AACTTGCCCC AAGACCACGG CGACCTTCCC TCGGAAAAAG AAAAGAAGCT GGACGGCTCC GATTCCGACG AAGTTGCTGC GTAGGTAGCG AGTTCCTCTA TATGTTGTGG TGACAGGGAA TGTGTTCTGT GGGTGGCAAT GCCGCTCCCA AAAATTGAAA AGGTGTGTAT GCCATCTGGC TTCTTTTTAT CGTCTTTTCA AATCTGTCAT TCACATTTCG TTAAAAGCAA TTATTAATAG TAGTTCTCTC TTTGGCTTG
|
Protein sequence | MGEPSSKTLP RAPEAEVQDD NIEDGNGNLN KETTGTSDSD TDREFDVDKY PVREWEFVIP GFRDPISINP VVSAIGVIVL WGLAIWCMVD PDGSRETLVG WRGDVTLYFS WLFMGSKAIF FFYLIYVVFK YGHVKLGRQD EPPEFSTGAY FAMIFAAGVA VGLFVFGVAE PLWHQESHYY ANAGYRSQDE IDMFALNMTV ANWGISGWAP YLIVAVAMGL AGHRFNLPMT FRSCFYPILG QYTWGWIGDL IDGFAIVVTV AGVCTSLGLG AIQIVVGFQY LGWVKDDITQ DEVSRVQNAT IWVITVIATA SVISGLNAGI RILSTIAFML GLVLLFLVFV MDDTKYLLNL QVQEVGYYLQ HSIFQLNFWT DAFGQIREGG GRAVDGAAAA AWWMDAWMIF YQAWWVSWSA FVGLFVARIS RGRTVSEVII YSLVAPVAYC IIWFSIWGGV GLRQARQGRE LEALGGTLFN DTEHFLVPGS TNCYDVPQET LSQDGTVVFE NHLLGVTPVC QFDSSQSNTA AFNVLYSFSF PDSFDTGFGP TLSVLFIISL AIYFATSSDS GSLIVDHLAS NGRKNHHWIQ RLFWAVTEGA VATALLSAGG EQALQAVQAA SIVCGLPFCF MLCYLLQSIE LFCREALIVG DGQDYRIPAQ STFSVPIYGG IFNNMEFLTS AGSVNPKRIE LGMDKATTFH VVEFIKGLFV PFVSLHKVLS DAYPRNSLSN TAVTAVYTVC YYMWIGIFAS LGSKEGLIGW GWLLFFACAC ILGSVRGGFR ARYNVRSNIL GDYMASLFFW PQVFTQMRQH CVELNLPQDH GDLPSEKEKK LDGSDSDEVA A
|
| |