Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_52461 |
Symbol | |
ID | 7195117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 318781 |
End bp | 321769 |
Gene Length | 2989 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 61% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183465 |
Protein GI | 219126439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000189103 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCT CGGCTCATTT CAAACTGAGC GACTTTCCTC ACAAAGTCCT CGACCCGATC GCCACCCTCA CCGTCCCACC CACCTACGCG ACCATCAAGC GTGCCCAACG CCAGCTCATG ACTAACGCCG CCGCCATTCC CACACTCAAC GGAGGTGGCG CCCACGGCCA TATGGCCTTG ACCCTGACCG CCCTTGCCTA CGCCGACATC AGCGACGTCC CGTTCGTCAT TCCCGTCGCC CCTCCGGCCA ATCCGCCTCC CGGCGCCACG CAACCGCAAA TCACCGAAAA CAACCGCATT CATCAACGCG ATGCTGACAT CTACAACCTT TATGTCGCCG TCAACAACGC GCTTCGGCAG CAACTTCTCG ACGCGGTTCC CCGCATTTAT GTCCGCGCCC TCGCCCATCC CATGTTCGAG TTTAGCAACG TCACGTGCCT CGACTTGCTC TCGCACCTCT GGACCAAATA CGGCACCATC AAGCCCGCCG AGCTCCAGAA AAATTTCCAG TCCATGTACA CCCCTTGGAA CACAACCGAG CCGCTCGAAT CCGTTTTTCT TCAGCTCGAC GAGGCCATCG CTTTCTCCAT TGACGGTAAC GACCCCATCT CGGAAGCTGC GGCTGTTCGC GCAGGCTACG AAGTCATTGC GCACTCGGGC CTGCTCCCCC TGGACTGCAA AGAATGGCGC AAATTGCCTA CTGCTGCTCA CACACTTGCC CATTTCCAGC AGCACTTTTC CCTTGCCGAC GACGACCGGC GCCTCACGGC CACCACCGGT TCCCTTGGAT ACGCCAATGT GCTTGCTGCT GCCCCCTCTC TTGCTCCTGC CACAACCTCC GACACTCTCA GCCTTCCTTT CTCCGCGCTC TCTGTGTCCC AAACTTCTGT CTCTTCGCCG GACATGACCT ATTGCTGGAC CCATGGTACC AGCAAAAACC GGCGCCATAC GAGCGCCACG TGCAAGAACA AGGCCCCTGG CCATCGCGAC GACGCGACCG CCACCAACAC TCTCGGCGGC TCCACCAAGG TTTGGACCGC TCCCAAGCCC CCTGAATAGG AAAGAGGGAC GGCTACGCCG ATGGTTAACT CTAGTAATAC CGATTCTTTA AATCATATTA CTCGTCTTAA TTCATCTGTA GTCCCCTCCC CGCCTAGTCC CCATACCTCG GCCATTGCTG ACACCGGTTG CACCGGCCAT TACATCACCG TCAACTGCCC CCACACCCAC AAACGTCCTG CAAGCCCCAG CCTTGCCGTC CGTGTCCCTA ACGGCGCCGT CCTCCGCTCA AGCCACATTG CCACCCTAGC CCTCCCTGGC TTCTCCCCTT CTGCTTGCCA GGCCCACATC TTCCCCGGGC TTACCTCGCA CCCACTCATT TCGATTGGAC AACTTTGTGA CGACGGCTGC ACTGCCACTT TCTCAGCCAC TCGCCTCGAG ATCCACCGCG ACACTACACT ACTCCTCTCC GGCACTCGTG CACCCACTAC CGGCCTCTGG CACCTTGATC TTACCCCTGC CAAGCCTCCT GCCACAGCCC ACGCTCTAGT TCCCAACACT CCCCTCGCTG ACCGCATCGC TTTTGTTCAT GCCTCGCTCT TCTCCCCGGC TATCTCCACA TGGTGCCAGG CCCTCGACTC CGGCCATCTT GCAACCTTTC CTGCACTTTC CTCCCGCCAG GTCCGCAAGT ATCCACCTCA TTCCCCCGCC ATGGTCAAAG GCCACCTCGA CCAACAACGC GCAAACCTTC GCTCCACCAA GCTTCCCCCT GTAGGTTCCC CCATCACGAC GGAACCCCCT GCCGCCGCTG TGCCCGACCT TGACCCTCCC GACGCCCACC CCGTCACACG CACACACCAT GTCTTTGTTG CCCACCAACG GGTTACCGGT CAGATCTACA CGGACCAACC GGGCCGCTTC CTCACTCCCT CCAGTGCCGG CCACAACGAT ATGCTTGTTC TTTATGATTA CGATAGCAAT GCTATCCACG TCGAACTCAT GAAGAACAAG TCCGGCCCCG AGATTCTAGC AGCCTATAAG CGCGCTCATG CTCTTCTCAC CCAGCGCGGC CTTCGTCCCC AACTTCAGCG TCTTGACAAC GAAGCCTCTG CAGCCCTCCA GTCCTTCATG TCCTCCGAGC ACGTGGACTT TCAGCTAGCA CCCCCTCATC TACACCGTCG TAATGCCGCC GAACGGGCCA TACGCACCTT CAAGAACCAC TTCATTGCTG GCCTCTGTAC CACAAACCCG GATTTTCCCC TTCATCTTTG GGACCGACTC CTCCCACAGG CCCTCATTAC CCTCAATCTT CTTCGTCGCT CCCGCATCAA TCCCAAGTTG TCCGCCCACG CACAACTTCA CGGTGCCTTT GACTACAACC GCACCCCGCT TGCTCCTCCA GGCACCCGCG TCTTAGTCCA TGTCAAGCCC GCTGTTCGCG AAACCTGGGC CCCCCATGCT GTCGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTATGGATC ACGGAAACAC GTGCCGAACG TGTTGCCGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT GGTCCATGCC CTCCAGAATC CTTCCCCGGC GTCTCCGTTC GCCCCCCTCG ATGCCACCCA GCACCAGGCA CTCACAGATC TTGCCACCCT CTTTGCCACT GTGGCCGCCC CAGCCGACGA CGTCCCTGCA CCCGCTCCCG TGCCTCCGGT CCGTCCCCCT GCCCCAACAA CTCCCCTTGC TCAGGTCCGC TTTGCCGTTC CTCTTGTCAC GGCCAAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG GCCCCAGCCC TTCCGAGGGT GCCCACCCTG GCCACCTATC ACTCTCGCAC CGGCAACCCC GGCCGTCGCC GCCGCAAAGC ACGCACACAA CCGGCAACCC CAACCCTAG
|
Protein sequence | MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKRAQRQLM TNAAAIPTLN GGGAHGHMAL TLTALAYADI SDVPFVIPVA PPANPPPGAT QPQITENNRI HQRDADIYNL YVAVNNALRQ QLLDAVPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE PLESVFLQLD EAIAFSIDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPTAAHTLA HFQQHFSLAD DDRRLTATTG SLGYANVLAA APSLAPATTS DTLSLPFSAL SVSQTSVSSP DMTYCWTHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKERGTATP MVNSSNTDSL NHITRLNSSV VPSPPSPHTS AIADTGCTGH YITVNCPHTH KRPASPSLAV RVPNGAVLRS SHIATLALPG FSPSACQAHI FPGLTSHPLI SIGQLCDDGC TATFSATRLE IHRDTTLLLS GTRAPTTGLW HLDLTPAKPP ATAHALVPNT PLADRIAFVH ASLFSPAIST WCQALDSGHL ATFPALSSRQ VRKYPPHSPA MVKGHLDQQR ANLRSTKLPP VGSPITTEPP AAAVPDLDPP DAHPVTRTHH VFVAHQRVTG QIYTDQPGRF LTPSSAGHND MLVLYDYDSN AIHVELMKNK SGPEILAAYK RAHALLTQRG LRPQLQRLDN EASAALQSFM SSEHVDFQLA PPHLHRRNAA ERAIRTFKNH FIAGLCTTNP DFPLHLWDRL LPQALITLNL LRRSRINPKL SAHAQLHGAF DYNRTPLAPP GTRVLVHVKP AVRETWAPHA VEGWYLGPAL NHYRCHRVWI TETRAERVAD TLSWFPTRIP MPAASSTDRA LAAARDLVHA LQNPSPASPF APLDATQHQA LTDLATLFAT VAAPADDVPA PAPVPPVRPP APTTPLAQVR FAVPLVTAKH APALPRPFRG CPPWPPITLA PATPAVAAAK HAHNRQPQP
|
| |