Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34728 |
Symbol | |
ID | 7200186 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 247328 |
End bp | 250409 |
Gene Length | 3082 bp |
Protein Length | 951 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179164 |
Protein GI | 219116739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.148187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCT CCGCTCATTT CAAACTGAGC GACTTTCCTC ACAAAGTCCT CGATCCGATT GCCACTCTCA CCGTTCCCCC GACCTACGCG ACCATCAAGC ATGCTCAACG CCAGCTCATG ACCAACGCCG CCGCCATTCC CACGCTCAAT GGTGGTGGCG CCCATGGCCA TATGGCCTTG ACCCTCACCC CCCTTGCCTA CGCCGACATC AGCAACGTCC CGTTCGTCAT TCCCGTCGCC CCTCCGGCCA ATCCGCCCCC CGGTGCCACG CAACCGCAAA TCACCGAAAA CAACCGTGTC CATCAACGCG ACGCTGACAT CTATAACTTG TATGTTGCCG TCAACAATGC CCTCCGCCAG CAGCTTCTCG ATGCGATTCC TCGCATCTAT GTGCGCGCCC TTGCGCACCC GATGTTCGAG TTCAGCAATG TTACCTGCCT TGATTTACTC TCGCATCTCT GGACAAAGTA CGGCACCATC AAGCCCGCCG AACTTCAGAA AAATTTCCAG TCCATGTACA CCCCCTGGAA CACCACTGAA CCGATCGAGT CCGTGTTTCT CCAGCTGGAC GAGGCCATTG CGTTCTCCAC CGATGGCAAT GACCCCATCT CGGAGGCTGC TGCAGTTCGA GCCGGCTACG AAGTCATTGC GCACTCTGGC CTGCTCCCTC TTGACTGCAA AGAATGGCGC AAACTGCCTC TTGCTTCTCA CACCCTTGCA AATTTTCAGC AGCACTTCTC CCTTGCCGAC GACGACCGGC GCCTTACGGC CACTACCGGT TCACTTGGCT ATGCCAACGT TCTCGCTGCT ACTCCCTCTC TGGCTCCAGC CACGGTTTCC GACACCCTCA GCCTGCCTTT CTCCGCGCTC TCTGTGTCCC AGCCTTCTGT CTCCTCCCCG GACATGACCT ATTGCTGGCC CCATGGGACC AGCAAGAACA GGCGCCACAC CAGTGCCACT TGCAAGAACA AGGCCCCTGG TCATCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC TCCACCAAGG TTTGGACTGC CCCCAAACCT CCCGAATAGG AAAGAGGGAC GGCTACGCCG ACGATTAAAA CTAGTAATAC CGATTATTTA AATCATATTA CTAGTCTTAA CTCGTCTGTA GCCCCCTCCC CGCCTAGTCC ACACACCTCA GCCATTGCCG ACACTGGCTG CACTGGCCAC TACATCACGG TCAACTGCCC TCATACCCAC AGGCACCCAG CCAACCCCAG CCTAGCAGTC CGTGTCCCGA ACGGCGCAGT CCTCCGATCG AGTCATGTTG CCACCCTGGC CCTTCCTGGT TTCTCCCCTG CCGCCTGCCA AGCACATATT TTTCCTGGGC TTGCCTCCCA TCCGCTCCTC TCTATTGGAC AACTGTGCGA CGACGGTTGC ACGGCAACCT TCTCGGCCAC TCGGCTCGAC ATTCATCGTG ACGCTACACT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGTCTCTGG CACCTCGATC TTGCCCCCGC TCCCTCTCCT GCGACGGCCC ACGCCCTTGT TCCCCACACA CCCCTTGCCG ACCGCATTGC TTTTGTCCAT GCCTCGCTCT TCTCCCCGGC ACTTTCAACG TGGTGTCAGG CACTCGATTC CGGCCATCTT ACCACTTTTC CCGACATTTC CTCCCGACAA GTCCGCAAAT ATCCACCCAG CTCCTCCGCC ATGGTCAAGG GTCACCTCGA CCAACAACGC GCAAACCTTC GCTCCACCAA GCTTCCCCCT GTTGGTTCCC CCACCACGAC TGCACCCCCT GCCCGCTCTG TACCCGACCT TGATCCTCCC AATGCCCCAC CAGTCGCACG TACGCACCAC GTCTTTGCTG CTCATCAGCG CGTCACCGGA CAAATCTACA CCGACCAACC AGGCCGTTTC CTTACTCCTT CAAGTGCCGG CCATAACGAC ATGCTCGTAC TGTATGATTA CGACAGCAAC GCCATCCACG TTGAACTCAT GAAGAACAAG TCTGGCCCCG AAATTCTCGC CGCCTATAAA CGCGCTCATG CTCTTTTCAC CCAGCGAGGC CTCCGTCCAC AACTCCAGCG CCTCGACAAC GAAGCCTCTG CAGCCCTCCA GTCCTTCATG ACCTCAGAGC ACGTCGACTT TCAGCTGGCA CCCCCCCATC TACACCGTCG TAATGCCGCC GAACGGGCCA TCCGCACCTT CAAGAACCAT TTCATTGCTG GCCTCTGCAC CACGAACCCG GATTTTCCCC TGCATCTTTG GGACCGCCTC CTCCCCCAGG CCCTCATCAC CCTAAATCTT CTTCGTCGCT CCCGCATCAA CCCCAAGTTG TCCGCCCACG CACAGCTTCA CGGTGCCTTT GACTACAACC GAACCCCGCT TGCTCCTCCC GGCACTCGCG TCTTGGTCCA TGTCAAGCCG TCCGCTCGCG AAACATGGGC CCCCCATGCT GTTGAAGGTT GGTATCTCGG CCCCGCTTTG AACCATTATC GCTGCCATCG CGTATGGATC ACAGAAACAC GAGCCGAACG TGTTGCTGAC ACCCTTTCTT GGTTCCCGAC CCGCCTCTCC ATGCCTTCCG CCTCCTCCAC CGACCGAGCC CTGGCCGCCG CCCGTGATCT TGTCCATGCG CTCCAAAATC CCTCCCCCGC CTCCCCGTTT GCGCCCCTCA ACGCCCACCA GCACCAGGCC CTCACACACC TTGCCGATCT CTTTGCCACG GTGGCCGCCC CGGCCGACGA CGCCCCCGCA CCTGCTCCCG TGCCTCCGGT CCGTCCTCCT ACCCCAGCAC TTCCCCCAGC TCAGGTCCGT TTTGCCGTCC CTCTCGTCAC GGCCGAACAT GCCCCAGCAC TTCCGAGGGT GCCCGTTCCT ACCGCCGCAC TTCCGAGGGT GCCCCCCATG GCTACCTATC ACTCGCGCAC CGGTAACCCC GGCCGTCGCC GCCGCAAAGC ACGCAAACAA CCGGCAACCC CAACCCTAGT TCCGGCGCAT CCACACAACA CCCGCACCCG ACCCTTTCTT GTCCCGGCCT CCGCCAACGC AGTTGTCGAC CCCGCAACCG GCGCCTCCTT AG
|
Protein sequence | MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKHAQRQLM TNAAAIPTLN GGGAHGHMAL TLTPLAYADI SNVPFVIPVA PPANPPPGAT QPQITENNRV HQRDADIYNL YVAVNNALRQ QLLDAIPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE PIESVFLQLD EAIAFSTDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPLASHTLA NFQQHFSLAD DDRRLTATTG SLGYANVLAA TPSLAPATVS DTLSLPFSAL SVSQPSVSSP DMTYCWPHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKPPPRLVH TPQPLPTLAA LATTSRHPAN PSLAVRVPNG AVLRSSHVAT LALPGFSPAA CQAHIFPGLA SHPLLSIGQL CDDGCTATFS ATRLDIHRDA TLLLSGARSP HTGLWHLDLA PAPSPATAHA LVPHTPLADR IAFVHASLFS PALSTWCQAL DSGHLTTFPD ISSRQVRKYP PSSSAMVKGH LDQQRANLRS TKLPPVGSPT TTAPPARSVP DLDPPNAPPV ARTHHVFAAH QRVTGQIYTD QPGRFLTPSS AGHNDMLVLY DYDSNAIHVE LMKNKSGPEI LAAYKRAHAL FTQRGLRPQL QRLDNEASAA LQSFMTSEHV DFQLAPPHLH RRNAAERAIR TFKNHFIAGL CTTNPDFPLH LWDRLLPQAL ITLNLLRRSR INPKLSAHAQ LHGAFDYNRT PLAPPGTRVL VHVKPSARET WAPHAVEGWY LGPALNHYRC HRVWITETRA ERVADTLSWF PTRLSMPSAS STDRALAAAR DLVHALQNPS PASPFAPLNA HQHQALTHLA DLFATVAAPA DDAPAPAPVP PVRPPTPALP PAQVRFAVPL VTAEHAPALP RVPVPTAALP RFRRIHTTPA PDPFLSRPPP TQLSTPQPAP P
|
| |