Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49524 |
Symbol | |
ID | 7195856 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 506303 |
End bp | 509374 |
Gene Length | 3072 bp |
Protein Length | 926 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184150 |
Protein GI | 219127871 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.168314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC CCCACGGCTT TGCTGTAGCT ACCGACACCG CAGCAATAAC AGCAGTATTA GCAGTACCGG CAGCAACAAC TACCGAAAAA ACCGACACTT TACAATTAAA CGTAGACTCA GCCACCAACA CCACAACGTG TCGAAAGCCG CATCGTGACG GTAAAGACGA CGAGAACGAT GACAAGCACG TCGATTTTAT CGTTGCTTCC GTCGACTCGC CGACGATGGA CTTGTCACAT ATTCCGAATC ACGCGGCGCG GGTCGCGCAC GGTAGGACTG CCACTGCTGG AGCACGTATA CACCACGGAG GCCATTCCGA TCAGTTCTCG CACACCAACA GCGAAGGAGG ACAGTCGTAC CAAATTATGG AGTACGATTT GGCCGACAAC GAACACGGTT TCGGTCGACA CGTTTACGAC TACAGCGACA AAGCCGACGA ACACAGTGGC ACCGACGAAG ACGACGATTT CGTCAATAAT CCCATGCTAT TGGCGGCGAC GGAACCGCAA ATTGAGGACA ACACACCCGC AGCCGCTGCA CTACGGGCCA ACGCCAACGT TAAGGTATCC ACCGTTGGCA AGGCCGCTCG TGCCGTACAC ATGCAGTCGG TGCGCAAACT CATCAAACGA GCTACCGGTA CAACCCACAG TGGATTCGTA CGTGGAAAAC CGCCCCGCTT GCCACCGGAT GTTCAGGAGG AAGAGCAGTC ACAAGGTGAG GAAGTATCGC TGGAAGCCCA CGAGCAACAT AATCGTCGAG CAATACACGG ATCCTCCGCA CTACATCCAC CGTTTCACCA TCCGCAACCA CTACAACCAC ATCCACACGC ACATAGTCTC GCACACCCTC CACTTCCAAC ACAAGCACCA CAAGCCATTC CACCGACATC ACTGGAATCG ACAGTCGTCC TGGGCGTCAG GGTACAACCA CATGATGTGG ACAGTGCCGA TGTCATGCCG GATGCCGTGG TAAAAGGGAC CGTCGAAGCA GGGAAAAAAG GTGAGCACAG CCAGCGCTCC TGTGTAGATT GCACCCACTC TGATCGTTCT CACAACATGT CTTTGCTCGA CAACTACAAC AACAACCATA GTGCTGGTCT CCGTGGATCC CCACGACATG CTCAAATTCT TGCGATGGCG GCGCAAGAAA AAAGCCCACG GACAGCAACC TCCAGTTGCC AAGTCCTACG TAAAGGGCAA GGTCATCGAC GGAGAACACG AACTTTATAC ACTGTCGATT GCCGTAATGA TTGGAGTTAG GACGTCTATT TCCTCCACGA ACGCTGTGAT TAACGGTACC ACGGCACCAC AAAATCAGCT ATTTCCGGAT GCCACCGCAT CGCAGCCCAG TAAACGATGG GTCCAATCAA CCGATTTCAG CGCATCCGAG AAGTACGAAT TTCGTCCCAA AGGTGGCGCC ACCCCACCGC ATAGACTGGC GCATACCTTC AAATTCAAAG ACTACTCGCC CATCGTATTT GCCTACTTGC GACGCATGTA CGGTGTCAAC GAATTCGATT TCTTACTATC CGTATGCGGC AACGCCAATT TTATCGAATT CATTTCCAAC GCCAAATCTG GACAATTCTT TTTCTATTCT AGTGACGGAA AATACATGAT CAAGACTATG ACCAATGCCG AAAGCAAATT TTTGCGGAGA AGTACGTGTT GTTATGACGA GTCTGTACTA GATTCTCTGA GGAGCGCGAT GACAATGGTC TCACTTGTTT ATCTTTCAAC AGTCCTCCCA AGTTATTTTC GACACTGCTG CGAGAATCCC AATACTCTCA TTACGCGTTT TTTTGGAATG TACCGCGTCA AATTGTATCA TTTGCGACGA AACGTCAAGT TTGTCATTAT GAACTCGGTA TATTACACGG ACAAGTATTT GCAAAATTTT TACGACTTGA AAGGTTCCGT AGTCGGTCGA AACGCCAAAC CTGGGCAAGC GGTCAAAAAG GACAACGATT TGCGCCAGGG ACTACCGGAG TCTGCCTTGT CGTTGCGACC GCCGGTGCGA ACCCACATGC GCGACCAAAT TTCCGCTGAT TGTGAGTTTT TGCGGCAGAT GGAAATTATG GATTACTCTA TGTTGGTAGG AGTTCATCAC GTGCCACCGG TGGAAGATCA CAGTCTCGCC ACGATTGGGT TTCGTGCGAG CGCACGGACG TCGGCCCAGC GCATGCGCAA GGGATCCTTG GAAATGGATT CCGGTTGGAA ACCGCTGCCG CAGGACCCGG TCACGAATGG CGCCTTGGTA GACGGGAGCG GCTCGGAAGA AAAAATCACC GATTTTTCGG TTGTTTCTGA TCGTCGGTAT AATCGCAATG AGTCATTGGG AGGTTTCTTT TTGGACGATG GGCTTGAGGA CGATGAAAGC AGCTACTTGA TGGGCAGCAG TAGACGTTTC GAACCGCGTT CGCCTTCATT CCACGAAGAA ACGGAACGAA AACGTCAAGC CACCATTGAG AAACTATACT GGCCTTTTCA TCGACTGTTC GATATTCATG GGTATCGCCT TTTGGAACCC GTACAATGCA CCAAGTGCTT CGCTGCTCCT TGCAACTGCG ATTCCGACGC CTCTCTACTC GAAGGCTACA AGATCCCCAT ATTCGTCAAG CCTCTGTCCG AACGTAAGGA CGGTGGTCTT GAAATGGATA CGACCGGGCG CCAATTACCT ATGAAGCTGA AAGGTCCACA CGGCGATCAG CTATACGAGG GTAAGATCTT TTACATGGGA ATCATCGACG TGTTACAAGA ATATACTTCT CGGAAACGGG TTGAGTCCAG CTATCGGGCC TTGACCAGTA GTGGAAAATT TGAAGCCAGC TGCGTCCCGC CCGACGTTTA CGGCGAACGC TTTGTACGAT TCTTTGACGA ATATACTGTT GGCATGACAC AGTCGAAGGA GGGTACGAAA GGATCTATAG AGAAGACTAA GAACACCCCC TGAAATAAAA CAGGCTGAAT CCCGAATGGC TTTACCTGCG GGCATTGTGC AATCAGTCCC GGGGTCGGAT GTAGTAAACT TTGCTAACAA ATGAACAAAA TTAAGGCAAT TTTATTACAA GG
|
Protein sequence | MTDPHGFAVA TDTAAITAVL AVPAATTTEK TDTLQLNVDS ATNTTTCRKP HRDGKDDEND DKHVDFIVAS VDSPTMDLSH IPNHAARVAH GRTATAGARI HHGGHSDQFS HTNSEGGQSY QIMEYDLADN EHGFGRHVYD YSDKADEHSG TDEDDDFVNN PMLLAATEPQ IEDNTPAAAA LRANANVKVS TVGKAARAVH MQSVRKLIKR ATGTTHSGFV RGKPPRLPPD VQEEEQSQGE EVSLEAHEQH NRRAIHGSSA LHPPFHHPQP LQPHPHAHSL AHPPLPTQAP QAIPPTSLES TVVLGVRVQP HDVDSADVMP DAVVKGTVEA GKKVLVSVDP HDMLKFLRWR RKKKAHGQQP PVAKSYVKGK VIDGEHELYT LSIAVMIGVR TSISSTNAVI NGTTAPQNQL FPDATASQPS KRWVQSTDFS ASEKYEFRPK GGATPPHRLA HTFKFKDYSP IVFAYLRRMY GVNEFDFLLS VCGNANFIEF ISNAKSGQFF FYSSDGKYMI KTMTNAESKF LRRILPSYFR HCCENPNTLI TRFFGMYRVK LYHLRRNVKF VIMNSVYYTD KYLQNFYDLK GSVVGRNAKP GQAVKKDNDL RQGLPESALS LRPPVRTHMR DQISADCEFL RQMEIMDYSM LVGVHHVPPV EDHSLATIGF RASARTSAQR MRKGSLEMDS GWKPLPQDPV TNGALVDGSG SEEKITDFSV VSDRRYNRNE SLGGFFLDDG LEDDESSYLM GSSRRFEPRS PSFHEETERK RQATIEKLYW PFHRLFDIHG YRLLEPVQCT KCFAAPCNCD SDASLLEGYK IPIFVKPLSE RKDGGLEMDT TGRQLPMKLK GPHGDQLYEG KIFYMGIIDV LQEYTSRKRV ESSYRALTSS GKFEASCVPP DVYGERFVRF FDEYTVGMTQ SKEGTKGSIE KTKNTP
|
| |