Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28568 |
Symbol | |
ID | 7201976 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 827192 |
End bp | 830271 |
Gene Length | 3080 bp |
Protein Length | 707 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181449 |
Protein GI | 219122221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.43424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAAAAGGAT TGACGGCAGC CTTGTTGCCC TTTCAAGTCG AAGGCGCATC GTGGATGTAT CACCAAGAAA CGCAAAAACC AGAGATTCGC GGAGGGATCC TAGCCGACGA AATGGGAATG GTACGCCTGG TGTGCAGGTT GCCGTGCCAC GGAAAACTTT ATTGCGCTGT ACCTGCTGTG CGTCGTTCGG GTCCACCAAA CGTCTTTTGC CTAACCTGAC CTTTTTATTC CTTTCTATCT AGGGGAAAAC AGTGCAAACA ATCGCTACCG TGCTGGACAA TCGTCCAAAA CTACAGCACA GTCGTCCCGG AGCTAAACAT CCGCCCTCAC TTCCCGATGT CACCGAACGG CAGTTGGAAG AGACACTTTG GAACCGAGCA GTCTCGGATT GGAAGCACGA AATGGACATG TGCAACGTTC CGCCCAAGAT GCGTCCCCAC AAGTATGCCG CGGCTCGCGC CGGCACCCTC GTGGTCTGTC CCGTCATTGC CCTGCATCAG TGGAAAACCG AGATCGAAAA GTTTACCGAG CTAGATACCC TGTCGGTGGG CATATATCAT GGTCCAAATC GGGCTACGGA TATGCCACCC GAACTGATGC AAAAATACGA TGTTGTGCTC ACCACGTACC AGGTTTTGGA ACAGGATTTT CGCAAAATGA TGTCGCCCAA TAAAATCAGC TGTCCCAATT GTGGGGGCAA GTTCAAAGTC GACAAGCTGC GAGTTCATCT CAAGTACTTT TGTGGAGATG GCGCTGAGCG TACCGAAGCG CAAGCACGTC AACATCGTGC CCGTGATCGG GACGAGAACG GTAGTGGTCG GGGTAATACC AATCGTGGTA TTGGTGGTGC AAGGGGCAAG AAAGATAAGG TTAAAAAGCC TCTGACCCCA ACAAAGAAGC ATTTGTCCAC CAAGAGTGTG GCAAAAACCA AGCAGGCGAC GAGACGAACA ATTCGGGTAA AGAGTTCGGG AGACTACGAA TCCGACAGTG AACTTTCGTT GGACGAACCG TTTCTGGCAA CTCCGCCGCA ATCAGGCCGT CCATCGCGAT CAGCAGCTTC AAAAGCTTCG AAACGCATGT CCAAGACGCT CAAGGAATGG GGTCGGGAGG GGCGTAACGA CAACGATGAA AGCAGCTTCG GCTTTGTTAG CGAGGGGGGA GACAGCGACT CATCCGATGA AGATATTCCG CCAGTGACCG CTGCGAAACT TAAATCCGTC GCCAAGAAAC GGACAGTGAG CCAACGAGAA TCGTCTCATG AGAGTGCGCT GGATCGCGCT TGCGAAAAGC AACGCAAAGC GATGGACAAT GTCAAAAAGC AAAAGACCGG GAAAAAGAAA ACGTTGGGCA AGAAAGGCAA GAAGAAATTC GATAATGAGG GGTCGTCTGA ATCAGATTCC GAAGGGAAAG CAAGCGATCC CATTAATGAT ATCGATATGA ATGAGTTGAT GAAGGAAGCC ATGGTGGGTT CGCGCTTTAG TGTGCTCCAC AGCTTCTGTT GGTGGCGAAT TATCCTTGAC GAGGCCCATT TTATTAAATC ACGATCGAGT CAAACTGCCG CTTCTGCGTT CTCACTGTCG GCTATTCATC GTTGGTGTCT GTCGGGAACG CCACTCCAGA ACCGTGTTGG AGAATTGTAC TCGTTGATTC GCTTTCTCCG AATCGATCCC ATGGCGCATT ACTTCTGCAA AGCGAAAGGA TGCGATTGCA AATCAATTCA TTACCGCATC AAAGACGGCA AGTGCCAGGA CTGTAGTCAC CACGCCTTTT CACATTACGC ACATTTTAAC CGGTACGTCC TGAATCCTAT TCAGCGAGAT GGGTACAGCG GTGACGGACG TCGAGCTATG TTCAAATTGA AGAACGAAGT TCTCGACAAA TCCTTGCTAC GTAGAACGAA AGAAACTCGG GCAGAAGATA TGAATTTGCC GCCACGACTG GTGACGATTC GACCCATTCG TCTACATCCA GTCGAGCAAG ATTTTTACGA TGCTCTCTAC ATGAACACTA AGGCTTCCTT TAATGACTAC GTTGATGAAG GAACCTTGCT GAACAATTAT GCGCACATCT TCGATCTTTT GACAAAAATG CGCCAAGCGG TCGATCATCC GTACATGATT GTTCACTCTA AAAAGAATAC CGAGAAGCGG CGATTGGAGC AGGGAGCTCC AGTCGCGAAC GGATCGGTGG ACTGTGATAT CTGTCATGAA TCTCCAACGG AGCGTGTCGT CAGCTCTTGT TGCGGTTCTG GCTTTTGCCG TGAGTGTGTG GTTGAATACC TCACCGGCGC CGGTGGTGGG AGCACCCCGT GCCCTTCCTG CCAATCCCCC TTTTCCATCG ACCTCAACCA GGCGAGTACT GAAGCACCAG TGGATGACGG TACGCTCGCG TATGGTGTCA GAGAGTCGCA GAAAAGTGTC GATTGTTCAT CAATTCCGTC GTTGAAAGAG CTGCAGCATG TTCCTTCGGG TTCTATTTTA CGACGGATCA ATCTAGCCGA GTTCGCCACA TCGTCGAAGA TTGAGGTCTT GGTCCAAGAG CTCGTTGCTA TGCGCAAGGG TCGGCCAGGT AGCAAAGCCC TCGTGTTCTC CCAGTTCGTC AACATGCTGG ACCTCACTCG CTGGCGCATC CATTCCGATC CCTGCTTAGC TGACTTAGGT CTCGGGGTTC GAATATTGCA CGGTGGAATG GACGTCAAGT CTCGCGATGC TACCCTTCAA GCATTCCGAG AAGATCCGAG CGTCCGAGTT TTACTCATGT CGCTGAAGGC TGGCGGTGTT GCACTGAACT TGACCGTCGC TTCGGAAGTG TATCTGTTAG ATAATTGGTG GAATCCAGCT GCAGAAATGC AGGCAATTGA TCGTACTCAT CGTCTCGGAC AGTACCGTCC AATTCGCGCT GTGCGATTCA TTGCGGAGGG CACTGTGGAA GAGCGCGTGT TGCAACTGCA GGAAAAGAAA AGGTTGGTGT TCGACGGTAC CGTGGGCCGA GATGCTGGCT CTTTGAAAAT GTTGACGGTA CACGATATGA AAGCCCTTTT TACTTGAGTT TTAGTTATAG CCAGGGAATT ATGAGTTATG
|
Protein sequence | MYHQETQKPE IRGGILADEM GMVRLHEMDM CNVPPKMRPH KYAAARAGTL VVCPVIALHQ WKTEIEKFTE LDTLSVGIYH GPNRATDMPP ELMQKYDVVL TTYQVLEQDF RKMMSPNKIS CPNCGGKFKV DKLRVHLKYF CGDGAERTEA QARQHRARDR DENGSGRGNT NRGIGGARGK KDKVKKPLTP TKKHLSTKTM VGSRFSVLHS FCWWRIILDE AHFIKSRSSQ TAASAFSLSA IHRWCLSGTP LQNRVGELYS LIRFLRIDPM AHYFCKAKGC DCKSIHYRIK DGKCQDCSHH AFSHYAHFNR YVLNPIQRDG YSGDGRRAMF KLKNEVLDKS LLRRTKETRA EDMNLPPRLV TIRPIRLHPV EQDFYDALYM NTKASFNDYV DEGTLLNNYA HIFDLLTKMR QAVDHPYMIV HSKKNTEKRR LEQGAPVANG SVDCDICHES PTERVVSSCC GSGFCRECVV EYLTGAGGGS TPCPSCQSPF SIDLNQASTE APVDDGTLAY GHVPSGSILR RINLAEFATS SKIEVLVQEL VAMRKGRPGS KALVFSQFVN MLDLTRWRIH SDPCLADLGL GVRILHGGMD VKSRDATLQA FREDPSVRVL LMSLKAGGVA LNLTVASEVY LLDNWWNPAA EMQAIDRTHR LGQYRPIRAV RFIAEGTVEE RVLQLQEKKR LVFDGTVGRD AGSLKMLTVH DMKALFT
|
| |