Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43016 |
Symbol | |
ID | 7196229 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1801809 |
End bp | 1805526 |
Gene Length | 3718 bp |
Protein Length | 1066 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177376 |
Protein GI | 219111249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAACTACGA GAAGGGTATA CGACGCCGCG TCTCTACCTT TTACTCTACA GGAGCCAATC CTACCGCTCT CCGCTCTCCT AGTACTCTCA TTGTACGGCA CCAATCGTTT CGGCATCGCG AGTAGTTCAC TGTCGAATCA TGTCGGAAGA ACCACCGGTG GGGGTCAATC CGAATGCGGA AGATGAAACT CCCGCTCCAG AAGGGGAGAA GAAGCTATCG AAGAATCAAC TGAAAAAGCT CGCCAAGGGA AAGGTACGAA GCGGCCCTGG GCGAACTCTG ACGTTTCGGG TCGAATGCGG GTGTTTAAAT ACCAGCTTAC CTTTTCTCTT GTCCATTCCG TAGGACAAGA AAAAGAAAGA CAAGCCTCAA TGGAACGCGC CGAGTAAGGA AAAAGTCAAA GCTCCGGTCG CTACGACTTC CTTCGTCAAC ACGACCCCAA AAGGGGAAAA GAAAGATCTT TCGGCTCCCA TGGACGCCGC ATATCATCCG TCAGCAGTCG AAGCCGCTTG GCAAGACTGG TGGGAAAAGT GTGGTTACTA CAGCTGCGAT CCGAAAGATG CGGTCGATCG CCCCGTAGAT GAAAAGTTCG TCATGGTGAT TCCACCACCG AATGTGACTG GATCGCTGCA TCTCGGACAC GCCCTGACGG CGGCCGTCGA AGATACCCTG ACGAGGTGGC ACCGGATGAA AGGACACGCC ACTCTGTATG TTCCCGGTAA GTTTCCATAG TGCATTTATT CGTTCGTTCG TCGTTGGCTT CTGTTTCCCC ATCACGCACG GGTCCTCTGG GCAGCCACGG GTGAAATCAA TTCACAATAG ATGGCAATCG GCTCCGTGCA AAAATTTACG CAAAACAACC CCTCGCACAA CGAGTTCTTC ATCAACGCAC TATTACTCAT TGTTTTCGGG GCTGACATTT CACAATCACA GGTACGGATC ATGCAGGAAT TGCCACGCAA TCGGTGGTCG AAAAAATGCT CATGAAGAGC GAAGGAAAAA GCCGGCACGA TTTGGGGAGG GAAGAATTTG TGAAAAAGGT GTGGGAATGG AAGAAGGACT ATGGAAGTAA GATCACCAAT CAATTGCGAT CATTGGGCAG CAGTGTCGAT TGGTCGCGTG AACGGTTTAC GATGGATGAA ATGTTGAGTA AAGCTGTCGT CGAAGCCTTT AATCGATTTC ACGAAAAGGG ACTCTTGTAC CGTGCGGATC GTCTAGGAAA CTGGTCGTGT GCATTAAAGT CCGCCATTTC CGATATCGAA GTCGACTTTA TTGAACTCGA AGGCCGTACC TTTTTGGACG TCAAAACACA TAAGGGCAAT CCCAACGACC CCAATGGTCG GTACGAATTT GGGACCTTGA CTTCGTTCGC CTATCCGATT GAAGACTCAG AGGAACAGAT TGTTGTTGCC ACTACCCGAC TGGAAACCAT GTTGGGAGAC ACTGCTGTTG CCGTTCACCC GGACGATCCT CGATACACCC ACTTGCACGG GAAGCATCTA ATCCATCCCT TCAACGGACG CCGAATTCCT ATTGTTTGCG ATAAAGAACT GGTCGACATG TCCTTTGGTA CCGGAGCAGT CAAAATTACT CCCGCCCATG ATCCAAACGA CTACGAGTGC GGTAAACGTC ACGAGCTGGA GTTCATCACA ATGTTGACTG CGGATGGTTC AATCAACGAA AATGGTGCCC CTTTCACCGG CATGATGCGG TACGACGCTC GCATTGCGGT AGAAGATGCG CTTAAAGAAA AGGGATTATT CAAAGGCAAA GAACCCAACA AGATGCGATT GGGTCTATGT TCGCGCTCGG GTGATATTTT GGAACCCATG ATTACTCCGC AGTGGTATGT TAACTGCGAC GGTATGGCTA AGCGGGCTAC CGATGCGGTA CGCAATAAAG AGTTAACAAT TCTTCCAGAG GAGCAAGAGA AGACGTGGTT TCATTGGTTG GATAACATCA AGGACTGGTG CGTCAGTCGA CAACTCTGGT GGGGTCACCA GATACCGGCA TGGTTCGCCA CCAAGAAGGG AGAAAGTTTA GAAAAGAATG ATATGGCCAA CAACGACCGG TGGGTTGTTG CTCGGTCCGC TGAGGTAGGC AATAGGATGA AAGCATGCAC ACATTTTTTT GTTGCCGTAC TTCGTCTCAC AATACTGTAT CTCGCAGGAA GCTCTCGAGA AGGCCGCTAA ATTACTTGGC TGTCCCGCTG GCGACATCTC AATTGAGCGG GATGAAGACG TTCTTGATAC GTGGTTTTCC TCTGGACTGT TTCCTTTTTC TGTCATGGGA TGGCCCGATG ACACTTCTGA TTTGAAGGCG TTTTATCCTA CGTCTCTACT CGAGACTGGT CTCGATATTC TCTTCTTTTG GGTGGCTCGT ATGGTTATGA TGGGTTTGGA ACTAACCGAC ACACTACCAT TCCACACCGT CTTCCTTCAT GCCATGGTGC GGGACAAGGA AGGAAAGAAA ATGTCAAAAT CTCTTGGAAA TGTGATCGAT CCTCTGGAGG TCATCAACGG GTGCTCATTG GCCTCTCTGC AAGAGCGTCT GGAAGGAGGC AACCTTCCGG CGAAGGAAGT GGAACGATCG AAGAAGAATA ACGAGCTCGA GTTTCCTGAC GGTATTCCAG AATGCGGATC GGATGCACTT CGGTTCGGCC TTATGGCCTA TATGGTCCAG GGACGTGATA TCAATCTTGA CGTCAAACGT GTTGTTGGGT TCCGGCTGTT TTGCAACAAA CTTTGGAACG CCACACGTTT TGCACTTCAA TTTGTTGCGG ACTTTACGCC TACTCCGACT CTGTTGGACG ACCTAATGGC TAGCGGCAAA ATGGCGACGC GAGACAAATT TATGATATCT CGATTGATGA AAGCGGTGGA AGCCGTCAAC GATTTCTTCT CGAGCTACCG GTTTGGCGAT GCACAACAAG CGGCCTATGC TTTGTGGATT GAAGATCTTT GCAACACATA CCTGGAACTG ATCAAACCCG TCGTATACGA CATGAGTGTC AACAACATAG ACAATCGGTG GGCAGCACAA GCAACGCTTT GGATCGCAAT GGAAACAGGC CTTCGGTTAC TGCATCCAAT GATGCCATTT GTTTCTGAGG AGCTTTGGCA GCGACTTCCT GGACGTGGGA CGCTAGGCAA AACGGAACCT GAAACTATCA TGCTCGCCCC GTACCCCGAA ACTCACAACT CTTACAAAAA TGAGGCCGTG GAGCAATCTA TGATGAACAC AATGGCTGTG GTTAATGCCT GCAGATCACT TCGTCAGTCG TACAACATTG CCAACAAGGT ACAGACACAT TTCTTTGTGA ACGTATCTGG ACTCGCGCTA CATGCCGTTC TCGACCAACT GGATGACATC AAGACACTTG GAAAAGCTTC TGCCATTGAT ATTAATCTTT CCCCAGCAGA CACACCAGAA ACTGTCGGAA CTGCCATTGT CAATGATCAG CTGACTGTTC TGATTGACTT ACAGGGACTG GTTGACTACA AAGTTGAGAT TGGGCGTCTG CAAAAGAATC TAAGGTCTAC TCTACCAACA ATTTCGACTC TCGAAATGAA AATGGCTACT GATGGTTATA CAGAAAACGT TCCAAACGAT CTTCAAAAAG CGAATCTAGA GAAACTTGAT TCGCTTTTGA AAAAGAAGTG TGATCTCGAA GAGGCTATTG CAAACTTTGA ACGTCTGGCC TTATTGGATA AGAATTAA
|
Protein sequence | MSEEPPVGVN PNAEDETPAP EGEKKLSKNQ LKKLAKGKDK KKKDKPQWNA PSKEKVKAPV ATTSFVNTTP KGEKKDLSAP MDAAYHPSAV EAAWQDWWEK CGYYSCDPKD AVDRPVDEKF VMVIPPPNVT GSLHLGHALT AAVEDTLTRW HRMKGHATLY VPGTDHAGIA TQSVVEKMLM KSEGKSRHDL GREEFVKKVW EWKKDYGSKI TNQLRSLGSS VDWSRERFTM DEMLSKAVVE AFNRFHEKGL LYRADRLGNW SCALKSAISD IEVDFIELEG RTFLDVKTHK GNPNDPNGRY EFGTLTSFAY PIEDSEEQIV VATTRLETML GDTAVAVHPD DPRYTHLHGK HLIHPFNGRR IPIVCDKELV DMSFGTGAVK ITPAHDPNDY ECGKRHELEF ITMLTADGSI NENGAPFTGM MRYDARIAVE DALKEKGLFK GKEPNKMRLG LCSRSGDILE PMITPQWYVN CDGMAKRATD AVRNKELTIL PEEQEKTWFH WLDNIKDWCV SRQLWWGHQI PAWFATKKGE SLEKNDMANN DRWVVARSAE EALEKAAKLL GCPAGDISIE RDEDVLDTWF SSGLFPFSVM GWPDDTSDLK AFYPTSLLET GLDILFFWVA RMVMMGLELT DTLPFHTVFL HAMVRDKEGK KMSKSLGNVI DPLEVINGCS LASLQERLEG GNLPAKEVER SKKNNELEFP DGIPECGSDA LRFGLMAYMV QGRDINLDVK RVVGFRLFCN KLWNATRFAL QFVADFTPTP TLLDDLMASG KMATRDKFMI SRLMKAVEAV NDFFSSYRFG DAQQAAYALW IEDLCNTYLE LIKPVVYDMS VNNIDNRWAA QATLWIAMET GLRLLHPMMP FVSEELWQRL PGRGTLGKTE PETIMLAPYP ETHNSYKNEA VEQSMMNTMA VVNACRSLRQ SYNIANKVQT HFFVNVSGLA LHAVLDQLDD IKTLGKASAI DINLSPADTP ETVGTAIVND QLTVLIDLQG LVDYKVEIGR LQKNLRSTLP TISTLEMKMA TDGYTENVPN DLQKANLEKL DSLLKKKCDL EEAIANFERL ALLDKN
|
| |