Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33295 |
Symbol | |
ID | 7204380 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 542624 |
End bp | 546324 |
Gene Length | 3701 bp |
Protein Length | 1186 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186367 |
Protein GI | 219113567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCACA ATCCGAGGTC GCGCAAAGGT GATATGGTTT TCCCTCGACC GTGAATGCAG CGCCCTCCGA CAATGAGGAT CAGAAGAGAT CCTCGTGCAA GCATCTGCAA CAAACTTGCG GCGAAACCTG TTCTCTCTAC ACCTACCTCG CAGCTGTATT CATCGTGAAC GTGCTGTTGT GAATTATCTG GCAGGCCGAG ACTCTCGTAT CATCAGCATG GTGGTTGCGG AAGTTCCCCT GTGGGTTATG CACAGCAGCA ACGCGAGTTC CAATACTACT CTGGACGGTG CCAGCGGACT CGCAACTGGT GGTCCTTTCA ACAAGCTTTC GGATGGACAT TTTTCCCGTC ACCATCATTT CGAGCACAAG GCGGCTTCTT CCTTGGCCTT GTTGGACTCG GCGTCGGGAA GGGCAAGAAA GTCTGCCATT TACGCCGTAG ACGTCCATCC CGACGGCCGA ATCTTTGCGA CAGCGGGAGG CGACTGTGCG GTCCGAATTT GGAATACACA GGCCTTGTTT GCTCCCAAAA ACAAGGGCGG TAGCTTTGCT GTGGCAGGTT CTTCGGATCC GACCGGCAAA CCCAACAATG CTACCACAAC CTACGTGAGC ACGAGTGCCT CGTCGGGACC GGAAGCCTCG GAAAGTAGCG CTGGGGAACA AGAAAGTGAC GTTGGTCCCC GGGACGAAAT GGTTCACGAT CTTAATAGTT TCGTCCGTCG CAAAAAAGAC CCCACAGTGA ACCAATCCTC GGCGGTACCC GCCGAGACGA CGAAATCATC TACTGCATCC GGGCAGTCGG TAGCGGACTC GTCTCCTGTT CGCCCCTCGA CGCACAAAAG ATCGCACCAT CAGCATCGTT TACTATGTAC ATTGTCCGCA CATTCCGGTT CATCGGTACT GGCGGTGCGA TTTTCAAGTA CCGGAACCTA TTTGGCTTCG GCGGGGGATG ACGGTTGTGT GTGTATCTAC ACTCACAACG AGGACACGGA GGGTAACCTG ACCCAAGAAC CGTCACCGCA CGACGAGCAC TGGTCTCGGA TCAAACTTTG CAGGGGACAT GGCTTGGATG TTGTGGATTT GGCTTGGGCA CCGGACGATT CATACCTAGT TTCCTGTTCG CTGGATTCCG AAACACCGAT CATTGTGTGG AAAACGACTC ACTTGGGATC TTCTCGTCGA GCCAACGCAA CGAGTATGAT ACTGAATCCA TTCAAGGTTC TCGGTAGAAA AGAACACACG AGTACGGTCA AGGGAGTTTC TTTCGATCCG GCAGGATCCT ATTTTGCGTC CTCGGGTGAT GATCCAGCCG TCTGCGTCTG GCGCGCGCAC GATGATTGGG GTTTGGAAAC CAAGATTGAC GCTAGCAGCG GTATCTTTCG GCGCTGGAAA GAAGACGATA CTATGGCCCT GTCTTCGCAA AGTCTTTTCC GCCGAATTAG TTGGTCAACT GATGGAGCAT TCTTGTGTTC CACTAATTCG GTAGTGAAGA ACAAGCACGT GGCGTCCACT ATTAGTCGCG ATGGGTGGAG TGTGAGTAGT GCGTCTTCGG CGGCAGCGGG AGCGGCCAAC CTAGTGGGGC ATAAGCAACC AGTGGTCGTC AGCCGCCACG CGTCGCAGCT GCTGAGTGCA CGCAAAGCCA ATGTTTCAGG AGGCCAAAAC GGTGACGACG ACGAAGAACC CGATTACGCA ACTCTGTTGG CACTCGGCGA CAAGCGAGGC TTCGTAACGA TCTGGAGTAC CAAGAAATCA CGTCCGATTT TTAAACTTCA ATGCAGTGAA AGTCGCAGTA CGGTCACTGA CATGGCCTGG GGTTCCTTAC CAAGAGGGGA TCTGATGCTT TTAGTAACAT TTTTGGACGG TCAGGTTGTG GCCCTGCGCT TTGAAGTACC CAGTGAGTTG GGAAATTTGT TGAGTAAATC TGAACGGGCA CGCGTATTTC AGCTCAGGTA CGGTATCGAT GTGAACGATG TGGAAACTTT TGGACAGCGC CATCTATTTA CAGGAGCAAG CTCAGGTCCT AACCTGATCG AAAACGCTTT GCAAATGACG TTGGAGCACA CTCATACTGG AATAGACGAT ATGGACGATG ATACCTCGAC ACCAGGTCCG GAACCAGAAG AGAGATTGAA TGACCTGCAA GCGGTTTCAA TTCGTTCGAA ACAGAAGGAA AGCCTCTCGA AAGGAAAGAA GCGCATTCAA CCAGTTCTTA TGGCTGTTAC CAGCAAGAAA ACGAAGCCAG GAGCCGAACT TCTCAAGACC AAAGCCGTCG AGCCAAACCA ATCCATCGAT CCTTTGCAGA ATGCTATTGA TGCAGCCAGC AAGGCCTCAG CCGCCATGGC AACCGCCGAT TCCACCAAGC GCGACATTAC TGTAAACGCT GCTCCCATGG ATGGCCTGTC CGGAACAGCG CAGTCAAATT CCGCTCGGCC ATCAGTGCGA CCCAATGGAC CTAGCTCTTG GATGGGAACA ATTCTGCCGC ACAGCTCCGA ACGGATCCAT TCTCTGGATC TCCCGCTTCT TGGTTTACAG TCAATGGATG TCACCACAGG ATTCGCAGAG CCATGCGTCG CCGAATGCAC CAACTCGGTC AAATTACCGG TGGGGTCACG GACAACATCA ATTCCATGTG TCGATGTAGC ATTATCTCGT GATGGCAAAA TATCGTGGAA GGATCAAATC CCTGGTACAT CATGCTCGGC CATTGCGGCC AGTACCACGT TAATGGCTGT CGGAACAACG GATGGATGCC TACAATTGTA TGGCACATCT CCTACGATCG GTTGGACATG TGGTCAGAGC TTTCGTTCTC ATCCGTCGTT AGTTCTTGGG CATCCCATTG TCTCATTGCA ACTTCAAGAG ACGCAGGGGG AAGACGACGA AATCTTTGCT ACTTTGCTCA CCTTAACGGG AGATGGTACA TTTGCCGTCT ACTCTGTTCT TCCAGTATTG CAACTACAAT TCAAAGGATC TGTTATGCCG GCGATGTCAC ACATGGCTTT GGGTACATCG TTGACGTCGG AGCAACATTC GATAAAAATT TCCAGAATAC AAATTACTGA AACCAATCGT GTATTACTGC TTCTATCGCT ACAGACAGTT GACAACGCAC AGCTTCGAGG CGGATTGCGG GGGACTACTC AAATTGACGC AGGGGTTGGT GGCTCGTTGC AAGCGTTTGT CTTCGACCAA AAGGCCGAAC TTTGGATGAA GGCGGCGGAC AACCGCTTTG TCCTTTCCGA CTTTTACAGT GCTCTACCGT CTGCGAAATT TAGCCCCAAT GGAGAGCTGT CAAGGTTGGA AGATGCTGTT CGAATCGGCG CACTCCAAGC GAGCATGAAG CCAGCTCAGC GCGGTCGTCT ACGTGATACC GACCGCCATG CGGACGAGAT GTTTTCCAGA GCCGATTTGG AATCTGGGAA TTTTATTCCG ACTCGGGCAC ATTGCGAGGA TCGAATGGCT TGCGCTATTG CACTCGAATC TGCGGACGAA TTCAAAAAAT GGTTATCATT GTATATAAAA GTATTGTGTG TGGTGGGGCA CACTGATTTT CTCAGAGTTT TGGTGGATAT CTTAATGAAC GAACCCAAGG ACAAACGAGA AACGATTCCT GATGGCATGT GCTGGTGGAT GTCGATCGCC CCTACAGTGG TGGGTTTGGA CAAAAGGACA CTTGTCAGGT CGCTAGTTAT TCCCGAGATG A
|
Protein sequence | MNHNPRSRKG DMVFPRPCIH RERAVVNYLA GRDSRIISMV VAEVPLWVMH SSNASSNTTL DGASGLATGG PFNKLSDGHF SRHHHFEHKA ASSLALLDSA SGRARKSAIY AVDVHPDGRI FATAGGDCAV RIWNTQALFA PKNKGGSFAV AGSSDPTGKP NNATTTYVST SASSGPEASE SSAGEQESDV GPRDEMVHDL NSFVRRKKDP TVNQSSAVPA ETTKSSTASG QSVADSSPVR PSTHKRSHHQ HRLLCTLSAH SGSSVLAVRF SSTGTYLASA GDDGCVCIYT HNEDTEGNLT QEPSPHDEHW SRIKLCRGHG LDVVDLAWAP DDSYLVSCSL DSETPIIVWK TTHLGSSRRA NATSMILNPF KVLGRKEHTS TVKGVSFDPA GSYFASSGDD PAVCVWRAHD DWGLETKIDA SSGIFRRWKE DDTMALSSQS LFRRISWSTD GAFLCSTNSV VKNKHVASTI SRDGWSVSSA SSAAAGAANL VGHKQPVVVS RHASQLLSAR KANVSGGQNG DDDEEPDYAT LLALGDKRGF VTIWSTKKSR PIFKLQCSES RSTVTDMAWG SLPRGDLMLL VTFLDGQVVA LRFEVPSELG NLLSKSERAR VFQLRYGIDV NDVETFGQRH LFTGASSGPN LIENALQMTL EHTHTGIDDM DDDTSTPGPE PEERLNDLQA VSIRSKQKES LSKGKKRIQP VLMAVTSKKT KPGAELLKTK AVEPNQSIDP LQNAIDAASK ASAAMATADS TKRDITVNAA PMDGLSGTAQ SNSARPSVRP NGPSSWMGTI LPHSSERIHS LDLPLLGLQS MDVTTGFAEP CVAECTNSVK LPVGSRTTSI PCVDVALSRD GKISWKDQIP GTSCSAIAAS TTLMAVGTTD GCLQLYGTSP TIGWTCGQSF RSHPSLVLGH PIVSLQLQET QGEDDEIFAT LLTLTGDGTF AVYSVLPVLQ LQFKGSVMPA MSHMALGTSL TSEQHSIKIS RIQITETNRV LLLLSLQTVD NAQLRGGLRG TTQIDAGVGG SLQAFVFDQK AELWMKAADN RFVLSDFYSA LPSAKFSPNG ELSRLEDAVR IGALQASMKP AQRGRLRDTD RHADEMFSRA DLESGNFIPT RAHCEDRMAC AIALESADEF KKWLSLYIKV LCVVGHTDFL RVLVDILMNE PKDKRETIPD GMCWWMSIAP TVLFPR
|
| |