Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34625 |
Symbol | |
ID | 7199696 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 1072931 |
End bp | 1076361 |
Gene Length | 3431 bp |
Protein Length | 675 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178907 |
Protein GI | 219116224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000047159 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCC GGAAACATAG CAATCGTACA AATGCACCCT TTATTGTGAG TCCACGTGGC CACGAAATAA GGCTGACAAT ATTTCTGTGC GCACCTCCTT AACTCCATGT GCATAGTTTT GATTCACACT TGCGACATCC ATGTTAGGCG AATCATGTCC ATGAGACATG TCGCAAGGAG GCATACCGTA ACATTCGTCA TAATGCCCAA CCTGTGTTTA CCATGCACTG TCAAACAGCG TTACTATCCT ACGTACGTAC ATTGGTTGTA CTATGTGAGA GATTAACAGT CTTGACAGCC CGTCAAACCA CTGCTGCCAA CTGCACAGTG TTTTTGTGGT AAACATTTTT CTTGAGTTTT ACTTAAGATT GTTTCCAAAT TTGTACGAGG TGTACATTTT TGGTAACAAA CCGCTGACAT TCTTTTGTGG TGTATAAGTG CCAGAGGTAT CACTTAAGGT GGTATTTATT GGCTTGTGCG ACACATCGTT TGGAAGCCGC ATCCCCATCC CCTCACGTAA ACTTGTACTG AGAACCCACT TGTAACCCTA ACGTTGGCTC AGCTCCAGGT TTTCGTAAGT AAGTTTTCGT TGCCTCCCTA CTTCCTCGGA ATTGCTGGTG CCGAAAACCC CTTGGAGTCT TCAGGGTTTG TTGGCAAGTC GTTCCGTGAT TACGGGTTGC TCTGGTTGGG TGAGGTAGAA ATACCAATAG GCCCAAAAGA CTCCAACCCT GATTCTGGTT GGGACGGGCT TACTACCCGG CGTAAAATAG TACAGGGGGC CAATCAAACC CTCGACAGTT TGCGTACGCG TGCGAGCGTA ACAGACACTA CCTACGGAGG TAGAATTGAT TGTACCAGTG CTAGCTTTGT TCACTGGTGT GGGGTCATTT AACGAAGTCC GCAATAACAA AACAGCAAAG GACCCAAACA CAAACGCTGT TTAATGGTTT GAGAACCTCA AGCATTCTCA TATTGCTCTG GCTACCTAGC CCATCATGGT GTTGACCAGT GAGGATGTCT ACGCGCACTT CTTAGACAAT GTGTTCTTGC TCTCCCAGGA GCATCCGATT CGACTATTTC TTATGCAACA AGGTTTTGAA TCTATGGAGG ACCTCTTTGG CATATTTGAA GATGATCTCA GTACCTTTGG ATATTTTCGC ACTGCTACTC TTAGATTCAA CAAGAACCCT CAATGGTCCC TTTTGTCCCT TGCACACCGT CAGATTCTTC GACACTTCCT GCATTGGCAG GCATCTCTTT GGCATCAAAA GGGAAGCAGC TTGAAGTATC TGGAGCTTGT TACATTGACC GACCAGGATT TCGCTCAGTA CCGACAATCA ACATTGGAAG AGATACTTCC CGATGACACC AACAAAGTTC ATGCTTGCTC TAGCATGCCA CCTGTTCCTA GCAAGACGGT CCTCATGCAC ATTCTGGACA ATGTGTTTGT GCTCTCCCAA GATCACCCAA TCCGTTTATG TTTTTTGCAG CAGAGCATTG AATCTATGGA GGATTTTTTC AGTTTTCTTG AAGATGGCAT TGATGCCCTC ACCTTTTCAC CCACACCTGC TGACAAAGGC AACTCACTTC CACAACGGAT GCCCATGAAG CTTGGACACT GTTGGCTTCT GCAGGCTTTC TTTGACTGGC AAGTATTGCT TGAATGGGAA AAGGGGAGTT TTTTGGAGAA TTCGGAACTT GCTGCATTGA CCAAAGCGGA TTTTACTCGT TACCGACGAT CTGCAATCAA GAAAGCCTCA ACTGCATCCC TTATGCCATC AGCTTCTATC CTTGGTTTCA CCAGGAAGGG CCTTACACCA CAGGATGGGG AGGGTTGTTC TAAGCCCATC AACTTCGCCG AGTCGCATAG AATCGAAAGC AAAAATTCCT GTGTCTTAAA GGGTTTACCT GATTCCACTC CTGACAACCT CATTGGAGAA ACTTCGCCAA CCAATAGTCA GGATGATGGG GAGCAATTTC GTGAACAACA AGCTTTTACC AAAAGTCAGA ATAATGGGGA GCAAGTTTGT GAATGGCAAC CTTTTTCAAC TGATAGTCAT GATGATGGGG AGCAATCCTA TGGATGCACT GTACGAAAAT TTCTGGGTTG TGATAAACCA TGCAACATGC AATTTTCGGT GGATATCAGT GATTGCAAGT GTGATGAAAG CTTTGCCTGT ACCGAAAGTG ATGCCAAGTA TGATGAAAAC CTTGTTTCTA TCGAAAATCT AGACAAACAC GAAACCAATC TAGATGAAAA GTGGCACAAG AGGGACCGAG AATGGTGTTT CAAGGCATCT CCACCCATGT TAGGAATCTA CAAAGGTCCT AGATACAATG GGTTTGTTCA ACAGGAAACT AGGGATTCTA CCTGTGAACC TCTAGATGTC TGTCACGGTA TAGTACCATG CGACATGTCT TCTAATGATG TCCGTCAGTT GGATGTCAAC ACGGTGAACG GCAGTTGGAG TGTTCTTGAG AATGAAGTGA GTCGTATGGG AGTTAGTTCG GTCTTATTGT ATGGAGTCCC TAGATACAAT GATGTATCGA GCGTTGGAGA GAACGCGACA GAACGTGTCG TGTTCTGACG GTGGCAACGG AGTGGGACAG ATAATACTTA TTGCCTGCTT AACGGATGTA AATGGATGCG CTCCATTTTC CACGGCATGC ATGTTATGGA AGGTATTCTC GGTGTCACTA TAAGTATATG TGAGCGCGGG TCATGGTATA GGAATACTAT GACCGAAGAA AGATATTTGT GTACGGTTGA GTTTGTTTCT CAGTTTTGTT GTTTTGTTGG CAAATCGGAG AAGCAACGAG AACTAGTTTA TAGCACTAAT TCTTCAATCA TTGTTTTATA AGTTCTTTGA TCCGTACTGC TAGTTTACGC ACAGGTCCTC GTATCCGCTG CCTAACCCGA GGGGAGCGTC GCCAACGAGT AGAGTTTTAC AGTCCCTAAG CCCCTTAGGT GTATCGCCGG TCGTTTGTCA TAACGCCCTC TCTGTTTACC AAGCTCTGTC CCTAACAGCG CTTCTGTCCG AACGGTGCAC ACTGGTGTAT ACCGTAACAG ATTAGCAGTC TTGGTTTCTC GTCAAATCAC TGCTGCTCGT CGCACAGAAT TTGGTAAACC CACCTTGAGT CCGCCTCAAG GAAGTGTTGA GCTTGGTACC AACCGTACCT TGTTTGACAA CACTGCTTAC ATTGCTTGTT GTTGCATAAA GTGTCAGAGG TGTCTACGGG GACATTTATC GGCTTGTGCT TCGCACCTTT GTCAAGCTGC CACCACCACC CTCCTGACCA TTGGTATTGA AAACCCATTA GTACTTCAGG GTTTGTCAGT AAGTCGTCTT GGCCTCCCTA CTTCCTCGGA ATAACCGGCA CTGGAAATCG TCTAGGTTTT CTTCTGGATT TGCCAGTGGG TTGTTTCGTG A
|
Protein sequence | MRLRKHSNRT NAPFIANHVH ETCRKEAYRN IRHNAQPVFT MHCQTALLSY LQVFVSKFSL PPYFLGIAGA ENPLESSGFV GKSFRDYGLL WLGEVEIPIG PKDSNPDSGW DGLTTRRKIV QGANQTLDSL RTRASPIMVL TSEDVYAHFL DNVFLLSQEH PIRLFLMQQG FESMEDLFGI FEDDLSTFGY FRTATLRFNK NPQWSLLSLA HRQILRHFLH WQASLWHQKG SSLKYLELVT LTDQDFAQYR QSTLEEILPD DTNKVHACSS MPPVPSKTVL MHILDNVFVL SQDHPIRLCF LQQSIESMED FFSFLEDGID ALTFSPTPAD KGNSLPQRMP MKLGHCWLLQ AFFDWQVLLE WEKGSFLENS ELAALTKADF TRYRRSAIKK ASTASLMPSA SILGFTRKGL TPQDGEGCSK PINFAESHRI ESKNSCVLKG LPDSTPDNLI GETSPTNSQD DGEQFREQQA FTKSQNNGEQ VCEWQPFSTD SHDDGEQSYG CTVRKFLGCD KPCNMQFSVD ISDCKCDESF ACTESDAKYD ENLVSIENLD KHETNLDEKW HKRDREWCFK ASPPILAVLV SRQITAARRT EFGKPTLSPP QGSVELGTNR TLFDNTAYIA CCCIKCQRCL RGHLSACASH LCQAATTTLL TIGIENPLVL QGLSVFFWIC QWVVS
|
| |