Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45998 |
Symbol | |
ID | 7201061 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 917688 |
End bp | 921011 |
Gene Length | 3324 bp |
Protein Length | 250 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180142 |
Protein GI | 219118750 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000053085 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTAACAGC CAGATTACTT ACCGTTACGG TAACAGGTTT GGCGTTGGTA GGCACACACA GGTATATCTG GCGCTACCCT GTTCTGGATA CTCTCTCTAT ACATTTACAA ATAGAAACAG TATCGACTAG TCACTCTCCA AGTGTAGCAT GTTTCGCAAT CAATACGATA CGGATGTGAC GGTGTGGAGT CCGGAAGGAC GACTCTTGCA GGTGCGTGCG TTTGCTTCGT TGGAATTAGA TTTGAAATTA TTGGAGCTGA AAGAAACATT GGATGCCTTG GAGTTACGGC ATTAGTTCAA CGACTCTGGA GTCTTTGCAT TCTTGCCGGT CCCTTGCTTG CTTACTTGCG TATACAATGC ACACACTTTC ACGCACGCAC TATTACTCAC TTCACACACT GACTAACAAC GTTTCACGCT CGACAGGTCG AGTACGCCAT GGAATCGGTC AAGCAAGGAT CCGCCTGCGT AGGACTACGC TCGGACTCCA TTTGCGTTTT GGGTGCGCTC AAACGTTCCG TATCCGAGCT CAGTAGTCAC CAGAAGAAGC TCTTGCACAT TGACGACCAC ATTGCCGTTG GCATTGCCGG ACTCACCGCG GACGCGCGTT CACTCGCCAA GTCTCTCCAG AGCGAGTGTC TCAACCACAA GTACGTCTAC GGGACGCCCA TCCCGCCGCA TCAGCTCATG GCCGACCTCG CCGACAAGCA CCAGCGGACC ACGCAAACCT ACGTCCGTAG ACCCTTTGGA GTTGGACTCC TCGTCGCCAG TGTCGATACC ACCCGCCAAA CACCGCACTT GTACCAAACC TGTCCCAGCG GGAACTTGTA CGAATTCGTC GCATCCGCCA TTGGCGCACG ATCCCAATCC GCACGCACCT ATCTCGAAAA ACACGTGGAC GACTTGGCCA ACGCCACCCG CGATCAACTC ATCGTACACG CACTGCAAGC CCTCACCGGA TGTGTCTCGG GAGACGATGA ACTCACACCC GACAACGGGT CCATCGTTGT TGTTGGAAAT GACATCCCCT ACACCATGAT TGAAGGAACC GACTTGCAAC CCTTTTTGGA CCAGTTGGAA ACACGAGAAA CCCCGGACGA CGATGATGAC GGCGACGCAC CCGACGAAGC GGAGAGTCCA CTAGTGCCGG AGCCGCAAGC CATGGAAACC TAACTGGTAT CGAGTCATAC CCAGCTTTCT ACAGGGTCTC CGAACGATTC CGGTCGGTAA TTGGGGCGCG TTGCCGTTCC AGATAGGATT TCCAATTCAC ACTTGCCTAC TAATGTAGGG TCATACCTAG CCTTGCAGTC TACACTCGTT TACTACACTT TCTTTACATA CATATACCTA TACACAGTCA CATGCCCATA CAACGGTGCC GTCACCACGA ATCTCCAGCC AACGTCCACA CAAACGTATA CACACGCACA GTGTGCATCC AAATACAAAT CCAACACACA CATATATATA GAAAGCAAGG CATTGACCTG GTGGTCGCCG CTACTTGCCA TTGGACAAGG ACAGCAACTG CAAAGCCGCA ATCAACGCAT CACCATCCCC ACCGTCGACG TTCTCGTCCG CTTTGGCAGC GGGGAGCGAT GTTTCCTTGA CGGGATCCTT CGCCAACGAT ACAGCCAGAA CNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NGTGGCTGCT CATGACACAA TAACTTTTCG TTGGATGTTA CCGATACTAT ACTGACTCCT ACGCAGTATT CGGTGATCAT ACCCATTTCC ATCAGACTCC TCCCTGACTG CATCCAACTT TGCCTTCATA AAAAGAAGCG CACCTCGCAT CGAAATCGAG CATGACGCCC TTCTATTTCC TGCCTTCGTG GCTACTTGCT GCTACGCTCC TACTTCAACG ATCAACCCCA GCATTCTCCA AGTGTGGCAC AAAAGACCCA TCGCCCTTCG AACAGCGCTT AGATCAGATT CGAATCAAGC ACTTAAAAAA CTCACCTCAA GGTCGCCGAT TGATTGCGGA CTCTTGCGAA GATCTCTGCG TCCAGTGTGT AGAGGTAGAC GTCTACTTTC ACCTGAGCGC CCTTCCCGTT CCGACAGACG ACGGCAGCGA ACAGTTCATT TTCCCTCACC CCCTCGAGTC GGTGGCTCGT ATCTATGCAT TCGACACGAC GGTGACCGTA GATGACTTTG CGTCGCTGCA AGATATCTAC AACCTGATTG ACACGAACAT GCAGGTACTC AACGAGAGGT ATGCGGAGTC TCCGTTCGTA TTTACTTGGA AAAATGCAGA CCCGGCCAGT GCCAGCGTTT CCGCCAATCC AGACATGGTG GACCTTGTCG TGGACACCTT ATTTGATGAG AATGGCGTGG TTTCTGAACT ACACACAGGG GATGCTAGCG TGCTGAACGT TTACCTGACG CATTCGCAAT GCCTACCAAC AGAGGAAGCC GACCCAGACA CAGGCGAAGA GGCTCTCGAC TGTAGCATTG TTGGTGTAGC CGTCTTTCCA AGCTATCAGC AAGCCAATCG TCGAGCCGAT GGTGTCTATG TCAACTACAG TACTCTATCT GGGGGAGGGT ACGTACACAG AAATGGCTCA AATCTCTCTG TGCCAGGCAA ATGGTAGCAC GCAACGACCG ACTTCACTCA CGCAATATGT CTTCCTTCCT TTCCAGCCTT CCGAGCAACG ATGTTGGTCT GACTTTGGTT CATGAAGTAG GCCACTGGCT GGGACTGTAT CATACCTTCC AGAACACTGG TGAAGATGAG GGTGCCGACC CATGCTCACC TGAGAACGGG AACGACTACG TTGCTGATAC GCCTGTACAG TCCGCCTCGT CCCAGGACTT GTACGAATGC TCGTTAACGT TTTACGCGGG CGAAGAGCTC CCAGACTCTT GTCCCGACTT GGCTGGAAGC GACCCTGTGT TCAACTACAT GAACTACGTC TCGGACGAAG AATGCTGGCC TCCTGGTGTA GGTGAATTTA CGTGCGGACA GTATGAGCGC ATGTACATGC AGTGGCTACT GTACCGCCAG TCCAACGAAC CCTGCCAAGA CAATGAAAGC GAAATAAACA TTTTGATGGA GATCAGTGAA CTCTTTTCGG AAGAAAATGC TTTTTACCTT ACATACGTCG ACACCGGTGA AGTGGTGCTT AATTCGACGC GTGATTTTGA AGTCCTTGGG CCTCCTTATC AAACTGAAGT GAAGTCTGAC TTTTGTGCTC CCGTTGGACA ATACTCGTTG ATGCTCGTGG ACAGTGCTCA AAACGGATTC CTGGACGGTG GATTTGTCGA AGTGACTGTC AGTG
|
Protein sequence | MFRNQYDTDV TVWSPEGRLL QVEYAMESVK QGSACVGLRS DSICVLGALK RSVSELSSHQ KKLLHIDDHI AVGIAGLTAD ARSLAKSLQS ECLNHKYVYG TPIPPHQLMA DLADKHQRTT QTYVRRPFGV GLLVASVDTT RQTPHLYQTC PSGNLYEFVA SAIGARSQSA RTYLEKHVDD LANATRDQLI VHALQALTGC VSGDDELTPD NGSIVVVGND IPYTMIEGTD LQPFLDQGSR PRHRRRGSRL
|
| |