Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43101 |
Symbol | |
ID | 7196878 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2033437 |
End bp | 2035838 |
Gene Length | 2402 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177429 |
Protein GI | 219111355 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCCATATG CGATATCGGT GTTCCGGGCG AAGCCAAAGA ACAGAAGTTT TGTTGGATGA GATCGCAAAA CTCTAAATGC GCGAAAGAAA ACAATGGGAA ACCATGCCTA TGGAAGTCGT CAGCGGGAGA CGGTATCAAC GATAGTCCGT CGTTGGACGC AGTCCATCGA CGCTTCGCAA GCAGACCACA AAAGAAAGAC TTCTTCTCTC TTTGTTCCAC TGCGCTTGTG GTGGCCCAGG AATTTGCTCT AAGTTGCTTT TTGCTAGCTC GGCATCGTGT CGCACTCTAT TTCGAACAAT CCCGATCAAC CCCAGAAAAT GCGTTTACAA TCAGTCAGAC CAACTTAGAC CAAGCGACCG TTGCCATGTC TGCGGCCCTC ATGTTGGTGC TGTTTTGCAG TAGTCGCTCT AATACAATTC CGCGACAGAA ACGAAGAGAG AAAGCTCAAC AGCGCTTGTC GGACGCAATT CTGCTTGGTA TTGTACTCCG TCTCCTTGCC AGCGTTTTGC GAACCTTGAC AGCTTCTTAC TCCTCCGATA CTGTCGAGGC CTTGGCCACT ACAGGTATGA CTTTGCACGT TGTAGCGTGT GACTATTCGT ACGCCAACGG CCGACGTCCC CATGGAGAGA TAATTCGTCC TTTGATATCG TCGCAACGTC CGGTCTTTCG AGGAGGCACC TTTTCTCTGA ATGCGGCTCT GTTTGCGACA ACACTATTGG TGAGCCGGGT GGAATCCAAT AGCATGGCAT ACTTTTTAAT ATCACTTGCA ATCGTCATGT TCGCCTTTTA CCCCGACGCA AGGCACGCTA TCGCCAATAG CTATCCCCCA TCGAGGAGCG GTACGTCCGT GAGTTGCTAC ACATTTTCAA CTTCTTTGAT ATATTCTTGT GATAGCACTT TTGATGGCAA TCCCTTTCTG TATTTACTAA ATCTGCTGCT TCATTCCGTC TCAACTCCGA AACTCAGGAT TCCCGTGGCT AATTACAGCA GCAATCTCGG GCTCAACTTT GGTGTTACTG AACAACCACG AAAAAATACT TTTCTTAGTA GGCATGACAT CGCTAGTTTT TGTGGTACCA TTGTGGAATT ATATCTCTCA GTGCAACAAG GTGTGGTTTC GTGGACCCTG GGACATACCG ACGCGTTCTT CAATGAAGTT GGACAATAAA TTGTGAGGGT GATGTCACCA GCTCATGCTT GTTCGATTCC TCTTCTAGCC GTCATCAAGA TAAAGAATCA AAGACTCTCC TTTCTCTGCG ACCTTTCCTC AAGATGGGAA AGAAGGAAGG GAGCCTTCTC TCCGTTGATA TTTTCAACAA CAGGATTTGA AGTAATTTTC ACTTAGTGGA GATTTGCCCC CTTTGCAAGG AGGATCTGTT TCATCTTCCA GTCAAGATGG CACGAAATTG GGTAGCATTT CTCGTCTTTT TGCTGTCAAA AATATTACAT GTGATTCACC TTATCAATCC GGTGACAGGC CAAATGGTAC AAATGTCCTC CAAAGCATAC TGGAGTGAAC CATTTTTGTC CCTTGATTAA GGCCGCCTGT TCTCGCATGA TGTGTTACAT TGTGCTGGGA AAGGAAGTGG TATTTTTAAG AACAATCGTG TTACGGCGAA TGACAAATCT CAAACAGGGT TCCAAATGAG CTGGTGTCGC GTTGGCATGT GAAGACAATA TGGGTGTCAA CAATCATCAG TACAAGGAAC AATCGCACAT GAGCTATCTA ATGAAAGCAG GTGATGTTTG CGTTGGCTAC AATTTGAAGG AAACACAATT ATCAGCGACG AAGCCGATTC GCTTCGATCT GAATTGAAAC TCCTTTGGCG CAACTTTTAC GGAGCCGTCG CTAATGAAGA ATCTGACGCT GTCAAAAGCG GCAAATGTGG CGCCTTCAGC GCTTGGATGT GGCCGTTGCC GAGACAGCAG CAGCAACTGG TAAGGTTGTC AAACACGCTG TCGAAGCCGA TGAAATGGAC AAAGAAGATT TCTGCAGGAA GTTGAGGCGG ATAATGGCAT GCGCTTGAAA ATGAATATTT ACTAGAGCGA AAAAAGATGC ACCAGCCACT ATGGGAGTGG AAGGAGACAC TACTGAAGAT GAAGACGGCA AGGACGATGA AGATGACCCA CAGATCACAC TCGACGAGTT GCTAGACGGG CTGGTTTGCA CCAAGGCCCG GAAATTGACA TGGTTGATCC GCAAGTCCTC GATCCACTTG TAAAGGGCGA AAAAGCAGCA AAAGACGGCA TTCGAGCTAG GAACATTTGT TACAAGGATG CGGCTGTTCC AGTCTCGAGT GGATGGGGGA ATCAATTCCC AGGCGGGTTT GTGCAATAAA GTCGGCAAAA ACCTTTCAAT ATTGATGTCG TGTCCCCAAC TTTCACAGTG AGGAGTATGC AG
|
Protein sequence | MRSQNSKCAK ENNGKPCLWK SSAGDGINDS PSLDAVHRRF ASRPQKKDFF SLCSTALVVA QEFALSCFLL ARHRVALYFE QSRSTPENAF TISQTNLDQA TVAMSAALML VLFCSSRSNT IPRQKRREKA QQRLSDAILL GIVLRLLASV LRTLTASYSS DTVEALATTG MTLHVVACDY SYANGRRPHG EIIRPLISSQ RPVFRGGTFS LNAALFATTL LVSRVESNSM AYFLISLAIV MFAFYPDARH AIANSYPPSR SGFPWLITAA ISGSTLVLLN NHEKILFLVG MTSLVFVVPL WNYISQCNKV WFRGPWDIPT RSSMKLDNKL
|
| |