Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45427 |
Symbol | |
ID | 7200674 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 122771 |
End bp | 124527 |
Gene Length | 1757 bp |
Protein Length | 436 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179798 |
Protein GI | 219118030 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGGAATCAA TCAATCAATC AATCAGCGTA TTGTCAAGCC GTCCATCTAC GCAGGCATTG TATCTCTCGA CAGGGAATCT CTTGCTGTCG CTAGTAACTT GCATTAGATC ATTTAGCGAG GAACAATTCA GCAATCCAAA TATCCGGAAC GAAAATGGTC GTTCGAAGAA GGAAAATTGC CGCCGTGCTG TTGCTTACTT TAACACTCTT TTCCAAGGGT GCATCCTTTC AGACCGTGAC GTCGTGGGGA GTACACAGTC GCTGTCGTCT TGGTCGACCA GCCGCTAATC CTTATCCAAC GCCAACCACT CAAGGGATAG TGGACCTTCA ACATGGCAAT CCACGTGGGT CCGCGTCCTC CACCTCTCTA AATATGTTCA TGGGATCCGA CGGTGGACTC TTAGGGATAG GAGGACCGGA GCTGGTAAGT GTATAAAGTT GTGTTGACAG GACACCGTCT GGAGATGACT ACGAATGCGG CCACCAGCGC AACAGCATGA CTCGCGTTTC GTTCGCTACG ATCCTAGATA CCGCATTCTC ACCTAGTTCT CCCTGCCCTT TGTGCTCAAA ACGTATGCTT TGGTTAGTTT ACTATTTTAC TGGTCGGATA CTTTGTGCTA GGCCCGAGTG ATCTGTACAA GCTCGTCAAG GAAATTGGCA AATTCATTCA AAACATCCGT TCCCTGGGTA CGGATTTGTC GACCACCTTC GAATCAAACA TGGAAAATCA ATTGCAGCTA CAGGAATTGC GCAAAGCTCA GCGTGAACTC ACGGACGCCT TTTCGTTCCG TCGATCCATC AACGTGGACG ATTCGGAAGC CTTCGCTACC ACTGCAACCA CACCGCGCGC CGAGGAAGCT TTGGTAGGTG GGGTGGCCGC CCTCGGCAAT GATGGAGAGG ACAATGCCAC CCGCAAACGC AAAAAGATCA AACGTCGCAA ACGACAACCG ACGGAAGAGG AATTGGCAGC CCAAGCAGAG GCAACAGCCG AGCCCGCCCC ACCTTTGTCT ACGTCAACAT TCAGTACGTC CTCCGTGGCC ACGGAGGCAC CGATGAGCAG TGTTGGGAAC GTTCCCGATT TAATTATGCC AGGAAACACA GAAAGAGAGA GCACGGGTGA CCCGTTCGTG AACGACACAC CGTATGGGGT GAGCCAGTCT GGATCAGAGT CGAAAATGAC TCCGGAAGAA GAAGCCGAAG TGGAACGGGA GTTTGCAAAG TACAGTAGTG ACCCGATGCC CAGCAGCAGC AACGATGTCA GTGGCTGGTA TGGAGCTCCG GATCAATCGA GGTACGATGC AGACGCGCAA AGTCGCTTTC AGCAGCAAGT GGCTGGAGAT TGGAACAAAA GTGTAATGGC GAACGAGGAC AAGCTGTCGC CGCTCGCCAA GGTCATGGAA ATGTTGGCTG TGCTGGAAAA AGAAAAAGTT GCCAAAACTC GACTTTTGGA AGAAGAATTT CGAAAGAGAG CCGAAATGGA AGACGCATTT TACCAAAAGC AGCGGACGTT ATTGGAAGAA GCGGCTGCTC AAGTGCAAAC GGATGCATAC AGTTCGTTCG GTAAGGGTGG CAAGAGCAGT AAACGAGACG TGGATAACGA CAAGAAGAAC AGTACTTTTG TCATTTGAAC GACGAAAGCA TCCCGTCCGT CCCGCTCTTC AATAGTATAG CCATTGCGTT CGATCTGTCG CTACCATCTT ACCTAAAAAT TTACGCCTAG CTTTGTTTTG AATATCTAAA CATTGTACTC TACAACA
|
Protein sequence | MVVRRRKIAA VLLLTLTLFS KGASFQTVTS WGVHSRCRLG RPAANPYPTP TTQGIVDLQH GNPRGSASST SLNMFMGSDG GLLGIGGPEL FTILLVGYFV LGPSDLYKLV KEIGKFIQNI RSLGTDLSTT FESNMENQLQ LQELRKAQRE LTDAFSFRRS INVDDSEAFA TTATTPRAEE ALVGGVAALG NDGEDNATRK RKKIKRRKRQ PTEEELAAQA EATAEPAPPL STSTFSTSSV ATEAPMSSVG NVPDLIMPGN TERESTGDPF VNDTPYGVSQ SGSESKMTPE EEAEVEREFA KYSSDPMPSS SNDVSGWYGA PDQSRYDADA QSRFQQQVAG DWNKSVMANE DKLSPLAKVM EMLAVLEKEK VAKTRLLEEE FRKRAEMEDA FYQKQRTLLE EAAAQVQTDA YSSFGKGGKS SKRDVDNDKK NSTFVI
|
| |