Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_36995 |
Symbol | |
ID | 7204754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 836918 |
End bp | 838195 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185964 |
Protein GI | 219121482 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0016758 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCAA ATGAAAAGTC AGAACGAGCG AAAAAGCGCT GGAGACTCCT GCGACGCGCT CTGACGAAGC AAACGGTGGA AAAGACTACC GGCTTCCCAG GCTACTGTTT GATATCAGGA ACGATAACGG CGTCGGACGA TGATAGCTTG CTGGGTGCCT TGTCATACAT CGACTATACA ACAAATGTGG TGCAGCAGAT CGAGAGCTCG TTGCTGGCTA GCTTGGCCCT CAACACATCG TTGCGGCAAA TTATCTTCTC CTTTGACCAC GAATCGCTAC CCCTGGGGCG TCCTCCTAAT CTCATTGAAC TTTCTCAGTC CCTTCTTTCG AGCTGTGATC TCACGTCCTG CGAAGTCGAA TCATTACCCA AAGAGGCCGA TCATACAGAA TCGTCAGCGC TCCGTATTTC CTTTGAGCCA TCCACCGCAT CCTTGTGCTT TGCAATAAAA TCGTATACCT CGCCGACTGT TGCTGGCGTG ACCCTAGCAC TTACCCGCGA ACGGGTCCGC CTTCAGATCA GCCTCGACGA ACTAACGACG CATCATACGA CATCCATTGA CAATACGGGA AATATATGCG TATGGGATTG CGAAACTACG CTGGCGTGGG CCGTCCTTGA ACAGCTCGAA AATATACTCG GGAATCTTGG TCGTTCGGTG GTTGTTACGG AGCTGGGTTC TGGCATGGCC GGTCTGGCCG CCTTGAGCTT GTGCCGCTCG CTTCCCCCAT GCTCCACCAT CTATGTGACT GATGGACATA TAAATAGTAT TCAGAACAAC ATGGTGAATC TGCGATTGAT GACAGTGGCC GAACTTAGAC CCAGTACTGT CGACGTGTTT TGCCAAAAGT TAAAATGGTC CCTGGAGGAG GACCAAGACA AGCTAGATCT TCCGGAAGAG GCGGATCTGG TGTTGCTCTC TGATTGCGCG CATTTTGAAC ACTATCATGG GGAGCTTCTT TGGACGCTTA TACGTGCAAC CAAAGTAGAC GGTCAAGTAT GGATGTGTCA TCCCGAGCGA GGCAATTCAC TTTGGAGGTT TCTCCAACTC ATTGACTCAG TTAACGACAG CGCTTCCTAC GGACCCCTTT TGCGCATAGT ATACTGGAAT CATGATCTTT TGGACGAAAG GCATCAAACT TTTCTCGAAC GTCACTTAGA TACGTACGAT CCCAACGTAC ATCGTCCAAG GATATATCGA TTGCAAAAGC TTCGTAAAAA CACGGAGATC GATCGAGCAG CAATTAAGCT ACACATTCAA AAGCGTGATT GCAATTAG
|
Protein sequence | MNSNEKSERA KKRWRLLRRA LTKQTVEKTT GFPGYCLISG TITASDDDSL LGALSYIDYT TNVVQQIESS LLASLALNTS LRQIIFSFDH ESLPLGRPPN LIELSQSLLS SCDLTSCEVE SLPKEADHTE SSALRISFEP STASLCFAIK SYTSPTVAGV TLALTRERVR LQISLDELTT HHTTSIDNTG NICVWDCETT LAWAVLEQLE NILGNLGRSV VVTELGSGMA GLAALSLCRS LPPCSTIYVT DGHINSIQNN MVNLRLMTVA ELRPSTVDVF CQKLKWSLEE DQDKLDLPEE ADLVLLSDCA HFEHYHGELL WTLIRATKVD GQVWMCHPER GNSLWRFLQL IDSVNDSASY GPLLRIVYWN HDLLDERHQT FLERHLDTYD PNVHRPRIYR LQKLRKNTEI DRAAIKLHIQ KRDCN
|
| |