Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46667 |
Symbol | |
ID | 7204587 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 128641 |
End bp | 129964 |
Gene Length | 1324 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185828 |
Protein GI | 219121199 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000285814 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCGT ATTTGGCATT GGTTCCTCGC GGACTGCAGC ATGTGGTCCA ATCCATGTTG CACGAGCAAC TCACGAGGGA CGGCCGGTCA ACGATGACGG TGCGTGTGGA CGTTGTCGGT CAAGTGGAGT ACGACTACGG TGCCGATGGA GACCGTGACG AGAGCGAAGC CAAGTATGCC CAACAGATGC GGGACAAACT AGTCGCTCAT CAGTCCTCGC AACAACGCAA AGGCAAAAAG CGAGCTCGAC AGAACGAAAG CACGGGGACT GCCACCTGCC GTGCACCGAC CGGAACGATG CACATCGATC AGACGTCCAC ATTGAGTGTG GGTTACGATG ATACACGTAT CGTTTGGAAT ACTCCGGGAG CTTTGCAAGG AGTTGTGTGG CTGCGTATTG TCACAAACGC GACGACAACT TTGGTAGACA GTCTTCGTTG CGTTGGGCCA CTGTTGTTAT TGGTGGATCT TTGGGAGGAG CAGAACGTGA ATATCAGCGA ATCGCAATCG ATCGACCAGG CTTTACAAGT CTTTCAACAA CACTGTAAGC AAGACATGTT TGACAGTGTT TTGCAGGTTT GGGAACGGTA CGTGATGGAC TGCTGGAACT TGACAGTGTC GCAAAAAGAA TCGATACGGA ATCGGCTTTC CGGGAAGGAG CCGGTACGGT ATCGACTGAG CTGCCTCCGC AACGACAGTA AGAGTTTTTC CTATTCGCGA CAAGAACTCC TGACACGGGC CGCCAGCTTT GTCATTCCCG ACAAGTTTGG CAAGACTTGG ATTGTGGATT TGGAGAATTA CGATATCGAA ATTGTGCTGC TCCTGCGTTC GAACCGTGTC GCAATTGGCC TGGCCACACG ACCCTATCAA TATCTGGGAG CCAAGTCGTT CGACAAAGGA GCTCTTCCCC CGGATGTAAG TAGGCCCTAT CTTTCCGGTC AAGTCTTGTC GAAAGTTTTG CGACTACGCC CTAGTACGGC ACAAATACTG CTGCATCAAG CAAAGCTACA ACCTGGTGAT GTCGTTCTTG ATCCGTGTGT TGGCATTGGA ACCATACCCC TGGAAGCGAC GTTGCAAGGC ATCCCAGTTT ACGCGGTCGG TGGAGATTTG GTTCTGGGTC ACAATCAGCT GGGGCCCATT GCTGCCCGTT ACGTCCGCGA ATGCCGCACT GTACAACGCA CAGAGAGTCA GTCATCCGGT GCCGCGGACT TGCTTGTCTG GGACGCATGT CTCGTACCCA TGCGTGACGG ATGTGTGGAT GTGATCGTGT CAGACTTACC CTTTGGTCAG ACCTGCCTCA GTAGCGGAAA ACTATCCCAA ATGA
|
Protein sequence | MNSYLALVPR GLQHVVQSML HEQLTRDGRS TMTVRVDVVG QVEYDYGADG DRDESEAKYA QQMRDKLVAH QSSQQRKGKK RARQNESTGT ATCRAPTGTM HIDQTSTLSV GYDDTRIVWN TPGALQGVVW LRIVTNATTT LVDSLRCVGP LLLLVDLWEE QNVNISESQS IDQALQVFQQ HCKQDMFDSV LQVWERVAKR IDTESAFREG AGTVSTELPP QRHFVIPDKF GKTWIVDLEN YDIEIVLLLR SNRVAIGLAT RPYQYLGAKS FDKGALPPDV SRPYLSGQVL SKVLRLRPST AQILLHQAKL QPGDVVLDPC VGIGTIPLEA TLQGIPVYAV GGDLVLGHNQ LGPIAARYVR ECRTVQRTES QSSGAADLLV WDACLVPMRD GCVDVIVPAS VAENYPK
|
| |