Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33584 |
Symbol | |
ID | 7204119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1305462 |
End bp | 1307855 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186502 |
Protein GI | 219113837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTA GTATCGCGCC CTCGCAGACA AGGAAAAGGC AGTTTCCGTA TTTGCGTCCC GACAAGGTCG ACACATCAGC CGAGATCTTG TCATCGTTTT CCGATATCCG CGTCAAAACG TTCGAAACCA TCTACGCCGA CGCCGATGAC GAGCACGCGG CATGGCGTCC CGACAAGTCT CCGAAAGGAT TTGTGGACGA ACCTATTCGC GACGTAGTAG ATTTGATCAA TACGCACCCG TCATTTGCGA CAGTGTCTTC TTGTTCCGGA CGAATTGCTC TTTTTGATTC GTCGCTGCAA CAAACAAACG ATGATGGTCT TGTAGAGAGT GGAAAAGGAA TTGGAGGGTG GCTAATGGTT TGTCACGAAG AAGCGGAACC CGTCTGTTTG CTCGATATTT TCCACGCAAG CGCTGATACA GGTCATCTCC ATAATGAAGC GGAAATGCCA CTTAGTTTTA AATTTGAGCC AATGCTGCTG CACGTTGCAG CTGCCAGTCT GTCTCGTGGG CAGCAACTCT TGCAATTAGC TTTGCAATTG GGATTCCGGG AATCTGGTTT GGTAGTGACA GATCTTCGTG TTACAGTGGC TATTCGAACT TATAGCTTGG CCTTGACGGT GCCTCTCGCT CGGCATGGGG CGTTTCGTCC TCCCGATGAG TATCTTCGAG CCCTTGCTGT AGAAGCCAAC AGACGGATGC GAGTCAACAC AGAAAAGATT CAAAGGCTGT TACATTCCTT GACGGAGCAC TTTTTCCGTC CCGTTCCCTT GTCTTGTCGA ATACGAGTCC AAGCGCTTCC CCAATTGGGA TTGCGGTCAC ATTCCGCAGT TGCCGTGACG AGACCTCGTA CAACAGACAG CATCGACATT ATTGTATTCG GGGGATATGG GAGAGGTCCA AGGCTAGCAA AGAACACTGG AAGAAGTTTA CAGGGATCAC AGAGATCCAG CCATATTTAT TGTTTGACGA GAGCGAACGG GGTCTGGGAG GATGGCTGGC ATGAGATTCC ACAAGGCCAT CCAAGTGATT TAGGAGATAC GTGCATTTCT CCGTTTCGAT TCACTTGCCG CCCAGTAGCT TTGACGACCC GTGAGGGTAG TGCAACGTGC GTTCTACCTG GCGGAATATC TATTGTGGCT ATATTTGGAG GTCGAACAAA CCCGGCTAAT CCACTTGGAG ACTTACTTCT CTACGACCAC GAACACCACC CCGGAATTCT ATGGGAACCC AACGACATCC GCGGATGTTT ACCAGAACCT TCTTGGGGGC ACACACTAAC TGCCATGCCT TTCGGGAGTA GCTCTAATCG TCTGGCGGTT CTTTGTGGTG GAAGAAACGA AAGGGAATGT TTGGGTTCGA TTTATATCTT GTCGGCGGTG AGAGATAACG AGCAAGCCGC ACACTTGATC TGGGAAGAGG TCGTCACGTC ACCTCCACTG CAAGGCGTCT TCCTCCATTC CGCTGTTGCT ACGAGCCACG ACTCGCTTCT ACTGTTTGGT GGATTGAATA AGCCTTTGGA CATTTTGGAG GCTTTCGACT ATATGACATG TGCGTGTGCG CATAGCGTTG ACCTTTGTAG TGGAAAACTA ACTCCAATCG ACAGTAAAAG CTGTCCCTGT CTTTTTGGGC ATACGGTGGT ACCTTTGGTC TCGAGCGAAA ACCAGGGATT TCAAAGTCAC TTCCTCTTGA CGGGTGGTCT ACAGAAAACG TCGCAAGGAG GGAATTTTGC CACATCAGCT CCATTTCGAT GTGTTTCCGT GTCAAAAAAT GGTCCGGATC TTTCCTTTGA GCAGCATGGT ACTATAATTG AAGAAAGCGA CGAAAAATTT GATTTTGGGT CCTTGCTGGA TCACAGCTGC ATACCTATTG ACAATTGCTC GCGTGAGCCG CACAAATTTA TTTCAGTAGG TGGTGGGGTT GCAGGATTTG CCTTTGAGCA GTGCTTTGCC GAGTCGTTTA CCTTTGAGGT TCAGCTTGTG CCAAGTGCCA ACAGTGTGGG AGACGACATA GTCGCTGGAA AAGCAGACGC GACCTTACGA AAATCGAACT CGACCGCGGA CAATTCGCGT GTGGCCACGC TGCATGCTTC GGAGTCGGCA CTGATCGATG TGCTGTACGT GGACAGACGA AACGCGAAGA AAGCGAAAAC AATGTTGGAA GAGGCGTCAT GGCTTGACAA GAGACATCGC ATGTTTCCAG CTGACAGTCA TGCACCTATC CTAGATGTCG AGAAATGTAT TGCACTCCCT GTTTTGGAGT CATGTTTATT TGCATTGGAC GCGATGGAAA CTGGATCCAT CAATCTCGGG AAAATTATCA TAGGTAGAGG AAAGCAGTCG ATGCCACTCA GTACGGCTGC ATATGCGAAC CAAGCGAAAA AGATAAATGC GTAA
|
Protein sequence | MSSSIAPSQT RKRQFPYLRP DKVDTSAEIL SSFSDIRVKT FETIYADADD EHAAWRPDKS PKGFVDEPIR DVVDLINTHP SFATVSSCSG RIALFDSSLQ QTNDDGLVES GKGIGGWLMV CHEEAEPVCL LDIFHASADT GHLHNEAEMP LSFKFEPMLL HVAAASLSRG QQLLQLALQL GFRESGLVVT DLRVTVAIRT YSLALTVPLA RHGAFRPPDE YLRALAVEAN RRMRVNTEKI QRLLHSLTEH FFRPVPLSCR IRVQALPQLG LRSHSAVAVT RPRTTDSIDI IVFGGYGRGP RLAKNTGRSL QGSQRSSHIY CLTRANGVWE DGWHEIPQGH PSDLGDTCIS PFRFTCRPVA LTTREGSATC VLPGGISIVA IFGGRTNPAN PLGDLLLYDH EHHPGILWEP NDIRGCLPEP SWGHTLTAMP FGSSSNRLAV LCGGRNEREC LGSIYILSAV RDNEQAAHLI WEEVVTSPPL QGVFLHSAVA TSHDSLLLFG GLNKPLDILE AFDYMTCACA HSVDLCSGKL TPIDSKSCPC LFGHTVVPLV SSENQGFQSH FLLTGGLQKT SQGGNFATSA PFRCVSVSKN GPDLSFEQHG TIIEESDEKF DFGSLLDHSC IPIDNCSREP HKFISVGGGV AGFAFEQCFA ESFTFEVQLV PSANSVGDDI VAGKADATLR KSNSTADNSR VATLHASESA LIDVLYVDRR NAKKAKTMLE EASWLDKRHR MFPADSHAPI LDVEKCIALP VLESCLFALD AMETGSINLG KIIIGRGKQS MPLSTAAYAN QAKKINA
|
| |