Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42688 |
Symbol | |
ID | 7196026 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 797454 |
End bp | 800329 |
Gene Length | 2876 bp |
Protein Length | 700 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176656 |
Protein GI | 219109805 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCCAACGG CGATGATGGC GGCGGCAGCC GCCTTTACAA AAATGATCCA ACACCAACGG CCGTAGCATG GAGAGCATCC AACCGCCGTC AACGGCGGCT CTGGTTGTCT CGTCTACTAA TGCGCGAAAG ACTCCGATTA TCCTGAGTGC AGTATACGGA AGGAAGCATT CGTGGACAAG TCTGTGCGCG GTCGCCTTTA TTGCTAGCTA TTTTTGGGTC GCCCGAACGA TCGAGGCCTT TTCATGCATT TCCTTTAACC GTGCTCAATA TCGGTCGGTC TCTGTAACGT TGCCGTCTGC CTTGTTTCGG AATGGAAATA TTCTGCGGGC ATCTCCGTCA GATTACGTAC ACTCGTTTCA ATCGAACGCA ACACGGACTA GTGCAAAAAA TAGGTCGATA CCGCTGCACA ACCCTCAAAA CATGACGGTC ATACGGAAGC AAAGTGCTCG AAAATCAGCG TTGCAACGTT CCTCGCAGAA ACGACCCATC CTTGCGCGAC TTGGACAACT GCGCAATCGT GTCCGGGCCA ACCAGACAAG CCCAGTAGGG ATTGACGGGT CAATGGGATT GCCATCGCAG CGAGAATGTG ATCAAGTACT AGCGGATTGT GTGGCTCTGG ATGAATGGGA ATTGGTTCTA GAGACTCTCG ATTTGATGAA ATCAGTTGGG CTCTCACAGC AACATTCCAC CTATCTTGCC TGCCTCGAGG CTTGTTTCCC CGTCGCCAAC GCGGCGTCAG CCAAAGAAAT ACTCTACGCA ATGGAGCAGG CCGGGGTAGA AGTGACAGCG GACGATATAG TTTGGGCCAT TCTCATATAT TGTCGCGCGT CTAGCATGAG GCCAAGGAAC GATCCTGTAT GGTTGCCGCT CGCACTTCAA CTGATCCAGG AACATCCGAA CGTTTCTGTG GCAGCATACG ATGCCGTCTT GTCGTGCATG GTCGAAACGA AACAGTGGAA AGAGGCGGTC CGACTATTGC GAGGAATGGA ACAACAAGGT TCCACTGGTC CAGCCCTTTC TACATATCGA TTTGTTTTGG AAAGCTGTGT AGCTAGCGAT CAACCCACAC AGGCGGTTCA AGTACTCCAG TCGTGTATTC ATCATGGTCT AGTGCCAACC CTCTATTCGT TCGAACTTGT CATCGGCGCA TTAGCCCAGA AAATGCAGTG GCGTCGAGCA CTACAACTCG TTGAGTTGAT GCGTCAGATT GACGTATCAC CCAACTTGGT CGTTTACAAT GCCGTACTTT CGGCATGTTC CAAAGCCAAG GAATTCTTAC CCGCGCGGCG ACTCCTACAT CAAATGCGTA GGGAAGGCGT TCAGCCAAGC ATTCGGTCTT TCAATGCCGT GATTGCGGCT TGCGCCAGCG CGGGCCAATG GCAGGATGCT ATACAGGTTT TGGATCAATG TCATCGAGAA CCCGGAATTC AACCGGACAT TTATACCTAC ACCAACGTCA TGCGCGCCTG TGCAAAAGGT ATGTTTGCAC AAAGATTTGA AAATCTCCTT GTTGCCTCGG ATTGGATCTA ATCGCTTTCA TGCGCATCGT TCTCATTCGT ATTGTACATA TTATTTTGTA GCTGGCAAAA CTCGCAAAGC ACTGACACTG CTTCAAGTGA TCAAGGATAA AAAGTTGCCG CTGGATGCGT ACGCCTACAC TGCCGCCATC GAGGCTTGTG CGAAGGCAAG TATGTCTAGA AAGGCGCTCG AACTTTTGGG CGAAATGGAA GGCATTGGTA TCGCGCCATC GGGAGTAACT TATAGTGTCG CCATTACAGC GTGTGGAAAT GGTGGTGAGT GGGAAAAGGC TCTCGGACTG TTGGATACAA TGCGACAGAA AAATTTGAAG ATCAATCTGA TCACATACAA TGCGGCGATT ACAGCCATTT CCAAAGCCGC CAAGAAAACA GCTAAGAATA GGGGACAGAG TGGGAAGTTA CACAGGACTG TTATGAGAAT GTTGGACCAA ATGCGTGAGG ATGGAATCGA GCCGGACGGG TTTAGCTTTT CAGCCGCCAT TAGTTGCTGC GGTTCCGAAG GACATTGGAA AGAGGCGTTA GAACTCATGG ACATCATGCA GAAAGGCGGT CCGAGAACCA CTCCGAACAA AATCGCTTAT ACAGCAGCCA TTGCGAGTTG TGGACAATCT GGGCAAGCCG ATGAAGCACT AAGGCTGTTT CGGCAGATGA AGGATCAAGG ACTATCGGTG GATCGCGTAG CTTACAATGC TGTTTTCTCA GCTCTTCGTA TTGCCAAGCG AGCTGACTTG GCTTTTGATT TATGGGCCGA GATGGTTGAT ACAAAGCCTT TCGAAGTAAA TTCGATTGCT GTAGCCAAAC TTGACGAGGT CAGCACTCCT GATATCATTA CTGTTACAGA TGCAATCGGG GCCCTTTCTT CTGCTTCAGA CACAGTAGAG GACCGATATC GTGTTGATAA AGTCTTTGCA GAAGCTGTGC GGCGTAATAT CGTCCTTCGC AGCGATACCC TGGATTCCAA ATGGGAAATC GATTTATCTG GAATGTCGTT TCCTGTAGCG CGGGCCGCTT GTCGCTACGT TATAACGAGC ATCAGTAAAA GCACAAGCAA CCGCGAATTT GAAGACCTGA CCTTCATTAC TGGTGTCGGT GTTGGAAAAA GCTTTTACAA AGGCTCAAAC GGGGTCCCCG ATTTGCCAAC GCAGCAAACT TCTCTGCAAA AATATGTGCA GCAGATCTTG ATATCTGATT TTCACCCAGA AATTGAGTCT TATGTTCCGA GTCTAGCGAA GGGAACAATC TGCATCGGTT CCGAAAGTAT TTGCAAATGG GCAGAAAGAC AGTAAGGCCA ATCAATCTAC CAAATTTTTT TAAAAATAGA GCCTATCACT TAACAC
|
Protein sequence | MESIQPPSTA ALVVSSTNAR KTPIILSAVY GRKHSWTSLC AVAFIASYFW VARTIEAFSC ISFNRAQYRS VSVTLPSALF RNGNILRASP SDYVHSFQSN ATRTSAKNRS IPLHNPQNMT VIRKQSARKS ALQRSSQKRP ILARLGQLRN RVRANQTSPV GIDGSMGLPS QRECDQVLAD CVALDEWELV LETLDLMKSV GLSQQHSTYL ACLEACFPVA NAASAKEILY AMEQAGVEVT ADDIVWAILI YCRASSMRPR NDPVWLPLAL QLIQEHPNVS VAAYDAVLSC MVETKQWKEA VRLLRGMEQQ GSTGPALSTY RFVLESCVAS DQPTQAVQVL QSCIHHGLVP TLYSFELVIG ALAQKMQWRR ALQLVELMRQ IDVSPNLVVY NAVLSACSKA KEFLPARRLL HQMRREGVQP SIRSFNAVIA ACASAGQWQD AIQVLDQCHR EPGIQPDIYT YTNVMRACAK AGKTRKALTL LQVIKDKKLP LDAYAYTAAI EACAKATLRI AKRADLAFDL WAEMVDTKPF EVNSIAVAKL DEVSTPDIIT VTDAIGALSS ASDTVEDRYR VDKVFAEAVR RNIVLRSDTL DSKWEIDLSG MSFPVARAAC RYVITSISKS TSNREFEDLT FITGVGVGKS FYKGSNGVPD LPTQQTSLQK YVQQILISDF HPEIESYVPS LAKGTICIGS ESICKWAERQ
|
| |