Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50095 |
Symbol | |
ID | 7198691 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 433960 |
End bp | 438052 |
Gene Length | 4093 bp |
Protein Length | 982 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184953 |
Protein GI | 219129557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAACA ATGGCACATA CAAGCAAAAA GATTTGGATC AGACTGCTAG AAGCAATACT CAAGCAACGC TTTGGATGAC AGACTTCTTT GTTGGGCTTG TTGTTCGCAA ATTGCACCAC TTGCACAGTA TGTTTGGAAC GTGTCCATTC TTAGCAGTTT CACAAAAGCA CAGGTTGTTG CGGCTCCAGC ATTGCGGTCA GCTGCAAAAG TGGCTATGTG ATTATAATAG AAAAAGGCTT ACATTCATAG TGGTTGTAAA TTAAAAATTC TTACTTTACA GTCAATACTG TTTCCTTAGA ATATAGCAAT CACAGCGTGT ATGGCGGCGT TGTGTGGGGT GCGGGCAAGC GCAGTCCGGC AGGTGGCAAG TTTGCGAATT TCGAAGCCGT CTTCCCACTA TTCCATCAGC TGCAGAGTGA TGTGAATGTT GACAGTGCAA CCCAGGCTGA ATCCGCCAGC GGTTGACTTG GCGCGGCGTG CTGGTTGGCG AGTGCTGTTT TTGTTTGCGG ACGACACCTT CCAGAACAAG CCGGGAGGCT TCTTTCCCCC GTCTTGGCGG TTCGGAGTTT GACTGGCTGG CTCAATGCGG TATTGGGATA GTGCCTTTTG TACGGATCCT CGGAGGAGTG TGAAATTGTT GGATATGGAG CAGACTTAGA CGTCGTTATC CCGGACAATG CGTACAAGTT CCTGAAGTTG GCGGCGGAGC GACTATTGAG GGGGGGGGGG GACAGAGGAA TGAATCCAGT GAAATCGACG ATTTTAATTT GGGCTTTGCA AACAGGCACG CAACCCAAGA AAACAGTTCC ACTACCGTGC GGCTACGACA ATTCCTGTAG GCTACGTTTT CCTCTAGAAG TTGTGTAGAT TGGGACCGGT GGACTCTTTT GATCCCAAGC GTTCCTACTG CAGATCCTCG TGCTTTTACG GTACTCGCGA AGAATGCTTG CAAAGCCGTG CTGGGCGGCG TTGGTGCTGG ATAGTGGTGA CGGCTTTGCA GCAAAACCGC ATTGCGGTGG GAGAACGTCT CTGTATGGTC AGTGCTGGTC TGACAGGGAT TTAGGATGAG TTTGTTAGGT GGAAAGGTAA ACTTATCGAT GATCAAAGAG AATTACTTAT TGAAAAGACA AACTATGTTA AAGATCCCGG AGCAGTTACC TACTCAAAAG ACAAACCAAC TTGAAGATCC CGGAGCAGTC ACCTACGCAA AAGACAAACC AACTTGAAGA TCCCGGAGCA GTTACCTACG CAAAAGACAA ACCAACTTGA AGATCCCGGA GCAATTACTT AGAAATAGTT TTTGGATAAA TTGTATACAG CTAGGTATAC CCCTGCTTCT CTGGGATCTT CCTGCCCTTA GGCACCAGCG AGGGGACACT GGAGGGATCC CGTGATGGAG GAACCGAGGG CAGTCAAAAT GGCACCAGCG AGGGGACACT GGAGGGATCC CGTGATGGAG GAACCGAGGG CAGTTGCAAT GGTACATCCG TTGGACTGGC CGATGGAAGT CTTGATGGGC TTTCCGATGG AGACAATGAC TGCATGGAGG ATGCTAGGGG CGACAAGATC ACAGCGATGT ATGGGATGGT GGACGCAGTC TTGGTATCAG CATCGCCACG CGTCCAGTCC GCAGAGCTAT TAATCAGCGA TAGGTCCGAG TTAGTAGCGT TCTCGATGGA ACCGACTTCA ACCTTAATAC CGGCTTTTAC CAGGAACATT TCCACGAGGC CGATTCATTT CCTTTATGCT CCCGTCAATG CTCTTTTTGA TGCTTCTGTT GACGCATCCG TCAATACAAT GCCACTCGAA TTCTGTATTT GGAGACAGGT ATGTGTCTCT AAACCTTATT CCATCTTTAG TTTTTACAGG TAAGATAGTC TAGCCTTAGA TTTTCATATG ATCTCAAGCA ATAGCGACAA ACGTGGTCTT TAAGTATTTG AATTAATACA GCGACAGGGC GCGTAAAATG TTCCCTACTT AGGCAAGCAC TCGCGACCCA GCTTTTTTAC TTAGGATTAG TGATTTTGCG GCCTCTTCGA GATAGAAAAA CCCCACGTCA AGCGTCTGTC ATATATACGT TAGACTAAGA CTGGTCTGCT CAGGCAGATA ATGGTATAAT TTGTGCTTTA CTCTGTTTCG TAACTACACT AGTGATCTTG TGCTATTCCA GTGCGATTCT AGTCGGTTAT CGCAGTTTTC TAGAGACCCG GATTTGTGAA TTTCCTCTCA AAAGAGGAAA TTCACTAACT GTAAAATGAC TTTTCGCAGT ACGCCTGGAC ATTTCTTGTC GGTTTTTTGG CATCTTTGAA ATGTGCGACA TCGTTTTACT GTTGGGACCT TGTGTTTGGA ATGCTCATTC GGTCTCTGTA GTCGGCTCAC AGTCAGTTCG AAGGCATGAT TCGGCGAAAA GGATTGCTCT CGTCGGAGCA AACTCTGACG ACGACGGAAG TAGAATCTGA TGGACGCAAT CAGATCGTTT CCGTGGCTGG CAAACATGTG AGACTTCAAA GCAGCACCTG TGTACGAAGC TATGCGCTTC AGCGTCGACG CTATATATTT GCTCTGATTT GTTGCTGTGG GGCTACCGCG TGGGGAGTCC ACCGATACTG GATTGCTGCT ACCGCTTCTC GTGGAAAAGA AATAGAAGCG GAGGATGGCA TGTTCCACAA GTGGGATAAA GTTGTACTAC CGCTGACTGA TCGAGTCGCC GCATTACGGC GGAACACCGC GGATGAATTG GACGACACCG ATTGCATTTT TCGCGACTCT CCAATTCGTC GAAAAGTGTT TGTGTATCCA GACTATGGAG ATACCGCAAA CGGTTGGACA GCGGACGTTT TGTCATCGGC AGGGCAAAAG TGGCAAACGA CCTTGCCGCC TTGGCCTTGG CTCGATCTGC GACGACAATC GCAGGCAAAT CGAACTAGTC ATTACGACAT AGAAGGCCAA CACGTACAGT ACGCCACAGA GCTACTTGTA AGAGAGGTGA TGATCAATCC CAAGTCCTGT CTACGAACGT ACAATCCCGA CGAAGCCACA CTTTTCTACG TACCTTACTT GCCCTCGGTA GAGCATCACA AAGGCAGCAA GTACATCAAT GATATGGCGT TATCTCCGTA TGGGAATGCA ATACTCGATA TTCTCGACAA GGATAATTAC ACGGCTTGGG AAAACACGTT TGGATTGACG GCGAAGTACT GGAAACGTCA TGGCGGGGCT GATCATATTC TTGTCTTCTC CGAACCTATG CATGGACTCT GGCATCCTCG TCAACGACGC GGGAACTACC ATTTTATTCA TTCGCAGAAG CAGCTGCATC CACCAATCGT CATTTCAGTC GAATTAAGTA CCACATTCGT AAAAATGTAC CCCAAGTGTG CCGCCAAAAA TATTCTAATG CCGTACCCCA ACACGGATGG ACGATGGTTC AACGGCAAGC ATCACTCGGA AGCGGTGAAA GCCTCTACGG CTTGGAATGC CTCTCTGAAA GTTTCAATTG CCGCCTTGCC AGAAGAACAA TTATTGGGCC AGGAGCCTGC GCGACCCATC GCTCAATTCT ACGGTGCAGG AAACCACGGA ACCTGCAAAC AATTGCGTCA AGCAATGGCT TCCGACTATT CGCAATGTGC ACTGTCCAGT AAGCTTTTCA AGCAAAACGT CAAAATATCG TCATACGTCA TAGGTATGAA TTTGGCAAGC TTTTGTCCGT GCCCAGGAGG CGATTCGCCG AGCGCTAAAC GGATGTTCGA CGCAGTCTTG GCCGGATGTA TTCCAATCAT CTTGTCGCAA GATTTCGTTT GGCCGTTTAC AAACGAGTTT GATCCAAACC TTGAGCTTGA TCCGACAGTG TTTTCTCTGC GTTACTCAGC AAAAGACTAC GAAGACCCGT TGCTGGACGT CACGACGTGC AGTCCACTTA ATTCCTCTAA ACCAGGTTTG CAAAGTAACT TGGAGCAGAT TTCCGCTCGG GAAATAGGGC GTCTTCGGAA TGGACTTCGG CAAGCTCGGG ATCTTTACAG CTGGTATCAA GTCCGACCCG ACCTTCCCGA CAATCCGTTG TGGGAAAATA TTTTACCGCC CATTTCTTGG TAG
|
Protein sequence | MGNNGTYKQK DLDQTARSNT QATLWMTDFF VGLVVRKLHH LHKYSNHSVY GGVVWGAGKR SPAGGKFANF EAVFPLFHQL QSDVNVDSAT QAESASEQAG RLLSPVLAVR SLTGWLNAEC EIVGYGADLD VVIPDNAYKF LKLAAERLLR GGGDRGMNPV KSTILIWALQ TGTQPKKTKL CRLGPVDSFD PKRSYCRSSC FYGTREECLQ SRAGRRWCWI VVTALQQNRI AVGERLCMVY PCFSGIFLPL GTSEGTLEGS RDGGTEGSQN GTSEGTLEGS RDGGTEGSCN GTSVGLADGS LDGLSDGDND CMEDARGDKI TAMYGMVDAV LVSASPRVQS AELLISDRSE LVAFSMEPTS TLIPAFTRNI STRPIHFLYA PVNALFDASV DASVNTMPLE FCIWRQFLQS AHSQFEGMIR RKGLLSSEQT LTTTEVESDG RNQIVSVAGK HVRLQSSTCV RSYALQRRRY IFALICCCGA TAWGVHRYWI AATASRGKEI EAEDGMFHKW DKVVLPLTDR VAALRRNTAD ELDDTDCIFR DSPIRRKVFV YPDYGDTANG WTADVLSSAG QKWQTTLPPW PWLDLRRQSQ ANRTSHYDIE GQHVQYATEL LVREVMINPK SCLRTYNPDE ATLFYVPYLP SVEHHKGSKY INDMALSPYG NAILDILDKD NYTAWENTFG LTAKYWKRHG GADHILVFSE PMHGLWHPRQ RRGNYHFIHS QKQLHPPIVI SVELSTTFVK MYPKCAAKNI LMPYPNTDGR WFNGKHHSEA VKASTAWNAS LKVSIAALPE EQLLGQEPAR PIAQFYGAGN HGTCKQLRQA MASDYSQCAL SSKLFKQNVK ISSYVIGMNL ASFCPCPGGD SPSAKRMFDA VLAGCIPIIL SQDFVWPFTN EFDPNLELDP TVFSLRYSAK DYEDPLLDVT TCSPLNSSKP GLQSNLEQIS AREIGRLRNG LRQARDLYSW YQVRPDLPDN PLWENILPPI SW
|
| |