Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49555 |
Symbol | |
ID | 7198225 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 25906 |
End bp | 29740 |
Gene Length | 3835 bp |
Protein Length | 1198 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184382 |
Protein GI | 219128359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTC GTTTGTGGTC CGAAAAGACC AAGGATGTTG ATCGCAATGG CGGCGATTCC ATGAACGACG ATGACGATGA CAGTCTATCT CTGGGTGACT CCTCGGGGGA AGAAGAAGAG TCGTCCGTTG AGTTTCATAC AAGCACGCAC GGGAGCGATT CCGGCAAGAA GTCAGCCACA TCCGAGGAAT CGAGTACCAA CCGGATCGCT CACTCAGAAA CCAAGGCCGT CACTCGGTCC AAGATTCTTT TTCTTCTCAT CATTGCTGCG GCCGCGGCTG GAGTTGGTAC CGCGACTTTT TTCTTTACAA GAAATAAACA AAAGGACGAA TTCCGTGCCG AGGTAAGATC GTAACAGAAG CATCCCATTC ACCGAAGGAA TCTATGCTCC TTCTGCTCAT TTCTCACACT ACGTTGGTGT GTTTCTCTTG GTTCAGTTCA ACAACTTTGC CAAACAGCTC ATTGACGAAT CCGAAAACAA CGCCCGTAGC GTGTTTGGTC AGCTACATAT TCTTTCGACC ACACTTTCAA CCTTTGCCAC TGATACCAAT CAAACATGGC CAAACATGAC TCTCACCCAT TTCGCAGTGA GAGCCGCAGA GATACGTGAA TTTACTGGAG TCGAGCTCGT TGTTCTTGCC CCACTGGTGC ATGTCGACGA AAAAGAAGAA TGGGAACGGT ATGCCTGGGA GCATCAGAAA GAGGCTTTCG AAAATGACCA CATTTACTAT CTAGGAGTAA GTCAGCCCTT GGGAAAGCGA TTCAAGAGTC ATGTTCTGTC TTCGACATGA TGGCTCACTG TTTTTACGCT TCATGTATTC TAACTCAACT GCAACAGAGT GAAAATCAGA CCGAACAGGA CATTGGAACT ATAACCACAG CAATTTACCC GTTTCCTACT TCGACTCGAC AACTGTCTGG CGATGTCCAC CGCCGACGAG TTCAGGACCA AGATCATGGC GACCATGACG TACCACGAGA AAACAATCGT GCGGGTCTAC CTGCAGGAGT TTCGGTCCCT GTGTGGCAGT TTGGACCTCG TGTTCGTGTA GCTTCTTTGG CTCTGCTCGA CCTCTATAGC TATCCCCCCT TCGAGATCAT CGTGAACAAC GTGTTCGATA GCAATCAAGC TCTTCTATCA GGGATCGTCA ATCTTGATGT ACTTGAAAGG TACGTTGACC TGGGTGAAAA CGATCAAAGC AAATTGAGAA GTGTGCTTGC CCAGCCCGTT TTCAAATCTC TACACGATCG GAATTTGGAA GTGGTCGGAT TTCTCTTCGG CGTTCTCCGT TGGCAAGATC TCTTTGAAAA CATCTTACCA CAAGGAGTCG ACGGGCTTGT TGTGAATATG AAAGACTCTT GTGGAGCTGG CTTCAGTTTT GAAATTAATG GCCGCGAAGC TCTGTTCACA GAAGATGATC GAACACGCGA TTCCTTGTAC AACGATATGG TGATCCGAGC TCCATTTGCT CGTTTTGCTA CTGTAGAGGA AGACGACAAG ACAAATAAAG GTTGTACATA CGAGCTCGAG ATTTACCCAT CGCAAAAATT CGAGGCGACT TTTGAATCCA ACGAGCCTTG GCTTTTTACA AGTGTGGTTG TCCTCGTGTT CTTATTCACT GCTGTTGCGT TTATGATTTA CGACGTGATG GTTCAACGCC GGCAGGACAA GGTCATGAAT ACAGCCAAGC GGACCCGAGA TATTGTCACT TCCCTCTTTC CGAAGGACGT TGGACTAAAG CTTATTGAAG ATGCCCAAGC TCAAGCAGCA TTGGAAGCAA ATGCAAAGAG AGGACATTTG TCGAGTAAAG CAAGTCTCAA ATCATACTTA GATGGGACAG GTGATCACAT GCAAGAGAAC AGCAAGCCTC TGGCGGATTT GTTCCCAGCA ACTACGGTGG CATTTGGCGA CATTGTGGGG TTCACGGCGT GGTCTTCCAC AAGGGAGCCT TCTCAGGTTT TTACTCTTTT GGAAACTATC TACCGAGAGT TTGACGATAT TGCGAAACGT CGAAGAGTTT TCAAGGTTGA AACGGTCGGA GACTGCTACG TAGCTGTTTG CGGTCTGCCC GATCCTCGTA AAGACCACTA CGTTGTGATG GCTAGATTTG CCCGTGACTG TCTTCAGCGA ATGCAAGTAG TTTTGAAACG GTTGGAGGTA CAGTTAGGGC CCGATACCGC GGAATTAGGG ATGCGATTTG GTCTGCATTC CGGTCCTGTC ACAGCTGGCG TTCTTCGGGG TGACAAGGCT CGTTTTCAGC TTTTTGGCGA TACAGTCAAC ATGACAGCTA GGATTGAAAG CACAGGTCAA AAAGATCGCA TTCATCTTTC GCAGGACACT GCTGACCTTT TGGTGGCTTC CGGCAGGAGC CATTGGGTGC ATGCTCGGGA AGAAACCGTG CACGCGAAAG GAAAAGGAGA AATCCAGACG TTTTGGCTGG AAATCAAGGG AGAGCCGACC AAGTCTACAA CATCCGCAAG CACCAACTCT GATGATCAGA ATGCTCCATC CGATCCCAAA AGTGCTTTTT CAGCCTCAGT TCCGAGGACC GCAAAAGAAG AGCGGCATGC TCGACTTGTT GATTGGAACG TAGACTTGCT TCAGAGGCTT CTCAGAGAAA TTATTGCGCA AAGGCAAGCA ATTGGAACAG AGGTGGATTC TTCAGAGACA ATAAGATTGC TGGAGCAAGC AATTGCCTCA GATTCCAAGA TCGTGCTGGA TGAAGTGAAA GACGTCGTTG CTTTGCCTAA GTACAACTCC AAAATAGCCC AAAAATGGTC AAAGCAAGAT TCAGTGCAGC TTGATGACAA GATTGTTGGC CAGCTTCGCG ACTACATTCG CACAATAGCG GCCTTGTACC GAGACAATCC TTTCCACAAC TTTGAGCATG CTTCCCATGT GACCATGTCG GTGGTTAAGC TTTTGTCTCG AATAGTCGCA CCAGATCTAA ATATCGTGGA AGATGATTGC GACACAAACA AGAGTCTCCA CGACCATACA TACGGGATTA CATCAGATCC GTTGACGCAA TTCGCTGTGG TTCTATCGGC GTTGCTTCAC GACGTTGACC ACACTGGTGT CCCCAACGCT CAACTCGTTA AGGAAGAGTC GGGAATTGCC GCCGTTTATC GCAAAAAAAG TGTTGCCGAA CAGAACTCGG TTGACATTTC GTGGAATTTA CTTCAAGAGG AAGTATATAC TGATCTACGC CGGTGTATTT ACGGCAACGG TCAGGAATTA CAACGTTTCC GCCAACTTGT TGTAAATACT GTCATGGCGA CAGATATTAT GGACAAAGAG CTCAGCGCGG ATAGAAAGTC GCGATGGAAC ATCGCTTTTG CTCAAGGCGA AGTTGGGAGC TTTCCCCTTC ATGAAAATGT ATGGACAAGT GTCAACCGAA AGGCAACAAT CGTAATCGAG CATCTAATTC AAGCATCGGA TGTGGCGCAT ACTATGCAAC ATTGGCATAT TTACCGAAAG TGGAATGCCA AGCTCTTTGA AGAATGCTAT GCGGCTTACA AAGCTGGGCG TGCAGACAAC GATCCGGCAG AACAGTGGTA CGAGGGCGAG ATTGGTTTCT TTGACTACTA CATCATTCCA CTTGCCATGA AATTAAAATC CTGTGGAGTA TTCGGCGTAT CCAGTGATGA GTACTTAAAC TATGCCCGCC AAAATAGGAA GGAATGGGAA AGCAAGGGCC GCGAAATAGT TGCTGAACTT GTTGAATCTG AAAAGCTTCG AGAAAGCGTG TGTCACAAGA ACACTGGTGC CCAAATTGAC TGTCAACAGT GAGGCATTAG AAAGAATATT TAATTTGCAA ACTTTCATGT TCTGA
|
Protein sequence | MKLRLWSEKT KDVDRNGGDS MNDDDDDSLS LGDSSGEEEE SSVEFHTSTH GSDSGKKSAT SEESSTNRIA HSETKAVTRS KILFLLIIAA AAAGVGTATF FFTRNKQKDE FRAEFNNFAK QLIDESENNA RSVFGQLHIL STTLSTFATD TNQTWPNMTL THFAVRAAEI REFTGVELVV LAPLVHVDEK EEWERYAWEH QKEAFENDHI YYLGSENQTE QDIGTITTAI YPFPTSTRQL SGDVHRRRVQ DQDHGDHDVP RENNRAGLPA GVSVPVWQFG PRVRVASLAL LDLYSYPPFE IIVNNVFDSN QALLSGIVNL DVLERYVDLG ENDQSKLRSV LAQPVFKSLH DRNLEVVGFL FGVLRWQDLF ENILPQGVDG LVVNMKDSCG AGFSFEINGR EALFTEDDRT RDSLYNDMVI RAPFARFATV EEDDKTNKGC TYELEIYPSQ KFEATFESNE PWLFTSVVVL VFLFTAVAFM IYDVMVQRRQ DKVMNTAKRT RDIVTSLFPK DVGLKLIEDA QAQAALEANA KRGHLSSKAS LKSYLDGTGD HMQENSKPLA DLFPATTVAF GDIVGFTAWS STREPSQVFT LLETIYREFD DIAKRRRVFK VETVGDCYVA VCGLPDPRKD HYVVMARFAR DCLQRMQVVL KRLEVQLGPD TAELGMRFGL HSGPVTAGVL RGDKARFQLF GDTVNMTARI ESTGQKDRIH LSQDTADLLV ASGRSHWVHA REETVHAKGK GEIQTFWLEI KGEPTKSTTS ASTNSDDQNA PSDPKSAFSA SVPRTAKEER HARLVDWNVD LLQRLLREII AQRQAIGTEV DSSETIRLLE QAIASDSKIV LDEVKDVVAL PKYNSKIAQK WSKQDSVQLD DKIVGQLRDY IRTIAALYRD NPFHNFEHAS HVTMSVVKLL SRIVAPDLNI VEDDCDTNKS LHDHTYGITS DPLTQFAVVL SALLHDVDHT GVPNAQLVKE ESGIAAVYRK KSVAEQNSVD ISWNLLQEEV YTDLRRCIYG NGQELQRFRQ LVVNTVMATD IMDKELSADR KSRWNIAFAQ GEVGSFPLHE NVWTSVNRKA TIVIEHLIQA SDVAHTMQHW HIYRKWNAKL FEECYAAYKA GRADNDPAEQ WYEGEIGFFD YYIIPLAMKL KSCGVFGVSS DEYLNYARQN RKEWESKGRE IVAELVESEK LRESVCHKNT GAQIDCQQ
|
| |