Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49352 |
Symbol | |
ID | 7195872 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 4287 |
End bp | 10556 |
Gene Length | 6270 bp |
Protein Length | 1967 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184167 |
Protein GI | 219127907 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.248427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCGT GGAAGTCGCG CCTGTACGCC TTTTTGCTCA AACGTGTTTT GGGACCTCTT CTGGATACAC TTTCTCTGGA TAAATTACAT CAGTCAATCG AAGTGTCCCT CCAAGACGGA AGGCTTGTGT TGAAAGATGT AGCCTTCAAT TTGGATTACA TTGAAAAGAT TCTTGTGGAT AAGGTAAACT GCACTCTTCG AATCAGAGAA GCTCGTATCG CTCGCTTGCA AATCGCGCTA TCACTGGAAG AGCACATACA ACCAGATGAA AGCTACTCGG ACCATCACCA AACGCAACCA AGCACGTTTG TCTGGAGAAT CATGGAGTTG ACCTCATCAA CTTCACCAAA AGTATCCCTG GTAGCTAATT TGGAGCTGGA TGGGGTTGTA ATAGAGCTAG AGCCGCAACC GGCGGCCCCT TTTTCAGCAA GACCTCCCCT GAAGACGCCA ATTGCGTATG AACTTCCTGA GGATGACGCC GTACGTTTGA GCTCGACAGC AAGTTCCCAA ATCTCCACCA AGAGTACACT TTCTGTATAC ATGGAATCGG TGTATTCCTC GTTGCGTTTG AGTGTTCGGC TGAACGACTT AGAAATAAGT TTGTGTAGTG GAATGGAGTC GGGTCGGGAA ATTTGGCTGA GCTTGCATTG CCAGTCTGTA TCGTATTATG ACGCAGCGGC CTCGAACAGT GGTGATGCGT CAGACTACAC AATGATACTT CACAAAGTGT TAGAGTTCAG TGTTGTCAGT CTCACCGTCG GTGAAACGCC TTGTCAACCG AGCTCGGCAG AAAATTCAAA TCATCAAGCG GAGAGAGCCA TGTCAGCAAC GAAAGTTGCA CTTTTGGACG GCTCAAGTCA AGTTCGACTA CGAGTTATGG AGTATCAAGT TGCCGATGCG AATCTCAATA CAGAAATACA AAAAAATTTA CAACAAGACA TTCAAATAAT GCTGAAGCAA CGACTCAATT TACTGGCAAA CAAAGTCTGT CTGGTTCATC TAAGCACTCT GGTTGACGTC CTGCGGAGCA ATTCGAACAA GACTTCTTCT ATATCTCGTG AGTCTACCGC TACGGAAACT CGCAGTATTG CCAGCGATAG CGAAGCAGAC TTGCAGACAC TGGACGGTAT TATGAAGCAG TATAAGGAAG CGAGGTATTT AGCAGATCGT CGAGAGATAC GAGGGGGTAT GCTCCTCCCT GACAAGGAGG GAAACGACAT TACCTTTGAC GCCTTTTTCG ACGCGAATGA CAAAAGCTTC TATCGATATT CAACTGTCTT GCAGCAATCA ACGATTTATG ATGGCAAGGT ACCTGAAAAT GGAAATAATT GTGTACACAC AAAGATTCAT GTTAACCTGG ATGAAGCAGG ACTAAAGATC TCCTTCGGAA CTCTATCGTC AAGTCACGTG GTGTCTCGAC CAGCTTTCTT CGAAGAGTAT ATCCTTGCTA CTTTTGCCGA TATCAATGTC ACTGCTTCCA TGTCTGCATT AACTTCGGAC ATTGGTGTCA GCTTGACGCA CATTGAAGTA GATGATTCAC AAGCCGGCTT TGATCCACTA TCTCAGAGTT CTCGCGTGGA AATTGGAAGC GTTTTGCGTT GTTCCTCTGC AGATTTTCAG GATACCGATC AGACAGATAT ACTTCTTGGT GCACCGTGCA TTTCACTGAG TATGAAGATT GACCGTTCAT CCACACTGCT TGAGGTTGCT GTTGAACCTT TGGAAATCAC TTATCGGCAC TCAACGGTTG CGCATCTTGT AACTCTACTA CATAGCTTTT CCAGTGAAGA TGAATTCTCG CAACAGCAAA ATGAACAAAA GAACCACATC GATGCAGTGC CGAAGAGCGA AAGCGATCGA AGATTTGGTC TTGAGGTTTC TTGTCCAGCG ATAACACTCA TCGTTCCAAT ACTACAGAGA GTGAACTTGG ATGCGCTTTT CGTTCGATGT GGGCAAAAAT GTACAAACAA CAAGGATGGT AAGCCGTCAT TAAGGGTATT ACTGGACAAC GTATCAATCG AAGCCCGATC AAATTTATTT GGAGACGCTA CTCCCGACCT TTCTCTTCTC TGTCACCATG CCATATTCTT TGCATTATCG ACTGTCAATG CCAACTCATT TGATGACGAA GTGCGACGAT TGGATCTTGT TGCGCTCGCA GGGCGCACCG AGGTCCTCCC GTGTATACCG ATCTCCATTC GAATTTATCT GAAAGTAACT TCGTCAGTCT CTGAAAACGA GCCTTCTCTG GCTGAATCGT TGTTCCCTCA AACACCTTCT CTTTCTTCGT TCAAAGCCCG ACAAGAGGAC GAGGACGAAG AGCAACATAT TACACAATTG CTTTCTTCGA ACTTGAAAGG TGTTAATTTC TTGAATCGCA AAGATCTACG AGGACGAGAT CCTCAAATAG AAATGATCGA AAACGCCGTT GAAAGTAGCA GGATTCTGGA GGTATCTCTG CCCCAAATTA TTGGCGACCT TACTAGCAGC GAGCTCTATT TTGTGGTGAA AATGCTGGAG TTTATCCGAC CAGCGAGAAA GGTCGATACT GTGGACACTC TATTGAAACA GTCTGAATCA AACGGTGTTA TTGATACTTC AAACTTGGCG AATGATATCG CAACTCTGGC TCTATCGATA GGAAGTTCTT CTTTGGCTAT CCACAATTTT GTGTCGACGG ATTCGACAGC TTCCTTTACG TTCTCTTTGC GTTTGTCTCA GACTCAAGTA CATATGCTTC TGCAAGATCG TCAGTTTCAA CAAGCTCGCT TTTTAATTCA TGACTTATCT TTTTGTGAAA GTACGTGCTC ATCGTCAAAA ATTTGTCATT CACTGTCAGA TCCGTCCAGC TAATCCAATT GCTATCCCTC AGTAGTTCCA TCAAAGGTGC CCTCTTTGCA AAAGTATACG TACACAACCG ACGAACGATT GATGGCATTA CGCGTTCGAA GCAGCTGCGC TCCACAATCT TTCGCGACAC CCATTTTACA TCGATCTGAA CTATTTGTCC CGATATCTCG GTCAAGTCCG GTCATGCTTT TGGATTGGAT GAATCTAAAG GGCGCCCGTG GTTCAGGTGG ACTGCCTTAC AGAAGTCTTC ATTTGACTTT TTACGATATG ACTTGTAGGT ACGAATACAA CACAATGTGG GTACAACGAC TCTGTTCCTT ATTGGGACAA ACGCAAGGAA GAAAAGAAGG CAACCAAGTT GCCGGAGGTG AAAGCGAATC CAACGCTTCC GGCGGCGAGT TTCCAGAAAA ACAAAGGCTT ATCCGCCTGT TCATTTCTCT TGCGGACTGC AACATCGATT ATAGTTCCCC TGTTCAGTTC GAAACGCCTT CGCGTTTGAT ATGTCGTCTG GGGGATGTTC GCATTTCCAG CAACGTCCTG CTTCCTGCGG CACCTGTACA AGCTGTTAAT CTGTCCGTGG GAGACGTATC GCTCTATCTT TGCAACCGCC GTTTTCCCTA TGATTTCGAA AATAGCAGGA TTGTTGGGGC AGAAGATCTT ATGAAGAACC CACAGACTGT TTCGGCGAAG GATAAGCGGG ACTTCATACC GTCATCATCC GAAAGTGTTC TTCAAGCGAT GAATTGCAGA ACTTTGGTAA TTCTCGATAC GTTCGATGCA GTATTAAGAC TTGCGCATCA CCGATCGTTG ACATCATCGG AGCCTTCCGT CCAAGTCAGC TTTACTGTTG GTGAGCTCGG AGTTTTTGCG TGTAAGGACT CCTTTAAGCA TCTTCTTGCT ACTATTGGAG AGCCTTTCCG AGAGCTGACA GCACTAAATG ATCGAGCACT GCAAGATTTG AAGGCAAAGG TGGATCCGTT CGTAACAAAG ACTCTAGTAG ACGAGGCTTC AAACAAAGAG ATGTCGTATG AATCTTTACA CAGACATACT GAAGCCTTGG CAAGTCTGAA GCTACGTAGC GCTTTGCAGC CCATTACAGG GACTGCTGCC GTCTTGGACC GCCAAGACGA CTTTTTACTC GATGGATATG ACTGGACAGC CATCGGTCCA GATGAACCGG GCTCTGTTGG CGTTATTCCG CCGGGTGACG AGCAATCAGC GCGTTGGTTA GGGAAGTCAA GAGAAATCTC ACAGTCTTCG GACGATCAAA ATGATGATGA CCCCCCTCTT TCGGGCTCGC CCTCTTCACC TCTTGGCATC GGAAAAGAAC TCAGCCAAGG ACTGACGATT ATCACGCATC ACTTCCCTCT CCAACCTCTT TCTGATCCGC TTTCCGCTGG CGACATGGGC ATACAGAAAT ACGCCGGGGC GGATGCCTTT CCATTGATCG AGACCCGGTT GGTCTTGCAT GATATGAGAG TCAAAGTCCG TTTATTCGAT GGCTTTGATT GGCCTGAACT TATTGACAGG GAGAAGCTCC CATTCTGCAA ATCTGGCGAT TTTGTGATCG ATGAACAGAC ACCCTCCACT AACAGTAAAA AAGCTAAAAC AAAAGCTACG GCGGCATTCG TTAGCACCTC AAGATCCAAT CAACATGAAC ATGAAAAAAA TCGAAAAGCG AAACTTCTTG GTGATTTACT AGACGGAAAT GCGATCGATG CCGGTACCTT TAAAAATGTC CCACTTCCAG AAGAGCGCGG TCTGGCGCTG AAGGAGCAAT CCGAGCTGCG TTCTCTTTCG CGAAGGACAG GCAAGTATAT TCAGTTCTCG GCCTCGGGCA TTCGCTTACG CTTGGATTTG CTCGAGAAGT CAAGCAACCA TCGACTAGTA TCTTGCATGA ATATTCGAGC CCATGACTTT TTCCTTGCCG AAACCGTAAG TGGTAGCAAT CCGAAGAAGA TGGCTGGCGA GTGGCTCAAC GAAGCGGACC ACCCACGCGA TACGCAGGAA GGGCTTTTCA TGATGAAGGT AAGCGTTCGT AATGATACAT GTTTCGACGA CGAGCCCACG TCTCACAACA ATCTTTTCAA AGATGGTCAC TTGGCATCCA GAAAGGAGAG TGACGAGCGA CGGTGCGATT GCCAGCGACA TTTGCGAAGC TGCCGTCACA ATTCTCCCCT TACGGTGCAA CTTGGACCAA CGCGCGATAA GGTTTGCCCG TCAATTCTTT CGGACAGAGT TGAGCGTGGA CGACGAGATT CATAGAGACT TGCCATCCGG ATTGCATCAG GTTCCACCAC CGCTTTTTCG ATTGTTTCGA GTAAGGCCAT TCAAGGTAAA AGTTGATTAC ACTCCTCAGA AAGTCGATCG AAAAGCTCTG AAAGACGGAG CGTTCGTCGA ACTTGTCAAT TTGAGCCCAA TCGATGCACT GATATTGACT CTGAAAAAGG TCGAAATAGA AGGAAAGACA GGCTTTGGCG AAGTCCTTCC CATCCTGATC CAAAGCTGGA TTCAGGAAAT ATGTAGTACA CAACTCCTCA AGTTTGTCAC AAACATGAGA CCCTTGGAAC CCATTACGCA GGTTGGCGAG ACAGCTGTGG ATATGATTGT TTTACCTTGG GAAGCCTTTC GGAACGGTGA TAGTGTGCAA AAGGCAATCC GATCGGGAAC GACAAGCTTT GCTTCGACGC TAGTGTATGA AGCTTTTACA GCAACATCGC GAGTTGCTGG GTACGTAGCG GATCAAGTAG GTCGCCTTGG ACGAGAGGAA TTTGACCGGT CCAACCAGCC ATCTCGTCCT CTTGAAGCAC CCCGTCGTGC CACCGATGCG GCGCCACACG CTCTGCAGAG TGTCACTCGT GGGTTACAAG AAGCAAACTA CAAGATTGTT ATCATTCCGT ACCGTGAGTA CCAGAGAACC GGAGCCCGTG GTGCGATTAG GAGCGTAGTA AAGGGCATTC CGGTGGCGAT TGGTGCCCCC ACAAGTGCAG CAGCCGAAGC CATATCCTAC GCCTTGCTCG GAGCTCGCAA TCAGATTCGG CCTGATATCC GAAAAGAGGA AGAATCGAGC CAACGTGGTA TGCATCTAGA TCGTTAAAAG GTCTCAAAGT CTCCTGTCGC GTCGGTACGA CTTGCTTCGA AACAGGTACT GAAGTCGAAG ATGCGGAGCG TGTATACGAA AGCGCTTGTC CCTTCCACCT ACAATCGGAA AATGGTGACT CTAGTGACGG GGTAGACGAG GCAGCAAACC CTGCTCTATA ATTCCTTTAA GACATACAAA CACGAGCTCG TACGGAAAGG CAGATTGATT TTCCGAATCG ATCGCAGCAT GTCGGTAGTA TACCCCCTCA TTTCCCAAGA CCGCGGATCA CGGTGGCAGT ATTTTCTTTA AAGTTTTAGC CTGCGTTACG GAATCGACAA
|
Protein sequence | MISWKSRLYA FLLKRVLGPL LDTLSLDKLH QSIEVSLQDG RLVLKDVAFN LDYIEKILVD KVNCTLRIRE ARIARLQIAL SLEEHIQPDE SYSDHHQTQP STFVWRIMEL TSSTSPKVSL VANLELDGVV IELEPQPAAP FSARPPLKTP IAYELPEDDA VRLSSTASSQ ISTKSTLSVY MESVYSSLRL SVRLNDLEIS LCSGMESGRE IWLSLHCQSV SYYDAAASNS GDASDYTMIL HKVLEFSVVS LTVGETPCQP SSAENSNHQA ERAMSATKVA LLDGSSQVRL RVMEYQVADA NLNTEIQKNL QQDIQIMLKQ RLNLLANKVC LVHLSTLVDV LRSNSNKTSS ISRESTATET RSIASDSEAD LQTLDGIMKQ YKEARYLADR REIRGGMLLP DKEGNDITFD AFFDANDKSF YRYSTVLQQS TIYDGKVPEN GNNCVHTKIH VNLDEAGLKI SFGTLSSSHV VSRPAFFEEY ILATFADINV TASMSALTSD IGVSLTHIEV DDSQAGFDPL SQSSRVEIGS VLRCSSADFQ DTDQTDILLG APCISLSMKI DRSSTLLEVA VEPLEITYRH STVAHLVTLL HSFSSEDEFS QQQNEQKNHI DAVPKSESDR RFGLEVSCPA ITLIVPILQR VNLDALFVRC GQKCTNNKDG KPSLRVLLDN VSIEARSNLF GDATPDLSLL CHHAIFFALS TVNANSFDDE VRRLDLVALA GRTEVLPCIP ISIRIYLKVT SSVSENEPSL AESLFPQTPS LSSFKARQED EDEEQHITQL LSSNLKGVNF LNRKDLRGRD PQIEMIENAV ESSRILEVSL PQIIGDLTSS ELYFVVKMLE FIRPARKVDT VDTLLKQSES NGVIDTSNLA NDIATLALSI GSSSLAIHNF VSTDSTASFT FSLRLSQTQV HMLLQDRQFQ QARFLIHDLS FCEIPSKVPS LQKYTYTTDE RLMALRVRSS CAPQSFATPI LHRSELFVPI SRSSPVMLLD WMNLKGARGS GGLPYRSLHL TFYDMTCRYE YNTMWVQRLC SLLGQTQGRK EGNQVAGGES ESNASGGEFP EKQRLIRLFI SLADCNIDYS SPVQFETPSR LICRLGDVRI SSNVLLPAAP VQAVNLSVGD VSLYLCNRRF PYDFENSRIV GAEDLMKNPQ TVSAKDKRDF IPSSSESVLQ AMNCRTLVIL DTFDAVLRLA HHRSLTSSEP SVQVSFTVGE LGVFACKDSF KHLLATIGEP FRELTALNDR ALQDLKAKVD PFVTKTLVDE ASNKEMSYES LHRHTEALAS LKLRSALQPI TGTAAVLDRQ DDFLLDGYDW TAIGPDEPGS VGVIPPGDEQ SARWLGKSRE ISQSSDDQND DDPPLSGSPS SPLGIGKELS QGLTIITHHF PLQPLSDPLS AGDMGIQKYA GADAFPLIET RLVLHDMRVK VRLFDGFDWP ELIDREKLPF CKSGDFVIDE QTPSTNSKKA KTKATAAFVS TSRSNQHEHE KNRKAKLLGD LLDGNAIDAG TFKNVPLPEE RGLALKEQSE LRSLSRRTGK YIQFSASGIR LRLDLLEKSS NHRLVSCMNI RAHDFFLAET VSGSNPKKMA GEWLNEADHP RDTQEGLFMM KMVTWHPERR VTSDGAIASD ICEAAVTILP LRCNLDQRAI RFARQFFRTE LSVDDEIHRD LPSGLHQVPP PLFRLFRVRP FKVKVDYTPQ KVDRKALKDG AFVELVNLSP IDALILTLKK VEIEGKTGFG EVLPILIQSW IQEICSTQLL KFVTNMRPLE PITQVGETAV DMIVLPWEAF RNGDSVQKAI RSGTTSFAST LVYEAFTATS RVAGYVADQV GRLGREEFDR SNQPSRPLEA PRRATDAAPH ALQSVTRGLQ EANYKIVIIP YREYQRTGAR GAIRSVVKGI PVAIGAPTSA AAEAISYALL GARNQIRPDI RKEEESSQRG TEVEDAERVY ESACPFHLQS ENGDSSDGVD EAANPAL
|
| |