Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48038 |
Symbol | |
ID | 7203253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 791867 |
End bp | 797224 |
Gene Length | 5358 bp |
Protein Length | 1717 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182300 |
Protein GI | 219123997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.322069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGGC AAAGGCAACG GGCGGGATCC GAAGACGGAG AAATCGATGA GGAAGAAGGA GAAATCACAG ATAATCCTCA GCCACCGGTG GCGTTATCGC TGTCTCCTTC GAAGTCTCGT CCCACCGTCA CCCACTCGCT CCCACCCAGT TCCCAAGCAA CAGGTGCCGC ATTCCATGGC GAATCAAACT TTCCTCCGCA GCCGCCTTTG CCGAACCGTC TGAGCAGTAC TAGCGGGCCG AACAGCATCC ATCCCTACCC GGGCGCCAGT CACAGTAGTA GTAACAGCAA CAACAACATA CCTCCCCCGC GTCGTGGAAG CTGGCGGGGC GGACGTGGCG GTACTAGTGT CGGTGGTGGA GCCTTTGAAC GACGTCCCGG TCGAGGGTTG GGCCACCGCA ATACTTCTTT CGGCAGCGGA CCGCCAGCGT TCGACGCGCC CCCACCGGCA CGCAGCCAAA GCTTCCAGTC ATTTCACCGC CACAGCAGCG GAAGCATTCC AGGCCTTCCC GCCAGCAATG TTCTCCCACC GGTAGCGGCT ACTGATCCGA GACGGGCGAC GGATCCACGG TTTCGGGGAG CACCCGGCGT TGCGAACACC CCGGTGTCCG CACCGCATTT TACCGAATCG CGGGCAACCA GCGAGGGCCG GGGCTTTACT ACCTCGTCCA GTAGTGCTCC GGTATCGGTC AGTAGTACTT TAGCGGATTC AGTCAGTCTG GCAACACCCT ACAGCAGCTT GGCTGAAGGC AAGCCACCCC ACGTACTAGC TGCTGAAGAC GCGAAGGTCG TTCGAGGACT CCGTGCCAGT AGTGTCGCGA GAAACGAGTC CATCGGCAAT GATGTTCCAC GGGATAGCAG CGTCAGTGGG CCCTTCCCGC CACTAGGGGG TTCCACGGAA ATTGCCTCTT TTCCTGGAGG TTTTCGCGAT AGAGGTCCAC CGCATCGTCG ACATACTGGC GACTTTCGGG GGCACCCTGG CGGCCACGGC TCGCTCAATC GACGAGTTTC AAGCGAATAT GGTAGCGGCG ACGGACCGAA TAGCGAAGTC CTACCCTTTG GAAGGCCCAA CGAAATTCAT CAATCTGCTA GGCAGGGTTC TCAGCATGCA GTCCCCCCAT CCGGTGAGCC ACTGCCGGTG TCGCGTGGTC AATCCACAGG TAATCAGACT CCTCCACCTC CGTTTCATCG AGGCCCAGCG CCGCCTCCTA CTCAAGAGCA ACAGCAACCT CCATTCCACC GTAGTCATCC GTTACCGCAG GATCCACCGC CGGGAGCTTT CCCTCGAGAC GGGCCGCCTC AAGGAGAAGT GCCATCCTTT TATAGGGATG AGCAATCAGT GTTTTCGCGA AATCAACACA CCCATGGCGG AGATTTCCCA CCGTTCAACC AAAGAGCTTC GGATCAGCCT CCTTTTTCCT CGGGCCCGCC CAACGTAGAG CAACCGCTAT TTCGAGGACC AAGACAGGAT TCCTATTACG GACCCGCTTC ACGCGACGTA AATTCTGGAA GGTTCGGGTC GCCCTCCCAA CGTGATCGCC CCATTGTCAA TGCCCGAGGG GTTTCGGGAG GTCCTCCACC TCCCCCACCA CCACTGGGAT CGCAAGGGGC GCTCACATCC ACACAACATC CCTCTTTAGC TCCTGGTGTG GCCCCGCTTC ATCGACGGAA CGATCCACGC CTTCATCGAG ACCCCGATGC GGAAGGACGA GATTTCGCGG ACGCGGCATC CGCATCCGAG CCGCTTCGAC CCAATTTTCC ACCCGAGCGC ACCGGATTCC CGATGCAAAG TGAACGCGGT TTCCGAAAGC CGCCTTTTGG ACAAATGTTT CCACCGGGAG GTGCTACAGA AAATAGTACC GGCTCGGATT CGTTTGGTCG ATCACGCGAA CGGAATACCG CGGCAGCGCG GTCGCCTCAA ACGTCACCAC ATACGCGTAA ACCCGTTTTG AGCTACTTTC AGGAATCGCC GGCGAAGGAA ATTCCGCGTC TGCCTGCCAT AATTGATGCC AAATCGGGCA GTCTGTCGAG TCGAATCAAA TCTGTAGGTC AACATACCGA AGCAAGGACA GAAGAGCCAG AACCTCTTTT GACATCCGTG CTTGGTGAGG ATTCAGTGGA TAGGGCGGAA AAGGTTGTAT TGCTTCTGAC TGATCAAAGG GATAAAGCTA GCTTGGAAAG AGATGATAAG GGATGTAGTG AGCTTCCGAA GAAGCAGACA ATCCTGATTG CGCTGAATCG TATGGACACC AAAATCAAGC TGCTTCAGAA ATCTACCTTA GATAAAGAAG AAGAAGTTGA AGCACATATC GAAAAAGAAA AGGAAGATCA AAAACGGGCT GCTAAAGAGG CGAAATCTGA AGCTGAACGT TTGGAAAAGG AACACAGGCG ACGCCGGGAA GAGGAACAAC AAGCCGATGA AAAGGCCGAA CAAGAGCAGA TTGAAGGTAT GATAGAAGAA GGGCAGGCCG GTTTCGATGC AGATCTAACA ATATCTACAG TGACGTTCGA GACCGATCTC GAAGCAGCTC GTAAGGTAGA AGAAGCAAGG TTTGAGCTAG AATGTCAAGA ACAGATATCT GCGGCTACGG AGCGATTCGA CAATGATGTG CAAACTACAC AGCAAGAGTT GGAGAATTCT ATACAATCTA TTTCGAATAC TCAAAACCTA ATTTCGGCTC TCGAGGAGGA GTACAAGTGC AAGATGGAGG AAGGAGATAC AGTCGGTGAA GAGAAAATGG ATCAACCTGA TCTAGTAAAT ACAGTTTTGG AAGAAAATCG AAGGCGCGCT GCCGAGGCCC ATGTGTCTCA ATGGGCAGGT TTCCCTGTGG TGTCGGATGA TGATGAGTAC GGTGTTTTAG AGAACGAAAA GGATCCTAAA GAAGGTAAAA GTCATGTACG GTGGGCAGAG ATGGCGCAGA AAGTTACCGG AGTCGGAGAT GCACTCTACA ACGAACCTTC GGAAGCGCCG TATTTTGAGC AAAATGAGAG ACTTCATGCA CTGATCGGCC CGCTGGTAAC AGAGCAAATA CGCTACAGTC AACGGCAAGT CGACACCCAC TGGAGAGAAC TTGCCGAAGA ATACGAATAC CGAAGAGTAG TTTACGAGGC TCAACAACTC AAAGATGGCA CGGCTCAGAG AAGGCGCATC AAATCCACAA GTGTGCCCCA TAGGCTCGTT GGGAGCAAAC CTAATGTCCC TATCCTCGAG TCCACATCTG GCCACGGACG CTCGTCGAAC AACCCATATC GTCGGGCACG TAGAGGCAAC GAGGTGCGGA CAGAATACGA ACAAGAACAA ATTATAGCAG AGCTGGCAGC CAAAGAAGCG CTGGAAAAGA GAATTGCAAC TGGGGGATCA GAGCTTCCGC GTCAGATAGG TCAGATCGAA AGAAGCTGGA CAGCCACCTA CATCCAAACA TTTTCGGCGC AAAGGGTTGA CCTTGAGGAA CAGGAGGCAG AGTTACGTAT TACGGGTGTT TGGACGGACA TGGAAAAGTG CATTTTCTTA GACCGATTTA TGCAGCATCC CAAGGATTTC CGCAAGATTG CTTCTTTTCT CCGAAATAAG ACGACAACTG ATTGTGTCGC CTTTTATTAC GATTCCAAGC AAACGCTGCC TTATAAGGGT GCGTTAAAGG AACACGTAAT GCGGCGGAAG AGACGTGGCG GATATCCAAT TTGGGAAGCA ACTATTCAAG CCGCCCTCTC GGTAGGTGCA GTCGTTGAAG CAGGGGATAG TGAAGAAAAG CCATTGATCT TCACACTTCC GTTTGATGAT CACACTTTTT CTACTTTTGG CCTTCATCCT TTGAAACGCG AAGTTTTGGA TTTAATGGAA ATAAAAGAGC AGGCTCTCGC TGAATTTGAC GCAGATGAGG ATGCAGACGA CGTTTCTAGC AAATCAGGGC AACCCAAAAA ACGTCCTCGC GATCGTCTTT TCCTGTTGGA TCCGAGACAA AGAAAATTCC TGAAACCCTT GCCCCAGGAA TCGGCTCACG CTACCTGCCT TAAGGTGGAC AGTGGAAAAG CAAGCACAGC TGACGATGAT CACAACGATT CCAAAGAGGG TACAGCAAAA GATGAGTCGG GGCGATTAAC TCCTCTAAGA AAAGCACCCC AAAAATGGAC GGCGTCAGAG AAAAAGATTT TTCACGATAC CTTGGAGAGT CATGGTAGGA ATTGGAGCAT GCTTTCCCAG GCTGTAGGGA CAAAAACGAT TTCTCAGATT AAGAATTACT ACTACGACTA CAAGAAGCAG AAAGATAAAA ATCGGACGAC TGACAAAGAC AAAAAGGTCG AAAGCAAAAC TGAGAGGACC GAATCTCACG AAAACAGTCC TACACCGCCA CATATTGCCG CGGATCAAAG ACCCGGCGAC CAGACTAGTA ACGAGCCGAT TTCGGATTTA CGCAAAAATC AGCCGCCTCG CTATGATCCC CAATTTGAAG CTAAACATAT CGAGCGACAG ATGTTCGAAG TGTTGCAACA GCAAGGGCAG GGTCCATACC CCGAACAAGA AAGGCTTGTC GATCGACGTC CTGTCGAATC GTTGAGTGAT CAAGAATTAT GGGCCCAATT ACACCGACAG GGACTTTTGG GTCAACAGCG AGGGCATTTA TCGGACGAGG CGGCACGGCA CCTTCTCCAG CATCACTCGC AGTCACACCA TCAGCAAGTC CTCTCAAATT TGATGCCCTG GGCTTCGGGA GGGCAACTTC CGCAGCCAGT CAAACGAGCG CAACCAATCA ATGTGCAAGA ATGGGAGCAG CTGCAGGCAA TTTTGCAGAT CCAGCGTCAA CAAGAACAAC ATCGCCATCA GCATCAACCT CACGTACCGC ACAACCCGAT GGCCAACTTG GACCCTCAAA TGCTTGCGTT GGCCCGTCTC GCTGGTTTGG ATTCCAGCGC ATTGGGTATG AACCCGCAAT TATCGCGACT TGCGCATCAT CCTGCAGTTG GCTCAGCTGG AAGTCATGAT GACGCACAAA TGGCTTTAGC ACAACGGCTT CTGAGCTACA GTCAGAGCGC TGGGGGAGGG GGGAATAGTG CCCAGGGGGC GCTAGATTTG TTGACACAGG CCATGAGTCG TGGGGGTGCC GGACGCCATC CGAATCCAGA TCGGGGTTCA GATCGGGGTA CAGATCGGTA CTAGAATGGA TACCTGATCG AGAAAAGTGG TTGGCGTTGG GTGTTGGCCG GGTACACAAG GTTTTTTGTG CATTCAGAAA AGTTCAACAG CTCAAGTCAA TAGTTTTTGT GTTGAACTGC TCCGCTCTCT GCTATCGAGC GCTTGATCCG TTGGAATAGC AAATATCTGC CTCTCTTGAT TTTCTATAGT TTACGGTATT TTCTGACT
|
Protein sequence | MSWQRQRAGS EDGEIDEEEG EITDNPQPPV ALSLSPSKSR PTVTHSLPPS SQATGAAFHG ESNFPPQPPL PNRLSSTSGP NSIHPYPGAS HSSSNSNNNI PPPRRGSWRG GRGGTSVGGG AFERRPGRGL GHRNTSFGSG PPAFDAPPPA RSQSFQSFHR HSSGSIPGLP ASNVLPPVAA TDPRRATDPR FRGAPGVANT PVSAPHFTES RATSEGRGFT TSSSSAPVSV SSTLADSVSL ATPYSSLAEG KPPHVLAAED AKVVRGLRAS SVARNESIGN DVPRDSSVSG PFPPLGGSTE IASFPGGFRD RGPPHRRHTG DFRGHPGGHG SLNRRVSSEY GSGDGPNSEV LPFGRPNEIH QSARQGSQHA VPPSGEPLPV SRGQSTGNQT PPPPFHRGPA PPPTQEQQQP PFHRSHPLPQ DPPPGAFPRD GPPQGEVPSF YRDEQSVFSR NQHTHGGDFP PFNQRASDQP PFSSGPPNVE QPLFRGPRQD SYYGPASRDV NSGRFGSPSQ RDRPIVNARG VSGGPPPPPP PLGSQGALTS TQHPSLAPGV APLHRRNDPR LHRDPDAEGR DFADAASASE PLRPNFPPER TGFPMQSERG FRKPPFGQMF PPGGATENST GSDSFGRSRE RNTAAARSPQ TSPHTRKPVL SYFQESPAKE IPRLPAIIDA KSGSLSSRIK SVGQHTEART EEPEPLLTSV LGEDSVDRAE KVVLLLTDQR DKASLERDDK GCSELPKKQT ILIALNRMDT KIKLLQKSTL DKEEEVEAHI EKEKEDQKRA AKEAKSEAER LEKEHRRRRE EEQQADEKAE QEQIEGMIEE GQAGFDADLT ISTVTFETDL EAARKVEEAR FELECQEQIS AATERFDNDV QTTQQELENS IQSISNTQNL ISALEEEYKC KMEEGDTVGE EKMDQPDLVN TVLEENRRRA AEAHVSQWAG FPVVSDDDEY GVLENEKDPK EGKSHVRWAE MAQKVTGVGD ALYNEPSEAP YFEQNERLHA LIGPLVTEQI RYSQRQVDTH WRELAEEYEY RRVVYEAQQL KDGTAQRRRI KSTSVPHRLV GSKPNVPILE STSGHGRSSN NPYRRARRGN EVRTEYEQEQ IIAELAAKEA LEKRIATGGS ELPRQIGQIE RSWTATYIQT FSAQRVDLEE QEAELRITGV WTDMEKCIFL DRFMQHPKDF RKIASFLRNK TTTDCVAFYY DSKQTLPYKG ALKEHVMRRK RRGGYPIWEA TIQAALSVGA VVEAGDSEEK PLIFTLPFDD HTFSTFGLHP LKREVLDLME IKEQALAEFD ADEDADDVSS KSGQPKKRPR DRLFLLDPRQ RKFLKPLPQE SAHATCLKVD SGKASTADDD HNDSKEGTAK DESGRLTPLR KAPQKWTASE KKIFHDTLES HGRNWSMLSQ AVGTKTISQI KNYYYDYKKQ KDKNRTTDKD KKVESKTERT ESHENSPTPP HIAADQRPGD QTSNEPISDL RKNQPPRYDP QFEAKHIERQ MFEVLQQQGQ GPYPEQERLV DRRPVESLSD QELWAQLHRQ GLLGQQRGHL SDEAARHLLQ HHSQSHHQQV LSNLMPWASG GQLPQPVKRA QPINVQEWEQ LQAILQIQRQ QEQHRHQHQP HVPHNPMANL DPQMLALARL AGLDSSALGM NPQLSRLAHH PAVGSAGSHD DAQMALAQRL LSYSQSAGGG GNSAQGALDL LTQAMSRGGA GRHPNPDRGS DRGTDRY
|
| |