Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43482 |
Symbol | |
ID | 7197537 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 578227 |
End bp | 582120 |
Gene Length | 3894 bp |
Protein Length | 1225 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177644 |
Protein GI | 219111785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTTGT TACGAAGACT ACATAGAACT TCACGTCCTA ATATTCCACT AAGGGGGAAA AGAAAATCGG GAAGCGATGA AGAAACGGCA AGTGATTCAA ACGAGGATCT TGGCCGGAAG CGGTTGCTTT CTAAGGATCG GGCCATATCC GCCCTTTCCC AACCAAACGC TCCTACCCAA ACTCCAAACA CGAACAACGA TTCTACAAAT AAGCAGTCAA CTTCCAAAGC AGCTAGCTTA CTGCCATCAA AGAAGGTTAA GAGTGATCGG CAGGAGGATG AGAGGACAAG GATAAAGAAA AACAACGCTC CAAAAGCTTC CACAGTGCCC TTGCAATCAA GATCTTCCGA CCGCAGGTGG AAAAATTCGA AGCGGAGTCG AGAAATAACA CCGAAAGTAC CCAATACTGC GAAAGCCGCC TATTCATCAC TGCTACCGCC ACCCCTATCG GGATCGAATG GCAGTGACGA CGATTCATTG AGTACAGGCC TATCATCGTG GGAAGAATTT CTTGGCAAAG GGAAAGAGAG TAGTGGCTCT AGGACAACGC CAATTGTAGG ACGTTCGCAA GAGAAGCGAT CTCCTTCAAG AGTGGAAGCA AGGAAGCGTG AGAAGTCGAA GGAGGAGCAG GCTGACGAAA GTGACAACAC TGACAAGCAT CGGAATCTGC CGTCGATTCT GGATCTGTTT CCGTCAGCTC TCTCAACAAA CCCAAACGAA AGGTCTGCAT CCAGCTCAAA AGTGCTGTCG GAAGACGCAG GGAAATCGTA TACATCGCTG GATGGAGTAC TGCCTGTGTC GGATCTGTTT TATCGCTCTT CGATACCGCA AGCGCAACGT ATTGATGGGT CCGAGCAGGC GACCAATGCA ACTGACGATG GCGAGGAAAG CCCGTATCGT AGGGCCGTTT CGCAAACACC CGGCAAGCAA GAAATGGCCA AAAAGCCGTC TTCCACAAAA AAACGCAGTG GTAGAAAAAT GGTTCGACGC GGTATGGAGA TGCTGGTTGG CGGTGTACCG ATCAACGCCG ACCCCCCGCA AAGAAACGTC GATCTCTGTT ACGATAGATT GGCAGCTGAC TGGGCCACTT CTATAAGTTT AAATACCCGA GAATTTGGGC CTCTTTTACA TGGTGCCAGC ATCCCAAAAG TATCGTTGAA GGAACGAGGC TTGTTTTGCG AGTACTTTTG CCATGCCGCC ATGAAATGGG ATGTCTGTCC GACAGATTTA CAGTCCATTA TCGATTCGCA TTCAAAGCAA ATTTCAGGCT TTGAGGTAAG TGCCTGGAAA AAGACCTCCG CACACCTTCC TGGAGGCAAG GACTTGGACA AATTAGCTGA AAGTATTAGC AAGGATATAT TGTCAAGGTT ACACGAAGAC AGCGACACTA GTGATTCTAT GATCACAAAA GATACTCGAC ACGACGGCAA AAGCTCCGGC GACGGAATTC CACGAATAGC TGCAAAGCTT GATTTACACG CTAAGGATTC AAGAAAGCAA AGGTCGAGGA GGTCAAAATC GAAAGCATAT GCTCAGTCTA AAGCAAAGGG ATTTGGAAAG GATAAAAATG GCGCGAGGAG ATCAAAAAAG GAATTTGTGG ACACCGATGC TTTGGCCTTG TACAGCAATG AATTCAAAAT AATGGCACAG CCTATTGGTA TCTCGAAGTC CGATCTGGAA AGCGGCGGAC AAGGCACTCA GGTTTTCGAG TCAGTTTTGC GGAAGGCTTT TGAAACCCAT CCAGTGCTCA TTGAGGCAAT TGGAGAGTTC CACGTCGATA TTTACAACTG TACGATCGAG GGAACTGGTC CAGATTCTTC ATTGTATTCG GTCGAATTTG GGGTATTTCC AAAAAAGGTA ATTCACCAAA GCGAGAAGCC GGAATTACTT CATAAAATGA TAGAAGCTCT GCGTTTTATA TTGGATACAG ACGACTCAGA AGATACATTG AACTCTACCT TGGCAAGGAT CGCAAGCGAA GAATATCGGT GGTCTCCCAG TATTAGGGAG CGTGTTGCTC TGCAGTTTAA TGATCAAAAC GCACCCAACC AAAGGCTTCT CTATACTGTC AATGCTGGGG TACTAGAGTT TGAAATTGGA GTTTCCCGCG CAGAGCTGGA ATCAGGAGGT GATGGTGGCG AAATTTTTCA ATCCGTGTTA GAGAAGGCCA TCGGTGGAGC TATGCGGAAC TCTCTGGCTG GTTTTCACTT TTCTATTACA CATTTCACGC TTGATGATCA CGACGATGGA ACATCGTTAG TTTCTGCTGA TGTACAAATG GAGACTTCTG AGCCGATTGC CCGATCTGAA AATCGTTTGA TCGAAAAAAA TCTTCGGGCG GCTTTGGCTC AAGCTTTTGA AAATGGTAGT ATCATTTTGA ATTTGGCCGC AGAAGCGAAG AAAGAAGAAA GATGGCCTAA GGAAGTACGA GATCGAGTTG TCGAAGAATG TCTATTTGAA GACGATGATG GAGACGAACC TGTATCGGGT CTCGGGCCCG TTTCCTATCC TTTTGGAGCT ACACGGGTTT TGTTGACAGA AGACAACGAC GAAATCGGAG ATACCTTCGA AGTCGATAAG AATGATTACT CTCAGAACGA TTTATTCTTG GGTGGAGGCA ACGACGGTGT CTTCTTTGAC TACTCGGAAG AAAATGCGTT CCGGGCTCCT TTCCGAGGAC AGCTTGGCTT GCGGCTGGTC GATGCGGTCA CGGAACGTGC CAAGCAGCGG CAGCCCCGCG TAATCGCGAT AGGTGATGTC CATGGATGCA TCGATGAGCT ACAAGACTTG CTACGTCAGT GCGACTATCG ACCAGGCGAT CTGGTCGTTT TTCTTGGCGA TCTAGTATGC AAGGGTCCTG ACAGTATTTC GGTCGTTCAA ATGGCCCGTG AAATCGGGGC TTTTGGTGTA AGGGGTAATC ACGACTTTGA AGTAATTCGG TGGCACCAAG CTATCAAGTC CGGAGTAGAC CCTCCGGTAG TAGGCTCAGA GCATTTTCAC ATTGCGTCTT GTTTGAGCAA GGCCGACATG AAATGGATGA ACAGTCTCCC TTGGTACTTG TCTAGCAAAG AACTTGGGTC GCTTTTTGTG CACGCCGGCT TTGTTTCTGG GATCAGGCTT GCAAAGCAAA ACCCTCGCCT GATGATGAAC ATGCGCAGCA TTCTTCCTGA CGGTACGGTT ACATCAAAGT TCTTCAACAA CTGGCCCTGG GCACGTCTCT GGGACGGTCC GCAAACGGTT TTATTCGGCC ACGACGCCGA CCGGGGCTTA CAGCAATACG AGCACGCTAT TGGACTCGAC ACTGGCTGCG TGTACGGTGG ACGATTGACC GCTTGCATAC TTCCCGAAAA GAGGTTAGTC AGTGTGAGTG CAAAGCGGGA GTACTTCAAA TACCGTCGAA AGCACTATGA TTGATGTTAG CTTGGTATTA GCTTATTCCA ATGTCTATAT TGGAGGTAGC TACCGTATAA AAGTGTCCGA GGTAATCCGT CCTCTCTCGC CTACTTAGTT GCTCACTATA ATTATGTCGC AGCTCCTTAT TAACAGCAAA AGATATTGTG ACAGCAAGCA AAGCTGTCAA AAAACGTACT GTATGAGTTC GGTTCCCGGT CACAGCTCCA GTCGAGTCCG TGATCAGGAT TCGATCTCCA TAACCGAGAA CGTGAGCAAG GTCTATTACA TTCTTTACAG TTAACTTTCA ATTTTTAGAT TTCACTACTG ACCGTTTTCT CTGTCGGTGA GATTCTGACG TACGGTGCCA GACACACGCA CGTTGCCGAA GCACTCTCGA ATTCTTTTTT TGCTGGCGTG ACGATTCGCG AATCTTGCTA TCGATTCTGT AGATTCCTAC GGAAGCAGAA TTTCCAAAAC AAGCGCGAAG GCATCAAATT TTGA
|
Protein sequence | MILLRRLHRT SRPNIPLRGK RKSGSDEETA SDSNEDLGRK RLLSKDRAIS ALSQPNAPTQ TPNTNNDSTN KQSTSKAASL LPSKKVKSDR QEDERTRIKK NNAPKASTVP LQSRSSDRRW KNSKRSREIT PKVPNTAKAA YSSLLPPPLS GSNGSDDDSL STGLSSWEEF LGKGKESSGS RTTPIVGRSQ EKRSPSRVEA RKREKSKEEQ ADESDNTDKH RNLPSILDLF PSALSTNPNE RSASSSKVLS EDAGKSYTSL DGVLPVSDLF YRSSIPQAQR IDGSEQATNA TDDGEESPYR RAVSQTPGKQ EMAKKPSSTK KRSGRKMVRR GMEMLVGGVP INADPPQRNV DLCYDRLAAD WATSISLNTR EFGPLLHGAS IPKVSLKERG LFCEYFCHAA MKWDVCPTDL QSIIDSHSKQ ISGFEVSAWK KTSAHLPGGK DLDKLAESIS KDILSRLHED SDTSDSMITK DTRHDGKSSG DGIPRIAAKL DLHAKDSRKQ RSRRSKSKAY AQSKAKGFGK DKNGARRSKK EFVDTDALAL YSNEFKIMAQ PIGISKSDLE SGGQGTQVFE SVLRKAFETH PVLIEAIGEF HVDIYNCTIE GTGPDSSLYS VEFGVFPKKV IHQSEKPELL HKMIEALRFI LDTDDSEDTL NSTLARIASE EYRWSPSIRE RVALQFNDQN APNQRLLYTV NAGVLEFEIG VSRAELESGG DGGEIFQSVL EKAIGGAMRN SLAGFHFSIT HFTLDDHDDG TSLVSADVQM ETSEPIARSE NRLIEKNLRA ALAQAFENGS IILNLAAEAK KEERWPKEVR DRVVEECLFE DDDGDEPVSG LGPVSYPFGA TRVLLTEDND EIGDTFEVDK NDYSQNDLFL GGGNDGVFFD YSEENAFRAP FRGQLGLRLV DAVTERAKQR QPRVIAIGDV HGCIDELQDL LRQCDYRPGD LVVFLGDLVC KGPDSISVVQ MAREIGAFGV RGNHDFEVIR WHQAIKSGVD PPVVGSEHFH IASCLSKADM KWMNSLPWYL SSKELGSLFV HAGFVSGIRL AKQNPRLMMN MRSILPDGTV TSKFFNNWPW ARLWDGPQTV LFGHDADRGL QQYEHAIGLD TGCVYGGRLT ACILPEKRLV SLLINSKRYC DSKQSCQKTY CMSSVPGHSS SRVRDQDSIS ITENISLLTV FSVGEILTYG ARHTHVAEAL SNSFFAGVTI RESCYRFCRF LRKQNFQNKR EGIKF
|
| |