Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18693 |
Symbol | |
ID | 7204038 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 995612 |
End bp | 1000106 |
Gene Length | 4495 bp |
Protein Length | 1351 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186167 |
Protein GI | 219113167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.749806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGAGAAAG CTTCTTTTGC TTCCCAACTG TTTTTTCAAT ACGCTTCGCC CCTCATTCAC AAGGCTTCGG AACGTCGATT GGAAGCTGAA GATTCATTTG AAATCTCCGA GTCACAGAAA ATGGATCACG CTGTCTCGGG TCTGGCGTCA GTTTATCGGG AGCTGAGGAC TCGATCGAAG ACTCGTCAAG AGCAGAAAAA TGTGGACAAA GCCGATCATG GGACGGAATC ACAATCACTG ACATTGACCA AAGCCTTGCT GTTGCATCAA AGACGAAATC TAATTGTGAC AGGGCTGTTG CGCCTTCTGA ATACGTCCAT TCAGGCATTT CCGGCTGTTC TGGTCTCTCG TCTCCTCAAA CTTATCGAAG CCGGTGACAC ACACGCGCCT GCCAAGGCCC TCCAAGCTGC CGTCACTCTT GTCGCGGTAC TTAGCATCAA AATGCTTGTG GAGAACCAGT ACTTCCACAA GGTTGTTAAA TGCTCAACGC AAGTGCGCGG CTCATTGGCG GGGCTGATCT TTGACAAGAG TCTGCGACTG CCTGGTGGGG GCAGCGGTGT CACGCACAAG GATGGCAACG GCGAAACGTC CGCTCTTGGT TCCGGAGGTG TCCTCAATTT GATGCAATCC GACGCAAGTT TGATTGAAAG TGCCGCCTTA CAGTTTCATA CCACATGGGA TGGCCCGCTG CAAATTGCCA TTTACACATA CTTGCTGTAC CAGTATTTGG GGTCGTCTGT TTTCTGGGGG ATCAGTGTCT TATTGGCCAC AATTCCGATC AATAGTGTCA CATTACGCAT CCTTAATCGA CTCGCCAAAT TTGAGAATGA AGCCAAAGAT AGGCGCACCA AACGTACGGC CGAGTCTATC AGCAACATGA AGCTACTTAA ACTTCAAGGC TGGGAGCAAC GCTTTGCGGA TGACATTCGT ATACATCGGA AGGAAGAACT TTCCCGACAC GTTTCCCGTG GTATCGTTCG AGCTGTGAAT ACGGCTGTTT CCAACGCTGT TCCAGCCTTG GTGTTGGTCG TAACGTTGAC TGCCTACGCC AAGACTGGAC GTCCAATCGT GGCGAGTACC ATTTTTACAG CCATTTCACT TTTCAATCAA CTACGATTTC CACTATTTTT TTATCCAATG CTCATTGACT CTCTCGCGAA TGGCAAAAAC GCAATGCGAC GCATTTCGAC GTATTTGTCA TCCGAAGAAA TTACACCATA CGTGAATTTT TTCCCGACTG TCGACGGTAC GGGTGGCTCA ATTGAAATGA CAAATGGTAA CTTTCTGTGG TCGTCGACAA GGTCCATAGA TGGAAATACT ACGTCAATTT CTCCAGCCTT GTCCAACGTC AAACTAAAGG TATACCCTGG AGAAGTAGTT GCAGTGGTTG GAACCGTTGG AAGCGGTAAA AGCGCACTCA TTAGGGGCTT ACTTGGCGAA CTGAATCCAG TGCCGCGAAC AATCATACAA GAAGCGTTGA AGTCCGAGGG GCAGGAGTCA GTCGGAGACG GCGTTCTTGA CCGAGCAACA GTGATTACGC ACGGGAACGT CGCTTATTGT TCGCAAGAAG CATGGCTTCC GAAAGGCACG CTACGGGATG CGATTGTGTT TGGACGGGAG TATGACAAAG ATCGATACCG AGCTGCAATT TATGATGCAG GGTTGGATAA GGATATTGTG AATGATGCCA GCCTCGCGGA TTCGACGGAA GGTATTTTGA GCCATGACAC CGATGTAGGC GAAGGTGGAT CCTCTCTCTC CGGTGGACAG CGAGCGCGAG TGGCTCTGGC TCGGGCCTTG TACGCCGGTG ACGATACCAA GGTATTTCTA TTGGACGATT GCTTGTCAGC TTTGGACGCA AGTGTTGGCT CGATGGTATT TGAACGAATC ACTGCTCGAT TGCGAAAAAC AAACGCCGCG ACCGTTCTAG TCACCAATGA TCCGTCATTG CCAAGACGTT GTGATAGAGT GTACTTAATG GGCAAGATTT CGAGTTTTGG CTCATGCTCA ACAATAGTGG ACAATGGAAG CTACGATGAT CTTCTATCTC GCGGCCACAA CCTGAGAAGT ATCTCAACTG TCGAATCAAA TTTTGGAATT GACAAAAAAA TTGATTCACA GTCAGGCAGT CGTGTTCATT CTGATGCGGG AAGCGCTGTC GATAATATGA TTGCTGCTCA ACCGACAAGG GAGGAGAGGA GAACTCACGT AACCGGTGTT TTGGAGGAAC CCACAAACGC AACAGATATT AAACAATGGC GCCATGCCGA TCCAGAGTGT CAAATAACGA TGGAAAATTG CCCAGATTAT ATTGCTGATC AAAGTGCCGA TCGCACAACG TCTGGACAAT ATCAAAGCAG AGATGAGCTG ACTGATGGCG AAAGAGAGAA AGTATTCCCT GTATTCGCCG CAAAAGGTTC TATCGTTGGA CCGGGTCATT TCAAGGATAC CAGCGCTCTC GCAGACGACG AGCGTCCCTT GATGTCCTCT AGTGTCACGA AGTTAACATC AGCGGATGAT TTCATGTCAG CAGGAGCAGT TCCACGATCC ACATATGTAG CTTACTTCAA ATCGGTGAGG AAACCAATTC TTGTTTTTGC TATGATTGCT TCCTATTTAA TGGCAAATGG CGCACAGTTT TACCAGCAAT ATACGGTTGC GAAATGGACT GAGCTTTCTC ATGCCGATGC CATGGCGGCT GCTCTCGGAG CCAAGTATCT GCGTTCCCTT GTGAATGCTG CGGGCGTCGT TTCTGTCTTC TTATGGTTCC GAAGCGTTTT TACTATGCAA GTTGGTGTCC GTGCATCAGA TTTCTTTCAC TCTCGCATGC TGTCGTCGGT TTTTTCGGCC CCAATGTCGT TCTTTGACGC AACCCCTTCG GGTCAGATTT TGTCCAGATT TGGAAAAGAA ATTGAGACTG TGGATCGAGG AGTGCCAGAT AGCATTGGGT CAGTTCTTTT CTGTTTCCTA CAAATCTTTA TGTCAATTGG TGCGCTATCT GCTATCATAA CGCCTGGAAT GCTTGGTCCT CTAGGTATTG TTACATATAT GTATATCAAG ACTATGGCGA AATTCCGCCC GGCAGCTCGA GACATGAAAC GAGCCGAAAC AAAGACTCGG TCGCCAATTT ACACGCAATT TGGCGAGGCA CTTCGAGGTA CAGAGACGAT CCGTTCTATT CCAGGCGCGA AGCAGACTTG GTCCTCGAAG CATCGCTCAT TGTCTGATCA AAACCTTGGA GTATTTTACA CGGTGAAATC CTTCGATCGT TGGCTTTCAA CCCGGTTGGA ATCACTTGGC AATACAGTCG TCTTTACGGC AGCAGTTGCC TCAGTCTTCT TAACTCGCGC TGGCCGGATG GAAGCTGGAT CAGCAGGCTG GGGTCTCACG CAGGCTCTTG CAATTACCGG TCTACTGACA TGGGCGGTCC GCTGCTTGAC AGATCTAGAG ACTAGTATGA TGAGTGTCAT GAGAGTCAAA GAGCTGACAG ATTTGGACAG AGACGAAGTT GATGTACCCG GGCAAACGTC GAAACGCATG CCAGTCGAAC CTTCCAATGC TGGTGACGCA CTGATGCCGC TTCTTCCAGA ATCTGTACAC TTCAACTCAA CACTGGCCCC GTTGGATAGC AATGTCCTCT TGAAAGATGG ATGGCCCTGG AAGGGCAATG TTATTTTTCG AAATGTTTCA ATGAAGTACA ATCCGTCATC GCCACTTGTT CTTCGTTCTG TAACGGTCGC AATACCGGCA GGAACGACAC TCGGGGTGGT CGGACGAACG GGATCAGGAA AGAGTTCCCT TCTTTTAACT TTGTTCCGAC TAGCTGAGAT AGAGAGTAAC GGCTCAATTG AAATTGATGG TGTGGATATT CGATCAGTCA GTTTGGAAAC GCTGCGAAGT TCGCTTGCGA TAATTCCTCA GGATCCTGTT CTATTTGCTG GTAGTATATC ATATAATCTG GATGCCAGTG GTAACGCAGA GCCGTCAGAA ATGTGGAATG CCTTAAAGGC TGCCTCGCCA AGTCTCGCAC GCCAATTTAT GTCGACAGGT GGTCTCGAGT CCCCAATTTC AGAAGGTGGA AAAAATTTAA GCCTAGGACA ACGACAGCTC ATCTGCCTAG CACGGGCCTT GCTCCGGCAA AGTAAGATTC TCGTCTTGGA CGAGGCTACC AGCAGCGTGG ATTCGAAAAC AGACCAAGAC GTCCAGGAGA CCATCCGTCG AGAATTCGTT GAGAAGGGCG TGACAGTAAT CACCGTAGCT CATCGTTTGG ATACGGTGCT GGGTTACGAC AAGATTGCTG TACTTGGAGC GGGAAGAATG CTAGAATATG GAGCACCAAA CGAGCTTTTG CAGAAAACCA GCGGCGAGCT TCGACGCTTG GTCGATGCTG ATAGACTCAG CAAAGAAAAA GGTTCCAAGC AACAAGCATC AACTAACAAT AAGGTCCCGG CGGGGCTTAT GTAAAGGGAG ATAATACAAA AAAATTTCGA AATAC
|
Protein sequence | MDHAVSGLAS VYRELRTRSK TRQEQKNVDK ADHGTESQSL TLTKALLLHQ RRNLIVTGLL RLLNTSIQAF PAVLVSRLLK LIEAGDTHAP AKALQAAVTL VAVLSIKMLV ENQYFHKVVK CSTQVRGSLA GLIFDKSLRL PGGGSGVTHK DGNGETSALG SGGVLNLMQS DASLIESAAL QFHTTWDGPL QIAIYTYLLY QYLGSSVFWG ISVLLATIPI NSVTLRILNR LAKFENEAKD RRTKRTAESI SNMKLLKLQG WEQRFADDIR IHRKEELSRH VSRGIVRAVN TAVSNAVPAL VLVVTLTAYA KTGRPIVAST IFTAISLFNQ LRFPLFFYPM LIDSLANGKN AMRRISTYLS SEEITPYVNF FPTVDGTGGS IEMTNGNFLW SSTRSIDGNT TSISPALSNV KLKVYPGEVV AVVGTVGSGK SALIRGLLGE LNPSVGDGVL DRATVITHGN VAYCSQEAWL PKGTLRDAIV FGREYDKDRY RAAIYDAGLD KDIVNDASLA DSTEGILSHD TDVGEGGSSL SGGQRARVAL ARALYAGDDT KVFLLDDCLS ALDASVGSMV FERITARLRK TNAATVLVTN DPSLPRRCDR VYLMGKISSF GSCSTIVDNG SYDDLLSRGH NLRSISTVES NFGIDKKIDS QSGSRVHSDA GSAVDNMIAA QPTREERRTH VTGVLEEPTN ATDIKQWRHA DPECSIVGPG HFKDTSALAD DERPLMSSSV TKLTSADDFM SAGAVPRSTY VAYFKSVRKP ILVFAMIASY LMANGAQFYQ QYTVAKWTEL SHADAMAAAL GAKYLRSLVN AAGVVSVFLW FRSVFTMQVG VRASDFFHSR MLSSVFSAPM SFFDATPSGQ ILSRFGKEIE TVDRGVPDSI GSVLFCFLQI FMSIGALSAI ITPGMLGPLG IVTYMYIKTM AKFRPAARDM KRAETKTRSP IYTQFGEALR GTETIRSIPG AKQTWSSKHR SLSDQNLGVF YTVKSFDRWL STRLESLGNT VVFTAAVASV FLTRAGRMEA GSAGWGLTQA LAITGLLTWA VRCLTDLETS MMSVMRVKEL TDLDRDEVDV PGQTNVLLKD GWPWKGNVIF RNVSMKYNPS SPLVLRSVTV AIPAGTTLGV VGRTGSGKSS LLLTLFRLAE IESNGSIEID GVDIRSVSLE TLRSSLAIIP QDPVLFAGSI SYNLDASGNA EPSEMWNALK AASPSLARQF MSTGGLESPI SEGGKNLSLG QRQLICLARA LLRQSKILVL DEATSSVDSK TDQDVQETIR REFVEKGVTV ITVAHRLDTV LGYDKIAVLG AGRMLEYGAP NELLQKTSGE LRRLVDADRL SKEKGSKQQA STNNKVPAGL M
|
| |