Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45473 |
Symbol | |
ID | 7200571 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 293841 |
End bp | 298085 |
Gene Length | 4245 bp |
Protein Length | 1226 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179615 |
Protein GI | 219117648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCC TGGCACCGAG AGGTAACATT ATTGCACTAT TGGGTCTCCC ACGCGGCAGC GTTATTAGTT TGGATGGGCA AACGGTAGCT TTGAAACGGG ATGACTTTGT AGGCTTTTCG AACGTCCCCG CAAGTGACAG TTGTCATTTT GTAACGATAA GGGCGACTTC GAAAACTAGC ACCACGAGCG ATTTCCCAGC GCAATTGGCA GCAGCAGTGA CGGTTGCCTA CATGACCTGG GACAATGACT TGATTCGCAA ATTCGATCCG CAAACAGAAG AAATGTCCAG TGTACCTGCC GACACTTGCA CAACCGACAA TCTACTGCAC AGAATTGAAA CACGGCACAT CGATGGACAA AGTTTAATTT CTTACCCACA GCTATTAAAC GACGAAAAGG AGAAGGACTG GGTATTGCTA ACAAATCACG TCTCGAAACG ACTGCTACAA AAACGCCACA TAGGAACAAA TGATAAAATT GTTCCAGGAT TGTGGGATGA GCAGGAAAAC GAAGCAATCG ACGGTACACC GATCCACTAC CCCATCATTC CAATCTTAGC CCCACATGCA TCAAGGCATG CAGCTACAAA ACGATACTTG GCAAACTTGT CTCCCTCCGC TCGGACAAAA TTATATACCC ACGAATCACC TTCCGATTCC ATTTTTGAGC GAGTGTTGCT TGAAAAGTAT GACAATGACA GCGGGCTACT CTTGGGCGAT ATACAATTGG CATATGTAGT ATTTTTGCAC TTGCACTGCT TCGTCTCTTT TGAACACTGG CGGGACATGA TCGTCATGCT AAGCTTGCTA TCTGAAAAGG TTTTTGCCAA CGACCGCATG AAGCGCTTTC CAAACAAGCT GATACAAGTG CTCAAATCAC AGTTGATGGT AATCGGAGAA GATTTGATCG AGAATTCAGA CTTGTTCGAC GACAATGATT TATCACCTGC CATGCAGCGT CTGATTGCCA TCTTGATGAA GGTTGTAAAA GATGGTGAAG GGCAGGCTCT TTTGTCTCGA CTATGGGCCA TGCTTCGTAC CCGATTTCCG ATGTTTACAG ACAACGAGTT TCATTTATCA AGTAATTACA GCTCTGGAAC AAACAACGAA GAGACAGAGG AAAGTGACGA GGATAGACCA GTTCTAGTGG CCAGCGAGGA AGTGCAGGCT TCGTTGGCTC GGTCACAAGG CGCGCGGTTG TCAACAGGTA TTGCCTTTGA ACCTGTTTTC ATGAAGACCG AGCTTCAGGA AGCTTATCCA CTCTTACTTG CAGCGGTCAT GCCGCACGAA GATATACTTA TGACGTGCGC CAGGGCGCTG GACGAAGCCA CTGATGTTTC TTTGGTTCGC GAAGCCGCAG CTTACTTGGA GCAAGTCGAG CTACAAAGAC ATACTACAAG GTAAGGCGAC TTGGACTTTT CACCATTGAC CATCAATTTG TAGAGAGATC GAACGACGTG GTGACTAAAA GAGAACCTCC CCTCGATCAT GTACTTTGTT AGCATAACAT GCCTGGATTT ACAGTTATTC ACATAAACTT ACAAAGACAC ATCATTCTTC GACTTCTTTG GTCCCCCACC GTTTCCAACC AATAAAATCT GTTCAGACAT TGTGGATAGT GTCGTATCGA CATAGCTTTG TGTCGTCCAC AAGCCTTTTC TGGCCAAATC CTTGGAAGAC ATCGAACAGT AGTAGTGAAA AAGACATCAT CTCATCGTCT CACTGCCAAA TGAAGTATAG GATCGAATTT ATCGTTCACT GGTACCGGAA CTGGACTTTG TTTTACCTTG TATTTCTACA AGCAGCTATG GCGCAAGTGG ACTTTGTAAA GGATGCGAAG GAAATTCGTT CTTATTTGAA GACAGAGGAG GACGCGTTTG TGACCGATGG CATACATGGT CTTATCCGAG GCTCCTCTTC CATCGCACGT CTCATCCGCA ACGCGTTGCA TACAGATGGG TACTTTGTTT TGCATAGTAT CTTGACGACC AGAGAGTGCC AGCAAGCCGT CGATAGGATG TGGGACTTCG TACATGATAC ATCGGCGGGA TCCGTGCAAC GGGAGCAAGA TGAAACCTGG GCTTCGTGGC CATGTGATGA ACCCATCGAT CGTTGCAATA GTGTGGAAAC ATTTAATGTA AACGGAGCTG GCTGGTTACT TGGAGATCTC CGGGAGCAAT TGGCGGAGAG GGTTTTCGAA GATTTGTTCG GTACTTCTGA GTTGCATTCT TCTAAGGAAG GGTTTGTGTT TGGGCGTCCG AACCATGTAG CCGCCGACGG CAATTCCGAC ATGTCGTTTA CGCGAAACAG CGATGTGACG ATTCGATCTG TAGTGGCGCT TGAGGATGCA GGTGCTGCTA AGGGTGGATT TTCTTTCTTC CCCGCGTCAT TTCGAGCTTC ATTGGAGGAA TGCTTGGACA AAAAGCCAGA AATAGTCAAT CTAAAAAAAG GAGATGTGTT ACTTTGGCGG TCGGATCTCC TTCATGCTTT GATCCCACCT TCACAGCCCA CGCTTCAATT TCAAACATTG GCACTCGTCA GCATGCAACC CGCTAGCCGA ACTCCCGTGT ACCTGCGCAG CCTCAAAATG GAAGCGTACA AACAACGAAG AAGTGGCAGT CATTGTGTCC ACGAAGAGAA CTGGAGCCGA AATAGTGGCA TCGGTCGCCC CTATTTTCGC TGCAGTCCAC CTCTACTAAC CCGACGACAA GCGGAATTGT ACGGTCTAAT ATCATACACA AGCTGTGACG AAGCATGGAA AGAAGAAAAG AAGCGAGCCA TGGTATGCGG TGTGCGTTTC CAAGATGAAT TTGAGGCGCA CACATTTCCT ACGGCTCGCC CGTGTTCGGC TGTACTTGAC TATTTGACGA CAAATAATCC AGGCGACATG ATGGGAAAAG ACAAATATTT AGGTGGAGTG GCATCGCCAT GTGGAAAATA TGTGTATGGC GTCCCAGGAT CGGTACGCTT TTTCTTCTTT TGGCGTTGCG TCTAAAAAGA GTGGACTTAG TAGCTTACAT GGATCACTTG TTCTAACAGG CGCGACGCGT GTTGAGAATT CACGTAGAAG ATGGAAATAT GGACTGTATC GGACCTTCAT TCGAAGGGAA ATTTAAATGG TTACGGGGCG TCGATGTTCC AGCGGAATCG ATGATGGACA AAAGGTATCC GCAAGGATGC TGTTTGGCTC TCCCTTGCAA CCATAGCTCC ATTTTAAAAA TCAATCCATC GACAGATGAG GTTTATTCAT TTGGCCAAGA CACCATAAAA GGCTGCGGCA GCGACGATTG GCTCTACCAT GGTGGAAACC TGGCTTCAAA TGGTTGGATT TATGCGATTC CTGCAAACGC GAAACAAGTG CTTAAATTCC ACCCGGTAAC AGACAAAGTA TACTTGATAG GGCCAAACTT TCCTGGTCGG TGCAAGTGGT TTGGTGGAAT CCTTGGTTCC GATGGTTGCG TCTATGGTAT CCCTCACAAT CAGACCGGCG TACTGAAAAT CGATCCATCA ACAGATCAAG TCGCAATTCT CTACCATGAC AGTGGAAAGC CGCTGCCGGA TGGTCGCTGG AAATGGCATG GTGGTATACG TGCTGGTGAC AAGATCTATG GGTTTCCAAA CAATAGCGAC AACATTCTAG TCATCAACTG TCGCCTCAAA CAAGTCTACA CAATTGGGGA TAGTTCAATT TTGAGATCGG GTAGACATCG AGTGAACAAT GATAATCGTT ACAAGTACCT GGGTGGCGCT TTAACATCAG ACGGTGGGTT TGCCTACCTT TTTCCGTGTG ATGCAGAACG TGTTCTCCGA ATTAATTGCG ATACGGATGA TCTCGCGCTC GTGGGGCCAT TTTTGCTCGA AGGGGAAAAC AAGTTCCAGA ACGGCTTTGC CGCACGTGAT GGATGTTTGT ACGGTATTCC ACAGAGAAGC TCAGGTGTCT TGAAAATAAC ACCGTCTTCG AATCCCAGCG AAGAGGACCA CGTTGATATT GTATATTGCG GCGATGACAT GATAGGTTGC AAAGATAAGT TTGAAGGAGG AGTCCTGGGA CTTGACGGGC GTTTATACTG CATTCCTTTG CGAGGTGAGC AGCTTCAACA TATCTTGTTA AATGCATTCA CAGTCTGCAT GCTCTGACAC ATACTACACA TTCTCAACTA CAGCAAACAC ATGTCTAAGA ATCACACCTT CGTATAGTAC ACCATCCTAA AAAAAAACAA GAATTCCAAA CAGTATTCTA ATTTG
|
Protein sequence | MNILAPRGNI IALLGLPRGS VISLDGQTVA LKRDDFVGFS NVPASDSCHF VTIRATSKTS TTSDFPAQLA AAVTVAYMTW DNDLIRKFDP QTEEMSSVPA DTCTTDNLLH RIETRHIDGQ SLISYPQLLN DEKEKDWVLL TNHVSKRLLQ KRHIGTNDKI VPGLWDEQEN EAIDGTPIHY PIIPILAPHA SRHAATKRYL ANLSPSARTK LYTHESPSDS IFERVLLEKY DNDSGLLLGD IQLAYVVFLH LHCFVSFEHW RDMIVMLSLL SEKVFANDRM KRFPNKLIQV LKSQLMVIGE DLIENSDLFD DNDLSPAMQR LIAILMKVVK DGEGQALLSR LWAMLRTRFP MFTDNEFHLS SNYSSGTNNE ETEESDEDRP VLVASEEVQA SLARSQGARL STGIAFEPVF MKTELQEAYP LLLAAVMPHE DILMTCARAL DEATDVSLVR EAAAYLEQVE LQRHTTRIEF IVHWYRNWTL FYLVFLQAAM AQVDFVKDAK EIRSYLKTEE DAFVTDGIHG LIRGSSSIAR LIRNALHTDG YFVLHSILTT RECQQAVDRM WDFVHDTSAG SVQREQDETW ASWPCDEPID RCNSVETFNV NGAGWLLGDL REQLAERVFE DLFGTSELHS SKEGFVFGRP NHVAADGNSD MSFTRNSDVT IRSVVALEDA GAAKGGFSFF PASFRASLEE CLDKKPEIVN LKKGDVLLWR SDLLHALIPP SQPTLQFQTL ALVSMQPASR TPVYLRSLKM EAYKQRRSGS HCVHEENWSR NSGIGRPYFR CSPPLLTRRQ AELYGLISYT SCDEAWKEEK KRAMVCGVRF QDEFEAHTFP TARPCSAVLD YLTTNNPGDM MGKDKYLGGV ASPCGKYVYG VPGSARRVLR IHVEDGNMDC IGPSFEGKFK WLRGVDVPAE SMMDKRYPQG CCLALPCNHS SILKINPSTD EVYSFGQDTI KGCGSDDWLY HGGNLASNGW IYAIPANAKQ VLKFHPVTDK VYLIGPNFPG RCKWFGGILG SDGCVYGIPH NQTGVLKIDP STDQVAILYH DSGKPLPDGR WKWHGGIRAG DKIYGFPNNS DNILVINCRL KQVYTIGDSS ILRSGRHRVN NDNRYKYLGG ALTSDGGFAY LFPCDAERVL RINCDTDDLA LVGPFLLEGE NKFQNGFAAR DGCLYGIPQR SSGVLKITPS SNPSEEDHVD IVYCGDDMIG CKDKFEGGVL GLDGRLYCIP LRQTHV
|
| |