Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46563 |
Symbol | |
ID | 7201846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 725233 |
End bp | 730024 |
Gene Length | 4792 bp |
Protein Length | 1245 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181062 |
Protein GI | 219120656 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0629816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGGAGTAA CTGCAAATTA ACGTAAGCAT CATTACGGTT TCATCCACAC ACGTTTCTCT CCAATACAAA GCATCCTAGG ACCTAGGACC AAGGCACTCC TTCCCTGGTA CGTCTACTCC GCATGACGAC TCCCACGGAA AGCTATATAC TTAGAGAGCA ACCTGATTGT GCGTGCGGGC ACGGAAAAAG GACCCAAGAT TTGCCTCACC AAAAAAAAAT GGACGGCACG AAAGGATGAC CAACGTAAAT CTATGCCAAA AGACCCTACC AGTTTAAACT GAATTTGGAT ATAGTCCTGG GAGCAATGGT GACTCGTTTA AATTTGTCTC ACATACTTCT GTTCGCTTGT ATTCTTCGAA GGAGTATCAG AGCATTTGGG GTAGCGTTCT TCATCTCTAT CAGAAATGAC CGTAAAAAAA AGGGTAAGGT CAAAAAGTGC TATGCTAAAA AAATGATGAT ATTCCTAAGA AAAACCATTC AAATGTCAAA AGACCCATCG ATATGAGTCT GAAAGTGAAA CTCATTTTGG GCTTTCCCCA AGTCATACTT TTCTTTGAGG CGTACAATCC CAAAGAGTTC CACCAGATGC TTTGGGAGAC CTCTCACAGA AGATTGATGG AAATGACCGT GTTGGAGTAG ACCGATCGTG CCCAGACTAG TACACCCTAC GAGATCCATT TTGCTGAAGA AATAAAAATG ACGGAGAATC GAAGAATGAC CAACCGATGT CAACAATGTC AAAAAGACCT TGATACCAAG GATTTGGTAA TTGAGGTACT GTTTGATGTT ACATTTTCAT GCTCTTTTCT GCTCATAATC TTTGTGATCA CAATTGCAAC TCACAGTCAC CATTTTTGAA AAGCTCTCGA GAGAGAACAT CACAAGCGAA GGCTAACGAT GTACTCTCAC CTTGTCCTCG GCTTAGCCAT TCTTTTCCTT GGCGCTAGCC AGTGCAACAG CAACCTGATT CGTGGTAATT CTTCGATGTC TGCGAGCGAA GCCGGCGATT CTGAAGCGCT GACAACAACC ATATCTCGGC GTCTTGTCGA AAATTTTGTT CCTTTGAGGT GCAATGCTGC CATCGGTAAA GCCCCTTGTT CATCTTTCGT CACGATGTTT GGCAATGGCG CTGTTCGATC AAACAGAGTC ATTATTCCAT GTGGCAAGTG CGTTACAATG GACTATCCGG GACCGCAACT AACACTAAAT GATGGCATCG ACATACGAGG CAAACTAGTG TTTCCAGACG GATACCGACT CACGGTTCGG ACGAACTTGG TGTCGGTCCA AGGCGAGCTT GAGATGAGAA GTTCTAAACC AGTTGATGGA AACCCCGATG TACGGTTTGT AATGCAAGGT ACTGAGAGTG CGTCGTTCGC ACCGATCAGC AGCAACGCCG CAAAATGTGG AGGAAATTGC GATGCAGGAG CTCGATCCAT TGCAGTAGCT GGTGGGAAGG TCAATCGTAA GTTCGGATTG TTGCGAAAAT GGCATCCGCA TTTCCTGCAC ACATATACGT GACTGTCGGC ACTTACAGAA ATCCCTAATT GTTGCAGTCA ATGGCCTACC TACTAGCACA CCAACCTGGC TACACGTTCA CGACGTTATC GGAAGCTCGG CTATTGTCGT TTCCAATTCG GTCCGTAATA AATGGGGTGC CGGTGCAACT ATCGTCATTA CCTCCGAGCA CCAAGGTTAC TTTGGCGAGC AAGTTCGTAA AATCACCAGC ATTTCGGACG TCGGCTCCAA TAGCGTGCGT CTCAATCTCG ACCAACCCAT CAACCGTCCG GTCACGCTTC GCGACAGCCC AGACTTTGCC ACCGAGGTAG CCTTGCTGTC ACGCAACATT GTCTTTGAAG GGGCCCCCGG AACAAAGGGT GGCCACTTTT GGATCATGCA CACTCCCCGA GTAAGGCAGC GTATCGAAGG AGTGGAACTC GTTAACTTCG GACAAGAAGG CCTCTTGGGT AGATACCCGG TCCACTTTCA TATGTGCGGG GACGTCTCAG GCTCGGTAGT AGTCAAGAAT ACCATTCGCA ATTCCAACCA GCGCTGCGTT GTCGTCCACG GTACCAACAA CCTCCTTGTT CAAGAAAATG TAGCCTATTT CACCAAAGGA CACTGTTACA TGTTGGAAGA TGGCATCGAG ACGGGCAATC AGTTTGTGCG AAACATTGGC ATACGCACTG TCAAACCAGA GATCATCATT CCGAACATGG GTAGTAACGG CAAGGAATCT GACATGACTG CCAGTACCTT CTGGATCACG AACGCTGACA ACTCGTGGAT CGGAAATGTA GTAGCCGGAT CCGAAGCCCT GGGCTTTTGG TTTGAGCTGT TGGTGCGTGG AAATCTGGCC AACGAGCACC AAGACTTTGA TCCTATGATG GTTCCGACCC GCAAGTTCGA GGGCAACGTC GTTCATAGCG TGTATGGGGT AGGGATGACC TACTACTTGA GCGGTTACAT CCCGGAGACA CTGCAGTACT TCAAAAACAA CAAGTTCTTC CGCAACCATC ACCTTGCGCT CCGTATCCAC CGGGCGCGAA ACATCGTTCT GACTGGCAAC AAGTTCTCGG ACAACAGATA TGCCATTCGA ATCGATCGTG ACGAAGAAAT TCATGTCACC GACACAACCA TTGTTGGTTA TTCCGATCTA TTCAAGGACG TGGTCAGAAG GAATCGCTTT GCACAAGCAC CATGCGCCCA AGGGATATCT TTCCAATCGA CTAATCCATG GAAAGACAAG ATGGATTCGG AGCTCAACGG TGTCATTTTG GACAAAGTCA GATTTTCTGG ATTTTCCAAC GCGGTGTGTT CGTCTTCAAC CGCAATCGAG CTGGATTCCC GACTCGACGG GTACAAATCG TTCGAGATGT TCTCGCAGTA CTCCGGCGCC ACCGTAAGTG ACGCAAACTC GATCGACTTT TGTCGTGGAA AGTCTGCCGG TGCACGTGAT GTGTACGTGT CCGATACCAC AGGTTCTCTT CTCGACGGCG TTGCCTCTGC TCCTTCTACT CTGATGGTCA ACTCGCCGGA GCTGGCAAGC TTTGTCAATC CCAGTGCATG TACCGAAAAC GCAGCTCGAT GCTACACCTA CTGTAGCAAC ACTTGCTTCC GTACCGTTCA CTACTACGTC CCAGTGGGCC AAAGTCGAGA CTACAAGCTC AAAGTATGCG ACCGCAAGGA CGCGTCCGAC TGTACCGTAC TGTCCGGTTA CGTACACTCT AATCACCCCT GGCCTCGCCG ATTTGCCGTG CATGTCCCGT CAGGGCGGGA GTACGACACT TACTTCCTTG ACAAGGGAGT GCCTGTGTAC CCAACGAATG TCGAGATTGT GTTCCAGGAA AAGCTTTGCC CTACGGCACC GGATGATGAT GATATCGCTC TTCTCTACAA GGCTCGAGGT ACCACCTTTC CCCCCACGCC TAGTCCGACG ACAGCTCTCG CCGCATGTGG AAACTTGATC GCAAACTCGG ACTTTGAGCG TGGTTTCAAC GGATATTGGG ATGCACAAGG AGCCGGTACA TTGTCTACTA CAGCCGGCTA TGAATCTGCC ACAGCTATGT ACTACGCCTC TGGTAATCGC AACCGGTACT GGGTGGGACC ATCACACCAA TGGCGAGAAG GTCTGGACTT GAAATGTCTA AAGCAGGGTA CCATATGGGA GTTTTCTGCT CGCTTGAAGC TTGTCGACTC AAAGACTGGA AGGGGAGCCT CATGTGACCC AGGCTCTTCC TCGGGAGGCG AAATGTGCCC TCAGGTGCAG CTCATTGTGC GCGACCAATC TTGGACTCAG CATTCTTTCA GGATCAGCGG CTTCGACGGT GGGGACACTT GGGTAGCCAA TGGGTTTAAC GAGTTCAAAG GCTATTGGAC AATCCCAGCG AATGGTTCAG GATGGCAAGG GGGTGTCGCA AACATGCGAG TGATACTTTC CGAATTCCCA TTTGGTATGG ATTTGGTTGT TGACAATTTC AGCATGGTAC TGACGTTTGA CACGAACAAG AGTCTCGCAC AGTCTCCAAC CTTGAGTCCG GTGCTGGCCG TGCCTCCTCC AACCCAAGCT CCTATCCGCA TCCCAACTCA GGCTCCTGTC CGCATTCCAG CACAAGCTCC AATCCGCACT CCAAACGAAA GTCCGCTGAT TGGTCCCGGA ACGGACCTGA ATACATGCGC TAAGATGATC GCGAATTCCA ACTTTGATCT GGGGTATGAA GGATACTGGT CAGTTCCTAG CGGCGGAAGT CTGTCGAACA AGGAAGGCTT CAGTTCCTCG ACTTCCATGT ACTACGATTC TGGGAACCGG CGAAGATACT GGATAGGTCC TGAGTACAAG TGGCCAAATG GTGTTGACTA TGGGTGTTTG TTGCAAGGAA CAACATGGAA GTTTACCGCT AAATTTCAAC TTATCGACGC TGCAACTGGT AAAGGAAGCT CTTGCAGCGT TAATTCAAGT AAAGAAGGAG AAATGTGTCC CCGCGTTCGA CTGACTCTCC GAGACGATGG GTGGACACTT CGAAACGTGA GGATCGATGG ATTCAGCGCG GCGGATACCT GGGACGCCAA CGGCATGAAT TCGTTCACGG GCTACTGGAC TGTCCCAGAA AACGGCCCGG ACTGGCAGGG TCGTGTCCGT AATATTCGTT TGGCAATTGC TGATTTTCCC TTTGACAAGA ATCTAGTCGT TGATGATTTT CAGATGGCGT TTTTCAATAC GAACTAGGAA AACGCCAACA AACTCAGAAA CCAACAAACG GTTCCAACAA AATATAAACT ACTCTATAGT CG
|
Protein sequence | MYSHLVLGLA ILFLGASQCN SNLIRGNSSM SASEAGDSEA LTTTISRRLV ENFVPLRCNA AIGKAPCSSF VTMFGNGAVR SNRVIIPCGK CVTMDYPGPQ LTLNDGIDIR GKLVFPDGYR LTVRTNLVSV QGELEMRSSK PVDGNPDVRF VMQGTESASF APISSNAAKC GGNCDAGARS IAVAGGKVNL NGLPTSTPTW LHVHDVIGSS AIVVSNSVRN KWGAGATIVI TSEHQGYFGE QVRKITSISD VGSNSVRLNL DQPINRPVTL RDSPDFATEV ALLSRNIVFE GAPGTKGGHF WIMHTPRVRQ RIEGVELVNF GQEGLLGRYP VHFHMCGDVS GSVVVKNTIR NSNQRCVVVH GTNNLLVQEN VAYFTKGHCY MLEDGIETGN QFVRNIGIRT VKPEIIIPNM GSNGKESDMT ASTFWITNAD NSWIGNVVAG SEALGFWFEL LVRGNLANEH QDFDPMMVPT RKFEGNVVHS VYGVGMTYYL SGYIPETLQY FKNNKFFRNH HLALRIHRAR NIVLTGNKFS DNRYAIRIDR DEEIHVTDTT IVGYSDLFKD VVRRNRFAQA PCAQGISFQS TNPWKDKMDS ELNGVILDKV RFSGFSNAVC SSSTAIELDS RLDGYKSFEM FSQYSGATVS DANSIDFCRG KSAGARDVYV SDTTGSLLDG VASAPSTLMV NSPELASFVN PSACTENAAR CYTYCSNTCF RTVHYYVPVG QSRDYKLKVC DRKDASDCTV LSGYVHSNHP WPRRFAVHVP SGREYDTYFL DKGVPVYPTN VEIVFQEKLC PTAPDDDDIA LLYKARGTTF PPTPSPTTAL AACGNLIANS DFERGFNGYW DAQGAGTLST TAGYESATAM YYASGNRNRY WVGPSHQWRE GLDLKCLKQG TIWEFSARLK LVDSKTGRGA SCDPGSSSGG EMCPQVQLIV RDQSWTQHSF RISGFDGGDT WVANGFNEFK GYWTIPANGS GWQGGVANMR VILSEFPFGM DLVVDNFSMV LTFDTNKSLA QSPTLSPVLA VPPPTQAPIR IPTQAPVRIP AQAPIRTPNE SPLIGPGTDL NTCAKMIANS NFDLGYEGYW SVPSGGSLSN KEGFSSSTSM YYDSGNRRRY WIGPEYKWPN GVDYGCLLQG TTWKFTAKFQ LIDAATGKGS SCSVNSSKEG EMCPRVRLTL RDDGWTLRNV RIDGFSAADT WDANGMNSFT GYWTVPENGP DWQGRVRNIR LAIADFPFDK NLVVDDFQMA FFNTN
|
| |