Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48325 |
Symbol | |
ID | 7203793 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 121760 |
End bp | 124209 |
Gene Length | 2450 bp |
Protein Length | 670 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182775 |
Protein GI | 219124994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTCTCTTG CGTTACCGAC AATCTACCTT TCTGATCATC TCCCTTGCCC AACAGAAATA CGAATACGCT TATTATCTCA ATCCTCCTCC CACACTCACA ACTTTCCCCG TGTGCGCTCC GCTGTGACCA GTGCCTTCCA CAATTTGACG ATTCTCCCGA GACACGGGAA CCCCGCACTT ACGTTACCTT ACTGTTAATA CTGTTGGTGT TGTTACTTTC CTCGAACGCC GTTCGTCCCT ACCTACCGCA TGGATCCTTC GCAGTACCAC CATTTCTCCG CGCAGCAATT GCAAGGATTG ACGGGAGGAG ACTCCTCCAG TAGAGCTCCG CAAGACGCCC ACGGCGGTGG GAGTCACGGA AACAACGGTG GTAACCAAGA AACGCACATC AACAATCAAG AGAACGCGCA GCAGCTCTCG GCGAGCCTCT TTCAACAAAT GCAAAGCATC CAACAACAAC AGCAGCAACA ACAACAATCT GCGCATACGC AGGGAACTGC GCAAGGTGGT AGCATCAACA ATAGCGGAGT GCCATTACTG GTGGGAGCAA CGCCCTTGCA ACAGCAACTT TCGGCTCTGC AGAGTCTGAC TGCACATCCG GGATTCGCCG CCTTTCAAAA TCAGCAACAG CAAACCCAGC CTCCTCCCCA ACAACCAACG TCGCAGCTGC AGATTCCCCA AGGACTTCTC AATATTCCTG GTCTTACTGC CGAACAAGCA GCCTTGTTCT TTCAACAAGC CCAACAACAA CAGCAGCAGC AGCAAGCCCA ACAACAACAA ATGCAACAGC CGAATCAGCA AGCGCAACAG CTTCAACAGC AAATGCAGCA ACATTTACAG CAGCAAGCCC AGCAGCAGCA GAACCAAACT GTACAGCAAC AGTACCAGCA ATTTCAACAA CCACAACAAC AACAATTCCA GCAGCAACAG CCACAAGGTC TCATGAATGC CAATGCTCTT GGTATTAGCC TCCAGCAGCA GATCCAACAG TTGCAGCTGG CTCAACAGCT CATGGCGGCT GGTTTACCCG CCAATTTGTC CTTGATGGGC GCCGGGGCGG CTCCTGGAAA CGTCAATCTG GCGAGCTTTA TGAGTGGACA AACACAGCAG CCAGCCGTGA CACAGCCAAA TTCACAGTTG GCCGCTTTTC AAAATAATCA GTTGCTAATG GCGCAGCAGA ATCTGGGAAA GCAAACAACA CCGTCCACTG GAGCCTTTGG TGGTTTTTTG CCGGAAGAGT CGCAAAAACC GAGCGATTCG GCCGGAAGCG CGCCGACAGC TACAGAAGAG TGGGCGGAAC CGTTCGCGGG CAAGGGAAAG AAAGAGCCAC CGTTTCCACT CAAGTTGCAC CAGATTTTAG GCAATCCGGA ATTTGCCGAG TGCATTTGCT GGAATCCGCA CGGGCGTTCC TGGAGGATCT TGAAGCCACC CGTGTTTGAA CAGCTCGTCA TTCCACTCTA CTTCCGGTAC GTACCTAATT GTAGAAACAC GGAATTGTTG CGGATTTTCA GCTCTCACTT ACCCGAATTG TATTGCTGAA TATATTTCAG CCACGCTAAA TACGCCTCTT TCATGCGTCA AGTCAATGGC TGGGGTTTCA AACGAATTGT TTCGGGTAAC GATCACAATT CATACTTCCA CGAACTCTTC GTGCGCGAGT ATCCCCAACT GTGCATCAAA ATGAAGCGCA TCAAAAAGGG CGAGGGTGAG AGGAAGAGAA AGTCGGATGA TGGTAGTGAC GATAGCGATG GCGGCAATAT TGAAGGCGAG AATTCGGCAG CGAATGAGGC TGGCGATCAA GGCAGCAACG AGGGAAGTGA CGATAACAAA GGACAATCAC AGAATGATTC CTTGAGCAAT CTTCACCATC AATACAATCT TCCTTCACAA GAGTCGTCGA ATCAGCAGGG CTCTATTTTT TCGCAACTGG GCGGAGGGAG CAACAACACC CGCAAGTGAT CAGAATGCCT TGGCGGGTAT TTTGAACCAG GGACAGTTTA ATCTTGGACA ATTGAACGGC GGAGGAAGAC TTTCGAATAT GATGGGAAGC TCTGCTTCTC CCAACCCCGC GACTCCTGCA TCCAGCGCCC CTGTATCTTT GCCGAGCGGT CTTGCAGGTT TGCAAGGCGT TGACAGCGCC ACCCTCATGA AGTTGCAGGA AGCTTTGTCG GCGGCCGGTG GGGGCAATGC TATGCAGTCA TTACTGCAGC AGCAAGCTCC CGCTGCGCCT TCCGCTCCAC AACAATTGCA GCAAACGCCG CAGCTCGGCA ATTTTGCCTT CCTTAGCTCG CAGATTGGCG GTCCTAACGC AGCACTGCTG GCGGCGCAAC TCCAGGCAGC AACCCAAGGG CCAAGCGGAG GCAACGGTGG AGAAGAGCGT GATACAGACA AGGAAAGCGA AGCAGTCTGA ATGGATTTCG GATGATACGT AATCAGTGAA GTATTGTTTA
|
Protein sequence | MDPSQYHHFS AQQLQGLTGG DSSSRAPQDA HGGGSHGNNG GNQETHINNQ ENAQQLSASL FQQMQSIQQQ QQQQQQSAHT QGTAQGGSIN NSGVPLLVGA TPLQQQLSAL QSLTAHPGFA AFQNQQQQTQ PPPQQPTSQL QIPQGLLNIP GLTAEQAALF FQQAQQQQQQ QQAQQQQMQQ PNQQAQQLQQ QMQQHLQQQA QQQQNQTVQQ QYQQFQQPQQ QQFQQQQPQG LMNANALGIS LQQQIQQLQL AQQLMAAGLP ANLSLMGAGA APGNVNLASF MSGQTQQPAV TQPNSQLAAF QNNQLLMAQQ NLGKQTTPST GAFGGFLPEE SQKPSDSAGS APTATEEWAE PFAGKGKKEP PFPLKLHQIL GNPEFAECIC WNPHGRSWRI LKPPVFEQLV IPLYFRHAKY ASFMRQVNGW GFKRIVSGND HNSYFHELFV REYPQLCIKM KRIKKGEGER KRKSDDGSDD SDGGNIEGEN SAANEAGDQG SNEGSDDNKG QSQNDSLSNL HHQYNLPSQD DQNALAGILN QGQFNLGQLN GGGRLSNMMG SSASPNPATP ASSAPVSLPS GLAGLQGVDS ATLMKLQEAL SAAGGGNAMQ SLLQQQAPAA PSAPQQLQQT PQLGNFAFLS SQIGGPNAAL LAAQLQAATQ GPSGGNGGEE RDTDKESEAV
|
| |