Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25218 |
Symbol | |
ID | 7197127 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 320709 |
End bp | 322698 |
Gene Length | 1990 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177598 |
Protein GI | 219111693 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGGTTACG ATCTCTAGAA TTCCATCCTT CCCACATTTT CGAGGACTGT ACTCGATCTC CGTTGCCTTG ATCGACATCA ACGTGTACTG CTCTCGGTTT ATTTACGGTG CGTTGGTCTG AGATTGCAGC GTACTCTTTC CATATCTAAG AATCTACATT TGAAAGCACC GTTTCTGAAA TCGCCTCCAA CATGAACGAT TCTTCGTCGT CCCGGACCGA TCCCTTTGCC CCACGAGTCG GCAAACCACT GCTTTGGAGA AATGTGCAGA TGAAGGCTAC TGTGGTAAGA TCCATGATGT GTGTGCGTAT CGGAAAACTC ATCGTGCTAC ATGATCACGT TGCTGACGAT TGCGATGATT TTCTTCAGAA CAAGAAGGAC AGTGATGGTT CCAAGGTTAT TCTGAAGGAT GTGTGGGGGA GTATTCCAAA TGGACGTGTG ACTGCCATTA TGGGGCCCTC CGGAGCGGGA AAGGTATGTT CAAACATTCA TGTTGACATT CTGATTGCTG CTTCGCGTCT TCATGGCGCT ACTTATGTGT TTTACATATA GACGTCTCTT CTCAATATTT TGGCCGGTCG GGTCCAATCT TCCTCCAAGA TTGAAATTGA AGCCGATATT CGCCTCGGGG ACCAGCGTAT TGACCCTTCA CAACAAGCGG TGCGTCGCAC CATGGCCTTT GTAGCTCAGG AAGAGTCTCT GCAAGTCACT AGTACCCCTC GTGAGGCGAT TCGTTTCTCC GCTCGCATGC GCTTGCCGGT GGATCGGACC GACGAAGAAA TTGACGAAAT GACGAACCGT ATGGTGAAAG AACTTGGGCT CGAGAAAGCC GCCGATACAG TCATTGGAGG AGCACTCTTG AAGGGAATCT CGGGTGGCGA GCGAAAGCGC ACGGCAGTCG GTGTCGAGCT GGTTGTACAA CCCACTTTGA TCTTTTTGGA TGAGCCCACG ACGGGCTTGG ACAGCTATTC GGCAATTGCG GTTGTTCAAA TTCTCAAACG TGTTGCCACC TCTGGGTCGG CTGTCGCCTT GACGATACAT CAACCCGCAT CCGAAATCGT TCAACTTTTG GATGAGTTAA TTTTGTTGCG ATTGGGGAGT GTGTTGTATC AAGGTCCGGT CGCGGACATT CCGCGTGTTT TCGAGAATGC TGGTCTCCCA CTCCCATCTC GTTACAATCC AGCTGACTGG ATGTTGCACG TTGCCCAGAC GATTACGACG GATAAACTTG CCGAAGGTCT GTACCCCAAA GATATCCGGG AATTTCCGCC TTTGCCTCCT CCATCCCGTT CTGGATCTAC AGGCACGGGC GAAGGTAGGG GCAAGAAAGT TTCACAGTTA TCTCAGATTG CAGCCTTGAC AAAGCGAGAA TTGATCAATT TGTACCGCTT TCCTGACGCG ATCTTTATGC GCTGGGGTGG TGACACTATA CTGGCGTTGC TCATTTCCTT GATCTACAAA AGTGTTGGCG AGACGGATCG ATCGGTCCAG TCGAATATTC GCAGCATCTT CGGTGCCCTC GTATTCTCGC AATTGATGGG ACTCTTTGGA TGCGCCGAAC CGAGTCTCAT GTACTTTCCT CAAGATCGTC CAGTATTTTT GCGCGAGTAT GCCACCAATC ACTATTCTGT GACGTCCTAC TTTCTGAGCC GAACCTTCAT CGAGTCCGTC TTGGCCTTTG TGCAGATTAC GGCTCAATCT CTTCTTTACG TCTACGTGAT GGAACTAAAC ATACCATTCT GGGAATTTGT GGGAATCAAC TACGTTTTGA GTATGGCCGG GACAGGTGTA GCAGTCTTTA TCGGGTCCAT TGTTGAAGAC CCCAGGACAG CCACAGAGCT GCTTCCACTT GTATTGGTTC CGCAATTGCT TTTTGCTGGC TTTTTTGTGT CAATTGACAA CATTCCCTCC TGGCTACAGT GGGCTCAGTA TCTTTGCGCT TTGACGTACA GCATTCGCTT GGCGGTCATC GGGGAGTTCA GTTCCTGTGG
|
Protein sequence | MNDSSSSRTD PFAPRVGKPL LWRNVQMKAT VNKKDSDGSK VILKDVWGSI PNGRVTAIMG PSGAGKTSLL NILAGRVQSS SKIEIEADIR LGDQRIDPSQ QAVRRTMAFV AQEESLQVTS TPREAIRFSA RMRLPVDRTD EEIDEMTNRM VKELGLEKAA DTVIGGALLK GISGGERKRT AVGVELVVQP TLIFLDEPTT GLDSYSAIAV VQILKRVATS GSAVALTIHQ PASEIVQLLD ELILLRLGSV LYQGPVADIP RVFENAGLPL PSRYNPADWM LHVAQTITTD KLAEGLYPKD IREFPPLPPP SRSGSTGTGE GRGKKVSQLS QIAALTKREL INLYRFPDAI FMRWGGDTIL ALLISLIYKS VGETDRSVQS NIRSIFGALV FSQLMGLFGC AEPSLMYFPQ DRPVFLREYA TNHYSVTSYF LSRTFIESVL AFVQITAQSL LYVYVMELNI PFWEFVGINY VLSMAGTGVA VFIGSIVEDP RTATELLPLV LVPQLLFAGF FVSIDNIPSW LQWAQYLCAL TYSIRLAVIG EFSSC
|
| |