Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49357 |
Symbol | |
ID | 7195877 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 24338 |
End bp | 27060 |
Gene Length | 2723 bp |
Protein Length | 841 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184050 |
Protein GI | 219127662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0047033 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAATACTTT GGGGAATTAC CTGCAAGCCA ACTATCATAC ATTGCAGGCC AAAGGAGCTT ACTGCATCAT GAAGTGGACG AATGTCCATA TTTGGCTCGG TGTAGGGACG CTACTCGCGG GACGGGAATC CAGCACCGTA CACGCCGAGT CGCACGAGCA CCGTTCCCGG AAGCTCGCAG TAGCCGTTGC GTCTCGGTAA GTCCCGAAGC ATTGCGTGTA CGATCAGGTT ATGTACCGGA ATGAATTTAC ACTCACCAGA AATGGCGCTT GTGTCCAAAT CGGGTGGGTC TTCTTTCTAT AACAGTGTCT TGCAGAAGTC AGTGAACGAC TTTGTGACGA ATTCAATTGC GCGACCGCAG CAGTTGAACT TGCCGCCCCA GATGCAGCAG CAATCGCCGT TGCATCTCCA CAGAACTTTA GAAGATGGAT CCAATGTAGA ATGCGATGCT GCCTTTATGG AGTGTATTCT CAACGACAAG TGCGTAGAGT GCTTTCTGAA AATGCAAACG GAAGGGATCG ATTGGGCGTT CGTCACCAAA GAAACGCCGT GCGATTCGGT CTTGGAGACG CTCTACGACA AAACGTTTTG TCTCAATCAG CGTGGTGATG AAGCCGCTGA GGGAATCTTT TGTAGTACGT TTGCTGCTTG TGTGGATACA ACACAGGAAG TCCCAGAAGA GGACGACAAA ACGATTGATT GCAGCAAGCT TACTAAATGT GATTGGCCCG GAATCCATCT TAGCTTTTTG GGGGATGGTG TGTGTCACGA GAATCTGGAG GGTTGCTACA ATACAGCCGT TTGTAATTAC GATGGGGGCG ACTGCTGCGA CGACACTTGT TCGACCGACA GCGACTACAA AAAATGTGGA CAAGATGGGT TTTCGTGCAA GGACCCTAGT AGTGCCAAAT GTGACTCCTC TCTTACCACG GGCTGTCCGA TAAGCGACGA CGACGATGGC GCGTTCCCTT TACCACCCCA GTGCTCTGAT TCCCAAGCGA TTTATCGTTT GGTCATGTAC GATTCCTTTG GCGATGGCTG GGACGAGACA GTTCTCACAA TTCGGGATGC TGATACCAGA CGTAGTGTTT ACACCGGATC TTTGCAGGAA GGGTCACAGG GGACTCAACT TATCTGCCTC AGTCGCTATG CCCAGTGCTA TCACGTCGAT GTCAACGGTG GCACCTGGGG CAACGAAGTC TCGTGGGAAG TCAAGCCCTA CACAGATGGT ACTCCGGCCT TGGCTGGTGG AAGCTCGCCT ATGAGTTGCG ACTTTTCCGT CTCTGGAGGC AAATGTGAAA AAACATGCAC CGGACGACCC AATATTCGTC CCGCAGACGA TCCAGAGTAC AAAGAATTCA AGGAAATGTA TACTTGCATC AACGAAAAGT GTCCGATTCA GGTAGGGGCA TGTCAGAAGG ACGAAGCTTG CAGCAACTGT TTTGTGGAAG AGGCTCCCGA TTACTGTTTT GGCATTGACA CATTTAATGC GGTAGTCGAT TGTACCTTGT GCAAGTGTAG TGGAGTCAAA GATAGTGATT TCTGCGACAA GAAAGCGTCC CCAGGTATAA TCATCCCGGA TGCAGACGCC AAGCCCGGTG GAGATTCTGT CAAGAAACCC TGCACACCAG CTGAGACACT CAAAGGTGGG CAATCTGTAA TAAAGTTTAG CCAGTGCACT GATTTGGATC AAGTTGGTAT GATGGTAACC GATTTCGATC AAAGCAACTT TGGGGATTTG GATGTGTTCG AAATGTGCGC ACATTCGTTC AACAAGGATG TCAACCATGG TGGACGCACG GCACAAGGAT GTATGCAAAT TCTTGCCAAC ACCATGAATA CTGCCGCGAG AGACTCGAAT TCCTCGGAGG ATCAACGCAC TACAGCAATT GCTGCCTTGG CCAAATTCCT GTACACAGAC GCGAAGAGTT TTTGCGATTG CGCAAAAGAT GCTTCTGATA ATTGCCCTCT CTGTCCTTCA TTCGCCAATT TCAAGACTCT CTTATACGAG TCACTAGATG CATGTCAATC TTTGGACGAG ATCGACTGCG ACGCTTGGGA CGAGTTTCAA AAACCGTGTA AGCAGAATCT ACAGTCCACG TTTGGAACGA TCGATTTCAC GAAGGAGGAA CAGTGTGATT TTGTCCATGA TAATAGCTGC GGTGGCGCTG GTCCATTTCC AGCCTTTCGT CGACTTGATT GCGGAGACGA GCTGGAATCT ACTGTTTGGA ACTTCTACCT TGACTTTGCC AAAGGCTGTT TAAGCGATGC CCCGCCGACC CCTGTAAGCC CAACTCCGTC ACCAGTGACT CCGCGGACCG CCGTCCCCGT TGCACCAGCT CCAACGCCGG TCAATCCTTC ACCCTACGAT CCATCTCCTG CGCAGCCGTC GGACAACAAA AAGCCATATG TGCCTCCAGA GGAACGTGAC AAGAAAAAGT CGAAAGACGC CAAACCCGAC GACACGAGCA GTAAGAAGAA ATCGCACTGG TTCCGAAATC TTTTCATTCT TTGTTTGCTC TGTGGAGGAG GTTATTACTA CTACAAACGC CGATCAGAGT TTAGCTTCGT TCGCTACCGC CGGGCTCGGA ACTTTGGCGG CGACGAAGAT GGCGTGTACA GCGGTCTCAC AATGGAAAGC TCCACATCAT TCGAGCCGCC GTCATTGCCA CCAACTCCTG CTGCCATGGG CAACTACACA TAACTAAAAA ACATTGCATA TCC
|
Protein sequence | MKWTNVHIWL GVGTLLAGRE SSTVHAESHE HRSRKLAVAV ASRVLQKSVN DFVTNSIARP QQLNLPPQMQ QQSPLHLHRT LEDGSNVECD AAFMECILND KCVECFLKMQ TEGIDWAFVT KETPCDSVLE TLYDKTFCLN QRGDEAAEGI FCSTFAACVD TTQEVPEEDD KTIDCSKLTK CDWPGIHLSF LGDGVCHENL EGCYNTAVCN YDGGDCCDDT CSTDSDYKKC GQDGFSCKDP SSAKCDSSLT TGCPISDDDD GAFPLPPQCS DSQAIYRLVM YDSFGDGWDE TVLTIRDADT RRSVYTGSLQ EGSQGTQLIC LSRYAQCYHV DVNGGTWGNE VSWEVKPYTD GTPALAGGSS PMSCDFSVSG GKCEKTCTGR PNIRPADDPE YKEFKEMYTC INEKCPIQVG ACQKDEACSN CFVEEAPDYC FGIDTFNAVV DCTLCKCSGV KDSDFCDKKA SPGIIIPDAD AKPGGDSVKK PCTPAETLKG GQSVIKFSQC TDLDQVGMMV TDFDQSNFGD LDVFEMCAHS FNKDVNHGGR TAQGCMQILA NTMNTAARDS NSSEDQRTTA IAALAKFLYT DAKSFCDCAK DASDNCPLCP SFANFKTLLY ESLDACQSLD EIDCDAWDEF QKPCKQNLQS TFGTIDFTKE EQCDFVHDNS CGGAGPFPAF RRLDCGDELE STVWNFYLDF AKGCLSDAPP TPVSPTPSPV TPRTAVPVAP APTPVNPSPY DPSPAQPSDN KKPYVPPEER DKKKSKDAKP DDTSSKKKSH WFRNLFILCL LCGGGYYYYK RRSEFSFVRY RRARNFGGDE DGVYSGLTME SSTSFEPPSL PPTPAAMGNY T
|
| |