Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37756 |
Symbol | |
ID | 7202294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 880331 |
End bp | 881722 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181821 |
Protein GI | 219122997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0114774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTA TTCGGAAAAC CCTGCAGCTC ACGATCTGGC TATCCGCGTG TCTCGTCACC TCCTATTCCT ACAGCAATAG CAACGTGCCG GTAGGCAAAT CCAGTGCCAG GAACACTGAG ACTTCCCCCG TCCCTATATC TTCGCATACC TTTTTGTTCC GATCCCACCC CATTGCATAT GAGACCGCTG TGGTCAGATT TCCCACCAAG ATAGCCGTGC CGCCGCAATC ACCAACCACA CCGTATCGAG ACGTCTCGCC GGTGCTGCTC CTGAACGGCT TTGGGGTCGG GTCCTTCCAC CAACACCGAC TCATCCAAGC CCTGCAACAA CAGTCCGACC AATCTACAGT AACTGACAAA AACAGCAATA GAGATGAACC CGCCAGTCTT GCTACTATTA TTTACACGCT TGATTATCTC GGACAAGGTC GCTCCTGGCC CGTGGATTCC AACGATGGAC AAAGTGAAGC GGAATTGGGA TTGCGCTACT GTGGACAAAC ATGGGTGGAC CAGATTGTAG CATTTTTGGA GACAATCGTT TTGCCTGCTC GTGAATCCTG TTTCTCGTCC ACGAGACACT ATACTGCTCC TCCGGAACGA GTCCATTTGG TAGGCAATTC TGTCGGCGGA CACTTGGCCG TATTTGTGGC TGCCTTGCGA CCCGACTTGG TAGCCTCCGT CACCCTGCTC AACGCCACTC CTGTTTGGGG ACTCAATTTG CCCGGCTGGA CCGGTCATTT GCCGGCTCCT TTTCTGCCCA AGACCATTGG TCGATTTCTG TTCGATCAGA TTCGCAATCT CAACACAATC GAACAATATT TGGCGGCGGC GTACGTCCAT CGGGAGGCGT TTGACGCCAC GCTCATGCAA CAAATCCGAG CCTGCACTGA AAGTCAAGGG GGACACGCGG CCTTTGCCTC GATTCTTTGG TCTCCTCCCG TGACCTTACC GACGAAACCA AATGATGCTC CAAGCAATAC CAAAAACGAC TACAAAAAGA TCAACGCTTT CGACGAAGCC CTTTCCCGGC TCGAGTGTGA CGTTTTGCTA TGCTTTGGAG CCGACGATCC TTGGTGCAAA CCGGCCTTTG CAGCGCGTAT GCTCCGAGCT CTGGGACAGC GTCCAACGGG TAAGGTCCAG CGATACGTGG AACTCTCCAG CGTTGGTCAC TGTCCCAATC ACGAGGCGCC AAACGCCGTA GCATACGTTT TGCTACCCTG GTTGCTTTCG TCAAATGCAC AACGCCAACA AATTGCATTG GTGCCAGCGC CACTCTCAGA AGACAAACGA ACGTCAGTAC GAGAAACCTG GGGGGTCACG GAATTGACCG AACGCCAAGC CGACGACATT TCTTTATCAT TAGTGGATCG ACTAGCCGTA CTATTTGTAT AG
|
Protein sequence | MKIIRKTLQL TIWLSACLVT SYSYSNSNVP VGKSSARNTE TSPVPISSHT FLFRSHPIAY ETAVVRFPTK IAVPPQSPTT PYRDVSPVLL LNGFGVGSFH QHRLIQALQQ QSDQSTVTDK NSNRDEPASL ATIIYTLDYL GQGRSWPVDS NDGQSEAELG LRYCGQTWVD QIVAFLETIV LPARESCFSS TRHYTAPPER VHLVGNSVGG HLAVFVAALR PDLVASVTLL NATPVWGLNL PGWTGHLPAP FLPKTIGRFL FDQIRNLNTI EQYLAAAYVH REAFDATLMQ QIRACTESQG GHAAFASILW SPPVTLPTKP NDAPSNTKND YKKINAFDEA LSRLECDVLL CFGADDPWCK PAFAARMLRA LGQRPTGKVQ RYVELSSVGH CPNHEAPNAV AYVLLPWLLS SNAQRQQIAL VPAPLSEDKR TSVRETWGVT ELTERQADDI SLSLVDRLAV LFV
|
| |