Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50641 |
Symbol | |
ID | 7199478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 60897 |
End bp | 62379 |
Gene Length | 1483 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185603 |
Protein GI | 219130926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00738361 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGCTCCC CAAACTGACC ATGGGGGTCG GTACCAAAGG CAAGACATCC ATCGCTTACA CCTTCGCCTT CGGAAATTCA CTTTTTTTCC AATGAAGATT ATTCGGAAAA CCCTGCAGCT CACGATCTGG CTATCCGCGT GTCTCGTCAC CTCCTATTCC TACAGCAATA GCAACGTGCC GGTAGGCAAA TCCAGTGCCA GGAACACTGA GACTTCCCCC GTCCCTATAT CTTCGCATAC CTTTTTGTTC CGATCCCACC CCATTGCATA TGAGACCGCT GTGGTCAGAT TTCCCACCAA GATAGCCGTG CCGCCGCAAT CACCAACCAC ACCGTATCGA GACGTCTCGC CGGTGCTGCT CCTGAACGGC TTTGGGGTCG GGTCCTTCCA CCAACACCGA CTCATCCAAG CCCTGCAACA ACAGTCCGAC CAATCTACAG TAACTGACAA AAACAGCAAT AGAGATGAAC CCGCCAGTCT TGCTACTATT ATTTACACGC TTGATTATCT CGGACAAGGT CGCTCCTGGC CCGTGGATTC CAACGATGGA CAAAGTGAAG CGGAATTGGG ATTGCGCTAC TGTGGACAAA CATGGGTGGA CCAGATTGTA GCATTTTTGG AGACAATCGT TTTGCCTGCT CGTGAATCCT GTTTCTCGTC CACGAGACAC TATACTGCTC CTCCGGAACG AGTCCATTTG GTAGGCAATT CTGTCGGCGG ACACTTGGCC GTATTTGTGG CTGCCTTGCG ACCCGACTTG GTAGCCTCCG TCACCCTGCT CAACGCCACT CCTGTTTGGG GACTCAATTT GCCCGGCTGG ACCGGTCATT TGCCGGCTCC TTTTCTGCCC AAGACCATTG GTCGATTTCT GTTCGATCAG ATTCGCAATC TCAACACAAT CGAACAATAT TTGGCGGCGG CGTACGTCCA TCGGGAGGCG TTTGACGCCA CGCTCATGCA ACAAATCCGA GCCTGCACTG AAAGTCAAGG GGGACACGCG GCCTTTGCCT CGATTCTTTG GTCTCCTCCC GTGACCTTAC CGACGAAACC AAATGATGCT CCAAGCAATA CCAAAAACGA CTACAAAAAG ATCAACGCTT TCGACGAAGC CCTTTCCCGG CTCGAGTGTG ACGTTTTGCT ATGCTTTGGA GCCGACGATC CTTGGTGCAA ACCGGCCTTT GCAGCGCGTA TGCTCCGAGC TCTGGGACAG CGTCCAACGG GTAAGGTCCA GCGATACGTG GAACTCTCCA GCGTTGGTCA CTGTCCCAAT CACGAGGCGC CAAACGCCGT AGCATACGTT TTGCTACCCT GGTTGCTTTC GTCAAATGCA CAACGCCAAC AAATTGCATT GGTGCCAGCG CCACTCTCAG AAGACAAACG AACGTCAGTA CGAGAAACCT GGGGGGTCAC GGAATTGACC GAACGCCAAG CCGACGACAT TTCTTTATCA TTAGTGGATC GACTAGCCGT ACTATTTGTA TAG
|
Protein sequence | MKIIRKTLQL TIWLSACLVT SYSYSNSNVP VGKSSARNTE TSPVPISSHT FLFRSHPIAY ETAVVRFPTK IAVPPQSPTT PYRDVSPVLL LNGFGVGSFH QHRLIQALQQ QSDQSTVTDK NSNRDEPASL ATIIYTLDYL GQGRSWPVDS NDGQSEAELG LRYCGQTWVD QIVAFLETIV LPARESCFSS TRHYTAPPER VHLVGNSVGG HLAVFVAALR PDLVASVTLL NATPVWGLNL PGWTGHLPAP FLPKTIGRFL FDQIRNLNTI EQYLAAAYVH REAFDATLMQ QIRACTESQG GHAAFASILW SPPVTLPTKP NDAPSNTKND YKKINAFDEA LSRLECDVLL CFGADDPWCK PAFAARMLRA LGQRPTGKVQ RYVELSSVGH CPNHEAPNAV AYVLLPWLLS SNAQRQQIAL VPAPLSEDKR TSVRETWGVT ELTERQADDI SLSLVDRLAV LFV
|
| |