Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22901 |
Symbol | |
ID | 7195390 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 249364 |
End bp | 251438 |
Gene Length | 2075 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183576 |
Protein GI | 219126673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGTACCAA CGATACCGAT TCGGTGAATC GCGCCTTTCT CGCTAATCTA CGTCTTTAGC GCCAACCATT GTTCAAAGTT CACTGTCAAA ATCATGTATC CTAGCTTCTT GTGGAATTCG GGGCATTCAC AGTCAATCTC AATCTCAGCG AGACATTAAT TCGACCATCT TCCAACAATG TCCAGACTGT GACTCTCAGA TATCCAATAG GAGTCTTGGG TAAAGCGAAA CACGAGTGAG CGTGGCGACT GCGTAAGTTT CGTTTCGCCG TCAGATAATG CCAATTTGTG GGCAAATCAG CGATAGGCAG ATTCCAAGAT TCTGCCGTTT ACAGAAAGAT CCCATCACAA AGTTTGTATC CCTTGTATCC ATTTCTCCGA AATCGTATGC TTTACTATGA AAGTAATTGG ATTGAATTTG CTTTGGTGTA GTGTCGGGGT CCTCCTCCCT TATGGGTCAA TGGCTGGACC CGAGACCGAC TTGATCGAAA ACCTGCCCAT GCATGGAAAG ACTAAGACGC CGCACTATTC AGGCTACTTG GACGCGACGG AAGGTTGTAA CTTAGAGGTT AATGGTCCCT ATTGCAAAAT TCATTATTGG CTGGCTATGG CGGAAGGCGA CTTTCTAAAC AAGCCTGTTG TGTTGTGGTT GAATGGAGGG CCGGGATCTT CGTCCATTCT TGGCTTTCTT CAAGAAAATG GACCACTTCT TATGAACTCT ACAGGAGGGT TGATGGAAAA TCCCTACAGC TGGACCAAGG TTTCCAACTT GTTGGTTATT GAATCACCGA TTGGGGTAGG GTATAGCTAC TGTGCGAGTC AACTCCTGGG AAAAGTGTGT GAAAACACAG ACAAGTACAC GGCATCGGCG GCTCGAGCGG CAATCGTGGA TTTCTTTGCC AAGTTTCCGT ATTTCGCCAG CAACGACTTT TTCATAACTG GAGAATCTTA TGCCGGAGTT TATTTGCCGA CCCTGGCCTA CGAACTACTA GAGCATGCGC CTCACATTTC GTTGACGGGT ATGGCTGTCG GAGATCCGTG TACCGATAAC ACGGCACAAG CCGACTCCAT GGACGCTCTC TGGTACGGAC ACAAGTACGG TCTCGTCGAC GACGCCATCT TTGACACACT ATGGAATCAA TGTGGTATCA GGGCTCCATC TTTTCTGATG AAAAGCAAAA TTCAATTGAC GCATAATGCG GATCTAGGAG AGGAATTTCT CAACGACATC GGGTATGATG GCGACTCAAA CGTATGTCGA TTAAGTATGC GCAAGTTTTT GATGTCCTCT AGTCGCGCCT TGTCGCAAAG CTGGCGAGGT ATGTTTATTG ACGACTATTC GCTCTTTGCT CCCGTCACGG ACTTGGAAGA CATACACATG ACCGCCTACA TGAACCGCCC CGACGTCCGT GAAGCCCTGC ATGTTATGGA TACTCCTATT CGGAGCTGGC CATACCCCAA TGTTGGTTTC GACTACACAA AAGAATATGA TGCTTGCAAT GCGGATGCCG ATGAAGAAGC TCTGTCCATG ATTGACTTTT ATCGCAAATT GGGCCCGCGC CTCCGGGCCA TTTGGATATA TAACGGCGAT ACGGATCCAT GCGTCTCCTA TGAAGGAACA CGCGTGGCTG TTTCTCGAAT TGGGTTCCCG GAACTTGATG GTGGCGGATA CCGCCCGTGG TTTTACAACC AAACGGCGAC CACTGGTGTG TACATTTCGT TTGGTTTGAA GCCTTTGATA AACATCACAG TTTTGCTAAC TCTATGAAAT TTGTTTTTAT TTGGAAACGT TTCGTGCAGT TGAAGTGTTG ATGGAAAAGC CAGCTTTGTT TGGTCCCGAC TTATTATTGC AAGAGTTAGG AGCACAATTA GGAGGCGAAG TTGTGAATTA CGAAAACAAC ATTTCATTCT TGACTTTCCA TGGATCGGGC CACATGGTTC CACAGTTCCG ACCGCAAGCT GCTTTACACA TGCTACGAAA GCTCGTCAAC TATGAAGCGC TTTCACCGAA ATTGCCTCTA AATGCTACAT TGGTTGATTT GAGCAACCAG GACTTTCGTG TGACTATGGA CATCT
|
Protein sequence | MKVIGLNLLW CSVGVLLPYG SMAGPETDLI ENLPMHGKTK TPHYSGYLDA TEGCNLEVNG PYCKIHYWLA MAEGDFLNKP VVLWLNGGPG SSSILGFLQE NGPLLMNSTG GLMENPYSWT KVSNLLVIES PIGVGYSYCA SQLLGKVCEN TDKYTASAAR AAIVDFFAKF PYFASNDFFI TGESYAGVYL PTLAYELLEH APHISLTGMA VGDPCTDNTA QADSMDALWY GHKYGLVDDA IFDTLWNQCG IRAPSFLMKS KIQLTHNADL GEEFLNDIGY DGDSNVCRLS MRKFLMSSSR ALSQSWRGMF IDDYSLFAPV TDLEDIHMTA YMNRPDVREA LHVMDTPIRS WPYPNVGFDY TKEYDACNAD ADEEALSMID FYRKLGPRLR AIWIYNGDTD PCVSYEGTRV AVSRIGFPEL DGGGYRPWFY NQTATTVEVL MEKPALFGPD LLLQELGAQL GGEVVNYENN ISFLTFHGSG HMVPQFRPQA ALHMLRKLVN YEALSPKLPL NATLVDLSNQ DFRVTMDI
|
| |