Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41302 |
Symbol | |
ID | 7199142 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 68044 |
End bp | 70507 |
Gene Length | 2464 bp |
Protein Length | 747 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185319 |
Protein GI | 219130327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTTT TCCCAAAGGT TACTGTTACC ATAGTAACAT TTGTAATTGC GCTAGCCTGG GGTTTCAATG ACGCCGTGGC TCAGACTGTA GCATGCCCTC AAGCAGAGGG TTTGACCGGC TACACAACAA TTGCATCGAT CAACAATGAC ATGAGGGCCG AGTTAGCACG AATCAGTGAC GGCGAAAAAC TTCCTGAAGA CAGTTACACC TATACGCTTT GTCCGCAGAC AATATTTGAT GTCTCAAATG AGCCTTTGCA GCCGATTTTA AGTGATGTTT CTTTCGCGTG TGGCTCAAAC GGGGAGGCGA ATGACAGTTG CGTTCTTTTC GGAGGTAGCC AGCAAGTCCT CATTGTAGAC TCACTTTTGA ATTCATATCC TCTAGAACGA ATTACTTTCT CTGGGTTGAC ATTCTCTGGT TTCAACCAGA ATTCTGAAAG TAGGGGTACT GCGATTGCTG CTTTTGCTTC GAAGCCCACT TCTGCGATTT TCCGTGACGC TTTGTTTCGT GTACGTTTTT TCCGTTTTGA ATGCCAACTA CAAATAAGAT GTCCAATCGC TTCTCACAAA TTCAAACACA AATCACAGGA CTTTATGAGC GATTTCGTCA TACTTCAAAG TACAGGGGAG CCGGGGTCGG AGCCAATGAT GATCGAAATC AATGACAGTA TCGTCAATGG CGGCACAACA GGAGTCTTCT TTGACAACGA CGGCGGCTTT CTAAATATCA GAAATATTCA GGTTGAAGGA TTAAATGCTG CCTCTTTCAT TGCCACGGCC AATGGAGGCG TTTCGCGGCT TAGGGAGTCC TCAATTTCCA AAGGATCTTT GGATTCGATA ACATACACAA CTAACTCGGC GGAGCAGCAA GTTGCAGATG TGAACATTTT TTCAATGAGC CGCCTTGCAG ACGCATTTTA TGCGGAACAA GAAGGAAGCG GTTTAGTTGT GAGAAATGTC AGCTTATTTT CCAACGATCT CAGTCCTATG GAGTGGACTG CAATATCAGC ACAATCCGGA GCAATCGTGG AAGTAGTAGG TTCAACAATC TCAGGGAATT CAGGTCTATT GTTTGCTTTA CAAGCTGGTA TTGGCTCCGC TGTCTCTATA ACTGATTCGT CCATAAACCA AAACACGGCA GCGGTAAGAA TACAGACGAC TCTCGTTTTG TTCACGTCAC TTCTTGCCTT AACCAGTTTC CTTTCGTAGA GCTCGACAAG CGCTTCCATC TTCGTAATTG GTGGCTCGGC AATTGTCGAA CGATCTGAGT TCACCCAAAA CTTTGGTTTT TCGGTAAGTT GTTCACAATC AGTGACGAAA CCTCAAATCA CTATTACGAT GCAACTGATC TTTTGCTTCG TCCATTTAGG GAGAAATTTT AGCTTTTCTG GGTGGCTCTG TTGAATTGAG TCAATCATGT ATCCAAGACA GTAGGTCGGA TTTTGTAGCG TTCGCAGATA GCCAATCTAC CGTTTCAGGT GAAAACGCGA TGAACTTTGT TGGTAGCTAC GAATCCTCGT TTTGTACATC TAGCGGGCCC CGACTCTTTC GCGAAGATGT CGGGGCTGGC TGCTTCGTCG GCGGCCTATG CACGGGAACT TGTCGGAATA TTGCCGATGC TTCTGAATGT ATGGCCCGCG CAGCAACCCC AACCTCAAGT CCGACTAACT TCCCCAATTC GGTGTTACCG ACAGTCACCT CAACAGGTCA ACCCGATACG AGTATTCCCT CGTTTACACC GGGATCCTTT GTGCCAAATA CCCAAATACC AATTCCAACT GTTGACCTTA CTTTTTCGCC AACTACAGAT GATACTCTGC TTCCGACAAG GAATAGTGTT CCTGAAAGCA CAAACCTGCC AACACTAGAA GTCCAACGTC CGACTTTGGC TTCAATTACA ACATCAGTAC CGATTACTCC GCCGAGCGAA CCAACAGTAA TTCCGACCTC TGCTCCACCG GGTACAACCG TCACGCCAAC AATGCTGCCC ACAAATGTGC ATGGCCAGAA CGGAACTCCG TCTCCAACAC AGAGATGCCG ACCAGCAGGA AGCGGGACTA TGGCCAATTC TATCGGAAAG GGCAAGGGGG ACAAAAGCTT CAAATCGTCC CGAGATTCTT CAAAGAATAT CCGAACACCA GAGTTCGGGA TTGAACAGTG GGTGGACAAC AAATTATTGG ACGGACGTCA CACATGGTTC AGTACCCAAT CGAAGGAAGA ACCTTTGTTC AACGAAAGCT TGCCCATTTG CCCTCCGGAA GAGTCTGCAA GGCCAACCGT TGGTGCCGTA TCCACAAAGT CCGGTAAAGG AAAAAGAGGT GCTTCAGAGA AATCTTCGAA GAAGGGTAGT ATGAGCGCTA AATCTGGAAA GGCAAAAAGC AGCAGTACCA AGAGTAAAAA AAGCAGCGAC AGGAGTAGCT TCAGCAGCAA GAAGAAAAAG GGTGGTGTAC GTCGTGAAAT TTGA
|
Protein sequence | MALFPKVTVT IVTFVIALAW GFNDAVAQTV ACPQAEGLTG YTTIASINND MRAELARISD GEKLPEDSYT YTLCPQTIFD VSNEPLQPIL SDVSFACGSN GEANDSCVLF GGSQQVLIVD SLLNSYPLER ITFSGLTFSG FNQNSESRGT AIAAFASKPT SAIFRDALFR DFMSDFVILQ STGEPGSEPM MIEINDSIVN GGTTGVFFDN DGGFLNIRNI QVEGLNAASF IATANGGVSR LRESSISKGS LDSITYTTNS AEQQVADVNI FSMSRLADAF YAEQEGSGLV VRNVSLFSND LSPMEWTAIS AQSGAIVEVV GSTISGNSGL LFALQAGIGS AVSITDSSIN QNTAASSTSA SIFVIGGSAI VERSEFTQNF GFSGEILAFL GGSVELSQSC IQDSRSDFVA FADSQSTVSG ENAMNFVGSY ESSFCTSSGP RLFREDVGAG CFVGGLCTGT CRNIADASEC MARAATPTSS PTNFPNSVLP TVTSTGQPDT SIPSFTPGSF VPNTQIPIPT VDLTFSPTTD DTLLPTRNSV PESTNLPTLE VQRPTLASIT TSVPITPPSE PTVIPTSAPP GTTVTPTMLP TNVHGQNGTP SPTQRCRPAG SGTMANSIGK GKGDKSFKSS RDSSKNIRTP EFGIEQWVDN KLLDGRHTWF STQSKEEPLF NESLPICPPE ESARPTVGAV STKSGKGKRG ASEKSSKKGS MSAKSGKAKS SSTKSKKSSD RSSFSSKKKK GGVRREI
|
| |