Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49053 |
Symbol | |
ID | 7195302 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 438331 |
End bp | 440705 |
Gene Length | 2375 bp |
Protein Length | 688 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183609 |
Protein GI | 219126742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.377912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGGGA GGGGCTCTCC GCAAGAGATT GGTGTTGGCC TTGTTGGTAA AATTCTTTTC ATCACGGCCA TTGTCGTGGT CGTGTGTGTA TCTCTTACCC AACCTTGGTC GAAAATGAGT GCATCTGCTG CATCCCGCAA GGTGGTGCAA TCAGTCGCCC ATCGGGTGGA AATTGATTGG TAGGTAAAAC CAAAGGTTCC TTTCCTTGGC CGCGCCGCGG CGGACACTTC CGCCGCAATC CTCCAATTCC GTAGCGTACG CACTTACCGC ATACCCTCGC TCACACCGTT CCCCATCTCT CTACATATCA TCGTGCTGGT GTCCACATTA GGTCCCGCCA TGTCTGGAGT TGCTCTCCTC AAGTAGCCGC CTCCTTCAAC AAGCTCAAAT CGTGGGTGGC GCTGTCGGAT TCCATGGCGG AAAAGTACGC CACCTTGCCG ACGCCAATTG ATTTCGGCAA GGCCAAGAGT GCCGTTCGCG ATAAAAGTCT GGTGGACTCG CTCGAAGCCT TCTACAAGTC CAATCAAGCG CCGGCGGAAA CGCACGAGTG GGCGCCGGAA GAACAGGAGT TGACGGCCAA GAAAATCGCC TACCTACAGG AACTCGACGC CATGCACCAA GAACTCTTGC CCGTACTGGA AGCCGAGTTG GCTTTTCAAA AGAACAACCG AACAACCAGC GATACTACCG TCTTTGATAT GAAGGTGAAT TACCCCGTTA TTCACGAAGA AATCGAGGAC GAAATTGAAC GTCGTGAATG GTTCAAGGAT ACCGGAATCG GTCCCAACAA GTAGGCAGTG TAGCGTTACC TATAGTCGAA AACTAATTCT AGTTGGACTC GTCCGAGACT TTTTCTAGTT GTCCGATCAC GATTGGGCCT GTGAGTTGTC CGCAACGGCG CTCTCGCCGT TTGGAAACGC TGGGTACGAC TGGCTTCGCC CTGGAGGTGA ATCCGGAGCT TGTTCATTAT TTATTTTTTT CCAACCGGTC TCAGAGTCAC ACCATTTCCA GTGATGGAAA ATGGTTGGGA GCAGACACAA ACGCAATTCC AAAACCCCCA ATGTTGGGTG GTGCAGAGTT TGCGGAAACT AAAAGGGCTC ATTTTATGCC AGAGTGGAAC CAATCGTAGC GTTCAAAACC ACCATAATTT CACCATGGAT CCTTTACTTC TCGCTCGGAA ATTACGCCGA CGCCGCGGAC GCTCCATTCT TGCCTCGCAG CTTTCCAATG CAGCCGTTAC CTTGTGTACG GGAGCCGTGG TGGCAATGAT TTTAGCTGCC TTGCACGTGT CCCGACTAAA TACTGATCGT GGCGTTCCCA ATGCTCCCGA AAGCTTGCCT CGGAGTCGTC GTCCCGGTGC ACCCATGGTG TTTGGTCAGC GACAGCTCCG GGAAGCAAGG GTTGAGCATG CGCGGTCGGG AAACTCAGAC GCGTCTACAC CGCTGGATTT TGTCGTGGCG GGCTTTCCCA AAACCGGTAC CACGACACTA TTATACGCCT TTCGGGATCA CGAAGAAATG GACATTGCCA ATTCGGAACG CTGTAGTGTC GCGCACGTTC CACTATCCGA GAGTCGGGCG CAGCAAGACT TGAACGCGGC CGTTGCCGAA CTCTCTCCGT TGAGGAGCGT CAAACGCGGT ATCAAATGCC CCACGATGCT GAGCAACTCT GTATCGCTAT CGCGGCTGCA GAGACATTCA CCGTCGGCCA AACTCATTGT TGGATTGCGT CACCCGACGG AACTGTTGCA AAGCTTTTAT AACTACCGTG TAACGGAAGT TTACGACAAG CGGTCCCACC AGTCAATACC CAGCTTTGAT GAGATCGTTC GGACCGGCAA GGCTTGGAAA GGCGTCTCCT TGGAAGCGGT CCGTTTTGAC TTGTTTTTGA TGCAGCTCGG AAAGACCAAT ATGTCAACCA TTGACCTTCA GCAATACGCC GGTCGCAAAT ACATGGGGGT GAAGCCTAGC GAAATGCAAG TCTTTCTATA CACCCTTGAT CAAATTGAGG ACAGGGATCC GACGAACAGT TTGGTATTTC GGAAAGGCCT TGAGAGCTAC CTTGGTCTGG AGACTCCGTT CCAGCCCTTT GGCCGTGAGA ACACCAACCA CTTTGTCGGA AACAAAGCGT ATCCCGAAAC GCTCGATATT TGCCAACCCA AGTACAACAA ACTGAGAAAA AAGTTAGCGG ACCAGGGACG TAAAACCGCT CGTTGGATTC ATGAGAGGCT TTTGGAGAGT CCTGACGTCT TTGTCGGAAA CAAAGATGGC TTTCTTCAGT CGCTCGAATC GTGGGGGACT GATATTTGTC ATCTCTCATC CATAGCGGAC GTAAAAGTTT TGCCTCAGAG ACGGACTCTC AAGAAACCTA TTTAG
|
Protein sequence | MYGRGSPQEI GVGLVGKILF ITAIVVVVCV SLTQPWSKMS ASAASRKVVQ SVAHRVEIDW SRHVWSCSPQ VAASFNKLKS WVALSDSMAE KYATLPTPID FGKAKSAVRD KSLVDSLEAF YKSNQAPAET HEWAPEEQEL TAKKIAYLQE LDAMHQELLP VLEAELAFQK NNRTTSDTTV FDMKVNYPVI HEEIEDEIER REWFKDTGIG PNNCPITIGP VSCPQRRSRR LETLVMENGW EQTQTQFQNP QCWVVQSLRK LKGLILCQSG TNRSVQNHHN FTMDPLLLAR KLRRRRGRSI LASQLSNAAV TLCTGAVVAM ILAALHVSRL NTDRGVPNAP ESLPRSRRPG APMVFGQRQL REARVEHARS GNSDASTPLD FVVAGFPKTG TTTLLYAFRD HEEMDIANSE RCSVAHVPLS ESRAQQDLNA AVAELSPLRS VKRGIKCPTM LSNSVSLSRL QRHSPSAKLI VGLRHPTELL QSFYNYRVTE VYDKRSHQSI PSFDEIVRTG KAWKGVSLEA VRFDLFLMQL GKTNMSTIDL QQYAGRKYMG VKPSEMQVFL YTLDQIEDRD PTNSLVFRKG LESYLGLETP FQPFGRENTN HFVGNKAYPE TLDICQPKYN KLRKKLADQG RKTARWIHER LLESPDVFVG NKDGFLQSLE SWGTDICHLS SIADVKVLPQ RRTLKKPI
|
| |