Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47726 |
Symbol | |
ID | 7202905 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 652068 |
End bp | 654205 |
Gene Length | 2138 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181947 |
Protein GI | 219123262 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.347863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAAAATCCAT CTTTCCTTTG AGCGACTGTG ACGACCTGTG GAGGGAGAGT GATTTTTGTC ACTGTGAAGT TGTCGGTACT CCGGCACTCG GCAGGTAAAA TACTCGCGGT GGGTGCTTTG GTGCGTTCGG TGTAGGACGC ATCCTTCACT GTCACTGGGG GTATCGGCAA GAACCATACA CCGTAGAGAG AGCGAGACGA AAAGGACACA CCGTAGCGTA GAACAGTCGG ACGAAACGTA ACTCAGAAGA GTCGGTATTG GTACACACTC ACCATTGTTG GGAATTATGA GTGCGACGAC TCCGCCAGTC GCTTCTTCGA AAGTGGACCT GTTCGAGTAC ACAGAAGAGC AACTATTGGA AGAAAAGGTA CTCACACACT ACGAGATTCT CGGTATATCG ACCTTTTGTT CGCAAGATGG TACGTTTTAC CGAATCACAC TTTGGATTGG CGTCACGCTC GGCGTACCTC TGAAGGATTT ACTGGTCGTT CGCCTCGTCT CACCTCTTCC GTACCAACCC CGTGCTAACC TTCTTGATTC CAATCGCCGC GACAGATGTC AAAAAGGCCT TTCGGCGCTC TTCACTCAAG TACCATCCCG ACAAGCACGG CCACGATAAG GACTACGCAT TCCTGGCGCT CAAGCAGGCC CACGATACGC TCTACGATCA CGAAAAACGC CAAGCCTACG ACTCCACAAC CTTACCCTTT GACGACGCCA TCCCCCCACC GCGGGATAAG CTCCTGCAGG ACGATCTACT CCTCTACAAG GACAACGACT TTTACGAGCT CTATCGACCC GTCTTTGAAC GCAATCTGCG GTTCGACGCA AACTTGCGAC CGGACGCCGT CGGCAACGCC AAGAACGGGA ATCACAACGG CAAGAAGAAA AAAGCCGGCA AGGCCAAGGC GCCCCCGACT TTGGGCGACG CCGACACCCC CATTGCCCAA GTACACGCCT TTTACGAATA CTGGATTCAT TTCGAATCGT GGCGCGATTT TTCGGCCCAA GCCACGGACG AACTTCAGGT GGAGAACGAA CTGGAAAATG CCGAATCGCG CTTTGAAAAA CGCTGGATCC AGAAAGAAAT CGATAAGCGC GCCAAGCAGC TGAAAAAGAC AGAAATGAGT CGGATTCAAT TGCTGGTGGA ACGGGCGATG GAAGCCGATC CGCGATTGCG GAAGTTTCGA CAGGAACAGT TGGCCGCCAA GGAACAGGCC AAGCGCGAGC GACAGGAAAA GGCGGAACAA CAAAAGATAC AAGCGCAATT AGAGCATGAA CGGCAACAAC AACAAGAAGT AGTGGACCGG CAGCGGCGAG CGGAAGAAAA GGTGACGCGC GAACAACAAA AGAAACATAT ACGGAAAGCG CGACAGAGTC TGCGAAAAAT GGCGTCGGCC TCTTTTGAAT CTCTCGAATC GGAACAGAAA TCGTCCATTG TATGGGCGGA TACTTACGAC ATGAATCTGG ATGTCGAAGT GCTGTGTACA AATTTGGATT TGACGGGGTT GCAATCGTTG GCTCAAGAAT TGGAAAATAT CACTTGTCCG AAAGAATCGT TGACGATGAT TCACCAAGAA GTACTAGTGG CTAAGCAGCG GGAGACAGAC GGGGACTTTA GCAATGGCGA ACAATCGTCT CCATCTCACA ACGGAACTTT TTCGACGAAA GAAACGACAA CGTCGCCTGT TGTAACACCG GCGTTGAAGC CGAACCTCTG GACCAAGGAG GAATTGTCAG CCCTGGCTAA GGCAGTCAAG AAGTATCCAC CCGGTGGTTC CTCACGATGG GAACAGATTG CGTTGTTTGT GAATAATTTG TGCAAACAGG ACGAACCCCG ATCCAAGGAG GAATGTATCG AGAAATACAA CAACGTGGCG AAGACGCACA GCAAACCAAC CGAAAGCACG AACGGCGTCG CGGCAGCATC AGAACCCGAA AACTCTTCGC AATCCAACGA AGACGTGTGG ACGGCCGAAC AGGATCAGCA GCTGCAAGAT GGACTAGCTG CGAATCCAGC GAGCATGGAC AAAAATGAAC GGTGGACCGC AATTACAGAG TGTGTCCCGG GAAAATCCAA GAAGCAGTGC GTACAACGGT TCAAAGTGAT TCGGGATGCC TTGAAAAAGA AGAAATAG
|
Protein sequence | MSATTPPVAS SKVDLFEYTE EQLLEEKVLT HYEILGISTF CSQDDVKKAF RRSSLKYHPD KHGHDKDYAF LALKQAHDTL YDHEKRQAYD STTLPFDDAI PPPRDKLLQD DLLLYKDNDF YELYRPVFER NLRFDANLRP DAVGNAKNGN HNGKKKKAGK AKAPPTLGDA DTPIAQVHAF YEYWIHFESW RDFSAQATDE LQVENELENA ESRFEKRWIQ KEIDKRAKQL KKTEMSRIQL LVERAMEADP RLRKFRQEQL AAKEQAKRER QEKAEQQKIQ AQLEHERQQQ QEVVDRQRRA EEKVTREQQK KHIRKARQSL RKMASASFES LESEQKSSIV WADTYDMNLD VEVLCTNLDL TGLQSLAQEL ENITCPKESL TMIHQEVLVA KQRETDGDFS NGEQSSPSHN GTFSTKETTT SPVVTPALKP NLWTKEELSA LAKAVKKYPP GGSSRWEQIA LFVNNLCKQD EPRSKEECIE KYNNVAKTHS KPTESTNGVA AASEPENSSQ SNEDVWTAEQ DQQLQDGLAA NPASMDKNER WTAITECVPG KSKKQCVQRF KVIRDALKKK K
|
| |