Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21455 |
Symbol | |
ID | 7202204 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 865077 |
End bp | 868137 |
Gene Length | 3061 bp |
Protein Length | 815 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181459 |
Protein GI | 219122242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAGATCGTC CCCGGCAGGC TCGTCGTCCT TGAACGGCGG TAGCAGCAAT TCAAATGATT TTGAAATGCC TTCCTTTCAA ACAAACATGT TCCCCAGCAA GAATAAAGAC GGATTGTCGT CGGGAAAGGG AAAGGGGGCG CTCTACGATG CCTACAATCA GCTACATACA CTTGCGCAGG TGAGAGATTG AGATGGTTCC ATACTGAATG GAGACACTGA GAATCCAGGG CCTCGCTCAC ACACTTCACT ATTCTCTACA GGCCTACAAT AAGCCGTTCG ACTCTCCAGC TGTTGTTGTA GTAGGACACC AGTCTTCCGG AAAGTCAGCA CTTATCGAAG CACTCATGGG TTTTCAATTC AATCAAGTTG GAGGCGGAAC AAAAACGCGA CGACCGGTAG CTTTACGTAT GCAGTACAAT CCTCAGTGCC ACGAGCCCCG ATGGTTTCTT GTCGGAGAGG ATGGTGTCGA GCATCCGCAA ACACTGACAG AAATCCAGGC CTACATTGAA CGAGAGAACC AGCGTTTGGA ACGCGATCCG TTGAGGTCTT TCGACCCTCG AGAAATCAAC ATGCGAATGG AGTACAAGCA TTGTCCAAAC ATGATTCTGA TTGACACACC CGGACTTATC TCAGCACCAA GAGTTCCCAA GGGTCGGGCA GGTAGTGCCA ATGCAGCACT TGCTCAGCAA CGGGCGTTGC AAGCCTCGGC GAGGGAAGCA GAAAGACTAG TAGTTGAAAA GATGAAGTGC GAAGACTACA TCATTCTGTG CGTCGAAGAT ACGAGCGACT GGAAGCATGG ACAAACTCGG GAAGTAGTCC AGAAAGCTGA CCCAGACCTT TCACGCACGG TCATTGTTAA CACAAAGTTT GATACCAAGG TGCCGCAATT TGGTAACCCA GCAGACGTTG AGGACTTTTT GAGGGCTCCA GTGTTGGACA GCGTTTGTCC CAATAAGCTC GGAGGTCCCT TTTTTACTTC TGTTCCAAGC GGTCGCGTTG GGCGCATGAG TGATCATAAT TCCATGGATG GCGACTTTTT GTTTGATAGT GACGAAGACT TTGTAGTTGC TTGCGCGGAC ACTGAGCAGA CGGACAGAAA TGTGGTGCTC TCTCGGCTGC GACGACTCGG GAAGGTTGAC AAGAAGGAGA AAACGAGTCT AACTACGCGC GTCGGCTTAA AGAAACTTCG CACCTTTTTG GAAGAGCGGG TAGATGAGTG CTATCGTCGA AACGTTCGAA AAATTATCCC GATGTTGCAA GCTGAGTATT CCTCGACCGA ACGAAAACTC AAAGCTTGCG ATCGTGAATT GCAAGCGCTT TCTGTCGACC GCTTGAAGGC TGGCGCAGAC GCATTTTGCG ACGAATTCTG TGCAAATCTG CGGAAGGCCA TGCAGGGTAC CATTGTGGCT CCCGTCACTC TATTTGGAGA AACGTTGGGT CAAGAAACAT CTGTCTCGGG TTCATTCCAT GGTACGTGTC GTTTAGCTTG AATTCTCAAT TCCTTTATGC TTTGCTCACG TACAATGTTC ACATTCCGAA ACACTTAGAT GTACAAGGTA GTCCGATGGC TGTGTCCGAT CGTACGTGGG ATCGTCTTGT TGAGTCTGAA GTTGGCAACC GAGACCATCG GTTGTACGGA GGGTCTCAAT ACCACCGAAC TCTCAGAGAA TTCCACCTCG CCACAAAATG TCTTCGTACC CCGGTAATCT CAGAGGATGA GATCGCCAAT GCTGCTGGAA TAGGCGATAC TCACGACGGA GTGAATTTTC TACATGCTGC TTGTGTTATT TCCTTGGAAA AAGCTCGCAT TTCCTTCGAA CCCTTGCTCT GGGCCTTACA AATGCGGATG GCGCACGTTA TGGAACGACT GTGCCCAGTG ACAGAGTATA TGATTCGAGA AGGCCGAGAT CGTGCCAAGT TTAAGACGTA TCGGAATCAG GAGGAGGATG AGAAAGACCA AGACATAAAT AAGCTTGACT CGCTAGAGAA CGCCATGGAT ATTTCCCAGA ACCCTCAGTT TCGTCAGCTG ATTCGTACAA TATACGAGAA ATTTGTCCAG AAGTGCGCAG ATTCGGTAAG TATTCCTTTG TTTCATTTCT TGAGTCAAGC AAACAAAAAT TAACGCTGGG ACCTATTCAA TTTGACAGGC TATGTCTCGA TGTCGAGATG ATTTGACTTC GATTACTCGA TATGTGACTT GGAACCTCGA TGAGCGCAGT AGTGGTGCCC TTACTCGTGC ACTTCCTGAC CAGACGGATA TGGTTTCCGT CTATCAGGCA ACCCTCGAGA GCGCGAAGGG AGACTCAAAG GTGTCTGCAG TTGGAGACGA CGAGGATATC AAGTCATCGG GTCAACTGAC GGCACAAAAG CCTTTGTCCC CAATGGCAAG CAGAAAACGC AACGTTGATC GCGACTATCA AAATCTGCTG CAACTTATGG AAGAAGCAAT AATGTCTCGG GATGCCAATC GGACAAACTT AGTGGTGGGT GGACTGGTAC AACACATTGT TGGATCATGG CGGGAGCAAT TTTGCAGAAG CGTAACGACG AAGTTCAATT GCTACTTTAT GCTCCCGTTC GTAGACGAGT TCCACAGGTA TATTCGGAAC GAGTTGCATA AAGCATATGA TGGTGAAGCC GGCACTGCAT CAGATGTTTT TGATCTCGCG TCGGTGCGCC GATCGCTTGA GTCACACCGC GTCGAGCTGG AGAATGAATG TCTCGCCAAT AAGAGACTTC AGGAAAAGTT TCAGCTCTGT GCAAAGCTGA TGAACACCAA ACAAGAGTCC GTGTTCGACA AGCCGTCTCG ATAAACAATA TTTCAGGGGC TGACTGGCAA CAGAGGGGAA TAGGTCGATG GGGGAACTGC GGATGCTCAA TTCCGTGTGA TCATTCTGTA AAAACTTGCC TGGTGTTCCG ACCCGATATG GCATGGTCAT CGCAGTTTAT ACCTATCAAT AGTTACAATG GACTGAGCGC AAGACAAAAT CATCAGTTTG TCACTTTCGT CGATTGGTAG TGTCTTCGAA TGCATCTTAA AGAAGAAAGT AATTTTTTGT T
|
Protein sequence | MPSFQTNMFP SKNKDGLSSG KGKGALYDAY NQLHTLAQAY NKPFDSPAVV VVGHQSSGKS ALIEALMGFQ FNQVGGGTKT RRPVALRMQY NPQCHEPRWF LVGEDGVEHP QTLTEIQAYI ERENQRLERD PLRSFDPREI NMRMEYKHCP NMILIDTPGL ISAPRVPKGR AGSANAALAQ QRALQASARE AERLVVEKMK CEDYIILCVE DTSDWKHGQT REVVQKADPD LSRTVIVNTK FDTKVPQFGN PADVEDFLRA PVLDSVCPNK LGGPFFTSVP SGRVGRMSDH NSMDGDFLFD SDEDFVVACA DTEQTDRNVV LSRLRRLGKV DKKEKTSLTT RVGLKKLRTF LEERVDECYR RNVRKIIPML QAEYSSTERK LKACDRELQA LSVDRLKAGA DAFCDEFCAN LRKAMQGTIV APVTLFGETL GQETSVSGSF HGTYVQGSPM AVSDRTWDRL VESEVGNRDH RLYGGSQYHR TLREFHLATK CLRTPVISED EIANAAGIGD THDGVNFLHA ACVISLEKAR ISFEPLLWAL QMRMAHVMER LCPVTEYMIR EGRDQNAMDI SQNPQFRQLI RTIYEKFVQK CADSAMSRCR DDLTSITRYV TWNLDERSSG ALTRALPDQT DMVSVYQATL ESAKGDSKVS AVGDDEDIKS SGQLTAQKPL SPMASRKRNV DRDYQNLLQL MEEAIMSRDA NRTNLVVGGL VQHIVGSWRE QFCRSVTTKF NCYFMLPFVD EFHRYIRNEL HKAYDGEAGT ASDVFDLASV RRSLESHRVE LENECLANKR LQEKFQLCAK LMNTKQESVF DKPSR
|
| |