Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47664 |
Symbol | |
ID | 7202854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 455339 |
End bp | 459241 |
Gene Length | 3903 bp |
Protein Length | 1300 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181911 |
Protein GI | 219123187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00840283 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTAG AAGAATGCTC GACCAACAAC GAAAAACCAC ATATTCAACT ACTCCATGAT GAAAAGAACT GTGTCGGGCT TTCTCGTCTG CCTTTCCGAT CGTCGCTCAT TTCTTCGGCT GCGATGCCTC TTTTAGTCAT GATCAGCTTG CTTCTTTTGC AGCCAGTAGC CTCGCAGACA CCCTATCGGA CGTGTTATGA GGCCCTAAGT AGTGCCGATG GTGATCGAAA TAGTGTTCTG ACGCAGGCAG AATACGTCGA GTCCTTGAGG ATTTTGACCT TTGGTGCAGT CAGCGTCAAC TCCTACGAAA ATCTTTCCGA AGAGCTAAAA TCGGGTTTCA CCTTTCGTGC TCCAGACGGT TCACCGGGAG TCGACATAAG TGAAGTAGCA TCCAGTAGTA GCTCCAGTAT GTCCCTATTT TGCACCAGTA TATACGCTGG TCTCGTGGAA AGCCTAGGTA TTGCAACTTC TCAACAAGCT TGCTTCATTG CCATGTCCGT TGGTGATATT GGACGAGACG ATGCTCTGGC AGCCGAGCCG GATTTTGTCC GATATGCTAA CCAGATGGCG GGAGGGTCCT ACGGAATTTC CATTTCCTTT GCATCTTTAC CGCCACCCCT GCAAGCGGTC TACAACGACT TTTCAGACGG TGATAATGGG ATTCCGGTCA TCGGTTCCAA ACCGGGCACG ACGCCAACAA TCGAAGACCA GAGCTTTTTA ACCAATCTTT GTCGACAGAC CGCTGTGGCT GTTGTTGCGG GGGAACAGCC AATGGCTACA CCAGCAACCA TGGCTGGAAC GACTCCGGCC ACGTCAGCAC CTGTTTCTTC TCCAACAGAC GGCGCTGGAT CTATCACTCC CCTTTTTACG TTTTCCGATT GCACAACGGC GATGCTGGTG TCCGATTTGA ACCGTGATGA TTTCTTCGCC CAAGCGGAGT ATTTAAGATT CTTAAATCGA CTAACTACCA ACGCATTTTC TGATCAGACG TTCGGTACGC TTCCCAGTCG ACTCCAGAGT AACTACAACG TCTTGGCAAC TAAGGATGGT CAGATTCCAG TCAACGGGTC TAAGCCAGGT TCTGCGCCGA GTGCTGAGGA ATTAGCAAGC CTGGAAAATG TCTGCGATTC CACGGAAACA GCCTTACGTG AGGAAAACAG CGGCGGAGGC GTAGTTCCAA CGTTTGCTCC TACCATGACA ACGTCCGGGA TAACTATGGC TCCAGTGCCA ACTGAGCAGA CGACCGGTAG TCCGGATCCC GTTGCCATCC CAACACTTCG TCCAGTCGCA GCTCCATCAG TACCGACTAT TCCTTTTAAT ATCTGCACAC TCAATATGGC CACGTCGGAT TTGGATCGGA GTGGTAGTCT TAGCTCGAAC GAGTATTTTA ATTTTGTCAA CAAAATTGCT GGAAATACTT ATGATGGCTT AACATTTGAC ACCTTACCCG ATGCCGTGCA AGAGGCTTTT GATGGTCTAT CTGATGGCAA ATTGATTGAC GTTTCGGGCT CGTTTCCAGG TCAACGTCCC AATGAGGCAC GTGAAGCCGA ACTGGAGGCC ATTTGTGCTA CCTCCTTGGA GGCGATATAC GGTCCTCCGC TTGTGACCGC CACGACGACA CCATCAGTAG ACCCCGCTCC AACAACGTCA CCAAGCGCTG CGCAAGTCCA GAACATAACA GTGTTCAACG GCTTTTATAT CTTCAATGTC AAGGGTGTTC GAGCAGCTAC TTTGATTTCA GGTCAAAATC GAGATGGGCT GAACCGTGGG TACGAAGCTT TTGTCAGGAA CATCACGGCC GAGTTTATAG CAACTACACA GGTGCCCGGC GAGCCTCAGC GGCATCTTCG ATCGCGCCAT TTAGAAGAAC CAGGCCTGGC GCCCGAGGCG GCACGAATTT ACGAAATAGA TGATGTTGAC TGTCCTCCTG TAATCAACGT CGAGAGTACT TTTTGTCAAA TAGTGTACGC AGAGTTCGAG GTACACCACA TTCGAGAGCC GGCCGCTGAT GCATTTTTTG AATCTTTGAC GAGGATAACT CAAGACGCAA TTGTCGGCGA CAAGACTGGT CTTAATGCTT TTGTGCTCGC GGCAAATCCC TATTCGGATG TGGAGGTTTT GGGACCGGCC GAACAGCTTC GTCCGATCAA CCTACCCGGG GTTCAAGAAC CCGTTCCAGG CAACCCTGTC GGAGCAGAAA GCGGAGGCAA CCAAGCAGGC CTTATTGCGG GTATTTTCGT CGCGATTGCA GTCATCGTAA CCGTAGCAGG AATTGTACAC TACCGTAAAC GGAAAGGTGA CAGTTATGGT TTGAACTCAG GTTTCAGTAC GCCATCCTTT TTCAAGCGAA AACCAAAGTC AAATCCAAAT GTACCAGAAG TCAATTCATT GGGGAGCGCG CAAAATCATG GCGAGCATCA CGATTTAGAA GACGGATTCG GGCGCTTTGA AACCATTGAA ACCAAATCTC CTATGAAACC TGGCGGAATG GCAGGCTTCT TCGGGGGTTC GCATCATAGT AAATCTGGAT CGGACGACGA GGAAAGCGGA GGATTTAGTG TACAGGAACA GTCACCAATC AAACGAGATG CACGAAATGC TCTAGGAGAC ATTGAAGTAT CCGACAATGG CCATTTGAAA GGAAACGCTT TCGGTGGGTT CGGTTTTGGA AAGAAAAAAT TGAACAAACT TCAACACTCT ACCAACGAGC TCGATAGTAG TGAATACGAC GACAATGAAG ACGATTTCGC GAATTATGGA TTTGAAGAGC CGGAAGAGCA GCGCCATGAT TCTGTGGGGG ACATGTTTGA TATTCTCAAT CAAAATGAAG GAGAAACAGA CTGGGATCCC CAACACGTTT CTAGCGCAGC ATGGCATGGA GACGTCAAAG GTACCTGGGG GGCCCAAGGC CACAACCATG ACTTCGGTGA CGACCATACC GTTTCACAAA GCGGTTCGGA GAGCCAGAGC CACGATAACG AAGAGGAATC CGGAAGCTAC TCAGAGGATG ATGATTCCGG CTCGAGTGAT GACAACTCGT ACAACGGAGA CGATGGTGAT GACACGACTC GACGAACTCG AGAACCTTTA AATTCTCGTG CCATTGGCGC GAGCGTCACG GAGAACATGC GGCACCTCGA CGCAATGGTG CATCATGGGC ACTGGAATGG TTTTGTTCAG AAAACTGCGG AGCTTGCGGA AGATCGAAGC GAAGGCTCTG AAGACAAAAG CGAAGAAGAG TCGTTTTCCG GCTCAGGGAC AAGTAGGACA GACTCTTTTG TGGATGATGG ACTAGATGGG GGTGAAGATG GTCTGAGCTT GAACTCTACT GAGAAGTCTA CACGAGAAAA GTATAGACTT CAAGTGGAAC AATTGGTTCA AAAAGTTGCA CCAGAGGAGT CAGACAACAT AAATGCAATG TTCGACAAAT TTCTTGGCCG AGAGGCTGAG CTCTTACAGA CGTTAGAATC CATGAATGAT CGTTCCGCTT CGCAAAGAGC CCGAAAAGCA GTACACAGGT CGAAAGCTTT TCCTCAACAA TCTGGGCGGC TTTCTGCGGG AGGGTTGGAT GGTTCGGCCG CCATTGCAGC AGCTAGTACA CTTGGTGGTG GATTCTACGA TAAAGGTGAT GATGAGCACG ATGAAAATCG TAGCAGTGAC GAGAACAGTC ACTCATATGA AAGCAGTGAT GAATTTAGCG GCAATGCTAG TGGCAGCGGC AGCTACGAAG ATGGTCAAGG AAGCCACCGT TCAGACATCT ACAACAAGCC GGAAGGGAGC TCCCGCTCTG GTTCCGCAAG CTTTGACGTT GTTGATGGGA GCTACCGTTC CGAGTCTGGG AGTTATGATG ATGCAGGTAG CGATAGCTAT GCCTCTGATA GCGGCAGCGA TGAAAACTCA TAA
|
Protein sequence | MDLEECSTNN EKPHIQLLHD EKNCVGLSRL PFRSSLISSA AMPLLVMISL LLLQPVASQT PYRTCYEALS SADGDRNSVL TQAEYVESLR ILTFGAVSVN SYENLSEELK SGFTFRAPDG SPGVDISEVA SSSSSSMSLF CTSIYAGLVE SLGIATSQQA CFIAMSVGDI GRDDALAAEP DFVRYANQMA GGSYGISISF ASLPPPLQAV YNDFSDGDNG IPVIGSKPGT TPTIEDQSFL TNLCRQTAVA VVAGEQPMAT PATMAGTTPA TSAPVSSPTD GAGSITPLFT FSDCTTAMLV SDLNRDDFFA QAEYLRFLNR LTTNAFSDQT FGTLPSRLQS NYNVLATKDG QIPVNGSKPG SAPSAEELAS LENVCDSTET ALREENSGGG VVPTFAPTMT TSGITMAPVP TEQTTGSPDP VAIPTLRPVA APSVPTIPFN ICTLNMATSD LDRSGSLSSN EYFNFVNKIA GNTYDGLTFD TLPDAVQEAF DGLSDGKLID VSGSFPGQRP NEAREAELEA ICATSLEAIY GPPLVTATTT PSVDPAPTTS PSAAQVQNIT VFNGFYIFNV KGVRAATLIS GQNRDGLNRG YEAFVRNITA EFIATTQVPG EPQRHLRSRH LEEPGLAPEA ARIYEIDDVD CPPVINVEST FCQIVYAEFE VHHIREPAAD AFFESLTRIT QDAIVGDKTG LNAFVLAANP YSDVEVLGPA EQLRPINLPG VQEPVPGNPV GAESGGNQAG LIAGIFVAIA VIVTVAGIVH YRKRKGDSYG LNSGFSTPSF FKRKPKSNPN VPEVNSLGSA QNHGEHHDLE DGFGRFETIE TKSPMKPGGM AGFFGGSHHS KSGSDDEESG GFSVQEQSPI KRDARNALGD IEVSDNGHLK GNAFGGFGFG KKKLNKLQHS TNELDSSEYD DNEDDFANYG FEEPEEQRHD SVGDMFDILN QNEGETDWDP QHVSSAAWHG DVKGTWGAQG HNHDFGDDHT VSQSGSESQS HDNEEESGSY SEDDDSGSSD DNSYNGDDGD DTTRRTREPL NSRAIGASVT ENMRHLDAMV HHGHWNGFVQ KTAELAEDRS EGSEDKSEEE SFSGSGTSRT DSFVDDGLDG GEDGLSLNST EKSTREKYRL QVEQLVQKVA PEESDNINAM FDKFLGREAE LLQTLESMND RSASQRARKA VHRSKAFPQQ SGRLSAGGLD GSAAIAAAST LGGGFYDKGD DEHDENRSSD ENSHSYESSD EFSGNASGSG SYEDGQGSHR SDIYNKPEGS SRSGSASFDV VDGSYRSESG SYDDAGSDSY ASDSGSDENS
|
| |