Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47891 |
Symbol | |
ID | 7203103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 364354 |
End bp | 367296 |
Gene Length | 2943 bp |
Protein Length | 867 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182379 |
Protein GI | 219124162 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC ATCCACGTCT TCAACGTTCG GTGTTTGACC CCCGCTATAT CGGGGCAATG ATTCTAGTTA CGATCCACGG AGTCTCTTCC GACGCCCCCG TCTGTGAGGG CGACCCGGGG CAGTGCGAAA GTGGCATTTT GGCACCGTGT GGTCTCTACT TGGCCCCGTC AACGAAGAAT CCGGAAACTC TTACACTCCA TTCGGGTGTA GACCGCGACG CACACGAGCT TGTGGGTGAG CCAGACATTG CTCTCCCCTT TTTCGATCCC AATAAGAACG AATGGAGCGC ATGGCACGAT ATGGTATGGA ATATAGACGT GCTAGATGGC CTTATTCTAG AGAACTCTTT TCTTTCGGAG CTGCTACTAC CCGGAGTAGG AACTTTACCA GCCTGCTCGG TCTTTCAGGG GGAAAACGTT CGACTGAAAA GAAATCACGT TATAGACAGT TTAGATGTGC ATCGAGCGAA GGATGCGACA GCCGGCTCGT TCTCGTATCA CCATGGCGTG ACGTACGAGA CTGTACGGTC CATGGCGGCA GGAGAGGAGC TTTTCTTAGA CTGCTTAGGT CCACCTCCGC CCTTCCGAAG AGACAAAGAA AAAGACGAGG ACGATGGTAA CGGTGACCTT GACAATGATG TTTACGAGGA TGAAAATGGC GGCGGTGAAG ACGATGAAAT CCGGTCACTT GAATGGCTGC AAGAAAACGG CGTATGCGTC GACAACATCT GGATTGGTCC TTCAACCAAA CTGGGAATCG GCAATGGCGC CTTCACCAAG CGAGCAGTAG CGAAAGGGAC TGTGATCGCT CCTTCGCCTG TGCTTCACTT GGACCGTTCC CAACTGCAAA TTGTCGAACA GCGTTTTCGG GAGGACCCTT TTCCTCCATT TTTTCGAGAA CATGGTGTTG AATATTCAGA TTACGTTGTT GGTCAACAAT TGGCATTGAA CTATTGCTAT GGTCATCCTG ATTCCAATGT TTTGTTGCTA CCTTTAGCGC CCGGGATCAA CTTCATCAAC CACGATGCCA TAAGTCCCAA TGCATTCGTT CGGTGGTCAA CTTCATTGAC GGAACCATCT GACTGGCTAG AGGAGACCGC GCACCAGCTG TTCGCAGAGT CTGTCGACGG GACACTGCTT ATCGAATTTG TAGCATTGCG CGAAATTGCG GCTGGGGAAG AGATATTTAT TGACTATGGG GAAACATGGA GCACAGCCTG GAATAGCCAT GTAAAGGAAT GGACATTCGA TGGTGCGAGC TACATATCGG CTGCTAAGTT TGAAGATCTG TACGGAAACG ATGCTATTCG CACGCATATG GAACAAAGCA AGAATCCCTA TCCAGACAAT CTGACAACTG CCTGCTACTT TGTCGCTATT GAGGTGGACG ACGAAGAGGA GCTAGTCGAG TGGGAAAACG AAGCGCTTCA TTGCTTGCGA CCATGCAGTA TCAAGACTCG ATATAAAGAA GATGGTATTA CCTTCTATAC CGCTATTGTT TATCCTTTGA AGAGCCCTGC TGAGCCACAG TATTGTGGTG AAATCCCAGA TTCGGGATTG TTCGTGACCG GGATACCTCT CCAGGCTGTA AAAGTTGTGG ACAAGGCGTA TTCATCTGAT GTCAATCAAA GAAATACTTT CCGACATGAA ATTGGTATCC CCAAAAGATT CTACCCTTCG AATTGGATGT CTGCCGACAG CCGACCTCTA GGCGATTTTT CTCCAGACCC GTTGAAGCCC GGTGAAATGG CCGAAATTCG TTGGGCTAGT TCTGGAGACG TAGCTACAAA ATGGGCCTAT CGCCTTGGTC TCCACGAAAG TATTCGCAAA ACACTGTTGG AATATTGCGA TAGAATGGGG ATTACCGACA TATTTCGGCA TGTAACTACA AGAGACAACG CACTCCTCCC CGGTGCCGAC AAAAATTTAG AGCTGAATGG ACATAATTGG TTCTTACAAC GGCCGGACAA GAAATGGCGG TCAAATCTCC ATTGGCTGAG TCCCGGTGAC AACGCTGCAC ATGAAGACTA CTTGCAAGCC TTGAGTGTCT CAGGGTTTGA CACAATTCTC AGGGGAATTG GAGAACAGAT GGGCATGGAC GGCCTAGTGG CATTCCACGT TACTTTCATT GCTGTGTCAT ATTCGACTGA AGGATACATG CACTACGATG TCACCGCGAC AGGGGGTAAA GCATACAACA TCATAATTCC TCTTATTTTG GCCAACGAGA CGGGTCCCGA ACTAGACTTA CGGAGCTCAT CAATACTAGG AGAAGATGAG ACCGAGTCTC TTGTCGGAAG ATATCGATAC GAGTACGAGG TGGCATCTAT GCTGGGTGAC GACGCTTACC ACGCTACGTC GGCGGTGGAC TATCGGGCCA GTAAGGAAAT GCGAATGGCA GCGACCATCT ATGTTGCGGA TGTCAACGAG GAGAATGCTG GTGCCATACT GAACGAGTAC ACACAGGCTT ATCCGCCCGA TGACCGGGAT CTTCTGATGA GTTGGTCTGG ACGACACTGG CGAAAAGACG ATACAACAGC AAAGCTACCT GCTCCTGTCA GCGGCCATAT TCTCCTTGAG GCGAATACGG ATAACACTAG CTAACTGGCC AAATCGCTTC ACAGTCAAGC CGCGAGCAAG CTGTTACAGA ATCCTTTGCA TCTTTACCCG AACTTAAGTA CGCACTGCAG CATTGTAAAT ATCTATCCTT GAAAATATGC CGCTGTATAT AGCTTCCTAA CCCGTTTATG GTGCACTTGC TTTTGCAAAG AACAAAGAAA CCTGCAGGCA TAGCTAGGAT ACATATGACA CAATCAAGAA CGAAGTGAAA ACAACAGAAG GGAGCACGTA TGAGTACTCA ATATCTCACA GTAACCACTT GTTCTCTTTG TGCTCTTGGA AGTCCTTACC TCACTGACCG ACGAAAAAAT TATTTATACT TTT
|
Protein sequence | MQQHPRLQRS VFDPRYIGAM ILVTIHGVSS DAPVCEGDPG QCESGILAPC GLYLAPSTKN PETLTLHSGV DRDAHELVGE PDIALPFFDP NKNEWSAWHD MVWNIDVLDG LILENSFLSE LLLPGVGTLP ACSVFQGENV RLKRNHVIDS LDVHRAKDAT AGSFSYHHGV TYETVRSMAA GEELFLDCLG PPPPFRRDKE KDEDDGNGDL DNDVYEDENG GGEDDEIRSL EWLQENGVCV DNIWIGPSTK LGIGNGAFTK RAVAKGTVIA PSPVLHLDRS QLQIVEQRFR EDPFPPFFRE HGVEYSDYVV GQQLALNYCY GHPDSNVLLL PLAPGINFIN HDAISPNAFV RWSTSLTEPS DWLEETAHQL FAESVDGTLL IEFVALREIA AGEEIFIDYG ETWSTAWNSH VKEWTFDGAS YISAAKFEDL YGNDAIRTHM EQSKNPYPDN LTTACYFVAI EVDDEEELVE WENEALHCLR PCSIKTRYKE DGITFYTAIV YPLKSPAEPQ YCGEIPDSGL FVTGIPLQAV KVVDKAYSSD VNQRNTFRHE IGIPKRFYPS NWMSADSRPL GDFSPDPLKP GEMAEIRWAS SGDVATKWAY RLGLHESIRK TLLEYCDRMG ITDIFRHVTT RDNALLPGAD KNLELNGHNW FLQRPDKKWR SNLHWLSPGD NAAHEDYLQA LSVSGFDTIL RGIGEQMGMD GLVAFHVTFI AVSYSTEGYM HYDVTATGGK AYNIIIPLIL ANETGPELDL RSSSILGEDE TESLVGRYRY EYEVASMLGD DAYHATSAVD YRASKEMRMA ATIYVADVNE ENAGAILNEY TQAYPPDDRD LLMSWSGRHW RKDDTTAKLP APVSGHILLE ANTDNTS
|
| |