Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47985 |
Symbol | |
ID | 7203215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 620992 |
End bp | 623881 |
Gene Length | 2890 bp |
Protein Length | 932 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182434 |
Protein GI | 219124277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCTC GCTTTTTTGT TTGTTTCGTC ATACTGTCCA GCCCGCTCAC TGTCGCCACG GCATTGTCGA CCTCAATTGT GGATAATGTT TCCAATGCAT CTCTTCGGCG TTGCAACCGT GCCAAAGGCA GTCGACTTTT TCTCTCTTCG GATTATGATG AAAATTTTAG TTCGGGGCAC GGTGGGACCA ATCGACGAGA GTTTCTCACT TGGGGGCCAG CGGCGACTAC GGCCTTGTCG GTCTTCGACT TTGGTAGCCC TCGCAACGCT TTCGCTGCAA ATTTGTTCGA TTCGCCTTAT TTGCCCAAAG CAAACGAAAA AGATGTAGAG CGAATACCGT ATTCCTCGGC CCGTAAGTAT AAGACAGTGA CTCTAGCGAA TGGCCTCCGT GCATTGCTTG TTAACGACAA ACGCGCATTT CGCTCATCAG CAGCTCTAAG CATTGCTGGG GCGGGACAGT TTGCCGATCC CCAAGATTTG CCTGGACTGG CGCACTTGAT GGAGCACATG ATCTTGAGCT TCAAATCAAG GTCAAGCTTT GGGAAATCCA AGGACTTTGA GGATTGGCTG GCTGACGTGG AAGGTGCGTC CAACGCATTT ACTGCATATG ATACCGTCTG CTTTCATTTT TCTTGTCCAG ACGTGGCATT GTCGGAAGCG TTAGATCGGT TCTCGGGATT GTTTTTGGAA TCGAATGTGG TCCAGGTCTG TCGAACAGAA AAGACTGTTC GACGTGAGAT TCGGCGCGTG GATTCAGAAC TTGATTTTGA GGATCCAAAT ATCCAAGCTC TTTATCTCGC GAAAGACTTC GTCAATCCTG AGAATCCGTA CTCACGATTC TCAGCGGGAA ATTTGAATAC GCTAGAGCGG ACGCCAAAAG AGCTGGGGGT TGATGTTGGT GAAAGACTAA TTGAGTTTTT CAGGAGTCGA TATCGTCCTG AGCAAGCCGT CCTCGTTGTT GTCAGCCCCC AGGATTTCTT TACAACTGGA AGATGGATTG CACCGTTTAG TAGCATACTT TCTCGTTGGA GAGTAACTGA CGTCACAACG TCAAGGAAAA ACATCTATCC TGGGGGTTTC CTGTTAGGCA ATAGGCTAAA GCATATGGTC CTCTATTGCA AAGGCGAGTC CCAGACCGAG AAGATATCAT TTGAGTGGAC CCTTGGTCTC GATTATGCTG ACATTGGATC AGCAAGCAAA CCTTTTGTGA CTGCCACGCA AATAGGATTC ATTTTATCAC AAATAATTAG TCGCAGAGGG CCAGGTTCTC TATATCAGTA TCTCCTACGT CGAGGCTGGG TCCCAAGCGG TAACACAGGC GTACCTCGGA TCACAATACC ACTCGAAGTC TCAAGATTCC AAATTATCAA GATCGAAGTG CTACTCACGA CGGAAGGCTA CTTAAATCGA TCGAGTGTTG TCGCCGCCGT ATATGATTGC ATTGATTTAA TCAAGAAAGG ACAAACGTTT AAGCTGCCTC GAGAGCTCAT AAGCCAGTAT GTGAATGTGG CCAAACTTTT TGGCTACACA CTGGCACCGA GACCACCCGA TGCCATAGAA TTAGCGATAG ATGCGCAGAT ATTTGGCGTT GGTGGTCCCA ACAATGTTGG CTCCGGAAAG TGGTACCGCT TCATAGCTAT CGACGAACGG GGCGCTACCG CTTTGCTCCA GAGAACCATA TCCATAGCTC TTGCAACAAT GAGCGATCCT TCCAACGCAC TGATTATCGC AAGCGCTGGA AATAACGCTT TAAAACAAGA GGGAATGCGA TTACCCCTCG ATGGAAAGCC TTCAACACCT TCGTCCAAGT GGCTATCTGA ACCTGTCACA GGTGGTCTTT TTTTCTTCGA CGACATGCTA CGTAGAAGGG CTGGGGTAGA GACTTTGATT CTATCAAAGC TGGTCTTTGA TGAGGAACTT GTTCGACCAG TTTTCAACCC ACTGATTCCC ATAGCGCTTC GACCGCCAAG GAATGTAATA GAACCAGAAG GCTTTTCCAA CACCAACGAG CGTCTGACTT TTGAAACGGA TCAAGACGTA TCGTTGTCGA AGGATGGGAA TTGGAAAATT TTGGGTGCCG AACCCGGACA GGTTGGCCTT CCGCTTCCTC GCTGTCCGCC TGAAACAACC TGTCGATGCG CTTTCACTCT CCAGTTACTG TCACCTCGTC CAGCGAGAGC AAATGTACGA CAGGCGGCCT TCGCCGAACT TTGGAAACTT TCTTTAGAGT TGGCGATTAC GGATTTGGCC GAGCTCGGAG CGCCGGGGAG CCTGGCATAC GATATCAGTT TTAACAAGTT CGGTCTCCGC TTGACATTTC TAGGTATCAG TCAAAATATT TCGTCCTACG CGCGGAGGAT TTGTCGTCGT TTGGTAAAAC ACCACTTCAC TCTTCTGACT GGCCCGGAGA TGCTGCCGCC GTCTCTAACA GCGATGGCAG TTTCGACTGC AAGTCGTGTA CCGAATCTTT CGCAACAGAG ACGAAGTCCT ATCGTCAGCA ATTTGCGTCG TGCGACGTCT TATGATGCTG CAACCGAAGG CATTGCTTTC CTGCGATCTT GTTCGGGCGC TGTTTGCTTT GCGGAGGGGG ATCTTCTTCA GTCCGAAGTG GTAGCTTTGC TTAGTGATCT TAATGATATT TTCTCCGAAA GCATCGGCTC TGGTTCGCGT TCTTCCGCTG CAATACCTAC AATCAATGAC CTTGTCTACA AACCGGTATG GAAGCCTCGC TTTGCTTCTT CATGTGCCGT CCCGGGAGTG GCGTTAGTGT CAGACGCGTG CGGCCGGTTA CCGAGATAGC ACTCTGGCAT TTGATCTAGT TTTTCAGCGT TTGGATGCTC ATAATATGTA TCGAGTGTAC AAAACATGTG AAAACAAGCC AGATCTGCTC
|
Protein sequence | MDARFFVCFV ILSSPLTVAT ALSTSIVDNV SNASLRRCNR AKGSRLFLSS DYDENFSSGH GGTNRREFLT WGPAATTALS VFDFGSPRNA FAANLFDSPY LPKANEKDVE RIPYSSARKY KTVTLANGLR ALLVNDKRAF RSSAALSIAG AGQFADPQDL PGLAHLMEHM ILSFKSRSSF GKSKDFEDWL ADVEGASNAF TAYDTVCFHF SCPDVALSEA LDRFSGLFLE SNVVQVCRTE KTVRREIRRV DSELDFEDPN IQALYLAKDF VNPENPYSRF SAGNLNTLER TPKELGVDVG ERLIEFFRSR YRPEQAVLVV VSPQDFFTTG RWIAPFSSIL SRWRVTDVTT SRKNIYPGGF LLGNRLKHMV LYCKGESQTE KISFEWTLGL DYADIGSASK PFVTATQIGF ILSQIISRRG PGSLYQYLLR RGWVPSGNTG VPRITIPLEV SRFQIIKIEV LLTTEGYLNR SSVVAAVYDC IDLIKKGQTF KLPRELISQY VNVAKLFGYT LAPRPPDAIE LAIDAQIFGV GGPNNVGSGK WYRFIAIDER GATALLQRTI SIALATMSDP SNALIIASAG NNALKQEGMR LPLDGKPSTP SSKWLSEPVT GGLFFFDDML RRRAGVETLI LSKLVFDEEL VRPVFNPLIP IALRPPRNVI EPEGFSNTNE RLTFETDQDV SLSKDGNWKI LGAEPGQVGL PLPRCPPETT CRCAFTLQLL SPRPARANVR QAAFAELWKL SLELAITDLA ELGAPGSLAY DISFNKFGLR LTFLGISQNI SSYARRICRR LVKHHFTLLT GPEMLPPSLT AMAVSTASRV PNLSQQRRSP IVSNLRRATS YDAATEGIAF LRSCSGAVCF AEGDLLQSEV VALLSDLNDI FSESIGSGSR SSAAIPTIND LVYKPVWKPR FASSCAVPGV ALVSDACGRL PR
|
| |