Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46850 |
Symbol | |
ID | 7204700 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 620020 |
End bp | 622396 |
Gene Length | 2377 bp |
Protein Length | 745 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185748 |
Protein GI | 219121033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00101234 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGCGG TCGCATTGAC TGCGCTCGTC GTACTGACAA CGACCGAAGC CTTTCAGACA CCTCGCTTTC ACTCGCTCGA CGTTACGAGT AAGCTATTTT CCGCTCGAGT ACAGGATGCG AGTTCGGTGA TGCAAGATGT ACGGGCAGAG CTGGCCAAGA ACGAAGACGC CAATCTCATG CTGCAAGCTT TGCGGGGACA AAACTTGAAC GACGACGACT CCGCTGTCGC TGGACTCCAG ATGCGTCTGG TGGATATTGC ACCCAAAGAA AACGAGGCTC TACCCTTTGA CTACAACCCC CAGGCACTGA AAGAGTTCTT TTCGAAGCGA CCTCTTGCTG TTGCTACTCG AATTCTGCAG TTGCTGTCGG TAGGAGGAAT CTTCGCGTTC AACACCATTT TTGATCAGCT TCTGGGCCGT GTCAAGAACA ATCCGGATTT GGAAGTTCAA CGAGCGGCGG AACTTCGCGA TCTGATTACT TCTCTGGGTC CCTTCTTTAT TAAGATTGGG CAAGCGTTGA GTATCAGACC TGATGTGTTG AGCCCACGTT CAATGGTCGA ATTGCAGAAG CTATGCGACA AGGTCCCCTC GTTCGATTCA ACGATAGCCT TTGCAACAAT AGAAGCAGAG TTGGGACGTC CGGTGGAAGA CATATTTTCT GAGATCACCC CTGAACCGGT GGCGGCCGCT AGTCTTGGAC AGGTATACAA AGCTGTTTTG CGGGATACTG GCGAAACAGT GGCCGTGAAA GTCCAAAGGC CTTCGGTTTT GGAAACCGTT TCTCTGGATT TGTATCTTGC CCGCGAACTC GGTATACTTG CTCGGAACAT TCCCGCACTG ACAGATAGGT TAGACGCTGT CGGTTTACTG GATGAGTTCG CGTTTCGCTT CTATCAAGAG CTAGACTACA ATCTAGAATG TGAAAACGGT ATTCGTATTG AGAAAGAAAT GCGTGTTTTG CCGATGGTAG TGATTCCCAG AAACTACCCG CAGTACACAG CGCGACGAGT TCACGTGGCG GAGTGGATTG AGGGTGAAAA GCTTTCACAA AGCAAGGCCG ACGATGTCGG AGCTTTGGTC AACTTGGGTG TAATTACCTA CCTGACACAG CTTCTCGACT CGGGCTTCTT TCATGCTGAC CCTCATCCTG GTACGTTCTC TGAAAAGTTT CTGTATACAT CTGTTCATCA GTTTTCTCAT TTCCAAATTC TTGTATAGGG AATATGATGC GCACTACTGA TGGTAAACTA GCGATTCTCG ATTTTGGGCT AATGACTGAG GTCACAGACG ATCAAAAGTA CGGGATGGTA GAAGCTATCG CTCATCTTCT CAATCGTGAC TACACAGAAA TTGGACAAGA TTTCATCAAT CTCGACTTTA TTCCCAAAGG GACCGATACA ACTCCTATTG TCCCTGCATT GACGAAAGTG TTTGATGTTG CTCTGGCAGG TGGCGGGGCT AAGAGTATCA ACTTTCAAGA ATTGTCGGCC GATTTGGCAC AGATTACATT CGATTACCCA TTCCGCATTC CTCCATACTT TGCATTGGTC ATCCGGGCCA TTTCTGTTTT GGAAGGAATT GCTTTAGTCG GAAACCCAAA CTTTGCCATT ATTGATGAGG CCTACCCATA TATTGCTCGG CGCTTAATGA CTGATCGATC ACCGCGTCTC CGTGCTGCTT TGCGCTACAT GATATATGGT CGTGAAAATG AGTTTGATGC AGAAAATGTT ATTGATTTAC TGCAGGCAGT AGAAAAATTC TCTGCTGTAA GGAATCAAGG TGATGGAACA GCTTACAAGG TGGACGGTGT TCGAGGATCG AAGGCGGTTG GTTCTGCTGG AGATTTTCGT GGATCACAGC AGGTGGATAC TAGCGACCGA AACACAAATA TCGACGGCGG ACGATTTCGT ATATCGTCGG AAATGGGTGT GAATGATGTC GGCGAGCTGG CAACGGCTGA CTCCCGAGAT CCACTCCAGG TGGTGGAAGC CAAAAATGAT GAAAGAACGG TTCGGGAAGG CTTAAGATTC TTCTTTAGTC CGGAAGGTGA GCCCTTCCGG GAATTCATGC TGGAAGAAGT TGTCACAGTG GTTGACGCAT CAGGACGTCA GGCCGTACAG GAATTGTTTC GTCGCGTAGG TCTCGGAAAC GTGCCGGTTC CTGCTTTTTT CCGCAGACTC AGTCCGGAGC TTACCGATGC CGACCGTCGT ATAGTCCAAC AGATTGGCAA ACTAGTACAG TTTCTTTTGG GGGACTTTGA AGGTACCGTG AACAATTCTG ACACGAGCGC CCGGGTCCGT AAGCTTATCC CGGTGATACG GGAGTACGCC CCACAGTTAC GGGATTTTGG GACTTTGTTG GTAGCTCGGT TAACTGA
|
Protein sequence | MYAVALTALV VLTTTEAFQT PRFHSLDVTS KLFSARVQDA SSVMQDVRAE LAKNEDANLM LQALRGQNLN DDDSAVAGLQ MRLVDIAPKE NEALPFDYNP QALKEFFSKR PLAVATRILQ LLSVGGIFAF NTIFDQLLGR VKNNPDLEVQ RAAELRDLIT SLGPFFIKIG QALSIRPDVL SPRSMVELQK LCDKVPSFDS TIAFATIEAE LGRPVEDIFS EITPEPVAAA SLGQVYKAVL RDTGETVAVK VQRPSVLETV SLDLYLAREL GILARNIPAL TDRLDAVGLL DEFAFRFYQE LDYNLECENG IRIEKEMRVL PMVVIPRNYP QYTARRVHVA EWIEGEKLSQ SKADDVGALV NLGVITYLTQ LLDSGFFHAD PHPAILDFGL MTEVTDDQKY GMVEAIAHLL NRDYTEIGQD FINLDFIPKG TDTTPIVPAL TKVFDVALAG GGAKSINFQE LSADLAQITF DYPFRIPPYF ALVIRAISVL EGIALVGNPN FAIIDEAYPY IARRLMTDRS PRLRAALRYM IYGRENEFDA ENVIDLLQAV EKFSAVRNQG DGTAYKVDGV RGSKAVGSAG DFRGSQQVDT SDRNTNIDGG RFRISSEMGV NDVGELATAD SRDPLQVVEA KNDERTVREG LRFFFSPEGE PFREFMLEEV VTVVDASGRQ AVQELFRRVG LGNVPVPAFF RRLSPELTDA DRRIVQQIGK LVQFLLGDFE GTVNNSDTSA RVLTGFWDFV GSSVN
|
| |