Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47258 |
Symbol | |
ID | 7202349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 101936 |
End bp | 105334 |
Gene Length | 3399 bp |
Protein Length | 969 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181660 |
Protein GI | 219122662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTCTCCTCG GGAGAACGCT CGTCCCCGCA CCCGTCACGA CTCTTGGGTG GCATTCCCCT CCCTCCCCCC TCACTAGTAG GAGCACGAAA TACCCAAGCG TGCCTAGAGT CGTCCGCGTT TCTCCTCGTC ATCGTTGTCG CGTGGGTTGT CCACTAGTGA TTTGTTCTGT TGCGTGTCGT TCGTTCCGGG CCGCACCCAA AGGACGTGAT GGGGTTTGAG CTCACGCAAG TCCTATCGGA CGACAAGCAC TACAGTGTGT TTGTGTGCGG GTTCTGTCAG AATTTGGTCG ACCTCGATGC GGTCGTGACC GCTCCGTGTT CCCACGCGTC GTGCCGGCAG TGCTGGCACG TGTGGGTGGA ACAACACTGT CGGCAACACC GGCCGGCGGA ATGTCTCTGT CCCACGTGCC GCGTCAACGT GACGCTGCCC GACCCAGTGA CACCAGGGAC GACGGCGCTC CGGATACACG GGGTCAACCT GGCTCTGCAA CCACTCGCAC AAACGCAGCC TTTGGCGTTT CGAACACTCC AACAAGTGCA AGTCGCGTGT TTGCACGGCA ACCGTTGCGA TTGGAGTGGA GACTACGGGG ACGTGTCGGC ACACGCGGAA GGTCACGCGG CGGAGGAAAC AAGATCGGCC GACACGACGG TCGCACACAC ACCCGGACGG CCTTCGTTGA GGGACTCGCA GAGTCATTCC ATGTCCGCCT TGTTCTGTCG CCAAAAGGTA CAGGAGAGGA GGAACACAAA TCCACGACCG GTGGTGGCTC GGACGCACGC GCCGGCTACG CCGCAGCGCA AGAGCCGGAG CTTGCGCGGA CTGCTCGCGG GGGAACAGCT AGTAGACCCC ACCGCGTGGA TGGGTTCGAC GGAATCCGCG ACCAACGCCT CCGCGGAAGA CAGTATGCGG ATGGTACCAA ATGGGCACAA TGAGCTTCGA TCACCATCGC CGGTGGACAC GGATGTATCC AATACTCCCG ACGATCCGGT TACTCCCGGA AAGGCATCGA GCTGTGGTGA CCACGGAATG GGTGAAAGGT GGGCCACCAA CGAGGCCGAC CATGGTCTCC TCAATACTCC ACCGCGAGAA GATGTGGCGG ATCGGGTACC CGCCTCGCCT CAACCAGAAT CTCCCCGCTG GACCACCAAC GGAATCAACG ACTCCATTTC GACGAATCCT TCGCCGGCCA AAGCACCCAT TACTCCCAAA CGAGACGAAG CGCCGAATCT CGTGAAACGC ACCAAGTCCG AGGAGAAGAC CGATTTCAAG CGGAAATTGA ATCGAAGGCA ATCTCACGAC GAACAAAACG ACGACCTGGA AGTTTCTTAC AGTCAATCGA CGGACTGGAA TATGTCCATC AACAGCCTTG GTGCCAACGT TTCTGCTCTG ACCAACGATT CTGTATCGCC AATGGAAACC GTGAACGAAA TCGACGAAAT CGACTTCAAG CAGCTTTTGG ACTCTGCAGG GGTGGCGGAC GGGGCCCTTT TTTCTCCAGA GCGGGTAAAG AAAATGATCG ACAAGGCAGA GAAACTGAAG AAACAAGCCA ACGCCAAATT CAACAAGGGA GATTTAGTCA ATTCTCGTGA GCTGTATACG GACGGAATAA AAATCATGCG AAAGATTCCC ATGGAGTCGG AACAACACAA AGAACTAGTT TCACAGATGT ACTCCAATCG AGCAGTTACT TACTTTCGAG AAAAGCGTTT CGACAGTTGT GCGTTGGACT GTGACAAAGC GATCGAACTA CTGCCAACCT ACGAGAAGTC ATGGATTCGA AAGTGGAGGG CACTAATGGC TCTAGGTGAC TTTGAAGCGG CCTACAATTG TCTAGAAACA GGGTCGAGAG TTGTCCCGGA CTCTCGTCGT ATTCAAGCAG AACTGACCAA GACTCAAGGC GAAAAGGAAT TGCTCTTCGA GGCAAAACAG GCACTCGATA TTGGGGACTT TCAGAGGAGT AAAGACATTT TGAAGCCGCA CGCAAGGACG TCAGACAACA TAGGACTCTT GTTTCTCGCG GCCAAAGCCG ATGTAGGCCT TGGCAACGTG GAGTCTGCCT TGGAAAAGAT CAACAAGGCA CTGAGGTTCA ACCCTACTCA TTCGGATGGA CTGGAATTGC GAGGGCACAC TCTGTTTCTT TCGGGAGATA CTGAAAAAGG CGTCCACATT TTGCAAGAAA CATTTAACCG AGACAAACAG AATGGCAATC TGGAGACTGA ATTAAATCGA TGCCAGAATA CCCATGTCGC CATTACCAAA GGCCGCGCTT CTGTAAAGCG TGGCCGCTAT GCCGAGGCTG CCGATTTTTT TTCCGCTGCT ATTAAGGAAA CTGGACTGAT ACCAACTCGT TGTCCATTGT TCGAGATGCT TCGAACAGAA AGAGCAGAAG CGTGGCTTTT GTCAAAAAAG TACCTCGAAG CACTCAAAGA CTGCCAAGAA GTTATTTTGA TTCAACGAGA AAACGCTACT GCATGGACCG TTCGTGCGGA AGTTCTAGCT GCGTTAGGGA AGCCAGAGGA AGCTAGACGG GAGTTGTTAA AAATCAAACG GACCTGGGGA GCAGAGAACC CGACGATTGA GGAAGGCTAC AGACGTGTAG ACTTTGAGCT TCGTGTAACG ACGGCAGATA AGAACCTTAC GGAGTTTATA CATCAGCTTG AGATGGGTAA CACCAATGTT ATGTCCATCA CAGCAAATAT GGACTGCGAG TCAGAGAGCA AAGCTGATCC TCGAGCACCA ACGTCGAGAT CCGATCATCG ATCGGCGCGG AGTCGTAGCA AGGTTCGAAG CCAAAGTGAC GGCCATCGAA GTAGCAGTCG GGCACGAGGT GACAGTCACC AGAAAGATCG CCGAGGCGAG AGACGATTCA GCACGGGTCG ATCCTCCCCT TTCCACGACC GAAAGGCAAG GGAGAGGCGA AGATCTGCTG GATCGGGCGG TAAGGACGAT GAACGACATC ACAATGGCCA GTCACGGGCG GCTGTGAAAA TTTGCGACAG TGCAAAGGTC TCTCCAGAGG GAAAGGTCGC CAAGAGTTCA AAAGAGATAC TGGAGCGCAC TCAAAGGGAA TTGAACGAAG AACGGAAGAA CAGATCGCGA TCGCAAAGTA TCAAGTGAAC ATGACGCAGC GGCTTATCCT TTGCCCTGCC CAGACCGATC CCGAAGTATC AAGCGAAAAA AATGTAACTA ACTGACTTAC GCTGTCGCTT GACGCGTCTC TACCACAAGG TTTATTTCTA TACCAGAAGA TTCCTCCCAA TCCTTTTGCC TGTTTCGTAA GCAGGAACAG CGGTTTATAA ATCCCGCTTG GTTGAGCTCA CTGTCGGAAA GAAGCGTGCA GCTTAGTGTC TCCGCCATGG GTTTCGGCAA TTAATAGTAA GCAAGGTAGT TTCAATGTC
|
Protein sequence | MGFELTQVLS DDKHYSVFVC GFCQNLVDLD AVVTAPCSHA SCRQCWHVWV EQHCRQHRPA ECLCPTCRVN VTLPDPVTPG TTALRIHGVN LALQPLAQTQ PLAFRTLQQV QVACLHGNRC DWSGDYGDVS AHAEGHAAEE TRSADTTVAH TPGRPSLRDS QSHSMSALFC RQKVQERRNT NPRPVVARTH APATPQRKSR SLRGLLAGEQ LVDPTAWMGS TESATNASAE DSMRMVPNGH NELRSPSPVD TDVSNTPDDP VTPGKASSCG DHGMGERWAT NEADHGLLNT PPREDVADRV PASPQPESPR WTTNGINDSI STNPSPAKAP ITPKRDEAPN LVKRTKSEEK TDFKRKLNRR QSHDEQNDDL EVSYSQSTDW NMSINSLGAN VSALTNDSVS PMETVNEIDE IDFKQLLDSA GVADGALFSP ERVKKMIDKA EKLKKQANAK FNKGDLVNSR ELYTDGIKIM RKIPMESEQH KELVSQMYSN RAVTYFREKR FDSCALDCDK AIELLPTYEK SWIRKWRALM ALGDFEAAYN CLETGSRVVP DSRRIQAELT KTQGEKELLF EAKQALDIGD FQRSKDILKP HARTSDNIGL LFLAAKADVG LGNVESALEK INKALRFNPT HSDGLELRGH TLFLSGDTEK GVHILQETFN RDKQNGNLET ELNRCQNTHV AITKGRASVK RGRYAEAADF FSAAIKETGL IPTRCPLFEM LRTERAEAWL LSKKYLEALK DCQEVILIQR ENATAWTVRA EVLAALGKPE EARRELLKIK RTWGAENPTI EEGYRRVDFE LRVTTADKNL TEFIHQLEMG NTNVMSITAN MDCESESKAD PRAPTSRSDH RSARSRSKVR SQSDGHRSSS RARGDSHQKD RRGERRFSTG RSSPFHDRKA RERRRSAGSG GKDDERHHNG QSRAAVKICD SAKVSPEGKV AKSSKEILER TQRELNEERK NRSRSQSIK
|
| |