Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44788 |
Symbol | |
ID | 7199747 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 245866 |
End bp | 249198 |
Gene Length | 3333 bp |
Protein Length | 967 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178733 |
Protein GI | 219115876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACCGTGAA GCAAAAAGTC TCATACTTCT TACAACGCCT GATCCAATTG TCGCTTCAAG CCCATCCGTG AATAGAAACA AATATAAATT GCATCCTTTG CTTTCCATCA ACTCCCCATG ACTGGCGATG ACACCACAAC GACTCCTCCG GTTGTCCCAC GCGAAGCCGT GCGACCCGTA TCATCCACAG AGAACGCGGG GAACTATCGT CCGCGCTCCC GGCAAAACAG TCGCACACGT CAAAACAGCC GGGATGAAAT GATCACGTTC AATCCAACAA CGGCCGCTAG TATGAACAAT CGCCGGGTGC TCTTTTCGGA AGCCGCACGT CGTGATGGAG GCGACAATAA TGTTGCCGCT CCGGAAATGA TGGGACCAAT TTTGCGACGC AAAAAGGTCG TGTCCATGCC GCTTGGTGCG GGAGGTAGGT CAACCTGACT TCGAGGAGAG AACGGTAAAA ACCAAATGCT TTTTGGTTGT CTAACGGCAC ACCATGGCCT TCTGCATTGC AATTCACAGG AGATTTCTTT GTGAGCAACG ATGTCGGAGG TCCACATCGA CAAATGCTGG CTCAGTACAA AGGCGTACAC GGGGACAAGA CTACTTTCAA GGGTCGTACG AAGCAAAAGA GCGTCGGCGC CGCTTTTGTA ACACCTGCGG ATCCGGACCC AACTACTGTT ACTTTGCCAC AAGCTCGTAT CCGCCAAGAC AGCGGTACTC GTCGGGATTT GAGCCACGTG ATTAAAGGTG TTCTCCGACG GAAACAAGTC AGTTTGGACT TGCTTGGTCC CAATGACTTT TTTGCACTAC CGACGGATGA ACCAGAAGAA ATGGTGTTAT TGCCAATAAT GCCCGAAGAC GACACAGCAC AACTGGACGA CGAGCAAGGT AGTATGACGA CGGAGAAGCG TCTATTTCGT CCTTTGCGGT TTGCCCCTCG TATTGGTGTG CGCGGTAATC GTGGTGGCGG ATACGGAGTT GGACACGCCG TAGAAGCGGC TCCAGAAACA ATTCCTGAAG AAGCAGTCGA CGAAGAGATG GTAGATGCTT TTCACCGATA CTCCAAAACA AACCAGGACT TGAGGCTACA AGTTATTTCA CTCAAAAACA AACTTGCCAC CAAAAGCGAT TGCTCGGCGC TTATGGAAGA GGCCGACAGT ATTTTGCGAA CGCATTCAGA GCTCAACCCG GATGAACAGC TCCTCAAAAG GAAGATCATG AAGGTTCAGG GGTCAATTGA TGAAGAGCAT GAATGCAGCG AAGGCATTGA CAACAACATT AAAGACCTTG ACAATTTGGG AATTATTCGG GCTACTCGTC GCGATTACAT CAAAACCGGC ATTTTGTTTA TGATTATGGT TGCCTTGACC ATCACAGTCT CAACCTGGGA GACTCACTTG GACGAAGAGA GCTTCATTTT CCGGCACGTC GGTTTAGCCT GCGTCACGGA ATGCCGTGGA AACTTGCTTA CACGTGATTT CTTTCACGGC CACAACCAAT TCAACGACGG AGATGTCATT GAGTTAATCA TGCACATGGA TCCAAATTCA CTTGCCGAAA CAATGGGAGC GTTGGCATTA GTACAGATTG TGGGAACGGA AACTAATGAA ACAAAAGCAA TGACGACTTT CGGACCGACC GCGGAAAACG ACCGTGAAAC CTATGATCAT CGTCTGGTGG TAAATTTTGA CCGCCCGCAT GAGCCGCATA TTATTGTTGT GAACAGTACC AAGCCCAATT TTGAGTTGTC GTTTACTTTG ACGGCGCGCC TACTGGCTCC GTTGGCTGAC AACAGTGTTG CAATTGCAGC AGTGATTATG GTGGTTGTGT ACCTTTTTAT TCTGCTGGAA GTGATCCACC GTACGCTAGT CGCTATTTTT GGCTCTATGG TGGCCCTGAT GTTTCTCTTT GTGATGCAAA ATGGCGAAAC GGAAAGCATT CGTCAAATTA TGTTGAACTT AGAATGGTCT ACGCTGGGTC TTTTGTTTGG AATGATGCTC ATTGTTGGTG AGCTTTCTCA TACAGGAGTG TTTGAGTGGT GTGCTGTCCG TTTGCTCATG GCAAGCAACG GATCATTTAC TCGCTTGATT GTGTTGCTTT GTGCTCTAAC CGCAGTAGCC AGTGCCTTTT TGGATAATGT GACCACTATG CTTCTGGTTG CACCAGTGAC GATTGACATG TGCAATATTT TGGGTGTTGA TCCTCGACCA TATTTGATTG GGGAGGTGCT GCTTAGCAAT ATAGGCGGAA CAGCAACATT AATTGGTACG TTACGTGATC AAACAGAATA GATGCAATGT TTCATTCTAC TTACCCCCGG GCACCTTTTG TGTACAGGTG ACCCTCCAAA CATTATCATT GGAAGTTCGT TTGACGAAAT TGGCTTTGTT GATTTTATCG TGAATGTACT TCCTTGCATT TTTCTTCTTT GCATCCCAGT CTCTCTTGGG CTGGTGGTTT GGGTTTACCG GTACTATCTC ACTACAAGCA CCATGAAAGT CCTAGATACA GCGAAACTCA AGACTGCTTA TCCAATCTAT GATGAGCCTC GCCTTATGAT TGCTGGCACC GTTACTGCTT TTGTAATCAT AATGTTTTTC CTGCATCCGG TACATCACAA AGACACTGCG TGGATTGCCC TCCTTGGAGC GTTTATTACT ATTGCATTCA CCAATCCACA CGACGTGCAA GATGCGGTAT GTCCAGGGTA CTTAACGGAG TTTTTTGGGA TGGATTTCTC ACAGCATTCC CCATCTCCTT TCTTTTAGCT CCGAAACCAT GTTGAGTGGG ACACCCTTCT ATTTTTTGCG GGATTGTTTG TTTTAGTCGA GGCATGTGCA GCAATGGGCT TACTCGAGGA AATTGGAAAC TTGCTTGGTG ACTACATTCA GGCACAGGAA GAGAGCAAAC AGCTTACGCT GGCAATAACA TTACTTATGT GGGTCAGTGC GATAACGTCG GCATTTCTCG ATAACATTCC TTACACGGCG ACGTTGATAC CAGTGATTCA GATCCTTGCT GATAGTCTAC CTGATACGTT GCCAATCGAA ATATTAGCGT GGGCTCTTTC CTTTGGTGCC TGTCTAGGAG GCAATGGTAC TCTCTTAGGA GCAAGCGCCA ACATTGTGAC GGCAGGAATT TCAACAAACA AAGGATTTGA GATCTCTTTT TTAAACTTCC TTTATCCTGG TATGCTTTTC ATGATTGTAA CAGTGGCAAT ATCAAACCTG TATATGTTGG TGCGATACTC ATGGATCTAA GAGAAGGATG GTAAGCCGAC AACAATCCTC GTCTCTAAAT ACAACTAATT TAAAGACGCA TAACGCGCTG TCA
|
Protein sequence | MTGDDTTTTP PVVPREAVRP VSSTENAGNY RPRSRQNSRT RQNSRDEMIT FNPTTAASMN NRRVLFSEAA RRDGGDNNVA APEMMGPILR RKKVVSMPLG AGGDFFVSND VGGPHRQMLA QYKGVHGDKT TFKGRTKQKS VGAAFVTPAD PDPTTVTLPQ ARIRQDSGTR RDLSHVIKGV LRRKQVSLDL LGPNDFFALP TDEPEEMVLL PIMPEDDTAQ LDDEQGSMTT EKRLFRPLRF APRIGVRGNR GGGYGVGHAV EAAPETIPEE AVDEEMVDAF HRYSKTNQDL RLQVISLKNK LATKSDCSAL MEEADSILRT HSELNPDEQL LKRKIMKVQG SIDEEHECSE GIDNNIKDLD NLGIIRATRR DYIKTGILFM IMVALTITVS TWETHLDEES FIFRHVGLAC VTECRGNLLT RDFFHGHNQF NDGDVIELIM HMDPNSLAET MGALALVQIV GTETNETKAM TTFGPTAEND RETYDHRLVV NFDRPHEPHI IVVNSTKPNF ELSFTLTARL LAPLADNSVA IAAVIMVVVY LFILLEVIHR TLVAIFGSMV ALMFLFVMQN GETESIRQIM LNLEWSTLGL LFGMMLIVGE LSHTGVFEWC AVRLLMASNG SFTRLIVLLC ALTAVASAFL DNVTTMLLVA PVTIDMCNIL GVDPRPYLIG EVLLSNIGGT ATLIGDPPNI IIGSSFDEIG FVDFIVNVLP CIFLLCIPVS LGLVVWVYRY YLTTSTMKVL DTAKLKTAYP IYDEPRLMIA GTVTAFVIIM FFLHPVHHKD TAWIALLGAF ITIAFTNPHD VQDALRNHVE WDTLLFFAGL FVLVEACAAM GLLEEIGNLL GDYIQAQEES KQLTLAITLL MWVSAITSAF LDNIPYTATL IPVIQILADS LPDTLPIEIL AWALSFGACL GGNGTLLGAS ANIVTAGIST NKGFEISFLN FLYPGMLFMI VTVAISNLYM LVRYSWI
|
| |