Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49562 |
Symbol | |
ID | 7198228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 47901 |
End bp | 51461 |
Gene Length | 3561 bp |
Protein Length | 1186 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184292 |
Protein GI | 219128171 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.585326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGCCC AAGCAGATAC GGTTGTCGAA AAAGCCATTC GCGACGGCTC ACCTATGATG AAACGATCGA AGACGACAAG CATGACACCC ATTCCCCACG AAGAAGACCC CTCGAATTCT TCGTTGAAAG ATACAGTAGC GGGAGACTCA GATGCGGAAG AACCTGAAAA GGTGAAGATG TTTCTGGGGA GTTTTGACGA CCTCAACAAA AAGCCTCGCC GCTCATCGCC AGGTGGGGAA AACGTTTGTA TGTACGATGA AGACGAAGAT CTAGATAAGG CATCGATGCA TTCGACGCGG TCTTTTCGTA GAGCCTCAAT GGGTGGACTT CCAAGTGAAA AAACGTACAT CCGACGGAGT CTCGTTTCGA ACGATGATGA CGAACGATCA ATACACTCTG CGCGATCGTC CCGTTCCCGG CGCGCCTCGC TGAACGGTCC CATTCCTGCA AGCATGCTAG GAGGCCTCGG TCCAAGTGAT GGTTACGGTA CAGACGACGA TGATGATGAT ATAGGAGTAT CGCAGCACTC ATTGCGACGC CGTGCGTCCA TGAGTCGTCG AGGCTCGGCG GGATTCGGAG CCCTCCCGCC GGGGCGTGTG CACGACGATC CCTGCGGTCC TCCCCAACCG GTGAAACGTC GCACTTCAGC GGGGAACAAC CCGCGTCGTG TATCTCCAAA CTATAATGCA GAGAAATTGG AAGGTGCAGG AACCAATGCG TCCCGTCGCA GGTCGGGGGG GTTTGGTACG GAGCATCGTG GATCGACTGG TTTCGAGACC GGAGGTACTC CCCTTGAATA CGCTACCCGG CGGGCTTCGG CCGGAAGTAT AAACAATAGT AGGTACGAAA GTGTAGATAC AGAAGCAGGT CCGGTGAACC TTCAAGGGCG TCGGGGCTAC GTTGGAAGTT CGAGTCGCCG GGGCCCAATC AGTAACGGTC CATCGGAGAC GGCAACACGC ATGGGGTCCA CGGCAGGTGG ACTTCTTCAT CAAGGAGGGC ATCGGGGACA CGATGGGAAC ACAAGTCGGC GAGGCCCCAT CAGCGGCGTA CCACAAGAGC ACCTTATGGC CACTCAACGT AAAATCTTTG CCCGTACGCC AGTCACGCGC AATGACACCA CCAACAGTGT TGAACTAGCA AGGGGTCGAC GTGCATCAGC CGAGCTTGAT CGCAACGCCT TTGCTCGGGG AGCACCTCCC CGAACAACTA CCGCAGACAG CATGGAATTG GCTCCCGGGC GGCGAGCCCT GACCACATCG TCCGAAAAGA GTCAATTTGT TAGAGGATTG CCCGCCCGAT CCACGACGGC CGACAGTATG GAGCTTGCTC CAGGACGTCG AACAGTGAGG GTGTCCCCAG ACCACGTGGG TTTCTCGGAA GAACCTATGC TGTGCAGCCA AGCTACTGAT AGTATCGAAC TTACTCCAGG TGGACGGGCT TCTCTCGCTA TGGACAAGTC CGTCCGCATC CCGGTGATGC GGACCAACAC TGCCAACAGT ATGGATCTCA TTCCTGGACG CAGGGGGGTG AAGGATTCGC TGACAGAGGA AACGGAGATG ATGTATGAAC AGGTCAATAC TGATCGTCAG ATGCCGCCGA GTCGTAGTGC GGCTCTACGC CGTGGACGGA GAGTATCGTC CGGCTTCAAT GAGTCCGCTT TGGATCGTAT GAACCAGGCT GCTTCTTGTG ATAATCACAA TGAGGAATCG GAAGAATCAG AGCCCGATTA TGGCTACGGA GACGGGATGG CTCATCCATA CGGATATTCA GCAGATAATT ACCTATCCCG GAGGGAAAGT ACTGCCTCCC AAGTCTCCAG CAACGCAGTC ATGAGCCGTC GAGGCAGCAA CTATTCTCAA TCTAGCGGTA TTGTCAGCCG TCGCGGAAGC AATTATTCCC AGTTGTCGGC ACAGCGCACC CGACGTCGCA ACAGCTATTT GGTGCGCCCA GAACGAGATT TCAATATGGC GGACGAATTG TTGGATAGCG CTTTACGCAT GTCGGATAGC GACCTTTCCG AAGACGAGCA CAAAACTTTT TCCAAAGATG TTCCTAAAGC CTTCTACAAC TACAATGGAG AGCAGTCATT GAACACACAA AGTGCATTTG GATATGATTC CGACGCCCAG TCCATGTCCA GTGCTGACTC GCAAAATTCA TCCAAATCCT ACTGCCGCAA CAGCTACTTG GTCCGGCCCG AGCAAGATGC GGCTTTTGCA AGTTCGCTCT TGGCAGCGAC GCCACGGATG TGTGAAAGGT TAGTCGCTAC TGAGCATGCA TCAATTGAGG AGCATCTAAC CGAAGATGAT CAATCCGAAG AAGAAGAAGA AGATTTCAAA AATCCGGACT CCCTTCTCAG TCGCAGTGGT TACCTTAAGG TACACGGAAC GCAACACCAT TCACACTTGC TGGAAGCGGC CGTTGAGGTA GAACTAAAGG AGTTGAAAGA CGGAGAGCAC GAGAATGACC AGCCTATGAG GTCTGTCCCT ATCGCGGCTG GACAAACCTC GGGAACGCTC GTCGGAGCGC GACTCCAGCC TACTAACGAA CAAGTAAAGC ATCAACCACA GAACGAATCC AAGGAAAAGG ATGCTTGGAA TCGTCGCACT CCCTTTACCA CTTCTCTTGC TGAGTCTTTG ACTGCGGATA AGATTAACGA ACTTCAGAAG ATTCAACCCA AGATCTCTAC GGAGAATATA CCAAAGCAGT ACATGCGCTC TTGCAAAATG AATTGGGGAG AACTCACCCT CGAGAGTAGC GATGAGTCAT CAGAAAATTA CTCCGAGTCT AGATTTGGCT CTAGATTGTA CGAAGATTCT CCCTCAAATC AAGAACTAGA CGGCTACAAA CAGCGAGCGC GTCGTCGTGA ATCAATCGAA CGAACCGTTT TAAATATTGA AAAGTCCGCT TCCTCCTCTC TCCTGCATGG CAAGAACACT CCAGATTTCA AGCCTGCAAC TGGCTGTGTA AACGCTTCCG ACTTCATTGT CCGTTGCTTC TCTGCTCGCC TCCGAATGGG GATCACAGTG CTCAAGCACA ACCGATCTCG CTGGAGTAAG TCGTCTAATC GTGATTTGGT ACTCCTTGAT GGTCGTACAC TTAGCTGGAA ACCTGTCGGC GGAGAGCAGG ACAAAGGAAA GCGTCCTCGA TTAGATATTT CCAAGTGCAG GGAGGTTCGT CATGCCTGGA GTCGGGATCC ATTGACACGA AAGCAGACTG GTACGATTAC TCTACGGAAG CGTTGTAAAG ACGGCATGGC GAGCAAGTCG TTCTCTCTTA TATTTGGAAA ACGCACTTTA GATATCACAG CCATGACAAA CGACCAATGT AAGGTTTTGA TGGAAGGCTT TTCGGCCCTT TGCTACCGAT TGCAATTAGA ACAGCTGGAA GAAGGCGACA CTCACTCGGA ATCGCGTGCT GCCCGTGGGG GCGATTCCAA GTCCATGACA ACAGACGACG ACTGGAACTC TACTGTATTT GGTGACTCAA CCATGAGCTT GACGCAAAGC AACACAACCG GTCGGACTCT TCCAATACCA GCGTCGCCAT GGGGGCTATG A
|
Protein sequence | MYAQADTVVE KAIRDGSPMM KRSKTTSMTP IPHEEDPSNS SLKDTVAGDS DAEEPEKVKM FLGSFDDLNK KPRRSSPGGE NVCMYDEDED LDKASMHSTR SFRRASMGGL PSEKTYIRRS LVSNDDDERS IHSARSSRSR RASLNGPIPA SMLGGLGPSD GYGTDDDDDD IGVSQHSLRR RASMSRRGSA GFGALPPGRV HDDPCGPPQP VKRRTSAGNN PRRVSPNYNA EKLEGAGTNA SRRRSGGFGT EHRGSTGFET GGTPLEYATR RASAGSINNS RYESVDTEAG PVNLQGRRGY VGSSSRRGPI SNGPSETATR MGSTAGGLLH QGGHRGHDGN TSRRGPISGV PQEHLMATQR KIFARTPVTR NDTTNSVELA RGRRASAELD RNAFARGAPP RTTTADSMEL APGRRALTTS SEKSQFVRGL PARSTTADSM ELAPGRRTVR VSPDHVGFSE EPMLCSQATD SIELTPGGRA SLAMDKSVRI PVMRTNTANS MDLIPGRRGV KDSLTEETEM MYEQVNTDRQ MPPSRSAALR RGRRVSSGFN ESALDRMNQA ASCDNHNEES EESEPDYGYG DGMAHPYGYS ADNYLSRRES TASQVSSNAV MSRRGSNYSQ SSGIVSRRGS NYSQLSAQRT RRRNSYLVRP ERDFNMADEL LDSALRMSDS DLSEDEHKTF SKDVPKAFYN YNGEQSLNTQ SAFGYDSDAQ SMSSADSQNS SKSYCRNSYL VRPEQDAAFA SSLLAATPRM CERLVATEHA SIEEHLTEDD QSEEEEEDFK NPDSLLSRSG YLKVHGTQHH SHLLEAAVEV ELKELKDGEH ENDQPMRSVP IAAGQTSGTL VGARLQPTNE QVKHQPQNES KEKDAWNRRT PFTTSLAESL TADKINELQK IQPKISTENI PKQYMRSCKM NWGELTLESS DESSENYSES RFGSRLYEDS PSNQELDGYK QRARRRESIE RTVLNIEKSA SSSLLHGKNT PDFKPATGCV NASDFIVRCF SARLRMGITV LKHNRSRWSK SSNRDLVLLD GRTLSWKPVG GEQDKGKRPR LDISKCREVR HAWSRDPLTR KQTGTITLRK RCKDGMASKS FSLIFGKRTL DITAMTNDQC KVLMEGFSAL CYRLQLEQLE EGDTHSESRA ARGGDSKSMT TDDDWNSTVF GDSTMSLTQS NTTGRTLPIP ASPWGL
|
| |