Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43785 |
Symbol | |
ID | 7197057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1476032 |
End bp | 1477798 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177841 |
Protein GI | 219112179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000378872 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTTTC GTCCTTTTGT GATGGTCATT TGGCATTCTT GTTTCTTCTT TATGGAGCTA ATTTGCTCTG TATCTAGCCT GCCCCCAACA ATAGTCCGGC ATCGTCCTAT GGGTATTGGC TTGTTGAACG AGCATCGTCG TAGTCATCTT CGTCGGACAG CGCGTGACGT GCACTCAAAG GATTCTACGG GAAAGTACTA TTCAGATATA TTGGGCCGGC AAGAACCTAA CTTGCCTGAA AAGCAAATAA GCGACCGTCT AAAGTACGCG GAAGGTATTC AGGAGGTTCT GGCAAAACCC ATGCACGTGG GGACTTCGCT AAAATTTCGA CCTACTGCGT ACGATTTATT TCAAGCTGGA ATTGTAGGGA TTTTCACGGG ATTTTCTGTC GCACTGTTCA AGCTTTCAAT CAATGCCGTG AAGAGCCTGT GTTACCGTCA AATTTTTTTT CAAACAAACC CAGTCTTGAT GGTCACTGTG CCGGCTATGG GTGGTGCAGC TGTCGGAGTC TTGATGCTTC TCGGGGATTT CCCTCCTGGT CTTCGCGGAA CGGTTATTGA AGTAGATAAA GAATCCCAAG GTACCGTCCA GAAGTTGCGA GATCGTGTAC AGACTCAATT TCGCTTTCTG CGAAAGTCTG CTGCGGCCAC TGTCACTTTA GGAACAGGGT GTAGTCTTGG GCCTGAAGGG CCGTGTGTCG AAATTGGAAT GGATGTGGCG CGCAGCTGTA TGGATATTAG CCGACGCACA GCAGAGCGCC AGCGCCTATG GAACCGTATG CTCTTGTCTT GCGGAGCAGC TGCTGGCGTT TCGGCAGGCT TTAACGCACC GATCGCGGGT ACTTTCTTTG CGCTGGAAAT TATGCACCGA ATGTTTTCTT CAATTGATGG TGAGGAAAAC GCAGACAAAG ATGCCTCCGC CGGGCTGAGT TCTCTGAACA CGGCGACCAT TGCTCCCGTT CTAATCGCTT CGGTCATGTC AGCTTTATGC GCAAGAACTC TACTCGGAGA CCATCTCGTA CTAGCCCTAG GCGGTTCTTA CTCTCTCAAA AAGCCTCTGA TTGAATTACC GTTGTACATG GTTCTCGGGC TCGTATCCGG AACCGTTTCC TTTGCTTTTA GCCGAGCTGC CAACCTCAGC CAAGCTGTGT TTGTCGGGGA TTATGGAAGC GATCGCTTTC GAATGGGAGT GCGTAGCCTG TCACCTGCGT TCAAGCCCGT CATTGGTGGC ATTCTTTGTG GGCTCGTTGG AATCAAGTTC CCGCAAATCC TTTTTTTTGG ATATGATTGC TTGAACCCAC TTCTAGCCAA CAACTCTTTG CCAACACCCC TACTTCTTTC CCTCCTGGCA GCAAAGATAT CTATTACAGC AATTTCCGCT GGCTCTGGAC TAGTCGGCGG CACTTTTGCT CCGTCGCTAT TTTTGGGAGC AGTAACTGGC GCTGCATTTC ACAACATTGT TTCGAGCATT CTCTATTGTG GCCTTGGCCT GAGTGCTGCT TCAGGACCTT TACTTGCCGA CGTCCCGGCC TATGCCATGG TAGGAGCGGG ATCTGTACTT GCTGCTCTCT TTCGAGCACC TTTGACAGCT TGCTTGCTTC TCTTTGAAGT AACCCGCGAC TATGACGTTA TTCTCCCATT GATGGCGAGT GCTGGCTTTG GCAGTGTCTT CGCGGATGTT TTAGATGGAA AGTTCAGCAG AGCTCAGAAG AGAAGAAGGC TTCGTCGAGA TAAAGATGCA GTATCTTGGG GCGACCTGTC AAGCTAG
|
Protein sequence | MYFRPFVMVI WHSCFFFMEL ICSVSSLPPT IVRHRPMGIG LLNEHRRSHL RRTARDVHSK DSTGKYYSDI LGRQEPNLPE KQISDRLKYA EGIQEVLAKP MHVGTSLKFR PTAYDLFQAG IVGIFTGFSV ALFKLSINAV KSLCYRQIFF QTNPVLMVTV PAMGGAAVGV LMLLGDFPPG LRGTVIEVDK ESQGTVQKLR DRVQTQFRFL RKSAAATVTL GTGCSLGPEG PCVEIGMDVA RSCMDISRRT AERQRLWNRM LLSCGAAAGV SAGFNAPIAG TFFALEIMHR MFSSIDGEEN ADKDASAGLS SLNTATIAPV LIASVMSALC ARTLLGDHLV LALGGSYSLK KPLIELPLYM VLGLVSGTVS FAFSRAANLS QAVFVGDYGS DRFRMGVRSL SPAFKPVIGG ILCGLVGIKF PQILFFGYDC LNPLLANNSL PTPLLLSLLA AKISITAISA GSGLVGGTFA PSLFLGAVTG AAFHNIVSSI LYCGLGLSAA SGPLLADVPA YAMVGAGSVL AALFRAPLTA CLLLFEVTRD YDVILPLMAS AGFGSVFADV LDGKFSRAQK RRRLRRDKDA VSWGDLSS
|
| |