Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49209 |
Symbol | |
ID | 7195518 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 231167 |
End bp | 232964 |
Gene Length | 1798 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183834 |
Protein GI | 219127213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0370698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCGTTCGA GCTTCGCTGG TATCCGAACC ATGACTTCCA AATTTCTATT CTCCGTTCTC GCGCTTAACC ATGCGTTTGC TTCGGCAAGT CCTGATTTTC TTATGATCGG AAACTCATAC TACTTTACGA AACAATTTGG TGAATCAACT CAAATCACTT CTTCGGGAAG GGCTTTCTGA TAACCGTGTG GCATCTAGTT TAACTGCCAG CGGAGGTAAG GCGCTGACAC AACATCTTCA GGATGCCCAG GGCGCTAACG GCAATGACAA AGATCTGCGT CAGTGGCTTT ACCTTGATCC GAAGCCCTTC GAGTGGGTCA TTCTACAGGA GCAAAGTCAA ACCCCAGGCT TTTATGGATA CAGTAGCTTC ACGACAAGTT TAAACGCCGC GGTTGGCCTG AACGAAATGA TTTCCGACGT TGGCGCCAAA ACTGTATTTT ATCAAACTTG GGGACGTCGT GACGGTGACA GCCGCAATGC CTGGCTCTTC CCAGATTTCT CTACCATGCA GGACCGTCTT GATGAAGGAT ATGGTCGCTA CAAAGCGGCG ACGAAAAACT CCAAATTGGC ACCGGTAGGC CCAGCGTTCC GTATCATATA CGATACTTTG ATTGAAGCGG AGATAGACCC TTTGAAATCA GAAAGTGCCT TTCACTCTTT GTACAGCTCA GATGGCAGTC ACCCTTCTGT CACTGGATCC TACCTTGCCG CTTGCGTCTT GTACTCCACC ATGACTGGCA AGGATCCACA AGGACTGAGT TACCGGCCTA GTGGCGTGTC AGAGGCCCAA CAGGCTATGC TCCAGAATGT TGCAGCTCAC ACTGTGCGAG ATGCGCCGTT GGTGAAGGCA CTGATTGCTC CTACCGTCTT TGCACCACCA GACAATGAAG TAACTGACGC TCCTACTAAT TCGCCAGTGA AAACTAGTGG TCCTTCACAG GCTCCGGTCA AAACCAGCAC TCCTACTCAG ACGCCCGTGG AGACTGACGC TCCTACTCTA GCCCCCGTGA AAACAGACGA TCCAAGTCAC TCTCCAGTTC GCCAGCCGCA ACCAGTGCCC GTAGCAGGAC CGTCTTGGGA CAAACGATGC GATCAAATGG TATCTGATAG TGACTTTGAG TCCGGCCTCG AAAGCTGGAC TGCTCAAGGT GCTGGTAAGA TCGAAAGTGT GTCTCCTGGC TACAAATCTG ACAAAGCCTT AGCTTCAACG GGAAGACTCC GTTATTGGAA TGGTATTGGC CTCGGCATTT CGCGCCGAAA CTACAATGGA TGCGTGGAGG CGGGATCCAA GTGGGAAGTG AGTCTGCAAG TCCGTCTGGT CAATCCTGAA ACCGGCAAAG GTGTTTCCTG TGATCGCAAT CCCACAAGAT TTACACCAGC GGACAAAAGA GGCTCCGGTT TTTGCCCCGC GGTGACCCTT TACCTACGTG ATGGATCCTG GCGACTAGGC AAGTTTACAC TGCGTGACTA TACCTCTAGC TGGAATCCCA GCGAGTTCAA CGAGCTACGA TCTGTATTTG AATTTCCGGC AGGCTCTTCT CAATGGAACG CTGATATTCG GAATTTTATC ATAAAGATCG ATCAGGCAGA CTACGACCTC GAGATGATTG TCGACGACTT TTCGATGAAA CGTATCGGCT AAAAAGGAGC CGCAACTCTT CTTTTCCATT CATTGTCGTG TATGGAAAAC AGCCATGTAC AAGTAAAATC ATCGCATCTT TCAAGGAATA GTGCTATTGT GTCTCAAACC GTTTTGACTA ATTGCTCAAT AGTGCCTTTT TCATTACCGT CCAAGAAA
|
Protein sequence | MTSKFLFSVL ALNHAFASAS PDFLMIGNSY YFTKQFGLSD NRVASSLTAS GGKALTQHLQ DAQGANGNDK DLRQWLYLDP KPFEWVILQE QSQTPGFYGY SSFTTSLNAA VGLNEMISDV GAKTVFYQTW GRRDGDSRNA WLFPDFSTMQ DRLDEGYGRY KAATKNSKLA PVGPAFRIIY DTLIEAEIDP LKSESAFHSL YSSDGSHPSV TGSYLAACVL YSTMTGKDPQ GLSYRPSGVS EAQQAMLQNV AAHTVRDAPL VKALIAPTVF APPDNEVTDA PTNSPVKTSG PSQAPVKTST PTQTPVETDA PTLAPVKTDD PSHSPVRQPQ PVPVAGPSWD KRCDQMVSDS DFESGLESWT AQGAGKIESV SPGYKSDKAL ASTGRLRYWN GIGLGISRRN YNGCVEAGSK WEVSLQVRLV NPETGKGVSC DRNPTRFTPA DKRGSGFCPA VTLYLRDGSW RLGKFTLRDY TSSWNPSEFN ELRSVFEFPA GSSQWNADIR NFIIKIDQAD YDLEMIVDDF SMKRIG
|
| |