Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49500 |
Symbol | |
ID | 7195840 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 424978 |
End bp | 428197 |
Gene Length | 3220 bp |
Protein Length | 1011 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184250 |
Protein GI | 219128080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.268799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGGAAGCTA CTGCAGATCC TTTCGCCATG GGAATGACAA TGCTACCGCA CATTCCTGTG CATAACAGGA ATACGGAGTT CCTATCGTAG TCACATGGTC GACGGACGAG AGGATGGTTT ATCTACAGGG TCCGTCGGAG CAAGCGGCAA TCGTTTGGGA TCCTTCCCTG ATCCACCAAC GTTCCGAGAC CGCTGTATAC TGGGATGATG ACATGCTTCA ACCCTGGGTA AAGCGCCACG CATCCCAGCC GTCACTGAAA TTTCTGTCGT ATCATGCCTG TCAGGAAATC CATTGTTGTA TAGTGAACAT TGTTCGTCGG ACGCTTGAGG ATTTACTACA GAAAGTAGGA GACGATTGTG CTATGGAAAG AAGCCAAAGC GAAATACATC CATTGTCGGT TCCGGTCGCC GTAGCGATTC CGGAAGGGAT TCTACTGCCA ATAGCGATCG AAGCCGTCTG CTGCTTGAAC GAGCCTTTTT TTATTGGAAA CCGTTCGTGC TCTGTCGTTT TGGTTCCTTT GGAACCAACC GAAGGCCGCG AGCGACTCCG GGACATGATT TGGGATTGTC GCCCCGCCTT AATTCTCACA ACGTCCGTCT GCGACACCGA TCGTCTCAAT AACATAGTGT CGACGGATTG CCGACCCAAT GCATCCGTGG CGGATGAGGG TACAGCTACA TTGTCGCACC CCGCTCTTTA CCGCGCCAAA TCGATTCAGT TTCTCAATTT GCAACAACAT ATATTGGACT CGGTCGGTGA CGGCAACGAG AAAGCACACG CCGCTTCTAC AGATCTTCCG TCCGAAAGAG TCACGGAATC TCTGGATCGT ATTTCCCATA TTGTTTATAC CAGTGGAAGT ACTGGGGCTC CCAAAGGCTG TGTTTCGTCA ATTCGGGCTT TGCGTTCCTA CCTTTCTTCA AAAAATACCG TACACAACGT GCTTACCGCG TCAACAGTCT TATTGGCCAG CACCATTTCA TTCGATCCCT GCTTTTCCGA CATTTTGGCC ACTTTTCAGA TTGGTGCAAC CCTAGCGATT GCGCCAAGAC GCACGTTACG GGAATCGCTC ACGCACGTAT TGCACTCGCT CCAAATTTCT CACGTCTTGT GTACACCAAC CTTGTGGAGT ACCCTGGCCT TGACAGGGAC CCGGCCAGCT GATCTTCCCA GCCTCCGCAT GATTGCCCTG GGTGGCGAAC CTATTCCACT AGCCATTGTT CAGGCCTGGG CTCGTGCTTT GCCGGATGAT CCCGTACACT GTCGTTTACT GGCCACGTAT GGAGTGACGG AAGCTTGTGT ATACCAATCA GCCGGAGAAG TATTCCGGTT GGACTGTGGA CAGTCAAAGG GGCAAGATGT TGGTTTATTG CTCCCGGGAA TGCGCGTTTC TATTTGTGAC GAATCGATTC AGGAAAGCTT GACCGAGGTT TTGCCGGCAG ACATTGCCGT TGGCGAGGTA GTTTTGTCCG GTAGCCAGCT TGACAGCGTT TGCTCGTATT TGAACCGTCC CGCACTCTCG ATCTCAAAAT ATCTGAAGTC AGAACGTCAT TGGCATTACC GTACAGGTGA TCGTGGATAC ATTGACAGCA AAACCTTACG GTTACACATC ACAGGGAGAA TTAATGGTGA AGACGGCATG GTAAAGATCA ATGGTATTCG AATCGAATTG GGTGAAATAG AAAACGCGTT GGTCGGCTCA ACTGCAGCCC TCGCTACAGT TTTGGACGCC ATGGTAGTTC CTCATGTACA CTGCATAACA GCCACGGATC TCGTTGCCTA CGTTGTCTTG GGAGGAGACT GTCGACAGGA AATGGGTGTA AAGGGCACGA TATCTTCGGA TGGAGTACTA CTCCCTCCAT GCCCATTGAT GGTTTTGTTA CGACATCGTT GCAAACTGAA AGCAAGAATG ATTCCGGCAT TTTTTATTAT AATTCCAAGA ACACCGCTGT CGCCGACCGG AAAACGACAT CGTGCTGGAT TGCCGCCTCT TGAGGCTGCT GTGCCGTTCT TTTCCATACT GAGACAGGGA GTAGATGCTA TCTCCCAGTA CCTCTCGGTG CGTACGGCAA GTCCGGTTCC ATGGTTGCGA GTCACATCGT AGACTGTTTG AATCTACAAT ACAACCAGCA AGCCTTGCTC ACGACGGACG CATCGTTTGC CATGCTCGGA GGAGACTCGC TAGCTGCCAC TCGCGTTGTT CGTGCGCTGT ATGCCGCACA TCACTGCGTC CACAACAGTC GCCATCTGGG AGGAGAGTAT GGCGTAATGG AGGGACCTTT TGATGCAGTT TACTTGATTG GCGCGGACAA TCTGGGAAGC TACGTAGACT GGCTAGATCA AAACAAGGTG TGTCAATCTC CGAACGTGGT CACAAAGCCG AGCTGTGACG ATCCCGTCCG GGATGCGATG CCTACTTCAT CCAACATTTC CCCACCAACA GTCCTAGAAC AGGAAGAATC GCAACTCTAC GACGCCTTGT TTCAATCCGT TACTCAAGGA CAAGTAGCCA TTGCAATGGC GTTGCTTTCG GTTGGCGCCG ACCCAAATCA AGGAGGACAC GATGGACGGC TGGGTAAAAT TTCGAGTCGC AACGATCAGA AAATAATCTT TCGCTCGAGT CCGTTACATC TTGCTTGTGT TAAGGGTATA CCGCTTTTGG TGGAAGCGTT GCTCGCTGAA GGCGCTAGAA TGAATTCACC GGACGCTTCC GGCTTATTTC CGTTACATTT GGCGGCTGCT GGCGAAGCCA GACGCGAGCT AGAAGGTGAA GGTCAGGCGG CGGATGATTG TCGTCGTCTG GAATGTGTCA AACTATTAAT TGCCGCGGGG ACACCACTGT CCATGAAAGA CGGCAGCAAG CAAACGGCTA TACATTGCGC GGCTCGAGGG GGTCATGTTG CCACATTGAG TTATGTGCTG AAAGAATGGC ACCGTCTTTA CGGGACGGAT CCCGAAAAGT GCCACGGGGT GAACTGGCGA GATCGATGGC TGCGAACTCC GGTGCACTGG GCAGTCTTGA ACGGCCACGT AGATGCCTTG GTTGTGCTTC TGCAGCACGG ATGCGATTCC AATCCTCCCC AACCCAAAAT GAACAAACGG TCCAGTGCGG CCATTGAAAG CCCCCTACAG ACGTGTGAGA GGTTGTACGG ATCGACGCCC TTGGGCGAAC GTATCAGAGA GCTGCTACTC GCTGGTAAAC GAGGAATTTT GCGGAGGTGA
|
Protein sequence | MVYLQGPSEQ AAIVWDPSLI HQRSETAVYW DDDMLQPWVK RHASQPSLKF LSYHACQEIH CCIVNIVRRT LEDLLQKVGD DCAMERSQSE IHPLSVPVAV AIPEGILLPI AIEAVCCLNE PFFIGNRSCS VVLVPLEPTE GRERLRDMIW DCRPALILTT SVCDTDRLNN IVSTDCRPNA SVADEGTATL SHPALYRAKS IQFLNLQQHI LDSVGDGNEK AHAASTDLPS ERVTESLDRI SHIVYTSGST GAPKGCVSSI RALRSYLSSK NTVHNVLTAS TVLLASTISF DPCFSDILAT FQIGATLAIA PRRTLRESLT HVLHSLQISH VLCTPTLWST LALTGTRPAD LPSLRMIALG GEPIPLAIVQ AWARALPDDP VHCRLLATYG VTEACVYQSA GEVFRLDCGQ SKGQDVGLLL PGMRVSICDE SIQESLTEVL PADIAVGEVV LSGSQLDSVC SYLNRPALSI SKYLKSERHW HYRTGDRGYI DSKTLRLHIT GRINGEDGMV KINGIRIELG EIENALVGST AALATVLDAM VVPHVHCITA TDLVAYVVLG GDCRQEMGVK GTISSDGVLL PPCPLMVLLR HRCKLKARMI PAFFIIIPRT PLSPTGKRHR AGLPPLEAAV PFFSILRQGV DAISQYLSQA LLTTDASFAM LGGDSLAATR VVRALYAAHH CVHNSRHLGG EYGVMEGPFD AVYLIGADNL GSYVDWLDQN KVCQSPNVVT KPSCDDPVRD AMPTSSNISP PTVLEQEESQ LYDALFQSVT QGQVAIAMAL LSVGADPNQG GHDGRLGKIS SRNDQKIIFR SSPLHLACVK GIPLLVEALL AEGARMNSPD ASGLFPLHLA AAGEARRELE GEGQAADDCR RLECVKLLIA AGTPLSMKDG SKQTAIHCAA RGGHVATLSY VLKEWHRLYG TDPEKCHGVN WRDRWLRTPV HWAVLNGHVD ALVVLLQHGC DSNPPQPKMN KRSSAAIESP LQTCERLYGS TPLGERIREL LLAGKRGILR R
|
| |