Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50061 |
Symbol | |
ID | 7198750 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 278469 |
End bp | 281915 |
Gene Length | 3447 bp |
Protein Length | 886 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184936 |
Protein GI | 219129522 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.29853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGG TGGACTGTGA GTCCTGTATC ATTACTCGTA GATCAAATTC ATAGTCCTCG TTTTGACTTG TCGGACTTCT CGAAGAAATC TTGCCAGGGT GGTCCACCCG TCAGCCGACC TAAAACCTTA AATCTTTTTC TGTAAGGTTA TGATGGTTCG TATCCACTGA CTCCATCTCT CCGGTTCGAT GACGGTGACT TCGAAGAGGT ATTCCACGAC CATGAAGGAA CGACCAAAGC GATAGGACGG GAGTTTACTG GAACTCTCTT TCCAACGCCA ATCGCAGTGT AAGAGAGAGA GAGAGAGATC CCCCGTGCCC GCTTCCAACA ATTTACGTGC CCCGGTTAGA ATCTGGAATG AGGCGAGTTT CTCATTCATT ACTGCGCAGT ACTGGGACCA GCTGTAGACG AAGAACCATT CAGCAATCCT CTCTGCGCTG GGCTCAATTG TCTACAGATG CATCGTCTTT AACCGACGTC TCATCTCTAC AGAACAGACT AGAATACAGT GCCCGTCGTC AGCCAAAAAA AGTACGGTCA TCCACAACGA TTAACGAAGA AACTCGCGCA TGGAACGTTT CTACCTCGAG TGGGCGGACG TTCCCTCATG TCGAAGGACA AAATACAGAA AATTTACTCG ACATAGAGAC GTATCCGATA GGTAGCTTGA CACCGACATT GTGGAGCCAA GGCCACGAAC TCTTAGCCTT TTGGGTAAAG CAACGTACGG CTGACTCTGT CGATTTCTCG TTCCGGCTGT GGGATCGACT TTGGCAGGAG CAAGTGCATC TCGAGACGGA CTGCGTGCAG CGCTCTACAG GCGATAGTCA TCCCCCATCC ACTTGGCTCA CAACGCCTCT CAATTCGGAA ATGCTCTGTA CATTGGTCAA CAATTGGCGC GTCTGGTCTT GGAATCAGCA CCCAGACTTG AACAAGCAAA ACCCACAACT GACAGTGAAA GAGAGATACA ATGCTCCGTA CGTGCAAGCT CTGGTGGAGA GGTGTTCCTC GCTTTTGAAC GAAAAAGTCT ACTATCTCAT TCTGGACGGA ACTCGCCGCT CGGGAAATCC CGCACAGGTT GCGGACTTGG CATCCGACCT ACTGGCAGCT TCCATACACC GGTGGAAAAC TCAAATGAAC CCCTCGTGTC GTCCGTCGAC GGAATTATAC AACTCGGCGT TGCTCGCTTG GTCTCGATCT AACCGCAATA ACGCCATTAC CATGGTAGAA AAACTATGGG CGCAAATGCA AGAACACAAT ATAGCTCCGG ATTCTCGTAG TTACGAGCGC ATTATTGCGG CCCACGTTTC CTCTACGACT CCCTCGCGTT CCGAAAAGGC CGAATATTGG TTGCGCAAAA TGGAACAGGA TCACACTGTT TGCATATCGA CACGGGCGTA CACGAGCGTC ATAGCTGCAC AAGATGATTG CGCGAAAGCT GAGCAACTCT TGGAAGAACT CTTAGACCTC AAAGCACAGA GCATTGATAG TCCGAATAGT ACGGAGGATT CCCAGTTTCA TCCTGGCGAT CAGGATGTTC CCGGGGCAGT CAATGCCACG CTGCAATGCC ACATCAGAGC AGCGAATGTA GAGCGAGCAC AGGATATCTT GTACCGTATG CGGGACTTGG GTTACATCGA TGTACTTTCC TACCGCACAG TTATGCTGGG ATGGTTGAAA GCGTCGCAGC CCGTCCTATG CCAGGCAATT TTACAGTCAG CTTTGGAGTC GTATCGGGAT GGGCTCTGTG GCGTTTGTCC GTCCGACGAG CTCGTTGCCT TGGCTGTTTC GGCGTGGGCA AAGTCTTCGC ATCCAAAGAA TGTGGAGCAA GCAATGGCGC TACTCAAGAG TTTGACCTCG TCTGAGTGGG ATCATATTGA TATGCAGGTA ACGACATCGA CTATGAATGC ACTGCTGGAA GTTTTGATTC GGTCGAAGGA TTATGGCGAT ATTCAAAAGG CGGAAGAACT GCTCCGCCAT ATTAAGGGTT TCTCCAAGAA GGATGCGAGC AAAGCCGCGA TGGCTCCAAA CGAGTCGAGC TACAATCTCA TGATTGGCGG TTGGGCGCGA CTCGGTCAGC CGGTGAAAGC TAGAGCATGG TTGGAAGAAA TGTACAAAGA CTACCAAGAG GGAAGTATTG AAACAATGCC GGGCCTGAAA ACTTTTAATA CTGTATTGGC TTCGTATATT CGTTCCCGGG ACCGACACGC GGCGGAGAAT GCCTGGGAAT TCTTAAGGCT ATTCAGAAGC AATGCGACGA GGGTCGTCTA CCGTTTGAAC TTGACGTTTA TTCCTACACG TCCGTTTTAT CTGCGCTCAG TAACGCCTGG GATCTGAAGA AGTACGGCGA GACTGCTCAA CGAGCTGAAG ATCTTTTAAA CGAAATGAAT CATCGTTACG GACAAGGTCA AGCAAGCGTA CGACCTAATA CGATTTCGTA CAACGCCGTC ATGAATGGAT GGGCTCGTGC AAAGAATCCC GAGAAGGCGG CGGCTGTTTT GCAAAAAATG TATGCCGACG TAAAGAAGGA AGGCAACATT AATGCATTGC CCGACGACAA AACTTTCAAT ACACTAATTA AAGCGTTCGC GTTGTCACAA GACCCAGGCG CGCCAGAAAA GGCGGAAGAG ATTCTTCGAC ATATGGTGGA GCAATACGAA TTGGGAGCAT CGAAGGTAAA ACCGACGGTG GTCACGTACA CGACAGTGAT TTTGTGCTAC GGACTTTCGA AACACCCCAA GGCTCCTTAT CGAGCTGATG AGCTGTTACA GCTTATCAAG GGGTTGTATC AACGGGGAGA GTTGGATGAC GGTCCCAGCC GTAGTACGTA CCAGGTTGTT CGTAAGGCTT GGGAGTTCTC GACGCATGCT AGAAAAAGGG ATCGCATTTC CGAATTGGAT CGGGAATACG TTGCGTTGTT TGGGCAGACT GACAACAGCC CGCGTTCCGG AAGTGGCCGT CCGAGAAAAG ACTACCTAAA CCACAATACG AAAGGGAAAC GTGACAAACG ACTATAACCA GACAAGATTC GCGACGAGTA ATTTCACATT CGTCGACGTG AATCATCCAT CAAGGAAGAA TAGAAGTTAC ATTCTAGAGC TTATTTGAAC AAAAAATCAC TGTAGTATTT GATTTTGATT TTGTCCAAGA TGGCCCAAGG ACGCCTTACA CCTAACCCAA AGCTTCAAGA AATGCGACTC GCGTGATCCA CCCAGGACTC AAATCCCTGT GCGATCTTTC TGTGACAGCT ATCAAATTCT GCCTACTGTA AATGGCACGT ACGCACGCAG TTTCTGTCGA GTCTCTCCAG ACTTTGTTTC TAACTGACTG TGAAGACCCT TTTTGCTGAG CTAAGAGTAA ATAGTTATAT CGCAAGGAAC CGTCAAGTTT ACTGAGAGTT CGCTGTCCAA AGTGTAACTA CATCTATTAG TAGTGTTGAA TACTGTG
|
Protein sequence | MNLVDYQIHS PRFDLSDFSK KSCQGGPPVS RPKTLNLFLR RTIQQSSLRW AQLSTDASSL TDVSSLQNRL EYSARRQPKK VRSSTTINEE TRAWNVSTSS GRTFPHVEGQ NTENLLDIET YPIGSLTPTL WSQGHELLAF WVKQRTADSV DFSFRLWDRL WQEQVHLETD CVQRSTGDSH PPSTWLTTPL NSEMLCTLVN NWRVWSWNQH PDLNKQNPQL TVKERYNAPY VQALVERCSS LLNEKVYYLI LDGTRRSGNP AQVADLASDL LAASIHRWKT QMNPSCRPST ELYNSALLAW SRSNRNNAIT MVEKLWAQMQ EHNIAPDSRS YERIIAAHVS STTPSRSEKA EYWLRKMEQD HTVCISTRAY TSVIAAQDDC AKAEQLLEEL LDLKAQSIDS PNSTEDSQFH PGDQDVPGAV NATLQCHIRA ANVERAQDIL YRMRDLGYID VLSYRTVMLG WLKASQPVLC QAILQSALES YRDGLCGVCP SDELVALAVS AWAKSSHPKN VEQAMALLKS LTSSEWDHID MQVTTSTMNA LLEVLIRSKD YGDIQKAEEL LRHIKGFSKK DASKAAMAPN ESSYNLMIGG WARLGQPVKA RAWLEEMYKD YQEGSIETMP GLKTFNTAIQ KQCDEGRLPF ELDVYSYTSV LSALSNAWDL KKYGETAQRA EDLLNEMNHR YGQGQASVRP NTISYNAVMN GWARAKNPEK AAAVLQKMYA DVKKEGNINA LPDDKTFNTL IKAFALSQDP GAPEKAEEIL RHMVEQYELG ASKVKPTVVT YTTVILCYGL SKHPKAPYRA DELLQLIKGL YQRGELDDGP SRSTYQVVRK AWEFSTHARK RDRISELDRE YVALFGQTDN SPRSGSGRPR KDYLNHNTKG KRDKRL
|
| |