Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45221 |
Symbol | |
ID | 7200102 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 522772 |
End bp | 526565 |
Gene Length | 3794 bp |
Protein Length | 1064 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179450 |
Protein GI | 219117311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGAGACAC ACATACACCT TGCCAAGTCC TCAACGTACC AAGGCAGTCT GTCTCCACTC TAAACTTCCA GCCGAAAATC CTACAGAAAT AGCTCACAGT CACTGCGTTG ACCATGGCAG AAGACGACGA AAGGAGAAAG GTAGACAAAA GCGTAGAGCG AGAATTTTCG CAACAAAGGA ATGACCAATC CATTCTATTG GACGCGTTGG AAGGATTGAG TGCTGGAATG TCGCCTGTAC AGCCCTCGCT GGCCATTGCC AAACCTCGTC GTGATCGGTA AGTTAGGTGC TCGCACTCGG AAATGAATTC AGTATCACTG CACTTTAATT CATAAGTCTG ATATCTGCTC CCGATCTGGT TATAGAACTG TTTCCTGGGA CATCAATGTT CCAGCTGACC TCATCGATGC ACCACCCAAC CACGACTTGT CTCCTGTGAC TCCTCCGCAA TCGAGCAACG TGACCACTGG TAAACTCTCT TTGAAGGACA TCAAGGATTC GTCGCCAATA GAAGATGAAG CGGTACGTTA AAAGATTTGA GTGCTATATA ACCAATAAAG GGACAAAGTC AAACCTTTCC ACTAACGACG CTTTATATCA TTTCCGTTGT CAGGAAAGCT ATATTCATCG AGCGTTGGAG GAAAGCGATC CAACCAGATC TGGCGAGAGC GAAGCCCAAA AGAATCTTCT TGGTGCGGAA GTCCCAAAAT ACCGCTTTGA AGATGCGCCG TCCTCTCCTG TTCCTTCGAC GTCTTCAAAG GCGTCCCAAA AATCCCATGT ATCAAAATCT CACCGCAAAA CGCTTAGCAC TGGAGACGCC CTCTTTAATT TAGCGGCAGA AATGCGTCAG ATGCAGAATC AAAGTGTTTC CGAGTCCTTC GACGGGGTGA AAAACGGAAC AACAAGCGCG GATATGTTTG CGACCAACGC CATCGCTCTC ATGAAGAGAA ATAAAGCAAA AAAAGAAGAG GAAACAAAAA GCGCAGCCGC CGTCGCTGCG GTGAGAGAAA CTTTTGATCC CGCCAGCAAG TGGAAAAAAC TTCGCACGGC TGTGCGAGCA TCAAACATGG TCGATTCTAA AAAGACAGAC GAAGCAGTCG ATCCTTTGGA AAACGATGAT GGCGATCAGC AAGTCGATCT TGAAAGTGGT GTTTCAGGAA AGGGAGAGAC GAAGCAAAAT AGTTCTGATA ATTTACGACG CTCGAAAAAG AAAAGTCAAA TCAATGAAGA TTTTAAATCA GATTTTCAAG ACTTTGAGGA ATGGTTGAAG TTCCGCAAGG GAAACGCACT TCGCTTCGTT AAGAACGTGC TTTTCTTTAT CATCTTCCCG GCAACTGGGA TTGCTGCCAT TCTGTTCTAC CTGGCGGATA ACCCGCCATG TGGCTCACAA GAGGAATGTA TTGCAGAGCA GCACCCTGTG GATGTGAAAA CTCCCATGCC TGTGGCAAAT ACTACTGACC CGGGAGATTC CGGTCTAGGA TCCATCGGAA ACTTCTTTCG TGTGGACCGT TCAAATCAAG CTAGTGCATC GTTTTGGATT TTGTTCATTG GAGTCCGGCA AGTGATTACT TTTTCATTGG CCAAAATGTC ACAAGCGGTC GTAATCGACT TCTTCGCCTT ACGAACCCGT TTGTTTGTGA AGGTGCTGGG ACCTTACGCA ACTTTATGGA CCGTACAATC ACGTGGCTGG CCATTCTTAC TCACATGGTG GGTTTTGTAC GATTTCTTTC TACTTTTCGG AAACAACAAA TTTGCTCGTC ACTGGTAAGT GTGACTGTGC TACAATCTAC GATTTTGACA TGATATATTG CTAACCGGTA GCACTGCCCA ATCCAATCCA GGCTTTTTTG GCAGGACACA ATTGGCCTTT TTAATTTCGA AAACCCAGCC GGCGGTGTTT TGAACAGCAA AGCGTACCAA ACAGTTCTCA TTTTGGCCGC ATCCATTGGC GTTGCTATCA CTGCGAAGCG TTTTTGGGTT GGACTGTTCC TGGGCCGACA GACCTTTGAT CGATATGCGA ATGATCTTGC GGTTATTATG CGAAAAGCTT TGCTGGTTGG TCAAATTGCT ACGCTCGCGA GAGATATGGA GAAATACGAT TTCAGTATGA ACGACTATCA TGTGGAGCAT TCTGTCGCAT ATAAGAAGAC AATGGAACAG AACGTTGCGG AATTAGACGG CGGCGAATCA GTGGGTAGTG GTGGTAATGC ATCCAAAAGC CAACGGTCAA TTCTTGCCAG CACGGTTGAT TACAATGCTT CTATTCGCGT TAAGATCAAC GAAGTCCTTG GTGCATGGGA GGAGCCGACA CTTCTGGATA AGAGTAATGT ATGTGCTAAA GGATTTGGAC GATGATATGG GCTTTTCCGA CTAATCTTAT CCTTCCTCTC TTGACATAGG ATATCGTCAG CATTAGCTCG ATTATACAAT TTCGTCAATC CCTTTCCTGT CTCAACACAA CGTTCCCATT TTCAGTTGCC TTCGGACCTG CAGACAGTCG TACAACGTGC ATTTCTTCCG TGGAAACGGT GTATCAGCGA TTGCTTGGAC GCACTCCAAG TAATGAAGCT TTAAACTTTG ATGTGCTTGC ACTAGTTGCA GTCGATCGCG ACGGATCACT AGACGAGGCG AAATTGAAAG AAGTGGTGAA AATATTTCGG CCCGACCGCG AAGGCAATCT CTCGCTGATT GATTTTGCCA AGTCAGTTGA CAGTGTTTAC AAGGAACTAC GATTGCTGCG CGCTTCGGTC GCTAATTCGT CCAAAATGGA CAAGGCGTTC GAGCGTATCA TCAACATTTT GTTCTACTTC ATAGTTGGAT GTATTTCGCT TGGGGTAATG GGTGTGGATC CTCTCGCTCT TTTCGGATCT GTCTCTGCCT TTGTTCTGGG TTTTGCTTTC ATGATCGGAG CGGCATGTTC TAAGTATTTT GAGGGTCTCC TGCTTATTTT GGTACGCCGT CCCTTCGACA TTGGCGACCG CATCCACGTG AGCGACGTAA ACAATGACAC CAGCTTCTCT GGCTCTCCGA CTTGGTTTGT GCGGGACGTC ACTCTATTTG CTACGACAGT GGTCTTTGCG GCTACCAACG AAGTCGCTAC CTACTCGAAT GGGTCGCTCG CCAGTAGCCG CATAATCAAT GCTGCGCGTT CGCCACAGGC TGTCTTGTAT TTCAACCTCA AATTTCCCAT CAACACACCA TATTCAAAAT TCAAAATCTT CAAGGCTGCT TTGGAGAAAT TTGTCAAGGC GCGTCCTCGG CAGTGGTTGA GTTTTTCAGC CTTCCGCGCT ACTCGCGTGG AGGCGGATGC TGGATTTGTT GAGTACATTG TAGTGGGACA GCATCGCGAA TCGTGGCAGA ATGTGGGAGC CTTGCTAGAC AGCAAGGCAG AGTTATCCAG CTTTGCTTTG GAGCTGTCAA AGCGTATGAA TATGCGCTAT CGCGCGCCAC CTTTACCGGT GGACCTCAGT ATGCGAGCTG CTGGAAATGG AGGTCCACTC AACGACATGC TTGCACAACA GATGCAAGCA GGCGACCAAT TTGGCTCTTC AGATGGAGAC GGTGCCAATG AAGACGGGTC CCAGAGCACG TACGATATCG GAGCGATCGA GTCCATGTTT GAAAAACCTA AATAAGGTGC ATCAATCCAA AAGCATACAG CGTACCCACT TCATATGACT TCCCCACGTG AAGCCACATG CAGCACATTT ATATTTAAAT AGAATGCTAT GACAAGAAAA GGACAAGACA CAGTAGAAAA GAAAATACTA GTAACCTCAA CAGCAATCGA TTGT
|
Protein sequence | MAEDDERRKV DKSVEREFSQ QRNDQSILLD ALEGLSAGMS PVQPSLAIAK PRRDRTVSWD INVPADLIDA PPNHDLSPVT PPQSSNVTTG KLSLKDIKDS SPIEDEAESY IHRALEESDP TRSGESEAQK NLLGAEVPKY RFEDAPSSPV PSTSSKASQK SHVSKSHRKT LSTGDALFNL AAEMRQMQNQ SVSESFDGVK NGTTSADMFA TNAIALMKRN KAKKEEETKS AAAVAAVRET FDPASKWKKL RTAVRASNMV DSKKTDEAVD PLENDDGDQQ VDLESGVSGK GETKQNSSDN LRRSKKKSQI NEDFKSDFQD FEEWLKFRKG NALRFVKNVL FFIIFPATGI AAILFYLADN PPCGSQEECI AEQHPVDVKT PMPVANTTDP GDSGLGSIGN FFRVDRSNQA SASFWILFIG VRQVITFSLA KMSQAVVIDF FALRTRLFVK VLGPYATLWT VQSRGWPFLL TWWVLYDFFL LFGNNKFARH WLFWQDTIGL FNFENPAGGV LNSKAYQTVL ILAASIGVAI TAKRFWVGLF LGRQTFDRYA NDLAVIMRKA LLVGQIATLA RDMEKYDFSM NDYHVEHSVA YKKTMEQNVA ELDGGESVGS GGNASKSQRS ILASTVDYNA SIRVKINEVL GAWEEPTLLD KSNDIVSISS IIQFRQSLSC LNTTFPFSVA FGPADSRTTC ISSVETVYQR LLGRTPSNEA LNFDVLALVA VDRDGSLDEA KLKEVVKIFR PDREGNLSLI DFAKSVDSVY KELRLLRASV ANSSKMDKAF ERIINILFYF IVGCISLGVM GVDPLALFGS VSAFVLGFAF MIGAACSKYF EGLLLILVRR PFDIGDRIHV SDVNNDTSFS GSPTWFVRDV TLFATTVVFA ATNEVATYSN GSLASSRIIN AARSPQAVLY FNLKFPINTP YSKFKIFKAA LEKFVKARPR QWLSFSAFRA TRVEADAGFV EYIVVGQHRE SWQNVGALLD SKAELSSFAL ELSKRMNMRY RAPPLPVDLS MRAAGNGGPL NDMLAQQMQA GDQFGSSDGD GANEDGSQST YDIGAIESMF EKPK
|
| |