Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44591 |
Symbol | |
ID | 7197610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 987391 |
End bp | 990639 |
Gene Length | 3249 bp |
Protein Length | 972 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178620 |
Protein GI | 219115649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGTAAGGGC TTGCTTTCCT AAATTTTGGA GGCCATCACA ACCAAAACGA CTTTGTACAT CTCAGTCTCA CAGTCAATAT CCTCTTCGGC ATTATACCTT CTCCTAGGCT ATCAACGAGT GGAAAAATAA AGTCCTCGGA AAGAATCTAT TCGCAAAAGG CTCCCAACCA TAGCTAAATC GCCTTTGGAG TATTTGTTCC TGTGACGCAT CCTGTACAAG AAAGCAATCA TTCAAAACGG TCCGTATCGA GTCCGAGGAA ACAAGCACCG TAGAACAATA GCAAGATTTT TCCCTCTCGC CGTCGCCAGC GAAGGAAAAA ACTCTCCGTC ATGACGAGCG CGCGCGAGAG GTCCAGTAGT GCTGGCGCTG GTGTCCCCCA CCAGCGTGAT CTACCAGTTT CTTTTCATCT CGCAGGGGAT GCTCAAATTG ACGAGACTAT TAGTAACGGT CGTATGCGAG CAAATACGGC ACCATCGATA GTTGATGAAG AAAGTGACAC AGTCCAGCAC CAGGAAAAAC AAGACATTCC TGAGATACCG CCTTCAGGGA GTTCGGAAGA TGAAGAGCTG AAGCTGGCGC AAACAATGGC GCTTGCGATA CAAAACAATC CCAATCTGAC TCGAGATGAA ATTCGTAAAT TGATTCAGAC AACTACAGGG GGTACCAAGC CAGCTTCTTT GATGAGTACT AACAATGGAA GTGAGACTTC TGGTTCGGCC GTCCTCCAGC CATCGTTGTT GATTCCGGGA AGGCTGATGT CGGGGTTTTT TCAATCGGAC GAGGCCGAAA AAGAGGGAGA TAATGTAAGT TCTGGGAAAG ATGGTAAGAA AAAGTCGTTT CGATTTGATG GAAGCTTGCG TGAACGCGCC AAAGGAAGGC TTTCAGAAAT TTGGACGTCG AGCGGGGCCA ATAGTAATAG TATCCGTGGT AAAGAGGAAG GTGATATCAA GGACGCTCAG GCTTCCGGTC ACTCTAGTCT TTCATCCACC ACGGGCAGAG ACAATTCGTC GCGGTCTTCT TTTCGTAAAG AGAGGAATCA TGTACGATCT CCACCTGTCA GTCCCTTGAC TACGCCAATG GTCATAGCTG ACGATAACAA ATTCAAGAGT ATCGTTTCCC CTGCGAGTGC TTCGATGACG ACAAGTAAAT CCGTACCGAT ACGTATGATA GGAATTGCCT GGAAGCGGCG CGGCGGCATG GGAAAATACT CTGCAAATGG GGCCTGGGAG CGTCGTCGTA TCGAGTTGCA AGGAACCAAA CTGCTTTACT ATCGGACCGA CGGAGACACG ACTTATGAGC AAGTTTCACC TTCAGCATCG CGTGACGATG CTTCCTTGAC CGCCTCCGCC GTTATGCCGT CTATGGATGA TAACAACAAT CCAGAAGGTG TAGTCATTGC CCGTCGCCAG ACTTGGCTTG AATCTGCTAC TTCCAAGGCG GCCTCGAGTT GGGTTGTCAA CGATCAAGAC CACACTACTC CTCGTGGACA CATTGATTTG GCCAAGGAAA AGGCGACAGT GAATGCTGCC TTTGGGCATT CAAATGCACC TTCGCCCTTC GCCATCTCGA TCAAAGTCCG CGCGGAAACA AAATGGAAAT TGTGTTTCGA CTACCATCGG ACCCAAATGG AATGGTTAGC TGCGTTGACA GATGTAGTGG TTCAAGGCAG TGTAGACTCA TACAATTCCA ACATCTTGAT AGCAGCCGAT CCCTCGAACC AGACAGAATC AGCTCTCTTC CATCCGCCGC AGGTGAATCA TCCTCCCACG GACAGCAACG AACCAGCTGT GTCGCGGCGT TTGTGGATGA TGGAGAGTTA CACAGTTTCT ACAACGCAAA ATGTAGATAA TGATCACTCG GACAGTGAAA ATGAAGACGA TAGCGAATCT TCGTCCCTAG ACGAAAGAGA CGATGATGTG GAGGAGTCTG TCACAATTTC CCGGGGAGCA TCGGATCCGA TGACCTTATC GGCATCACAC ACAAGTATTG TTGAAGGAGC GAAGAAGGTA CTTTTCATTC CCGAACGATA TCTCGCTTTC GTTTCAGGAA TCGTCAATGC ATCATTGGCC TATGCTCGAG CGTCTTCCAC TTCCACAGAA AATTTTTGGT ACGCTGTTGT CGTCGTAAAT TTCTGCGTTT TGTGGTGTTT GGTGAAAGAA CCCGATTGGA GGGGCATGGT TTCTCAGCCA GAAATGAGAA GCGCTCAACC TTCGCGTTGT GACAAGCAGC AACGGCGTGA AAACACGAAG CGAAAAGCAG CAGCGATAGA GCCAAAGTCT ACTGTAACAG GAAGATCAAC GGATCCATCT GCTTATATAC CCATGGCCGG ATCAACAACA GTGAAGCTCA AGCACACAAC GGATCTTCCC GTAAACGATA AAAATGAGGT TTTTGCTGGA TGGTGTGATC CGCCAGGAAA TATTTTGGCT GTTCGGTCCC ATGGATATAG TTTGACGAAG AAGAAAATTC CGTCGCCTGG TAGTCTCTAC AACTGTGCAA GGGTGGATAT ATTTGAATCC CCAAGTCGGT ATCCCGATAT GGCGCTTCGC GTCAAACTCC CATCGGTGGA CTTCAAAGAC GATGACAGGC CGAAAACGTG GAGGACACCA GATGTTCTGA TTGTCAGTAT TGCTTTGCCA ACGGACCCGC CCAAGCTTGG TCGGTCCAGC AGCGATGGAG GCGGATACAC AGTTACGTGT TACTTCACAA TGACACAGGA AACACGCGAT ATTCTACGTC GTGTTACGGC AGATGACTAT GATCCTTCAA AAGAGAACAT CGATGACATT CAAAAGAGTA CAGTCAACGC TGTTCGACTA CTAGAAGAGT GGGTTCGAAG GGCTCCAACG GATCCGTCAT GGTTTTCTCG TTTCAAGGTT GTCCCCAATG CTCACAACTT GAAGGAAATT GGCATGCCGG CATGGATATC AAAATACAAC GGTAAGCCTT TTTTGATCAA ACGCCCTGGA ACAACTGGTT TTATTTTCGA ACATCCCGAG CTGTCTTGTT TGGAGTTTGA CGTCTCGCTG CACCCTTTTC CGTATATTGC CAAGCAGGCG ATATGTTTTA TGAAGGAAAG CTATTTCAAA AAGGTTCTTG TCAGCTTTGG TTTTGTTATT GAAGGGAAAA GTGACGATCA ACTACCGGAG TGTGTCATTG GGTGCATGCA GCTTTGTTAT CCTGACCCAG CTCACGCTAT CTCCGGGGCA TCGTTTTTCG ACGGCACTTC GAAACGATCT TTCCAGTAG
|
Protein sequence | MTSARERSSS AGAGVPHQRD LPVSFHLAGD AQIDETISNG RMRANTAPSI VDEESDTVQH QEKQDIPEIP PSGSSEDEEL KLAQTMALAI QNNPNLTRDE IRKLIQTTTG GTKPASLMST NNGSETSGSA VLQPSLLIPG RLMSGFFQSD EAEKEGDNVS SGKDGKKKSF RFDGSLRERA KGRLSEIWTS SGANSNSIRG KEEGDIKDAQ ASGHSSLSST TGRDNSSRSS FRKERNHVRS PPVSPLTTPM VIADDNKFKS IVSPASASMT TSKSVPIRMI GIAWKRRGGM GKYSANGAWE RRRIELQGTK LLYYRTDGDT TYEQVSPSAS RDDASLTASA VMPSMDDNNN PEGVVIARRQ TWLESATSKA ASSWVVNDQD HTTPRGHIDL AKEKATVNAA FGHSNAPSPF AISIKVRAET KWKLCFDYHR TQMEWLAALT DVVVQGSVDS YNSNILIAAD PSNQTESALF HPPQVNHPPT DSNEPAVSRR LWMMESYTVS TTQNVDNDHS DSENEDDSES SSLDERDDDV EESVTISRGA SDPMTLSASH TSIVEGAKKV LFIPERYLAF VSGIVNASLA YARASSTSTE NFWYAVVVVN FCVLWCLVKE PDWRGMVSQP EMRSAQPSRC DKQQRRENTK RKAAAIEPKS TVTGRSTDPS AYIPMAGSTT VKLKHTTDLP VNDKNEVFAG WCDPPGNILA VRSHGYSLTK KKIPSPGSLY NCARVDIFES PSRYPDMALR VKLPSVDFKD DDRPKTWRTP DVLIVSIALP TDPPKLGRSS SDGGGYTVTC YFTMTQETRD ILRRVTADDY DPSKENIDDI QKSTVNAVRL LEEWVRRAPT DPSWFSRFKV VPNAHNLKEI GMPAWISKYN GKPFLIKRPG TTGFIFEHPE LSCLEFDVSL HPFPYIAKQA ICFMKESYFK KVLVSFGFVI EGKSDDQLPE CVIGCMQLCY PDPAHAISGA SFFDGTSKRS FQ
|
| |