Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35055 |
Symbol | |
ID | 7199995 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 968830 |
End bp | 971321 |
Gene Length | 2492 bp |
Protein Length | 740 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179328 |
Protein GI | 219117067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGAG CGAGTCCGAG ATTGCAAGAA CCGTCGCCAT CTGCGGCGTC CGTGTCGGCT CTGGCGGCTC CGGCTCCATG GTTTGGGCTG CATCCTGCCG GACCCACCAC GCGTACCAGC CGCGCGGCGG CCTTGTCCTG CCTCGGACGC CTCTGTAGTG TCGTCGTGCT TGCCATATTA GCTAGTAGTC ACGTACATCG TGTTTGGCAG CAGTATCCGG AGTGGCTTCC CACGTCGTCG CGCAATGCCA CTGCGTCCAT CACTCTGGAC TCCGCATCAA TTCACGCACA CACCGGACTC GTCCAGGAGT CGCACCACCC GAACGACGGA TTCCAGCGCG AATCGGTCTC GTCGTCGGTT TTGGGCGATG CGTACGACGA ACAAGGTCGG GCGGGGTACG TGGCGGATCC CACCGCCTTG CGGCGCGAGC GGCAACGGTT TCGACACGCG AGGACCCGGG AGGCGTCGGA ATCCGGGGCA ACGGAACCAT CAATCGAAGA TTCGTCGGAT TATTGGCACA TTCTGGAAAA TTTCGTACCG TTCCGAGCAG ACCAGGATCC GTTGCTCAAC TCGCGGAATC ATCCGTTGCG TGCCAACGTA TCTTCCGATT ATGTGTGTGC GTTCCCCCCG GGTCGGGGAC TCGAAGAGGA AGGTGGGTAC AAGCTCTTGA CGGAAAAGAT ACGACTCCAG ACCGTGCACC GTCGAAACAC TACCACCACG CCGCGACTGT TGTGTGCCGT GTACACGTAC CCACGAATGC GAGATTTGGC TCGAGCGTCG GCATTGTCGT GGGGATACCA ATGCGATGGT TTCTTGGCCT TTACGACGGA AACGATTCCG TCCTTGGGCT TTGTCCATCT ATCCCACGCT GGCAAAGAGT CCTACCGGAA CATGTGGCAG AAGACGCGGT CCATCTGGTC CTACATTGCT CGACACTACG CGGACGATTA CGACTACTTT CATTTGGGTG GGGACGACAT GTACGTGCTG GTGCCGAATT TGCGAGCCTT TTGGCAAGAC GAGATTATCC CAGACAGCAT GGGTACTGGT TCCGGTGCCG GCCAGGACCA AGCCATCTTT ACGGGTCAAC AAGTAGTGTT GCGTGAGGGG CAGCGTCCCT ACGTCTCGGG TGGTCCGGGA TACACAATGA ATCGTGCCGC CTTGCATCGT TTGGTGAACG AGGCCTTGCC GGAGTGCGAG GTGGATACGA TTGCCTCGCA CGAAGACCGT CTAGTCTCAC AGTGTTTCGA CCGCATCGGG ATAAAACCGT GGGACAGTCG TGATGTACCC ACCGGATCCC AGCGCTACCA CGACTGCTCA CCACACCACT TGTACACGTT CCGTGCGGTG ACTTCGTCCG GTCGCGGCCG CGGTTCGTTC CACGCGCGTG CGGCCGCCTA CTGGGCGAGC CAGCCGCGGT TGGGTTCGAT GGTCGGAACC AACGAGACGA CGGGTCCACG GTACGGCTTG GCAGCGGCCG CCACCCACAG TATTTCCTTT CACGACGTGC ACAATCCGCT GTACGTGGCA CGCATGTACG CGTTGCTGCA TCCCGGGAGT TGTCCGTCCG CGACGGCGCT CGGCCGTGGA CTCGCGCAGA GTCACGGGCA TCGTGCGGCG GGTTTCTAGA GGACGCTTCC AGAGGAAAGT GTCTGTGTGT TTAATGTGTA AGTGTGTGTA TAATTAGGTG GTGGTGTCGA TACATGTATG TATAGGGGGG TACTACAGAG ATCGAGAACG AACGGTTGGG GTTGTACGTT TGGAATTAAC AACGGTAGTA TTAGTCTAGA GGAGTACCTA TTTTGTAGTA TGGTTGGGCC GGTTCTCTCT CTCGGTAGCT CGGGCGCGAG GGTGGGGGTT CCATGCGATC GTCTCCTTCC CTGGAGCCCG TAGGGAGGGT TGGCGCGTCG GCCGAAAATC GAGTCTCGCA CGGCGCGCGG CCCAATGACG GCCGGCGTAC ACGACAGACC GGTTTCGGCC TACCGCGAGA CATACGCGCC ATCTCGGCAC ACCGCGCGCG TACCAACAAA CCCTCACACC CCAACTGACA AAACGAACAG GGCAGCACTC TCATATACCA CGACAACACT ATCGAGACCT GACAGGGAAC ACGGTCGGTA CGAATCCCAA GATATATACA CACACTAGAC AAACACTTGC TTGCAGAAAC ACTTTACTAC AAGAGAAAAG GGATGAATGC GTTGGAGGAG AGGCTTCCAA CGTCGGGGAA CGCATCGGAC GACATGGACG ACGCGTCGCT GGACCGCGGC ATTCCGGGAA CGGAGGACGA GGACGCGTCG GTGGCACGAG CTCGAGCACA CGTTGCCGTC CTCGATGAAC CTACGGGTGA TCGAGTCCCA CGCGATACGA CGGGAGATCA AGCCGTCTTG GGAGCAGCTG CGGCGCAGGA CGGGGTAACG GCCTTTGCCG TGACGGATGA AAGTGGACAA CTCGTCCTCG CACGTTTTCA ACAGTTTCTC GGACAATTGT GA
|
Protein sequence | MRRASPRLQE PSPSAASVSA LAAPAPWFGL HPAGPTTRTS RAAALSCLGR LCSVVVLAIL ASSHVHRVWQ QYPEWLPTSS RNATASITLD SASIHAHTGL VQESHHPNDG FQRESVSSSV LGDAYDEQGR AGYVADPTAL RRERQRFRHA RTREASESGA TEPSIEDSSD YWHILENFVP FRADQDPLLN SRNHPLRANV SSDYVCAFPP GRGLEEEGGY KLLTEKIRLQ TVHRRNTTTT PRLLCAVYTY PRMRDLARAS ALSWGYQCDG FLAFTTETIP SLGFVHLSHA GKESYRNMWQ KTRSIWSYIA RHYADDYDYF HLGGDDMYVL VPNLRAFWQD EIIPDSMGTG SGAGQDQAIF TGQQVVLREG QRPYVSGGPG YTMNRAALHR LVNEALPECE VDTIASHEDR LVSQCFDRIG IKPWDSRDVP TGSQRYHDCS PHHLYTFRAV TSSGRGRGSF HARAAAYWAS QPRLGSMVGT NETTGPRYGL AAAATHSISF HDVHNPLYVA RMYALLHPGS CPSATALGRG LAQSHGHRAA GGYYRDRERT VGVLGREGGG SMRSSPSLEP VGRVGASAEN RVSHGARPND GRRTRQTGFG LPRDIRAISA HRARQHSHIP RQHYRDLTGN TVETLYYKRK GMNALEERLP TSGNASDDMD DASLDRGIPG TEDEDASVAR ARAHVAVLDE PTGDRVPRDT TGDQAVLGAA AAQDGVTAFA VTDESGQLVL ARFQQFLGQL
|
| |