Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43936 |
Symbol | |
ID | 7204167 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 472499 |
End bp | 476260 |
Gene Length | 3762 bp |
Protein Length | 1223 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186354 |
Protein GI | 219113541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGTA GTACATCGAT CATGCCGTCT CCGCCGCGCA ATCCTCCTAC AAAAACGCAG CAACTTTGGC GCTGTGATGT CTGCCACACC GCCTCTTTCG CATCCTTTCA AGAAGCTTGC ATTCATGAAG ACCTCTGCCG TGAGGAACAA ACGCGCGAGG GGGAAGAACA TCAAAATCAT GAACCCCCCA CGCACGCCAA CCCGAAACGC CACCCCTTCT TCGGCGGTGT ACCGAAGCCT CGCGGGGCTA GCACTACGCC TGCTTTCGTC ACAGCCCACA AAATGAAAAG GAAACCCAGC AAAGCAAATT TGTCGTTACC TCACAAAGTT GTTGAGCGGG CAGCAAGAAA AGACGTTAGC TTGACTACAG AAGTGACACC CGTTCGCCAT TCCACACCCA ACCCCCGCAC CTCCTCTCTC GATGACGGCA TCGGGAAGCG TAGCCATCCA CCACTTGACG CGGATATGGA TTCCGTCGGT CAGATCTACG ATGTCAAGGA TTATGATACG CACGTACCTA CTTCCAAATC TCGCGCTAAA AAGACTTGCA AGCACAAAAC GGAATGTAGA CTTTCACCAT TGGTGTTTGG TAAGACTAAA GAAGACTCGA AGATATTGAT GGCGGAACAG CGTCAGGCAG AATTTCAAGC CAAGCGGCGT CTCAAAACTC TAAAGGATCG GGAACGCCAG CTAAAACGTC AACGTGCAGC CGATGACGCC ACCGGGTCGA CCGCGTCCTC CCATGATCCG ACAGCCAAGC AATACACAAA CGAACGACAT CCTTGTAACC AACAAAAGAA TGTAGTTGCT CACGCGGCAC GCTTTCCTAC ACCGTCGCAC GTGATTCCAG CGGACAGGAA GAACTCGCAA TCTGCTTCTG ATTCGAAAGC TCCTTATTCT TGGTTGTCGC TGGATGCGCT GAAACAAACA CGTGCGGCAA TTTCTCAGCT TCAGTATGCC GGATGCGATA GAAACATGAG TGAAAATTTT GTGCAGACAA AGCATTCGGA AAATGAAGCC AAACTTCGGC TGTTACCGGT AGATGGAAAC TCTTTTGCTG GAAACTTCAG GCCAAAAGAT CCAATTAAGC TTACTTTACA AGCACATCTC GTCCCACCCC TTGAGCCTGC CAGCAACACG GACACATCTC TATGGAGCGA CAGGTACGGT ATTCGAAGGT TACCGGAAGA TTTGTGTGGC GCCAGTATTG AAAATGCTGC TCTGGATCTG GCGAAGTTCG TCCAGGAGTG GAAACTGGAA CGCCAAAAAG CTCACGAATG TCGCGCCGAA AACCAGCGTC GTTTCCAAAA GAATGTATCT CGTGCCAACA AAGTTGTTTA CAAGGACGAC CGCGATCTTT GGGAGGATTC AGATGAAGAA GGACCGTCCT TACCGAGTTT GTGTTTGATA ACGGGACCAG TTGGAAGCGG TAAAACGAGC TTGGTACATG CTGTTGCTCA GCAACACGGC TGTCCCATTT TGGAGATTAA CACAACCGAA AGGAGAGGAG CGTCTGCCTT ACGCAAAACC ATTGAAGAGG CAACACAAAG CCACTCCACC ATGGACTTGC TGCACACACA CCAAGCCAAT GTCTTTGCCA CTGCGAAAAA GGGGCCAAAT ACCGATTGCA ATGACAAGTC TGTCAACTCC ATGATGAGGG GATCTACCCT GACGGTAATT CTTATAGACG AGGTCGACTT GCTGTTCGAG GCTCAAGGTG ACAATGGATT TTGGTCAGCT TTATGTGATT TACAAAAACG GAGCAAGTGC CCTATCATTC TGACTTCCAA TCGATTTCCG GATTGTCTCA ATTCTCCTTC GTTTCGCTTC CATCATATTG TCGTAGATAT GCCCCATCCT CGGGAATGCA TGTCCAAACT ACGCCGAATT CTCGAAAATG AACGTTTCTC GCTGTGCCGC AATACGTCGG ACAGCAGCTT GGAGAAGATG CTTTTAGAGC TTGCTGAGCT CTGTCGATCA GATATGCGAC GTATGTGTCT AGAACTGCAG CTATTTGCCT CCTCAGATGC TTCCTCCTTT CTCGAGGCGG CTACTGCCAT CTCTATCGAG CCAGAGGAAA CCTCATTTGC GTCCCTTTCT TCGCGCAGCC CCAAAATCGA TGCTATTCAA CCTAGTTGTG TTCCTTCTGA TACTTTTTCA TTCGTAACTG TGACAGGCAA AAACTTTTTA TCTCTTGTTG CGTCTCCTTC GTTGGGCAAT GGAGGCTTTC CAGTTGGCGT GAAAGTTGGC GACCAGGTGT GCTTTTCCTC GCGAATTATG AACGATTCTA CTATCCTATC GGTTGTCCCA CCATGTCGTC TACCACTATG TGTAGGCTCT TGCGGAATGG TGGAAGGCTG CAACAACCGG AGTCTTGAAA GTCGCTATGC TCCGGTTGTA GTCTATAGTC TTGCTGAACT TGGCTGTATA TCAACCACTT TCGGCCACCT CATTGTCGAT GAACTCACAT TGGGGACAAT ATCTTCTCTG CATACAATTT GCAATCTTGA ATATTGTTTA CCAACTCCAA TGAAAACAGA GCAAAATGAA GACGAAGGGG GCGGCCTGGA AGACGAGCAA CATAATATTA CATCCATCGA GAATGTACCT TCGAGCTTGG ATTGCTTGAT GCTTCCGAAT ACATTACCTA GAGTCTTTGA GGAGTCAAAG CTTCCGGCAA GGCTTTTGAA AGAGGGCCTA GACGCTTGGA ATTTGAAATA CGAGTTGTCG AGAGCAGTTG AAAATTACAG ACCTGTTTTA CCGGCGTTAG AAATGGAAGA AGCCTCGTTT GATGCAGAAA TTAATAGTGA TTCAAGACTG TTGGAGGACC TTGGAAGCTT GAGCACTCCC TTTTTGTCTG GTGCAGTTCG TGGCTTTGGT TTTGACCTAA CCGAAGCATT CCCAAAGTAC AACAACGAAA AAAGCAAGCC GTACGTAGAA TATCGACCGA TTGTTGCTCC ATTGTCCATA AATCTCATTC GGAATTTTTT TCAGGCCACC CGAAGACCGA CTCTTTTCAC TAGGTTGGAA GGAAGATGCG TACTTTTTTG GAGGTAGCGA GAGCTACGTG ATCCTGCCGT CGCGGGTCGA AAAGCGACGA CTCTTACATT ATTCTCTATG CTCAGAGGAA GGAGTTGGTT TGCTTACTGG TTTCTCAGCT CATCACGAAG ACGAACACAG CGCTGCAGGT ACCGGCGTGG ATGTTTTCAT CTCGGAACAG GAGAACATTG TTTTATTTGA CCAAACACAA GCATCCTTTC TTGCCCTTCC GAGTCTCCTC AGGACTCGAA TGAGAACCCA GTTGTTCAAA GACAAAGAAA GCCATCAGTG TCTCTATGAG CGAAAACGGA AAGCGTATTT TGATTTGACA ATCGATCGTC TTGTGAAAAC TGTTTTTACA AAAAATGCTG GTAGATTTCT TTTCTCACGG AATCTGACCG GCGCTATGGT TGACGGCATA CAGGACTCAG CACTTGTCCT AGACTATCTG CCAACGCTGA GATGCATTGC TCTGTACGAA AATGAAGTCA ACTATGCTCA TGAAACGGAT TGTTACAGAC AGACAACCAC GTCCCATGGT ATCTCCCGGC ATACTAGGCG TTCCAAAAAA AGGGACATGC GGAATCATTT GGAGTTGTTA GCGACGGAAT CCGATAGCAA CTTCAAGGGG CTGACTCCAA AACAGCTCAG CGACTTTTTG GTTGCAAGTA CGTTAAACTT TTGGCCGGCG CCGTAGGTTA ATCTTGCCAA ATTTTAGGCC TT
|
Protein sequence | MAGSTSIMPS PPRNPPTKTQ QLWRCDVCHT ASFASFQEAC IHEDLCREEQ TREGEEHQNH EPPTHANPKR HPFFGGVPKP RGASTTPAFV TAHKMKRKPS KANLSLPHKV VERAARKDVS LTTEVTPVRH STPNPRTSSL DDGIGKRSHP PLDADMDSVG QIYDVKDYDT HVPTSKSRAK KTCKHKTECR LSPLVFGKTK EDSKILMAEQ RQAEFQAKRR LKTLKDRERQ LKRQRAADDA TGSTASSHDP TAKQYTNERH PCNQQKNVVA HAARFPTPSH VIPADRKNSQ SASDSKAPYS WLSLDALKQT RAAISQLQYA GCDRNMSENF VQTKHSENEA KLRLLPVDGN SFAGNFRPKD PIKLTLQAHL VPPLEPASNT DTSLWSDRYG IRRLPEDLCG ASIENAALDL AKFVQEWKLE RQKAHECRAE NQRRFQKNVS RANKVVYKDD RDLWEDSDEE GPSLPSLCLI TGPVGSGKTS LVHAVAQQHG CPILEINTTE RRGASALRKT IEEATQSHST MDLLHTHQAN VFATAKKGPN TDCNDKSVNS MMRGSTLTVI LIDEVDLLFE AQGDNGFWSA LCDLQKRSKC PIILTSNRFP DCLNSPSFRF HHIVVDMPHP RECMSKLRRI LENERFSLCR NTSDSSLEKM LLELAELCRS DMRRMCLELQ LFASSDASSF LEAATAISIE PEETSFASLS SRSPKIDAIQ PSCVPSDTFS FVTVTGKNFL SLVASPSLGN GGFPVGVKVG DQVCFSSRIM NDSTILSVVP PCRLPLCVGS CGMVEGCNNR SLESRYAPVV VYSLAELGCI STTFGHLIVD ELTLGTISSL HTICNLEYCL PTPMKTEQNE DEGGGLEDEQ HNITSIENVP SSLDCLMLPN TLPRVFEESK LPARLLKEGL DAWNLKYELS RAVENYRPVL PALEMEEASF DAEINSDSRL LEDLGSLSTP FLSGAVRGFG FDLTEAFPKY NNEKSKPPPE DRLFSLGWKE DAYFFGGSES YVILPSRVEK RRLLHYSLCS EEGVGLLTGF SAHHEDEHSA AGTGVDVFIS EQENIVLFDQ TQASFLALPS LLRTRMRTQL FKDKESHQCL YERKRKAYFD LTIDRLVKTV FTKNAGRFLF SRNLTGAMVD GIQDSALVLD YLPTLRCIAL YENEVNYAHE TDCYRQTTTS HGISRHTRRS KKRDMRNHLE LLATESDSNF KGLTPKQLSD FLVASTLNFW PAP
|
| |