Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33124 |
Symbol | |
ID | 7204255 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 108080 |
End bp | 111525 |
Gene Length | 3446 bp |
Protein Length | 1108 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186280 |
Protein GI | 219113393 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0372289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCTC TCGAACATGT CCTTGTGAAC CTCTTGGGAG CAACAACGTT GGATTCGTCG TACCGTCGGT TCTTTGAAGA GTATGGGATT ACTCAGGCCA GTGAATTGGC CTCAATCACT GAACATTGTC TTGCAATGGT GTCTTATGGC GTCTTGACCC CTGCTGTGGG AGACGGCCCT GCTTCAATTG TTCGTACATT CCTTCCGCCT GCGCAACAGG ATCGGATTTT GAAGATTGTA CAATGGTTCC TTTTGAAAGG CACCAATGTG ACAAACAACA CCTGGCTTGA ACTTACCTCT GATGTTCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCCGCTAC TCCTGTTGGA TCTGATGCTC GGAGTTCCTT TGCCGAAAGT GCTGCTGCAA AATTTCGAAA GACGATCAAG AACCATTCCG TCCCGTACCC AAAGTTCAGT GAGGACCGTT TTTGGGTTAC TTGGAATACG AATATTCGTA TTAAACTCCG TATCCACGGT GTTCAGTTTG TACTTGACCC GGACTACTTG CCCCAGACCG TTGATGAGAT GGACACGTTC GTTGAGATGC AGAATTTTGT CTTCGGTGTA TTCAACGACA TTTTGTTGAC CCCTCGTGCG CGTGGGATCC TCCACAAGCA TGTGGATGAG CTGGATGCTC AGTCGGTCTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATTAACGCG CAGATCACGG CTACATCCAT TGAAACGAAG CTCACCTTGT ATTCGTTTGC GACTTCAAAG AGCAAAACTT GCGTTACTTT TTTGACAACT TGGCGCAATT TGATCTACGA TCTTGAACGG ATCAACAAGT TCCCTTTGCC AGATCACCAA AAGAGCGTGC GACTGAAGTC AGCTGTCCGT TCCCATCCTC AATTGAAACT TTTCCTTGGC AATGTGCAGC TTTACTCTTG TACCCATGTG GGAAAGAGTT CCGACGACTC AGACTTCGAG TACGTCTATG ATCTGATGCT CGAGCATGCA ACCAATATTG ATCAGACCGA TTTTGAAGAC CGCGGTAATA ACCGTGGTGG CCGTTCTGCA AACAACGCGA AGTCCCAGTC TTCTTCAAAG AAGAAAACTA ACAAGCCGCT TGGTAAGAAG CACAAGAATT ATGTGCCTCC TGAGAAATGG AATGCTCTCT CTCCTGAAGA GAAGCGCACC ATTATGGACC AACGAGGACC TTGTCCTGCT GCAGCTCCAG CCTCTGCTCT GTCCGTGAAT GCCGCTGCTA CTCAGCCTCC TCCCACCGTG TACGTGAGCG ACTCGACGGC TGTGGATAAC CAAAGTCTTG CTTCGACTCA AGTTCCAAAT GCTGCTGCAT CCGGACACCT GCTACGGTCG CTAATTTCAA ATTCTGCCGC TCGCCAGCCG TCCAACGGCG CAACCTCTGA TTCTTTTTCG GTGAATGGTA CCACGTACCG CCGTGAGGTA AACCATGCCT CCGTCAGATA CCGTCTGTCT ACTCACGATG TTTCTTTGAC TAAAGATTCT TTGATTGATG GTGGTGCCAA TGGTGGCCTT AGCGGCTCGG ATGTGACCGT TATTTCCCAA TCTCTGTCGC AAGCTACTGT TTCTGGGATT GGAAACTCGG AGCTCACCAA CCTTCGTCTG TCGACGGTTG CCGGACTCAT CCACACCACG GATGGTCCTA TCATTGGAGT GTTTAATCAG TATGCTCATC TTGGTACTGG TAATACCATT CATTTGTGCA ACCAAATGCG CTCCTGGGGA GTCACAGTTG ACGACGTCCC TCGTACTTTT GGTGGCAAAT AGAGTATTGT CACGTCCGAT GGCCGTTTTG TCATCCCGCT TTCAGTTTCT GGCGGACTCA CCTATTTGTC TATGCAGGCC CCCACCGAGG AGGACCTGGA CAATTTCGAG TGGGTTCATT TTACCGCCGA CAACGAGTGG GACCCGAATG GCGTGTCTTC TCTCCTGCTG CGACCGATGA TGATCTCAGT TTGCAGCTTT CTGCCGACCA TGTTCCGGGA TGAACGCTTC AACAACTTTG GCCTTCTTGC GCACTCCACG GTTGTCAGTC GTTCCCCCTT GAACGCCGAT GTCTTGCAAC CCAATTTTGG ATGGGTCCCC AGCGCTCGAA TCTCCCGCAC GTTCGAAAAT ACCACACAAT TTGCTCGTGC CGATGCTCGT TTGCCTTTGC GCAAGCATTT CAAGTCGCGC TTCCCTGCTG CCAATGTTTC TCGTTTGAAC GAAATTGTGG CAACCAATAC TTTTTTCTCG GATACCCCTG CGGCCGATGA CGGCATTTTC AACCATGGGG GGGGGGGGGG GGTACAATGG CCCAACTTTT CGTAGGAAAA AGTTCGCAAA TCACCTCTGT CTTCCCAATG AAGCGCAAAT CTCAGTTTGC CCATGCTTTC GAGGACTTTA TTTGTACCCA TGGTGCTCCC AATGCACTCC TCAGCGACAA TGCTCGTGCT CAGATCGGTA AGCAGGCGCT TCAGATTTTG CGAATGTACG CAATTGACGA TATGCAGTGC GAGCCGCATC ACCAACACCA GAACTACGCG GAACGCCGCA TTCAGGAAGT GAAAAAGATG GTAAATACCA TCATGGATTG TACAAACACT CCTCCGGAGT ATTGGTTGCT TTGCTTATTT TATGTGACCT ACTTGCTGAA TTGCCTTGCA GTTGAGAGCT TGAATTGGCG TACCCCGTTG CAAGTTGCGT ACGGCCAGCG TCCTGATATT TCCGCTTTGC TCCTTTTTCG TTGGTTTGAG CCGGTCTACT ACTACGACCC TGACCATGCA TCTTTCCCGT CGCAATCTCG CGAGAAGACT GGTCGTTGGA TTGGTGTTGC CGAACATAAA GGTGATGCTT TGACTTATTG GATTTTGACC GATAATACTC ACCAAGCCGT TGCCCGTTCT GTTGTTTGTT CAGCCAACGT TGATAACGGT CTGAAAAACC ACCGTGCTGC GAATTCCTCT CCCGATGGTG GGGAGCCTTC GAATCCTAAG CCTATTGTGT TGGCTTTGAG TGATCTACGC AATCCTGCTG CGATCAACCC ATCGCTCTTT GAATCCCCTG CGTTTTCTCC TGACGAATTA ATTGGTTGAT ACTTGGTTCG TGAAGCCCCT GATGGCCAGA GCCACCGAGC CCTTGTTGCT CGTAAAATTG TTGATGCCGA TTCCGACAAT CGCCAAGCAA TCCGTTTCCT ATTGCAAATT GATGAGAAGG ATGCCGACGA GATCATTTTG TATAATGAAC TCTGTGACTT GATGGAAGCT CAGCAAACCG ACCGTGTCAC GAATGGAAAT GTTGAAGGCC ACTTCAAATT TACTGGTGTC ATTGGACATC AAGGACCGTT GCAACCGACT GATGTAAACT ATAAGGGATC GTCGTGGAAT GATTTGGTTC AATGGGAAGA TGGTTCCCAG ACCTAG
|
Protein sequence | MDPLEHVLVN LLGATTLDSS YRRFFEEYGI TQASELASIT EHCLAMVSYG VLTPAVGDGP ASIVRTFLPP AQQDRILKIV QWFLLKGTNV TNNTWLELTS DVLEYWQPAS AIVAPATPVG SDARSSFAES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQFVLDPDYL PQTVDEMDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQSVYRDL VASYGKGINA QITATSIETK LTLYSFATSK SKTCVTFLTT WRNLIYDLER INKFPLPDHQ KSVRLKSAVR SHPQLKLFLG NVQLYSCTHV GKSSDDSDFE YVYDLMLEHA TNIDQTDFED RGNNRGGRSA NNAKSQSSSK KKTNKPLGKK HKNYVPPEKW NALSPEEKRT IMDQRGPCPA AAPASALSVN AAATQPPPTV YVSDSTAVDN QSLASTQVPN AAASGHLLRS LISNSAARQP SNGATSDSFS VNGTTYRREV NHASVRYRLS THDVSLTKDS LIDGGANGGL SGSDVTVISQ SLSQATVSGI GNSELTNLRL STVAGLIHTT DGPIIGVFNQ YAHLGTGNTI HLCNQMRSWG VTSIVTSDGR FVIPLSVSGG LTYLSMQAPT EEDLDNFEWV HFTADNEWDP NGVSSLLLRP MMISVCSFLP TMFRDERFNN FGLLAHSTVV SRSPLNADVL QPNFGWVPSA RISRTFENTT QFARADARLP LRKHFKSRFP AANVSRLNEI VATNTFFSDT PAADDGIFNH GGGGGRKSQF AHAFEDFICT HGAPNALLSD NARAQIGKQA LQILRMYAID DMQCEPHHQH QNYAERRIQE VKKMVNTIMD CTNTPPEYWL LCLFYVTYLL NCLAVESLNW RTPLQVAYGQ RPDISALLLF RWFEPVYYYD PDHASFPSQS REKTGRWIGV AEHKGDALTY WILTDNTHQA VARSVVCSAN VDNGLKNHRA ANSSPDGGEP SNPKPIVLAL SDLRNPAAIN PSLFESPAFS PDELIAPDGQ SHRALVARKI VDADSDNRQA IRFLLQIDEK DADEIILYNE LCDLMEAQQT DRVTNGNVEG HFKFTGVIGH QGPLQPTDVN YKGSSWNDLV QWEDGSQT
|
| |