Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45913 |
Symbol | |
ID | 7201002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 649885 |
End bp | 653015 |
Gene Length | 3131 bp |
Protein Length | 1012 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180287 |
Protein GI | 219119041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATGATTGGC ATGATGGATA GCGAGGAAAC GGAGACTGCT GAAAGCGAAA CAATTGTAAT AAATGAGGCT GGTTCAATCA CCCCCAATGT GAACGGCACA TTGACTGACG AAGATGCAGG CGCGCAAGAA AATGCAAAAG GGTTGCCACA GACAGCTTCT ACACAGTGCT CCGGTGATCC CCAGTCGAGA GCGCCGGTGA TGTTGGACGC GAAAGGCAAA AGCAATACTC TACCAGTCCA AACGCAAATA TATCGCACGC GTATACCCGC GATCGCTTCA GAGGCTGACG TATTTGGCTC GCAACGTGAT ACAATCGACG CACGAGAGAA AGCACCGCGC CCTTTTGAAT ATGAAATTGG AATGGAAGAA GGGAGTGCTG CCTTTGCCTT GCCATCTCCA TTACTTTGTG AAGGAGCATC TAGACCAGGC GCATTTCCAA TGGGCCTTAC TCGTCCCTTA GATGACAACG AAGATTTAAG CGCACACTCG AATCTTTTGC AGTCTGAATC ATTTTTACAT ACGTCGTCCT TGTTGAGCAT GGATGAGTCC AGCGGGATAT TGGTGGAAGC AAGCTTGGTT CCTGATGATA GTTTTATTGA GACAAGTACA ACAGACAATA AACCAACTGC CCTTGTCCAG GCAAACCCTC TGCCAGACAC GGCTTTTTTC TTTCGCCCGA AGATAATTTG CACTGTTCTT GGCACACTCA TAATAATTAT TCTTGCGTTG GCGTTGGGTA TAGTCCTTTC AGCAAGATCC ATTGATGAAG GAGTCAGTGC GCTAAACGAG GGAAACTTAT CAGCATCATC AAGCGACGCA CCAACCTCAG CAAATACATT AATTGAGCTG TTTCCTATTG AAGCTTTGCC TACCTACACA CAGATGTCAC TGCAGGATCC CTTGTCTGCT CAGTCAAGAG CCGTAGCTTG GCTTAAGGAT GACCCGCTTT TGGCAAGCTA TTCCAATTCT CGTCGACTAC AGCGATTCGC ACTAGCAACG CTTTACTATT CGACGAGAGG AGAAATGTGG AGTACAAATG ATGGCTGGCT GTCGGAAACG GATGAGTGCA CCTGGTTTTC TGACCTCGGG AGTGCGTCTT CAATTTGCGA CGCAGGCATC TACAGGGCCA TTTCGTTGGT AAAGAACAAA CTGCGTGGGA CTATCCCGCA GGAGATGGAA TTGCTGGACT CACTTGTACA CTTGCGAATG AATTTCAACT TTGTTTTCGG CCCCTTGGTA CCTCAAATTG CAAACCTTAT CGCATTGGAA GACCTTCAAT TAGCAAATGC TGGTTTACAG GGTCCATTTC CAATGGAACT AGCCTCCCTT ACAAAATGTC GAAAAGGCTT TTTCCATTCC AACGATTTTA CAGGAGGGTT ACCTTCTGAT TTATTTGCGA GCTGGACTGC TGTCAAGGAT TTAGATTTTG GGCGAAATAT GTTTGAAGGC TCCATTCCAG TGGAAATTGG TGCCATGGTG AACATTGTTT CATTGGGATT TGATGACAAT GTATTTTTGA GTGGAGAGCT CCCCTCAGAG TTTGGGCTCT TAACTGCTTT GGAGTTTCTC TGGCTTCAAG GAAACTCGCT CACTGGAACA GTTCCTTCCA GCTTGGGAGC TCTCACTCGT TTACGAGAGT TGCAAATACA CAACAACTTT TTCACGGGGG CTTTGCCGGA AGAACTGTGT GAGCTGGTCA AGAAAAACAA CCTGATTGTA GTTGTGGACT GTTTTCAAGT CAACTGTGAT TGTGAGTGTG AATGTGTGGA AAGTTCACGA CCTCTTCCAT CAGGACTGTT GTCAATGCCG ACATTAAATC CAACTATGAT TCCATCAAGT GTTAACGTAA TCTCGCCAAC TATCACATCA GCCAATCGCC TCCAAGAACT TCGGGACAAT AGCTTACCTG AATTTACACG CTTAGCTCTT GGCGATCCAA CCAGCCCACA AGCGCGTGCT TTTTCTTGGT TAGAGCAAGA TCCCAATGTT GATGATTTCT CCGAAAAACG TTTATTTCAT CGTTTTGTAC TGGCAACCTT GTATGAATCA ACAAATGGAG AGGCTTGGAT CAGAAACGAC GGGTGGATGA CATACACTCC TGAATGTGAC TGGTATTTTG ACACAACATG GACTGCAAGT CCGACATGTC TGGGAGACAC ATTTGAGTAC CTGGTACTGG AAGACAACAA TTTACAAGGA TCTCTTCCTT TAGAATTAGG CTTGTTGACG GGATTGAAAG CTATTGTGCT GTCGCAGAAT TTTCTTTCCG GTGAAATTCC ATCAACTCTG GGATCAATTT CTGGCTTGAT AGAGCTTGAG CTGAGTGAAA ACAATTTGGA GTGGTTCATT CCTACAGAAT TGGGTCTTTT GACCTCACTG ACTGTACTTA ACTTGCAGTC TAACAATCTG AGCGGCTCAA TACCAAGAGA AATTGGCAAT ATGTTAAAGC TGGAGTACTT ATTCTTGGAC ACAAATATCT TGACGGGAAC ACTACCAATG GAACTTGGAA ATCTCGTCAA CCTACTTTCA ATATGGATCT TTCGCAATGA CTTAGATGGT AGCATTCCAT CTTCTCTTGC TGACATCTCT CGACTAGAAG ATTTGCAGAT CGACCGAAAC TTATTGAATT ACAGTCTTCC GTCACCATTA TGGAGGGCTC TGAGTCGGGC TGCATTTATT AATGTTGCTG ACAATCTGCT ATCAGGAACA ATTCCATCTC AAGTCGGTTT GCTACGTCAA GTAGTCATGA TCGACTTCTT TGATAATTTG TTTTCTGGCA CAATTCCAAC AGAGTTTGGC TTGCTCACCA ACTTGGAGGA GCTCAGTTTT GTGGACAATA TCTTCTCAGG GACAATACCT ACTGAGCTTG GTCTTCTTAG CAATATGAGG ACTTTATTTC TGCATGACAA TTACTTTCAT GGAAGTGTTC CCAGTGAACT TTGCAATTTA GTCCACTCAC AATCCTTGGA TCTGTCAGTT GATTGTAATA TGGTGATCTG TACATGTAAC TGTGAATGCG ACTTATCCTT CAGGCTTTGA GTTCTGAGGA TTCATCAAAG TCTGTTGGAA GTCAGTAGAG TTAGCAGCGC ATAATTTACT TTTAACTGGA TATGGCGAGA AACAGTTCTT T
|
Protein sequence | MIGMMDSEET ETAESETIVI NEAGSITPNV NGTLTDEDAG AQENAKGLPQ TASTQCSGDP QSRAPVMLDA KGKSNTLPVQ TQIYRTRIPA IASEADVFGS QRDTIDAREK APRPFEYEIG MEEGSAAFAL PSPLLCEGAS RPGAFPMGLT RPLDDNEDLS AHSNLLQSES FLHTSSLLSM DESSGILVEA SLVPDDSFIE TSTTDNKPTA LVQANPLPDT AFFFRPKIIC TVLGTLIIII LALALGIVLS ARSIDEGVSA LNEGNLSASS SDAPTSANTL IELFPIEALP TYTQMSLQDP LSAQSRAVAW LKDDPLLASY SNSRRLQRFA LATLYYSTRG EMWSTNDGWL SETDECTWFS DLGSASSICD AGIYRAISLV KNKLRGTIPQ EMELLDSLVH LRMNFNFVFG PLVPQIANLI ALEDLQLANA GLQGPFPMEL ASLTKCRKGF FHSNDFTGGL PSDLFASWTA VKDLDFGRNM FEGSIPVEIG AMVNIVSLGF DDNVFLSGEL PSEFGLLTAL EFLWLQGNSL TGTVPSSLGA LTRLRELQIH NNFFTGALPE ELCELVKKNN LIVVVDCFQV NCDCECECVE SSRPLPSGLL SMPTLNPTMI PSSVNVISPT ITSANRLQEL RDNSLPEFTR LALGDPTSPQ ARAFSWLEQD PNVDDFSEKR LFHRFVLATL YESTNGEAWI RNDGWMTYTP ECDWYFDTTW TASPTCLGDT FEYLVLEDNN LQGSLPLELG LLTGLKAIVL SQNFLSGEIP STLGSISGLI ELELSENNLE WFIPTELGLL TSLTVLNLQS NNLSGSIPRE IGNMLKLEYL FLDTNILTGT LPMELGNLVN LLSIWIFRND LDGSIPSSLA DISRLEDLQI DRNLLNYSLP SPLWRALSRA AFINVADNLL SGTIPSQVGL LRQVVMIDFF DNLFSGTIPT EFGLLTNLEE LSFVDNIFSG TIPTELGLLS NMRTLFLHDN YFHGSVPSEL CNLVHSQSLD LSVDCNMVIC TCNCECDLSF RL
|
| |