Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50457 |
Symbol | |
ID | 7199264 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 123145 |
End bp | 126883 |
Gene Length | 3739 bp |
Protein Length | 1065 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185383 |
Protein GI | 219130461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0351497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACGA ACACGGCGCC AGGTCGCGAA GAAAAGGATG ACCCGCAACG CATCGACGGT TCGGAATTTT TGGCCCAAGT ACGGGGAACG GGTGGAGGCA ATCCGTCACG GGATACAGCG ATCGCCACCG CTCACAGCGA AGAAACCTTG CGAGCCCGCA ACGGATCTTT GGGACAACGA CAGAATCTTG CCGCGTTGGC CGACCTTTTC GTCCAACCAA CATCGGTGTC TAGTCATCCT ACACTGGCTA GCTCGGATGG CGGCAGTACT TCCGCGACCT CGACAAGTAA CTCCCCACAG CCAGAGCAAG TGCATACTTT GCCGGCCAAT AATGGAGCCT CGCAGTTGCC AAATTCCTTG CCACCACGGC ATCCCCCACA TCACCAACGC AAGGTCTCAT GGGGGAGAGA TGTTCCGCAG GGAGAAGAGC AAACTCAACA AAGTCGTCAG AAACGCACGA GCAGTGGTGA TCTCGGTTTG AACTTGCCCG AGCTGGACTC GACGTCTTCA GCTCCACCGG TTCTCGAACA ACAGGTTGGA GACTCGTGGA TGCCTCCGCC TAGGCTTCGC CTGACCTCTT CCAATTCCGT CGGGGGTGCT CCACAGGGAG GCGATCGCCT TCGCCTGGAT CACATCCTCC ATGCCGGACC ATTTGAACAG GAAGCCGAAA CCCATATTCT CAAAGCTTTG GAGCAACAAG CCGCGGAGGA AGGGGCGATG CACCGCGACC GAGCGGACAC GGGAACCTCC ACGATTCTGA GCCACATTCC CGATTCCGTT CACGGTTTCC GTCTCGAATC GCCTAGGAAC TCTGTAGATT CTAACCAGGC GAACGCTACG ACGTCGGACA CCGGCGATCC GGTAACGTCG GAGTCACAAT CGGCGAGTCG CCAAGAGCAA ACACCCCTGG TACGAGAGCG TCCCGTCCCC CTACACAAAC GTAACCAGAG CGTGGAGCAG ACGTTGTTTG GTCTCACTAC TGCTTTGTCA GCCTTACACC ACGGAGAGCC CTTACCTCAT CAGGCACCGA CGCACAATCA GCGGCGACAG TCGCATCCTC CTCCTCCTGA TACGAACTCT TCTCCGGAAG AAGAACCTTT GGGGTCCGCG GATCAATTGG CCAACAACGC GGTACGTATT GCCAAAATGG AAGAAGAGGG GACGAAAAAG CAGAACAAAG GTGCAGAACG GTGGGCATCA ATTCAGCAAA ATTTGCCCAC TTTGTTGGAG GGAAAGCTGG AGAAGGTAGA GGCACCAGAA GGAGGCGACG TAGAAATAGG CAAGCAAGAA CCCAGTTCCA ATTCTGTGAC GAGCAGCAAT AGTGCATCAG ACGGAGTAAC GCAACGCCAT CGTTTCCGAA AAAATGGCCG TGGCGCTGCT GTCTTTGGAA ACGCCAACAC CCGTTTTAAA GAAGACGTAG AGCTGTGGCG CTCCTTTTTC CAACCCCGCC GAGGAGCCTT GAGATCTTAT GTAAAAAATG TGCTTCTCTA TTTGATATTG CCAGCGGCAG GTGTTGCAGC AATACTATAC TACTTGGTGG AGAATCCACC CACAGGAAAG TCAGAGGATG GCTCCAGCGG CGATCATGCT TCGGCGTCTT GGTGGATCTT ATTTATCTGC ATTCGCCAAA TTATTACTTT TTCGATGGCT CTCGGTGCCC AAACACTGAT TATTGACTAT CTCGCTCTGG GCAGCCGCAA TCTGCTCCGC CTCGCGGGTC CAGTTGTCAC CCTGCTCATC GTTCAAAGCA AGGGATGGCC CTTTTTGTTC TTCTCTTGGT CGTTGTTTGA CTTCGCGATG CTGTCTGGTA CGGGTCCTTT TGCTGCGCAC TGGGCCTTCT GGCAGAATAC TGTCGGGCTT TTCAACGAAA AGAATCCAAG CGGAAATGTG GTCTCAAATG AATGGAATAT TCGTTTGTTG GCTGTTGCGA TGTGCGTAAG CTTGGTTGTG GCCGTCAAAC GCTTTTTGAT AGGATTATAC CTGGGTCGAC AAACCTTTGC TCATTTCGGG AAGCCTCTGG CGAAAGTTAT GAAGAAAATG CTGCTCGTAG GTGAAGTGGC TGCACTGGCA AGGGATATCG AAAAAAAGGA ATCAAATCGA AACCAATCAA ATCGAGAAGG GAACTCACAG TTTGGTTATA GGCTTGATAG TCATGTTAGC TTGAGAGGCT TAGCATTGTC TTCCGACGAC GAAGATGGAT CTACCACCGA ATCGCCAACA TCGCGTAGAG GAAGTGTGGA CACTTTGCAA TCCGACAGGG TGATCAATAT GAACGATCGC GACCCGCTGA CGGGAAACCT GCAGTATTCC GAAAAGATGA AGTTGAGACA ACTGTTAGAG CGGTGGGAAG AACCGGATCG GGAATTTGAT CAGCAGAGGG TGAGTCCTTT TGTCGCTAGT AAGGAAGCTT CGATAGTTCA AAATTCTCAC GAAGATAAAC TTGCAATTCC ACAGGATCAA AAAGCCTCCA TTTCCGCTGT TTTAAACTTT CGGAACGCTC TCACCTTTAT TCAGACAACC TACCCGTTCT CTTTTGCGTT TGGCCCCGCA CATTATAGAG AGCAGTGTAT CGATTCTGCA CAAGAAGTCT ACATGCGCCT TTATAAGCAC AATGCGGAAG AGCTGGTCCT TCACTTTGAG ACTATTGCCC TTATTGCTCT TCGTGAGGAC GGCAGCGTTG ACCAGGACAA AGCGAAGGAT TTGGTTAAGC TCTTTCGTCC AGACCGCGAG GGAAATTTGA CTATGCTGGA TTTTGTAAAA AGCATCGATG CTGTTTACAA AGACTTTCGG CTTTTGAGTG CTTCGATCGA GAACTCTACA CAGATTGACC GTGCATTCGA AAACATTTTC AATATTGGAT TCTATGCCGT TGTGATCACG GTAACATTAT CACAGCTGGG ATTCGATCCT TTGGCCCTTT TTCTAAGCCT TTCCTCTGTC ATTCTCGCAT TCGCTTTTGC GATTGGCAGC GCATCTGCGA AATATTTTGA AGGGGTTCTG TTTATTCTTG TCCGCCGTCC TTATTCGATC GGTGACCGGG TGCATGTGTC GAACGTCGAA GCCGATACCA GCTTTGACGG ATCGCCTGGC TGGGTTGTGG AAAACGTAAC ACTTTTTGAA AGTAAGTTGC CGTCTGGGAT GGCCTGTTCA CATTTGGCTC CATTGGTCTG ACTAACGCCG TTGCGTTTCT TTGTTGATAG CCACGGTTAT TTGGGGGCCC ACCAACGAGC GTGCTAGCCT GTCCAACGGC TCGCTTGCTA ACAGTCGTAT TATCAACCTT GCACGATCGC CACAAGCCCA ATTATTTATC TATCTCAAGA TCCCAATTGA CACTTCCTAC GAAAAGATTC TGATTTTCAA ATCAGCCGTG GAGGAATACA TGAAGGCCCG TCCCCGGGAA TGGCTGGCAC TCAATGGTTT TCGGGCGAAT CGTATTGCTG CCGACTTGGG CTGGACGGAA TATCTTATTA TCATTCAGCA CCGTGAAAGT TGGCAAGAAG TGGGTCAGGT CCTCGATAGC AAGGCCAATT TGAGCAGTTA TTGTCAGGAA GTGGCCAAAC AGCTCAACAT TCACTACAAA GCGCCACCAC TACCGGTCAA TCTCAAGTAC GCACAAGCGG CCAGTCCAGA GGAAGTCCTG GAAAGCAGCG ATGCGATTGA CACCGATTTG GACAGTGTGG CTCGGACCCA AGAATTTCGG TCGATGGCCT TGAGCAAGCA CAACATTCGA TACAGTTAG
|
Protein sequence | MNTNTAPGRE EKDDPQRIDG SEFLAQVRGT GGGNPSRDTA IATAHSEETL RARNGSLGQR QNLAALADLF VQPTSVSSHP TLASSDGGST SATSTSNSPQ PEQVHTLPAN NGASQLPNSL PPRHPPHHQR KVSWGRDVPQ GEEQTQQSRQ KRTSSGDLGL NLPELDSTSS APPVLEQQVG DSWMPPPRLR LTSSNSVGGA PQGGDRLRLD HILHAGPFEQ EAETHILKAL EQQAAEEGAM HRDRADTGTS TILSHIPDSV HGFRLESPRN SVDSNQANAT TSDTGDPVTS ESQSASRQEQ TPLVRERPVP LHKRNQSVEQ TLFGLTTALS ALHHGEPLPH QAPTHNQRRQ SHPPPPDTNS SPEEEPLGSA DQLANNAVRI AKMEEEGTKK QNKGAERWAS IQQNLPTLLE GKLEKVEAPE GGDVEIGKQE PSSNSVTSSN SASDGVTQRH RFRKNGRGAA VFGNANTRFK EDVELWRSFF QPRRGALRSY VKNVLLYLIL PAAGVAAILY YLVENPPTGK SEDGSSGDHA SASWWILFIC IRQIITFSMA LGAQTLIIDY LALGSRNLLR LAGPVVTLLI VQSKGWPFLF FSWSLFDFAM LSGTGPFAAH WAFWQNTVGL FNEKNPSGNV VSNEWNIRLL AVAMVINMND RDPLTGNLQY SEKMKLRQLL ERWEEPDREF DQQRTTYPFS FAFGPAHYRE QCIDSAQEVY MRLYKHNAEE LVLHFETIAL IALREDGSVD QDKAKDLVKL FRPDREGNLT MLDFVKSIDA VYKDFRLLSA SIENSTQIDR AFENIFNIGF YAVVITVTLS QLGFDPLALF LSLSSVILAF AFAIGSASAK YFEGVLFILV RRPYSIGDRV HVSNVEADTS FDGSPGWVVE NVTLFETTVI WGPTNERASL SNGSLANSRI INLARSPQAQ LFIYLKIPID TSYEKILIFK SAVEEYMKAR PREWLALNGF RANRIAADLG WTEYLIIIQH RESWQEVGQV LDSKANLSSY CQEVAKQLNI HYKAPPLPVN LKYAQAASPE EVLESSDAID TDLDSVARTQ EFRSMALSKH NIRYS
|
| |