Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38462 |
Symbol | |
ID | 7203386 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 170723 |
End bp | 174107 |
Gene Length | 3385 bp |
Protein Length | 1089 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182628 |
Protein GI | 219124684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT CCAGCGAGTC CTGACACAAA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA CAGATTCTTC GACATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT TTGGAAAATT CGGAGCTTGT TGCATTGACA AGTGGAGATT TCATTTTATA TCGACGCTCG GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC AAGACCCACT ACCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCAACG GACCCCTCAG AGAAGTCTCT CTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA CATACACTTC AGACTGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GTATCTACAA TGATATGGTC CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTT TGCTTGAAAA CGCTGTACAC GACATCCCTG AACTCCGTCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT AAAGGGAACA ACCTTTCCAA TTCTCGTAAT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC GACCTCTCGT ATGATCAACC TGACTTCACC GAAGACAATG GAATTGACTA TGATATCGAT CTCTCTCCTG CAGTGATCTA TGAGGCCAAT GCTCACAACC GCAAAGTCAG TCCATCTGGC CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAGATGTGG AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCAGCACC CGACAAAGGC CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA TCGAACGATA ATGGCGAGTT CAACCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC TATGACTGTG ATCAAACAAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG GGAGACGGCG ATATCCGAAA AGTTCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC CAGTCATCGG ACAATTGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT CGTCATTCCG TTGAGAACAA AACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT TGAGTCTCAC AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC CCGCAATGTT GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TGTTCGACAA GGCCTCCCAT ATATGGATAT GCGGCAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA TTTGACAACA CAGGTCAATA CCTCCACCGC CATATTGCGT ACTACGATCT CGATCGCGAG GACGCTATTG ATTGCATTAT CCAGTGTTGT AAGCACAATG TCAAACGCAA TGAACGGGAC TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCGC CGCTACGAAA GCACTTCAAA TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTTGT CAATGCCCTT GAGGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT ACCAATAACG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCATGATCG TACACCGCTC GAGACTCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA CCGGTGTACT ACCGGATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT CGCTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAAT CCTCACAGAC GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CATTGAAATC CGGAGAAACC AACCTACGCC TTACGCCACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTCGTCAAG TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGA ATCTACCCGG TTTCACCCCT GAGGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT GCACGTATCA CAAGGAAAAT CTTAG
|
Protein sequence | MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP PASPDTNEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV PMAEQLPKQL CLSLLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD KGNNLSNSRN VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN AHNRKVSPSG HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI SNDNGEFNRS EPDNQQAEAF YDCDQTTELL AHLTDRSVQL NVLEYQVSRH SVENKTAALV DRGANGGLAG CDVKVVNKTG RSASITGINE HTLSDLDIVT AAGFVESHKG PIIVIMHQYA YLGKGKTIHS SAQLEHYRNT VEDRSRNVGG QQRIVTLDDY IIPLHVRQGL PYMDMRQPTD SEFESLPHVV LTSDIDWDPS ILDNEVDMVN DWYDAMQDLP GNAYVEPRFD NTGQYLHRHI AYYDLDREDA IDCIIQCCKH NVKRNERDYE ALRPCLGWVS GDTVRKTIMA TTQYAREVYN APLRKHFKSR FPALNVHRRN EAVATDTIWS DTPAVDNGAK FAQLFVGRRS LVTDIYPMKT DKEFVNALED NIRHRGAMDK LLSDRAQVEI SKKVADITRA YNIDQWQSEP HHQHQNFAER RIATIEANTN NVLNKTGAPD STWLLCIAYI CYVFNHLSHE SLHDRTPLET LLGSTPDISV LLQFHFWEPV YYRIEDPSFP SDGTEKSGRF VGIAESVGDA LTYKILTDDT NKILYRSSVR SALKSGETNL RLTPQDGESN SKPINFVKSR RTENKNSYAL KNLPGFTPED LIGRTMMGSV FVHVSQGKS
|
| |