Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40731 |
Symbol | |
ID | 7198529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 257176 |
End bp | 258930 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184761 |
Protein GI | 219129154 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.247124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCCCA GTACTCGCAT CGATAGCGAG CACTCGACAC GAGACCGAGA TCCGTTAGCT TCGCCGCTAC CGTCATCTTC AGATGATCAC GGAAGTGCTT TGGACACGGA CGCTTCGCAA GTCTACAGTC TCTGGCTCGA TGGGGCCGAA GAGCATCTCA GAGCACCAAA TGGCAGCAAC ATGTCCGAAT CATTTGAGTC GAATCTGGCA ATAATTCAAC AAGCCAGCGG GGCATCCTTC ATGGGATCCG TCGCGAACTT GTGCAGCGCT ACGCTCGGCG CGGGAGTCTT GGCCTTGCCG TACGCTTTCT ATCAGGCAGG AATCGTGTTG GGTCTTTCTT TACTGTTGAC GTCGGCTGTT GCCACAGCTG TTTCCATCAA GCTTTTAGTC CAAGCTTCGG AACACTACCA ACTTTTCACC TATGAATTAT TGGTCGAAGC CTTGTTTGGC AAGCATTGGC GAGTTTGCGT TGAGGTGTCG ATTGTTGTTT TTTGCGGGGG ATGTGCAGTA GCATATGTGA TTGCTGTCGG CGATATCCTC GAACGATCCA ATCTCCTTTG GTATAACAGT CGAGCACTAT CCATGACTGC CGTATGGATG ACTGCAATGC TGCCTTTGAG TTTGCTGCGA CGTATGCAGT CCCTTCAATT TGCTAGTGGG GTAGGAATTG CTTCCATCGG GACTCTTGTC TTTGCAGCTT TCATACATTT GCTGGAAGGT AAAGGAGCCT CCACAAATGC GACAAATTAT ACACTGGCTG AATTTACCTT GCACCGAGCC TCCAATACGA TGAGTATGCA CAATGATTTT GGCGACTTTT TATGGCCCGC CCATGGGTCC GTTTCAGTCT TGACAGCCTG CCCTATTGTA CTATTTGCGT TTAGTTGCCA AGTCAACGTC TGTGCAATAT ACCAGGAGCT CGCTATTCCG CACATCCCCG ACACCAATCG ACACACTTTG CGACAGGACC GTATGCGCCT CGTCACATTG ACGGCAGTTG CTATTTGCGC GACACTTTAT TGCAGTATCT CGATTGTGGC ATTGGCCGAC TTTGGCAAGG ATGTGACTCC CAACATTCTT TCCAGCTACG AAATGCATGG CATTATGCAA GCAGCGGCAG CCTTCATGGG AGTCGCGGTC ACGTTTGCGT TTCCCCTTAA TGTCTTTCCA GCACGGGTTA CGCTCCAGGA CATTTTCTTT CCGAAAGTCT TATTGCACCC GCCTGTGAGA AACGAAACCT TGACAGCGGC ATTATTATTG GACCAAGATG AGGTCACTGA ACCTCGCCTT CCTATGAGTG CCGCTAGAGA TGTTCTCGTT GATGAAAGCG ACGAGAGAAC GCCACTACAA CCGCAGATAA ATTTTGCAAA TGGCGAAGGA GGGCTCAGGA ATGAAGATGG TGCTCAAGTT GACGCGATCG AGACGCCGTT GGATGAAGGC ATAGCGTCTC GACCGGCCGG AATTGAATCT GAATGGAACA TGCGACAGCA CGTTGGATTG ACGATAGGGA TTGCTGGCTC GGCATTGTGC CTAGCCCTTG TGGTGCCCGA CATTTCCGTC GTCTTTGGAG TTTTGGGAGG TACGGCTACC AGCATGCTTG GGTTTTGCGT ACCCGGCGCT CTGGGTGTGC GGCTGGGTCG GGACCTGGAC GATTGGTCCT TGTCAGTGCC TTCGTGGGTA CTGTTGATTG GAGGGGCTGT GTTTGGAACG GTGACGACAG CTGTAACGGT TTGGGACACT TTAGAAGCTC TGTAG
|
Protein sequence | MSPSTRIDSE HSTRDRDPLA SPLPSSSDDH GSALDTDASQ VYSLWLDGAE EHLRAPNGSN MSESFESNLA IIQQASGASF MGSVANLCSA TLGAGVLALP YAFYQAGIVL GLSLLLTSAV ATAVSIKLLV QASEHYQLFT YELLVEALFG KHWRVCVEVS IVVFCGGCAV AYVIAVGDIL ERSNLLWYNS RALSMTAVWM TAMLPLSLLR RMQSLQFASG VGIASIGTLV FAAFIHLLEG KGASTNATNY TLAEFTLHRA SNTMSMHNDF GDFLWPAHGS VSVLTACPIV LFAFSCQVNV CAIYQELAIP HIPDTNRHTL RQDRMRLVTL TAVAICATLY CSISIVALAD FGKDVTPNIL SSYEMHGIMQ AAAAFMGVAV TFAFPLNVFP ARVTLQDIFF PKVLLHPPVR NETLTAALLL DQDEVTEPRL PMSAARDVLV DESDERTPLQ PQINFANGEG GLRNEDGAQV DAIETPLDEG IASRPAGIES EWNMRQHVGL TIGIAGSALC LALVVPDISV VFGVLGGTAT SMLGFCVPGA LGVRLGRDLD DWSLSVPSWV LLIGGAVFGT VTTAVTVWDT LEAL
|
| |