Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38095 |
Symbol | |
ID | 7203039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 61044 |
End bp | 63377 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182151 |
Protein GI | 219123686 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.167042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAGG TGGCAATGCC GTGGACGGGA GCTGCTGTTT CCAAAGAACT ATACGAATGG GAGAAGGAAT TTCTTGCTAA TTCTATCGAG ACATCCCCAC TTTCCGCAGC ATCCAGACGA ACCGGGAATT ATGGACTTAC TTCCAGTATC GGCAGTGTTG CCGAGGACGA TAAAGTATAC AAACGTGACT CGTCGGCCAT CGCATGGCTA ACATTATCTG ATCAAAGCCC GCGGCTGCGA TCCTTCTTCC GCACACTCGT GTTTGATCGG GAGACTTGCG AGTTCACACC ACTACAATCA CACTTCTGGT CCGGTGTACA AGGCCTACTC ATCGGTACTT TAACTTTCGT TTGGAAGAAC AGTATTGAGT TTGGAATCGA ATTTTTCTGG GTCATTTTGC CCAAGACGCT GCGCAATTGT GGAGTCTTCA CCGATGACAA CGGTTGGCTT CCGATATGGC ACTACACCTG GATATTTTCC GTGCTCACGG CTACCATTTT AGGCTACTTT GCTGACTTGT ACAAGGTTCC CGGCCAGGAC TCGTACAACG ACAGCGTCCA CCAAACTGGA TTGGTTGACT TTCGCACCGC CCTGAGCGTC TTGGTGTTGT CCACTTGCGG ATTGTGGTCC GGTTTTAGCT TGGGGCCCGA ACTTCCGTTG GTAATTCTGG GTGGTCAATT TGGATCATAC ATTGGCTATA CGCTCAACCA AAGTGTCCTC CATTGTCGCG TCATGACATT GGTAGGATCG TCCGCAGCCG TGGCTGGATT CTTCAAATTA CCACTTGCCG GAGCCTTTTT CGTGTTGGAG ATCCTGCACC GAGACGGGCT ACAATACTAT GAAGCCCTCC GTCCTGCATT GTTCGCTTCC GTTGTGGCAG TGGAAACCAA TCGATTTTTA GCCCATAGAA ACGAGCACGT CTTTTTCCAG TACCCCGGCA CGGAAGAAGA AATGCCAAAC TCCCTATATC TTGGTGTAAT ATTCCTCGCT CTTTTTGGAG CTCTAGCCGT TGGGATTCCG TACATTATTG GCGTGAATTT TTGCAAAAAG CTAATTGACT CAACCTACGA CTGGCTCGAA GACGAATTTG GACAAGACTC TGAAAGGAAG AGCTTGAATG AGCTGCGTCG GCTCAATAAT TTCAAAAGTA CAGAACCAGA ACCTGAAAGC GAGTATCTTT GCGGCGTATT CTCGACAGAA GCCTTGCAAG CAGCCGGGAA AGCGGGATTG GCAGGCCTGA TACTCGGGTG GATCGCAATC TACCTTCCTC ACACCATGTT CTGGGGCGAA GCTCAGCTGC AGACCATAAT TGACCGAGGA GCGACACCAC TTCCTATTGT TGGTAGCGAC TACCCCAGCG GTTTGGGCTC GCAGGGGTTC TGCATGACGG ACACCCAAAG CTCCAGTCCG GAACCACTCA GCTTGACCTG CTTGGCTACC ATTGGTGTCG TCAAAGTGGT CCACGTTGGG CTGAGCCTCG GCACACACAT TATCGGTGGA CACTTTTGGG CTCCCTTGTT GGTGGCAGCC CCCGCCGCGC ACTTGCTCAC GGACGCCATG GGCGGTGTGG CCCGGGCCTT GGGCCACGCG GGGGGTTTAG AAGCCTACAA AACGATTGTG ATTCTGGTGG TCATGGGTGG CGCGCACGTG ACGGTGTTTC GCGCATACAC AGCCATTGCG TTTATTCTGT TCTTGACGGT CGCGGGACAC TTGACGGAAC TCTTGACCTT TCTCATTACC GCCCTCTCGG CGGTACAAGT GCTGACTACC GCCTGGATGG AACGCTTCGT CATGTACCGA TCTCAAGGTC CACGTTGCGA CGTTGTGGCC GCTCCCGAGG TTGTCGAAAA GCCCGACCGC TTTGACGATT TCGACGACGA CGATGAGGAA AGCGGCAACA GTGCCGCGGA TGCGAGTGTG GAGTCGGAAA CGAACCCCGG AGACTATTTG CGGGTGGAAA AAGCTAAATT CTACGGGGGC ACCACGGACT CTGTGGAACA ATGCTTTGGC GAAAGTGATC GTAGTATTCC CACCCCCACA AAGAGTAGTG CCGGAAAAGG CAACACCCAC AAGGTACAGG GCAATCGACG TCAGTTGCGT CGAATGACGT CCAGTCAACT GGATTTGTTG GAACAGCCCC GTCAAGCCGC GCTTCGGAAA CAAAAGTCGT TGACACTGTC TGGTTCCGCT CGGAGCCTGT TATCGAGTGG AAGTTCAGGT CGCAGCGTCA GTACTACTAG TAGGAGTATC AGTAGCAGCA TCAGTAGTAG CAGCTGCCTG GTGACACAAT CACGAGACCA TTGCGGCGAC CCAACATGGA CCCTGTATGT TTAA
|
Protein sequence | MQKVAMPWTG AAVSKELYEW EKEFLANSIE TSPLSAASRR TGNYGLTSSI GSVAEDDKVY KRDSSAIAWL TLSDQSPRLR SFFRTLVFDR ETCEFTPLQS HFWSGVQGLL IGTLTFVWKN SIEFGIEFFW VILPKTLRNC GVFTDDNGWL PIWHYTWIFS VLTATILGYF ADLYKVPGQD SYNDSVHQTG LVDFRTALSV LVLSTCGLWS GFSLGPELPL VILGGQFGSY IGYTLNQSVL HCRVMTLVGS SAAVAGFFKL PLAGAFFVLE ILHRDGLQYY EALRPALFAS VVAVETNRFL AHRNEHVFFQ YPGTEEEMPN SLYLGVIFLA LFGALAVGIP YIIGVNFCKK LIDSTYDWLE DEFGQDSERK SLNELRRLNN FKSTEPEPES EYLCGVFSTE ALQAAGKAGL AGLILGWIAI YLPHTMFWGE AQLQTIIDRG ATPLPIVGSD YPSGLGSQGF CMTDTQSSSP EPLSLTCLAT IGVVKVVHVG LSLGTHIIGG HFWAPLLVAA PAAHLLTDAM GGVARALGHA GGLEAYKTIV ILVVMGGAHV TVFRAYTAIA FILFLTVAGH LTELLTFLIT ALSAVQVLTT AWMERFVMYR SQGPRCDVVA APEVVEKPDR FDDFDDDDEE SGNSAADASV ESETNPGDYL RVEKAKFYGG TTDSVEQCFG ESDRSIPTPT KSSAGKGNTH KVQGNRRQLR RMTSSQLDLL EQPRQAALRK QKSLTLSGSA RSLLSSGSSG RSVSTTSRSI SSSISSSSCL VTQSRDHCGD PTWTLYV
|
| |