Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47779 |
Symbol | |
ID | 7202941 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 7827 |
End bp | 9963 |
Gene Length | 2137 bp |
Protein Length | 622 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182142 |
Protein GI | 219123667 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCTTTTTC GAAAAGTCAC GACCGCGATA AACACCACCA CCACCATGAA GACTTCCTTT GTCTTTCTTA CTCTCTTTTC CCCTTTGCTC GTGCTTGCGG CTGTAGAGGC GCAGTCTGCT GACGTCTCGG AACAGCCTGA CAAGTTGGTG AGTATATCTG TACTGTAGGC ACTCAATCCA AGCCTATATT TGTCCCCTTT TCCGAAGCTT GGCAACAATT TCCTCTTGGT AAAACTCTCA TCCTATCTCT CTCTCTTGTG CCTCTTCCAT TTCAATAGAA CAATGGTGTC CCATCCATCC TTGAGGTCAC TGGTCGCCGC CTCAAGAAGG GAGGCTCCAG CTCGTATGGC AAAGGCGGCA CAGCTTCGTC CAGCGCGGAC AGCAAAGGCA AGGGCGGCAA AGGTGCGTCC GGCTATACGA AAAGCACCAA GTCCTCCAAG TCTCGCAAAT CGGCAAAGGA TCCCAAATCA GAAAAGCACC CCAAGGCTAC CAACGCACCT GTACCCGTCG CAGCTCCCGT AGTCGCCCCT GTTCCACCTC CCACCGTCGC TCCCGTCCCA TCTCCCATTG TTCCTTTTAT GGAGGTTGTG GATGTGGGAT ACAGCATCTC GGTGGAAGCC AACATGGGGA CTGATATGAT GGAGTTCTTT ACAGAGCTTG AAACAGGACT TGTTGCCGCC ATGGATATCT TGGCGCCCGA AATTGTCGCG GAGATTCTTG CCCCCGGAGG ACGATACTTG AAGGCTAGTG TCCTGGCCAT TCACAATCAG CGCCGTCTGA CTGCCGTGGT GCAGTTGCCG ACCTCCATTG AATCCTTAAT GATGATGGGT AAGTGGAGGT TTGGATGAAG CATGAGATGT GCCGTTGCTA GACGTGCTAT GGTTCTAATG GTTGTTCTGG CTTTTCATTT TGTTTGTCAG AGTGCGGCAA TTCTGTGATT GGCGATGATT TACTTTGCTT TGACATTGTT CATAGCATTG CGCTAGAAAT CAGCGGTGAC ACTGTCCCAG AGGAATTTAC GGGGCTCTTT GAAGAAGACA TGGACACTGC AATTAAAGAA GATCTGCTAC GAGACAAATT GCTTGAGACC AATCCTAACT CTACAGTGAT TGTCTTTCCC CCTCCTGTCC AGCCAACTCC CGAAGTGTCC AGCCCCCCGA CTTTGGCTCC GACTACGAAG CCCGCCGGCG CTCCGCCAGG AATGCCAGTC AATGACCCCA ATTTGGTGCC GGCTCCATCT CCAACTGCAG CCCCAACTTT GATGCCGTCT ACGATGCCCT TGGCAGCACC CCCCACTGGA ACCGACTGGA CAATCAGACC CATTACGGCC GACGATTATT GGCAAAGCGT CACGTACGCG AATAACATGT TTGTCGCAGT CTCATTCGAC CGGGTCATGA CCAGTCCCGA CGGCATAAAT TGGACGCCTC GTACCGCTGC GGAAACCGGC GAATGGCGGG ACGTCACTTT TGGAAAAGGA ATGTTTGTTG CAGTGGGATG GAGTATCGTC ATGTACAGTT CGAATGGCGT GGATTGGACC AGTGCAACCA AGGTGCCCGC CCAAGAATGG CTTTCCGTCA CCTACGGAAA CGGATTGTTT GTCGCCGTGC CCTATGGTGG CAATCGTATC ATGACCAGTT CGGACGGCAT GGCTTGGACG AGTCGTACCA GTGCGGCCGA CGCTTTCTGG CAGGGTGTCG CGTACGGGAA TAATATGTTT GTGGCAGTTT CTCAGAATGA TGGAACGGTC ATGACCAGTC CAAACGGCAT CAACTGGACG GCTGGTACCG CTGCGGCAGC GAATAATTGG ATCAGCGTCA CGTACGGCAA AGGGAAGTTT GTCGCAGTGT CTTGTTTTGG CAGCGGCAAA AGTGTCATGA CAAGTGCAGA TGGCATAACT TGGACTGGCC ACACCGGTGT GCCAAACACT TGTTGGTGGC GTGTGACGTT TGGTGGCAAC ACGTTCGTTG CCGTGGCGTC TCGTGGCGAC GGCAGTCGGA TCATGACTAG TTCGGATGGC GAAACCTGGT CCTCGCAGAT CACTCCCGAA GCGAACACCT GGAGGGGCGT GACTTTTGGC GCAAATACAT TTGTGGCGGT AGCGGATAGT GGAACCAATC GAGTCATGAC CGGGTAG
|
Protein sequence | MKTSFVFLTL FSPLLVLAAV EAQSADVSEQ PDKLNNGVPS ILEVTGRRLK KGGSSSYGKG GTASSSADSK GKGGKGASGY TKSTKSSKSR KSAKDPKSEK HPKATNAPVP VAAPVVAPVP PPTVAPVPSP IVPFMEVVDV GYSISVEANM GTDMMEFFTE LETGLVAAMD ILAPEIVAEI LAPGGRYLKA SVLAIHNQRR LTAVVQLPTS IESLMMMECG NSVIGDDLLC FDIVHSIALE ISGDTVPEEF TGLFEEDMDT AIKEDLLRDK LLETNPNSTV IVFPPPVQPT PEVSSPPTLA PTTKPAGAPP GMPVNDPNLV PAPSPTAAPT LMPSTMPLAA PPTGTDWTIR PITADDYWQS VTYANNMFVA VSFDRVMTSP DGINWTPRTA AETGEWRDVT FGKGMFVAVG WSIVMYSSNG VDWTSATKVP AQEWLSVTYG NGLFVAVPYG GNRIMTSSDG MAWTSRTSAA DAFWQGVAYG NNMFVAVSQN DGTVMTSPNG INWTAGTAAA ANNWISVTYG KGKFVAVSCF GSGKSVMTSA DGITWTGHTG VPNTCWWRVT FGGNTFVAVA SRGDGSRIMT SSDGETWSSQ ITPEANTWRG VTFGANTFVA VADSGTNRVM TG
|
| |