Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42556 |
Symbol | |
ID | 7196260 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 409983 |
End bp | 412960 |
Gene Length | 2978 bp |
Protein Length | 964 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177084 |
Protein GI | 219110665 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.162025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTTAC CGGAGCGCAA CAAAGAAGAG ACGGAGCGGA CGCCGCTCGT GTCGAGTCGG AACACAGTGG CCAGCGGTAC TACTCCCCGA TCGAACGTTC CCCAACACGG CCGATCGCGG TCCAACTCAT TTCGCGAACA TCACGGTGTC ATGCCTCCGA TTGCATCGAA TTACCCGTTG CCACAGAACA AACACCGAGT AACGGCTTCC CTGAATGCGG AAGGATTCGC CCCGAGAGCG GTACCGTCTT ATACGCCCGG AGTTCCCGTC ACCAACGCCA GTTCTTCCGC CTATACGCTG GGACGTCAAC CCAGTCTTCC ACTACCCCCG CGCCAACGAC GTCAACCCAG CCAGGATTCT ACCAATGGAC CCAAAAAGGG AGGTTTCTTT TACTACATTG TCTACGCCAT GGTCAACGTC ATTATCAGTG TCCCGGGTTT GTATGGCTAC GCCGCCGTCA TTTTCAATCA CGAAGCCTAT AATGATCATA TGAATGCTCT CAGCAAATTG GTAATATTTT CTTCGCTCAT GCATCAGCTG GGATTTTGCT TGTTCTCGTC CCTGCCGTTT GCGATCGGTA CCGTACAAGA CGCCGGGCTC ATCTTTCTCA GTGCCATGAG CAACATCGTG GCCAAAGAAA TTCTGGCAAA CGGGGGAACA GTGGAGGAAG TTGTGTCCAC GACCTTGGTT ATTCTGGCAA TGAGCACAGC ATGTCTAGGA CTCGCTCTGG TGGCCATGGG CAAATTCCGA TTGGCCGATA TTGTTTCCTA CTTGCCCATG CCCGTCGTAG GGGGCTATTT GGCCTTTATA GGCTATTTTT GCCTCCAAGC CGGGGTTGCC TTGTGTATTT CGGAACCCAT GGTCGGGCTG GCGGATTGGC GATATTTATT GGAGTCGCAA AATGTGCTTC TGGCTGCACC TGGGCTGGGT GCGGGGCTAG CGCTGACGTA CATTTCTCGC AAGGCCGAAA GCGATGCCGC CTTGCCAGCG GCAATGATTG TCATTCCGGT TTTGTTTTAC GCTCTCATCT TTGGGACGGG TATGGGCATT GAAGGTGCTC GCGAAGATGG GTGGGTTGGC CAAACAGCAC CACCCGTGCC AGTGCAGGAT CTTTTTCATT TGGTGGACTT TAAGCTTGTT CATTGGACCC TGATCTCCAA GTGTTTGGCA ACATGGTTGG GCATGGTTTT TGTCGTTTCA TTTGCGTCTT GCCTAGATGT GGCGGCTATT AGCATGGATA TGGGAGAAGC TTTGGATACA AATCGAGAAC TGGCCACGGT TGGGATTTGC AACGTCATGT CAGGCTTGAC GTTTGGCTTC ACGGGCTCGT ACATATTTTC ACAAACAATC TTTACCTATC GGACTGGCGT ACACTCGAAA TGGATCGGCG TCATTATCAT GATTGTTTTT CTGGCCGTTG TTCTCTCAAC CGTGAACATG TTACAGGTGG CACCTTTATT CTTCCTTGGA TCCACACTCA TTTTCATTGG ATACGATCTG CTTTACGAGT GGCTCTTTGA AATTCGACAC AAGATCTTTT TGAGCGAATA CGTTGTCTTA TGGCTGACTT TTTTAGCAAT TCAGGTCGTC GGTATCAATG CCGGTATTGT TTTTGGTGTA GTTGTGGCCA TGGTCGATCA TGTCATGACG ACGGCACGCG TTTCTGCTCT CAATCGAGTA CCGAAGCGAT CGCGTGCTGT TTGGTCTCCA GAACATTGGA AGATTTTACA GACGCACGGA TACCATTCAC AACATCCCAA AATTGTAACC TATGAGATTA TCGGCTCGGT CTTTTTTGGA ACAGGCCAGC AGCTGTTATC GACTATTTCG GAAGAGATTG GCATAGATGC AACATTGGAA GAGGTCACTG AAGAAGCTGC TATTATGAGC CCTCATCGAG CTGGATATCT GATGACAAAG TCTCCAGGAT CTGGTGGAGC TACAAAGAAA CCAAAAAGTC CCCTTATGAG ACCTCGTCCG CATTTCTTAG TTCTCGACCT TGCTCAAATG CCGAACCTCG ACGCCTCCGC CGCTCGAGGA TGCTTTCTTC AGTTGGCAAA GATGTGCTCG AAACGACACA TTCTTGTCTG TGCTGCTGGG CTGTGCCCAC GTGTGGACTG GATGCTCCGG GCTCACGATG TCGCATACGA TGAAATTGAA GGAGAAAGAA TCAAACAAGA CATGGAAGGT GGTATTTTGC CAACTGGAAC GTGCGACAAG ATACTGCCGT TCCTAACTAT TTATGAAGCT CTTGAGTTTT GTGAAAGTCA GTTGATCCAG CAGTTGGATC GCCTGAATCG GTCGCCATCC TTTATCGGCC TCAAGGACAT TGCTCCATCG ACAGTGCGCA GAAAAGGGAA AGCAACTCTT GCAGAGGTCT TCTCTTTCAT TTTGGGCTTG AGGGAAGAGG ACAAAAAGCT TCTTGATAGT CTTTCTGACG AGACTTACCA TCAGGAGATG GAATACAATG CAGGGGACTG TATGTGAGTA TGATTTCAGT TGACTGCACA ATTCTTACAT GGTCCCTTGC TTTCTCACTC ACTCTCTCTC CATCTATTTA CCTTAGTTTC CCCAAAGACA CTCACTCTGA CTCATTTGGA GTTGTGCTCA AAGGTGCTGT CGCAAACGTT CGAGAAGAAC TCAGCTCGCA TTTGACAACT CACATCGTAT CTGGAGCAGG AAAAGTGTCC TTGACAGGTA CCGGAAGGAG TACTTCAAAT CTCATGGATC AAGGTGATAT TGGACATGTT CGTTCCTTCT TGTCGGTCGG AGGAATTTTT GGGTTCGTTG ACTTCCTTTT GGAGCATCAC CGAAGTTTTC GTAGCGTCGC ATCTCGCGAC AAGACGGTTG TTGCGAAGAT AACACGAGCA GGGCTGGATC GACTGCAAGA AGAGCACCCT GAGGTTGTGC GAATTGTACA GAGCGTCCTG CTCCAGGCTA GCGCCATGGA GCTTTCGAAC TGCACGTGCA GTGACTAA
|
Protein sequence | MYLPERNKEE TERTPLVSSR NTVASGTTPR SNVPQHGRSR SNSFREHHGV MPPIASNYPL PQNKHRVTAS LNAEGFAPRA VPSYTPGVPV TNASSSAYTL GRQPSLPLPP RQRRQPSQDS TNGPKKGGFF YYIVYAMVNV IISVPGLYGY AAVIFNHEAY NDHMNALSKL VIFSSLMHQL GFCLFSSLPF AIGTVQDAGL IFLSAMSNIV AKEILANGGT VEEVVSTTLV ILAMSTACLG LALVAMGKFR LADIVSYLPM PVVGGYLAFI GYFCLQAGVA LCISEPMVGL ADWRYLLESQ NVLLAAPGLG AGLALTYISR KAESDAALPA AMIVIPVLFY ALIFGTGMGI EGAREDGWVG QTAPPVPVQD LFHLVDFKLV HWTLISKCLA TWLGMVFVVS FASCLDVAAI SMDMGEALDT NRELATVGIC NVMSGLTFGF TGSYIFSQTI FTYRTGVHSK WIGVIIMIVF LAVVLSTVNM LQVAPLFFLG STLIFIGYDL LYEWLFEIRH KIFLSEYVVL WLTFLAIQVV GINAGIVFGV VVAMVDHVMT TARVSALNRV PKRSRAVWSP EHWKILQTHG YHSQHPKIVT YEIIGSVFFG TGQQLLSTIS EEIGIDATLE EVTEEAAIMS PHRAGYLMTK SPGSGGATKK PKSPLMRPRP HFLVLDLAQM PNLDASAARG CFLQLAKMCS KRHILVCAAG LCPRVDWMLR AHDVAYDEIE GERIKQDMEG GILPTGTCDK ILPFLTIYEA LEFCESQLIQ QLDRLNRSPS FIGLKDIAPS TVRRKGKATL AEVFSFILGL REEDKKLLDS LSDETYHQEM EYNAGDCIFP KDTHSDSFGV VLKGAVANVR EELSSHLTTH IVSGAGKVSL TGTGRSTSNL MDQGDIGHVR SFLSVGGIFG FVDFLLEHHR SFRSVASRDK TVVAKITRAG LDRLQEEHPE VVRIVQSVLL QASAMELSNC TCSD
|
| |