Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42501 |
Symbol | |
ID | 7196682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 233806 |
End bp | 236993 |
Gene Length | 3188 bp |
Protein Length | 755 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176550 |
Protein GI | 219109591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAAATTTC CTACGGCACC CAAATCCATC CTCTTTGAGT AAGCAGTAGC ACTCTCTCTT TTACCACTGC CTCTACCACT ACGGTCAGCG TTCTTACTAT CGCTTCCCAC ACCAATACCA TTATCAAGCA ACAACGGTAC ACACACAGCT AGAAGCAGCA GGATCATTCC AGGATCATGA CGAAACAGTC TTCAACACTC GCTCGTCTAT TGCTCTTACT CGCGCTGTGG TGCGATCCCA CGTACGGTCA GAACGACTGT GCTCTGTCCT TTTGGTTGTT CCCTTCACAG CAGGACTGTC GACTCCGCTC GGGCGATCCG TCCGCCATCG AAACCTTCGT TGCCGACGAC GTTTGTCGTG TGATGAGTAA CCCATCTCCA TTCTTGTTGG GGCGCTACAC GGCGAGTTGC GTCTCCTCGG ACACCGTGCG GATATCGCAG TCCGGCTGTA CCCGATCGGA CTGTTCGAGC ACGTCAGGTG GCAGCGTTTG CGATCGCGAC CTCACCAGCG TTTCCTCCTT TTACTCGCTT CTCAGTACGC CTGAGTACAA CGTTCAGGAC CCCGCGACAC AATCCGGAAC TTACCAGTGC TTTACGCTTC GCGGAGATTC GGAAGCCGTG ACCTTCGCCA TTTTTGGAGA TTGCGGCGCG TGTTTAGGGG ACGGAGGCGA ACCCATGGAA TCGTCACCAT CCCTCGCCCC CGTGGTCATG CCCGTAGTCA TGCCGACCAA TCCACCAACA CCGACAGGAC AAGCGTCGGT GGCCCCCGTA GGCATGCCGA CGCCTCGCCC GACGGCAGTT CCCGTAGCCA ACCCAGTGAG TGCCCCCATC AACACTCCCA GTTCAAGCCC TGTCGGGACC ATGCCAGATC CAACGATGGC ACCTTTCGGG ATGACCCCGC GACCGACGGC TTCACCGGTC GCATCACCCA CGCGAGCTCC GACGTTCGCC GAAACGGAAG AACCGACGGA AGATGACACA CCGGTCGGCA TGTCGATTCC GCCGACAGGT ACAACCCTAC CACCGACCGG ATCGACACCA CCGCCATCCG TTGGTGGGAT GAATACATCG GCACCTCCGA CCTCGAGTGT TTCGATCCCC GACACTCCTA CAGAGATTCC AAATATGACT GTTTCAACTC CACCGGTCGC GACTACTCCC ACTCCATTAC CTACCCCTCA AGGAACCAGC AGCGACTTGG TCGGACTTGT CATGCTGTTG ACCAATACGA ACGGCGTCCT GGGGGAAAGC TCCACGATAG CCTGGGAATC CGCCACTGCT GCTCACATCA CCAACACGCT TGCGAGAGAG GCCCCCCTCT CCAAGGCCAC GGTGGAGGTT GATGTTGTTA GCCAAACCCG AATGTCGCCC AACTCGTCGC TCAACGGTGT CCGTCGAGTC CAAGAAGTCG ACGAATTACG GCCACTTCGG GTGGCCTTTG ACGTCGTGGT CCGATTTCTC GCAACGACTT CGGCGCCCAA CGACGCTGAA GCCGCTACTT TGTTGGGGGA AGCCTTTAAT AGCCAAGAAG ATCGGGAGCT CTACATAAAT AGACTTAGGG CTACCGATAA TACTGCGTTT GAGTCTTTGG ACAGAATACA AGTTCTTGTG GATGGTTTCA CTCCCGCGGA GGAAGGATTT CAAGGAAGTA ATGACAGTGG TGGTTCAAGT ATGGGTATTA TCATTGGGGC GGCCGTCGGC GGTATCGCGA TTCTCGTGAT AGCGGCACTG GCAGCATTCT TCGTCTTTCG CAATCGCGAC AATCAGAACG ACCGCGTCAA TAAGAGACTC TCGGACGACG ACGAGCACAT GCAAAATGAC GCGCGAGAGC TTTACTCACC GCCCACAGAA TCGGCCTCGC CCTTAGGCGC TGCCAACGAA ATTTCCGTGG ACCGGCAAGA CGACGTCAGT ACACTAGGGG ATTCCATCTT GGCCGGCATG GCGGTATTGC AGGACGGCGA AGATGAAAGG ACGGCCAGCA TTGATGGTGG CTACGACTAC GCACAAGAGC AATTTCGTGG CGACGGACCC TTGTCGGTCT CACGCGGTGA AAACTCCACA ATGCTGTCGT CATACCCAAG CGTAGGACAA ATGGGAGGGC CTTCTCTTAT CGAGGACGAC GCCTCGTTTG AAGAACTGTA CGGGGACGTC GATGATACAT CGCACCCAGA TCGCTTCGAA GTCGATGTGC CACCCGGAAA ACTCGGCATG GTAGTGGATA CGCCCAATTA TGGAATCCCA CAGGTACACG CAATTAAAGA AGATAGTGTT TTGGTCGGGC GCGTAAAAGT CGGCGACCGA CTAATGCACG TGGACTTGAT CGATGTGACT CGAATGTCTG CCATTGAAGT GTCAAACTTG ATCCATAAAA AATCGAAGAG TGCCCGGGTG CTCGGGTTTG CCCGAAAAGC ATCGCCATCG AACGATCCGT ATGCGCTATC GTAGAACCTG AGGTTGCGCT ATGCCTGACT TGTTGACACG ATCTTTCTCG GTTCTCTTCA AGACAACAAC TTTGGATCTG GAATCCGTAC ACTGTTTTCT TCTGTTTTAC ACTATATGCC CCCGACTCCA ATAAATCCAT TCGCTTTCTA ACTTGTTCGC GCTGCTACTG GCGCGAAAGC GAACATCATC GTACAAAATA ATGTTAATTC ACACTTCTCC GCCACACATT CTTTATATCT GTCCTCTTGT GAATCCAATG CAGGGACAAG CCTTATGGTT CCAGGAAATA TACATCAAAG CCTAGTTACG GCACATTCCT CCGGACGACT GAGCTTTTCT TTGAACCCTG TAGTCTATAG AAATGATTTC GCTTGAGCCA GAAAATGGGA CAAGTTTTGT GAACGCAACA CGCGTTCGTT TGGAGGTACT CAATGCCGCG CATAGCAACC TGAAATCCTT TTCACTCGTC TTTCTTGCCT TCAATGGATG TCACCGTCGT TCTCTATAGT CGGAAAGTTG TAGGACTTAT GGCAACAATT TTATTGCAAG CACAGTTAGC GTCAGCTAGC GATGAAGAAG GCATCCCAAG AAGTCTTTTC CTTTGGACAA CAACTATCAA GAAAACAATG AACAATGCAG ATCAGAATCA TTAACTGTAA TAGCGATAAA CTTGAATTAG GCAACGTCGT CATTTGCTGT CCCTGGAATA AAAAAAAGGA ATTTGGATCG CCCAAATCAC AAATCGTA
|
Protein sequence | MTKQSSTLAR LLLLLALWCD PTYGQNDCAL SFWLFPSQQD CRLRSGDPSA IETFVADDVC RVMSNPSPFL LGRYTASCVS SDTVRISQSG CTRSDCSSTS GGSVCDRDLT SVSSFYSLLS TPEYNVQDPA TQSGTYQCFT LRGDSEAVTF AIFGDCGACL GDGGEPMESS PSLAPVVMPV VMPTNPPTPT GQASVAPVGM PTPRPTAVPV ANPVSAPINT PSSSPVGTMP DPTMAPFGMT PRPTASPVAS PTRAPTFAET EEPTEDDTPV GMSIPPTGTT LPPTGSTPPP SVGGMNTSAP PTSSVSIPDT PTEIPNMTVS TPPVATTPTP LPTPQGTSSD LVGLVMLLTN TNGVLGESST IAWESATAAH ITNTLAREAP LSKATVEVDV VSQTRMSPNS SLNGVRRVQE VDELRPLRVA FDVVVRFLAT TSAPNDAEAA TLLGEAFNSQ EDRELYINRL RATDNTAFES LDRIQVLVDG FTPAEEGFQG SNDSGGSSMG IIIGAAVGGI AILVIAALAA FFVFRNRDNQ NDRVNKRLSD DDEHMQNDAR ELYSPPTESA SPLGAANEIS VDRQDDVSTL GDSILAGMAV LQDGEDERTA SIDGGYDYAQ EQFRGDGPLS VSRGENSTML SSYPSVGQMG GPSLIEDDAS FEELYGDVDD TSHPDRFEVD VPPGKLGMVV DTPNYGIPQV HAIKEDSVLV GRVKVGDRLM HVDLIDVTRM SAIEVSNLIH KKSKSARVLG FARKASPSND PYALS
|
| |