Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18335 |
Symbol | |
ID | 7197229 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1159193 |
End bp | 1162459 |
Gene Length | 3267 bp |
Protein Length | 995 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177765 |
Protein GI | 219112027 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCG CGAACTTTTT GGATATTTTG AAGCCTCCAT TGGATGATCG AGAATATGTT GCGTACACAT TAGAAAACGG ACTTCGCGTC TTGTTGTGCT CGGACGAGTC TTCGAACGAA GCAGCCGTAG CTATGGATGT GCATGTCGGT GCATGCTCCG ACCCGGCGGA AGTTCCAGGA ATGGCACATT TCAATGAGGT ATGAGTCGTA AAAGTAAATT GTCCTTTGCC ATAGTCTCCA TGTTGAGTAA TTGCTCACAG TTGTCGGTTT TACATAGCAC ATGCTGTTTC TCGGGACGAA GAAATATCCA AAGGAGGACT CCTTTGAAGC CTTTTTGGCT TCAAACGGTG GTTCTTCTAA CGCTTACACG GCAAGCGAGG ATACGGTATA CTTCTTTGAT ATGGCAGCGG AAGCCAATGC AAAATTCGCG GAAGGACTGT CTCGCTTCGG TGCTTTCTTT ACAGCTCCTT TGTTTACAGA AGGTGCAACG GGTCGAGAAC TCAACGCTAT TGAAAGCGAG AACGCGAAGA ATCTGCAGTC AGATACTTTT CGTATTTTCC AAATCGATAA ATCCCGAGCA AATCCAGACC ACCCTTACAG CAAATTTTTT ACTGGTAACA AAAAAACTTT GTTAGACGAT ACCAAGGCAA AGGGCCTAAG CCTTCGAGAG GAGCTCATCA AGTTTTACAA CAACTACTAT TCGGCCAACC AAATGACGTT AGCTATTGTT GCTCCGCAGT CCATCGAAGA CCTGAAAAAC ATGGTTACGG AAGCATTTTT GGATATTCCG AATCGAAATG TTGATACGCC TGAGTCCTCA TGGGCCGGCA TTCCTCCTTT CATAGACGAG AGTTCGATCC CATCTTTCAA AAACGCGATC GAGATAGTTC CTGTGCAGGA TCTTCGACAA ATTATGATTT CATGGCCAAT TGTGTATAGC TCAGAGGATC AAAGGCAGGA TGACTTACTA AATAAGCCGA CAACGTACAT CGCACATTTA CTTGGGCACG AAGGACCCCG CTCTTTGCTT TCCTACCTCA AAAGTAGGGG GTGGGCAAAC TCTGTTGGTT GTGCCAACAG CGAGGAACTT TCTGACTTCG AGGTTTTTGA GGTGGTAGTA GGACTTACGA CCCAAGGCTT GGCGCAAGTG GATGAGGTGG TAGAGTCAGT GTACGCCTAT ATCAACATGC TTCGTGACCG CAAGATTCCG AACTATGTGT TTGAGGAAGT CTTTCGGCTT GAAGAACTGC AGTGGCGATT TTTGACAAAG GGAAGCCCTC GGAGTTATGC TTCGTCCCTG TCTACTGCAA TGCAAAAGTA TCCGCCAGAA CTGTACGTTG CTGGACCGAG GCGACTAGCG TTGGATGAAT TTATCATCGA GAAAAGAATG AACGGGCTCG CTCGCTCCGA GTTTGTATCT AGGGAAGCGC TAGAGCGCTC CCGGAAGCAA GCAGAGCTCT TAGCTGACAA TCTGACTGTA GATAATGCGC TTCTAACCGT GATGAGCAAA GACTTTGACA ACAAAACGGA TCGCAAAGAA AAATGGTACG GGACGGACTA CCGGGTCCGC CCTCTATCCG TTGAAACCCT CAGCCGATGG AGACGTGGTA TACGAGCGGA GCAAATTAAG ATCGACTTTC CAAGACCCAA TCCGTTTATT CCTACCGAGC AAGGTTTGCG CGTTAAAATT TCACCGTCCG GCTCAATGAA GGCTGCGAAG AGGTCTTTTG AATCCAGAAT GATGCCCGTC CCCCCTCCGT CTCTGCTTCG AGATGATGGA CCGGACGGTC GATGGAAGGT TTACTTTAAG GCTGATGATC GTTTCGGGTT GCCAAAAGGT TATATTGTCT TTCAGGTAGT CACTGGTGAA GCGTTCGCTT CGCCTAGAAG TGCAGCCTTG TCGAATCTTT TTGAAGTCAG TATTGCGGAC AAAATAGGGG AATACGCATA CGATGGTACG TAAAAGCAGA CACGAAATAT GGTTACAAGC ACAGCGTTCT AGACTAATTG TGTGAGAAAA TCGTCTGTTG TATCAAGCCA GCCTTGCCGG CTTAACGTAC GATGTAAAAA TTATGCCAAG AGGAATTCGA TTGACTTTTG GGGGCTACAA CGACAAACTG AAACGCTTCG CTTCGTACAT TTCGTTGAAG CTGACGACCG AAATACGTGA TGTTCTTCCG ACGAGTGAGA GTGTGTTTGA TCGATACAAG GATCAAGTAA TGCGGGGATT GTCTGCATTT GATGTCAAGC AACCGTACTT TCATGCGTCT TATTATTCCC AGATTGCTCT TCAGCCGCCT CGGTTTCAGT ACGACAATAC CGCACTAAGG GAAGCTATTA GAGAAGTAAA TTTGAGTGAT TTGATTGAAT ACGTCAACAC TCTTTGGAAG TCGGGCCGCG GCGAGGCTCT TATACAAGGA AATTTTGATC AAAAAGAAGC CATGGAACTC GTCAAAAACA TTGGTGATGT CTTGCCGTTT CGACCGATTG TCCAGGAGGA ATACCCTTCA CGCCTGGAGG CACTGCCTTT GCCTGCTTAC GGCCCAAAGA AGCTGCCAAC CAAGCTAATC GTTGCCGAGC CAAACCCTGA CAACGAAAAC TCTGTTGCCA CAGTAATGCT ACAAAGTCTC GGCACGTCAG AGAAGGATCA CGTACTGATC GAATTGATCA GCTCCATTGT GCAGGAGCCG TTTTACAACG AACTCCGTAC AAAAAAGCAG CTCGGCTACA TTGTATCGTC AGGAATTCGT GCCGTGGGTA ACAGCCGAAC GCTCTCATTC ATAGTCCAGT CCAGCGTGGC GCCGGCAGAC AAGTTGTCCA TCGAAATTGT CAAGTTCTTG AATACAGTGG AAGATCGTTT TCTCAACAAG CTCCTTAAAG CTGACCTCGC CGTGTACGTC AAAAGCCTGA TTGATCGCAA AACGGAACCC GACAAGGAAC TCGCTACAGA AGTGACTCGC AATTGGGCGG AGATTGCGAG CGGACGATTT CAGTTTGATC GCATCCAAAG GGAAGCTGCC GCGCTGCTCG ATGTACAAAA GGAGGATTTG CTAGATTTTT GGAGACGAAT TTATACCGGG GACAATTGCC GTGTATTGGT GACACAGGTA GTTCCTCGCC AAGGGCCAGC GTCTTCGCCC GTCCCAGCCA AGAGCACGGG ATACAATGAC AAGGATCCGC TACCCGAAGG ACTAGTCCTC GGGATTGACG ACTTGGATCA ATTCCGCGCC GATAGGCAGA TGTCAACTTA ATGCTAGGTC TACGACT
|
Protein sequence | MATANFLDIL KPPLDDREYV AYTLENGLRV LLCSDESSNE AAVAMDVHVG ACSDPAEVPG MAHFNEHMLF LGTKKYPKED SFEAFLASNG GSSNAYTASE DTVYFFDMAA EANAKFAEGL SRFGAFFTAP LFTEGATGRE LNAIESENAK NLQSDTFRIF QIDKSRANPD HPYSKFFTGN KKTLLDDTKA KGLSLREELI KFYNNYYSAN QMTLAIVAPQ SIEDLKNMVT EAFLDIPNRN VDTPESSWAG IPPFIDESSI PSFKNAIEIV PVQDLRQIMI SWPIVYSSED QRQDDLLNKP TTYIAHLLGH EGPRSLLSYL KSRGWANSVG CANSEELSDF EVFEVVVGLT TQGLAQVDEV VESVYAYINM LRDRKIPNYV FEEVFRLEEL QWRFLTKGSP RSYASSLSTA MQKYPPELYV AGPRRLAEAL ERSRKQAELL ADNLTVDNAL LTVMSKDFDN KTDRKEKWYG TDYRVRPLSV ETLSRWRRGI RAEQIKIDFP RPNPFIPTEQ GLRRSFESRM MPVPPPSLLR DDGPDGRWKV YFKADDRFGL PKGYIVFQVV TGEAFASPRS AALSNLFEVS IADKIGEYAY DASLAGLTYD VKIMPRGIRL TFGGYNDKLK RFASYISLKL TTEIRDVLPT SESVFDRYKD QVMRGLSAFD VKQPYFHASY YSQIALQPPR FQYDNTALRE AIREVNLSDL IEYVNTLWKS GRGEALIQGN FDQKEAMELV KNIGDVLPFR PIVQEEYPSR LEALPLPAYG PKKLPTKLIV AEPNPDNENS VATVMLQSLG TSEKDHVLIE LISSIVQEPF YNELRTKKQL GYIVSSGIRA VGNSRTLSFI VQSSVAPADK LSIEIVKFLN TVEDRFLNKL LKADLAVYVK SLIDRKTEPD KELATEVTRN WAEIASGRFQ FDRIQREAAA LLDVQKEDLL DFWRRIYTGD NCRVLVTQVV PRQGPASSPV PAKSTGYNDK DPLPEGLVLG IDDLDQFRAD RQMST
|
| |