Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49461 |
Symbol | |
ID | 7195816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 307416 |
End bp | 309258 |
Gene Length | 1843 bp |
Protein Length | 476 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184108 |
Protein GI | 219127784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.660908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTGTG TGTGTGTGTG TGTGTGTGTG TGTTTCGTTG ACTGTAAATG CTGAGATAGT CACAAGATAC TACTATCAAG CACAGTTCAA GAAACAAGCA AACATACTTC AGTCAACATC CTTTCAACAG GCTACCGTGT CCTCTGTTCC TTGTATAACT CACAGTCAAG CGCGAGTCCG TTAAGAAAGA CCACGGCAGT TAAACACTAT GCTTTCGTTC GTTCCAGGCC CACACCGGCA GCGATTAGAG ATTCGTGATC CGCACAGAAT CACTCGTTGT GCGGCTCCCG GAGACGAAAT CCGGCAATCA TCGTTTGGAT CAACCGTACG CAAGGATGAG ACTTGTGGAG AAACCAAGGA AATGCCGCAT CAATTTTTAT CGGATACCGC CATTCCTCCG CCTGGTCCAA TCCTACACGG TACGGGTAGC GGCGGTGGTA CATCCGGTTT CCGTCGCACC AAGTCGGTCT TTTCCATGTC TTCGGTACAA TCTAGTGTGT CCGATTCTTC CGAGACTTCT TCTTCTCCCT TGCATCTGCA GCCACCTACC CAAATTCAAG AAGCCGATCG TTTTCAAATT CAAAACGTAC CGCACTCGTT CGGCCGAGAC GTTCGAGCCT ACACGGAAGA TTCTCCTTGG GCGCCGGTCT TTGATGCGGG GATGCGCAAG GTCCTGGAAC CGCATCTGCG TCGAGCCCGC CTAGATCGTG AACACGCTCG TCAACTCACG GTCCATTGCC TCGGGCAACC GGTGGAGAGC TCCATCGGCA GCTCCGCGCA CGTGCCGTAT ACCTCACTAC GATACGAACG GCGGTTCTTG TTTGACGAAC AAATGTATCC CTTGCATCTA ATTCTGGCGC AGACACTCAA CGTGCAAGAC CTGTCGCAGG TGCATCAAGT TGTAAATGCG GATCTGATGA AACCGCTTCT TTGTCGTGAC ACAAGAAGAG CCTTCCACGT TGCCTACGAC AACTTCATCA CGTCCTTTTG TTTACCCCTA CTACATGAAA TCGCCATGAT CAAGAACATT GTGCACAGTG CCTCGCATCG AGTCACATAT CGCTACCAAG CCTTTCCCAA TATTAGCATT GTTCGTCCCG GTGACGAAGC TACCCTACCT ACCTGCCAGA CAGCACAAGG CAAGAGTATC GGATGCCTGT ATTTTCACAT TCCGCTCACA CCGTCCCACG GAACCAATGC GTTGTACGCC GAATCGTACC CCGGAAAGGA AGATTGGCAC CCCCTACAAG CCAAGTCCTT CGGTCTCGGA TATCTGTTTG ATGGTGCCCG GTGCTTACAA TTCGGCCTCG AGAACACTAC AAAATCGTCA CGCGTGTCGC TCGATTTTTC CGTCGCCCTC TATTGCGAGA GCAGCGTGGC TAAACCAAAT CACTGCCGTC ATCAAAATCG TGCGGAAGAT CGTCACACTG TACTCTGTCC ACCTGAGCTC TTGCACGATG CGTACTCGCG GGGCGGACCA GGGTACTACG AGGAAGCCGT GATAGATTTG AGTCGTCCTA CCGCAACGTC ATCGGCTCGG CACAACGCTG GTACCAAGCG TTGCACCAGC ATGGTGCAAC GTAAACGTGG CTGCAGACTC ATGGAGCCCG ATGGACGCAT GGGAGCGCCA TTTGTGTAAA AGAAAATGCC ACATCCTAGT TTGGGTGACA TCGGACATGC TGGATGCAGC GCTAGCGCTA GCAATACTCG CGTCTGATGT TCCAGCTTAC CTACACATTC CAAGTTTTTT TTGTTTGGCT GGCATGTCCA AATCTCTACG CCAAGAGAAT CTTCATATTG CGTAGCTGTT ACAAGTCGCA AAGCTAGTGA AAGTGATACC TGC
|
Protein sequence | MLSFVPGPHR QRLEIRDPHR ITRCAAPGDE IRQSSFGSTV RKDETCGETK EMPHQFLSDT AIPPPGPILH GTGSGGGTSG FRRTKSVFSM SSVQSSVSDS SETSSSPLHL QPPTQIQEAD RFQIQNVPHS FGRDVRAYTE DSPWAPVFDA GMRKVLEPHL RRARLDREHA RQLTVHCLGQ PVESSIGSSA HVPYTSLRYE RRFLFDEQMY PLHLILAQTL NVQDLSQVHQ VVNADLMKPL LCRDTRRAFH VAYDNFITSF CLPLLHEIAM IKNIVHSASH RVTYRYQAFP NISIVRPGDE ATLPTCQTAQ GKSIGCLYFH IPLTPSHGTN ALYAESYPGK EDWHPLQAKS FGLGYLFDGA RCLQFGLENT TKSSRVSLDF SVALYCESSV AKPNHCRHQN RAEDRHTVLC PPELLHDAYS RGGPGYYEEA VIDLSRPTAT SSARHNAGTK RCTSMVQRKR GCRLMEPDGR MGAPFV
|
| |