Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37952 |
Symbol | |
ID | 7202692 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 499075 |
End bp | 501718 |
Gene Length | 2644 bp |
Protein Length | 775 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182077 |
Protein GI | 219123533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTCG ACTCGCTAAT GTTTCCTCTG TCACTCGCAA TCTTGACTCA TAGCAAATCA TCCGCGCTTC CTCCTTGCGT CATCGCATCA AGGTCAACAT TGGTACCCTT CGACATCAAG GAACTAAAAT CGATCGAGGC AGGCATTACG CTTGCAAAGT TTCGTCTCCG ATTGTTTTCC TCATTTATTA CCAGATCTTC GTATCCAAAG TACACGAGCA AAAAATCAGA AGTACCAATT GCATACTACA CACTGCTTTT GCGCATCAAT ATGTACTTTT GTTCGAGCTG CTTCAAAAAT CCTTTTTGCC GCCATTACAC TCTTTTGTTG GAAACAACCA TCTGCATTTG CAACACGTTG CAATTTTTCT GATATTTCCG GACTCGAAAC TACCTACGAG TTTTCCTTCG CCAGCAAGCT TTCTGTCATC GGCTTTTCCG CATCCGTCAA CTACGTTTCG CTACCTCTTG GTACGAAAGA AATTGAGAAA GAAGGCAATT GTCACATCCA TTTTTCCCGG TCGCTCACAT TTTTTCCCTC TTTGTACCCA GATGCTTCAC CCTTCCTCAA AATTAGTGGA GCTAGTGAAC AGCCTTCTGT ATCAACCGAG AACGGAGTTG TGATCATAAC CCCAGACGAC TCATCCTGTA CAACACCTGA GAGTACAGCG GGTACTTCTT CGTCTTCTCG TATCATGCCT GGACTGTTTC TGACCTCGCT AATGGCTCCG CAAAAGGCCA AGGGATCCTT TGTGCTTCTT CTTTCATTGG CTGCTTCCCT AAATGGGGTC CATAGTCAAA CAAATGACGC ATGCACCCCT GCATTGGAGA TTGAAATTAG CCTTCCTACC GGAACAATCG TTTCTTCTGT CTTTGGTGAT ACAGACCATT ACCTAGCTGC AACGCTTGAA ACGGTAACAT GGGGCTACTA TGATATCAAC AAGCCATCGC AAATTTCAAT GGAATCCGGA GAAACGATCA CGGTTGAAGT AATTACTCAC CACTCTGGTC ATGACTATGC TAAGATGATC CGTGGTGATA TGGCTGTTGA GGAGATCTTT TACTGGGCCA CCAATACGTC ATTGAGCGAA AAACCCGAAC CAAAATTGGA TGGCACGGGT GTTCACTTGG TTACTGGACC AATTGAAGTG ATTGGTGCGG AGCCTGGTGA TGTGGTAGAA GTGGAAATTC TTGAATTGGA CCCTCGGTAC AACCCTATAT CGGGCAAGTG CTATGGCACC AACTCTCAAA AGTTTGCCGG CTATCATTAT AATGTATTGA CTGGATTTGG TCGTGATGGA ACACCTTACG TCCGTACCGG CGGTACTGAA GCCATTACTG TATTTGAGTT TGTGGAAACA TCTGAAGGTA AAATGGCCTA CGGAAAGCCT GTATATATGT ACCGCTTCCC AAACATGACT GCACCAGATG GATCAAACCG CACATTTGAC AACAATCCAG CAGTCATGAT CCCGCATGAA TTCAATTATG GTTACAATGG AGAACTACTA GAACTGGATC CAATTTTGTA TCCTGAAGGA TTTGATGGCA CCACGGTACG AGCTTTTGCC TCAATTGTTG GTGGAGGGGA TCCCATTCAT ACATCCACTA ACTTTGTTGT TTTTGTAGGT TACTGATGCT GGCGGTATCC AGTATCTTTC TCCAGAAGAA GCTGGTCTAG CCTGGAAAGT ACCGTTGCGC CCGCATATTG GTACATTGGC AGTTATGCCA AACAATACAG AGAACTACAT TGACGAAGAA GCCGAGGGTG GCGCAAACAC AATTCCTCCT GCACGTTTTG GCGGCAATAT TGATGACTGG AGAATTGGCA AAGGAGGAAC CATGTTTTAC AGAGTTGAAG TTCCTGGAGC CCAGATTGTT GTCGGTGACA CACATGCCGC TCAAGGCGAC TCAGAACTTG CCGGTACTGC AATGGAAACA TCCATGACTG CCAAGCTTCG TGTAACTTTA CACAAAGCAG GTAGTCTACC AACCAAGGTT GCAACACTAG ACTTTCCGCT CCTGGAGACA TTGGACAAAT TTGTGGTGCA CGGTTTTGCC TATGACAATT ATCTTGATCA GTTGGCAGAT CCCTCAGATA TCTTTTCGGA AGGTACCTCA TTGGACTTGG CAATGGCCGA CTGCTACATT AAAACCCGTA ATTGGATGAT GGATGTTTAT AGCTTGACTG AAGAGGAGAC AATTGCTCTC ATGACCACCT CAGTCGATTT TGGCATCACC CAGGTTGTTG ATGGAAACTG GGGAGTCCAT GCTGGTGCGT TATTCTCACT CGGTAATTCC GCTCAGATTA TTTTTTTGAC ATCTCATCGT CTTCTTATCA CTTTCCAGAT ATTGACAAGT GGGTTTTCGA CCAGACGGAC GCACCATACG ATTACCCCTG CACAACATCA AAGTCGGCTC GTCGCCGCCG TATCCTGAAG ATTGACGAGC GTCGGCTAAT CTTGGATTAT CATAATGTAA TGCTTTCCCC CAGCGAATAC GCTGACGAAC TGTTTCGCCG TGTTACCGGT ATTGAAGCCT CGTCGGAAAA AGTTGATACT TTTGCCCGCA ATCGTCTAGC GGAGCTTTTG ATGGAGTCCA AGCTCCAGTT TGCGAAGGCA CGCATGTCAA AGGGTATGGT CTGA
|
Protein sequence | MPFDSLMFPL SLAILTHSKS SALPPCVIAS RSTLVPFDIK ELKSIEAGIT LAKFRLRLFS SFITRSSYPK YTSKKSEFSF ASKLSVIGFS ASVNYVSLPL GTKEIEKEGN CHIHFSRSLT FFPSLYPDAS PFLKISGASE QPSVSTENGV VIITPDDSSC TTPESTAGTS SSSRIMPGLF LTSLMAPQKA KGSFVLLLSL AASLNGVHSQ TNDACTPALE IEISLPTGTI VSSVFGDTDH YLAATLETVT WGYYDINKPS QISMESGETI TVEVITHHSG HDYAKMIRGD MAVEEIFYWA TNTSLSEKPE PKLDGTGVHL VTGPIEVIGA EPGDVVEVEI LELDPRYNPI SGKCYGTNSQ KFAGYHYNVL TGFGRDGTPY VRTGGTEAIT VFEFVETSEG KMAYGKPVYM YRFPNMTAPD GSNRTFDNNP AVMIPHEFNY GYNGELLELD PILYPEGFDG TTVTDAGGIQ YLSPEEAGLA WKVPLRPHIG TLAVMPNNTE NYIDEEAEGG ANTIPPARFG GNIDDWRIGK GGTMFYRVEV PGAQIVVGDT HAAQGDSELA GTAMETSMTA KLRVTLHKAG SLPTKVATLD FPLLETLDKF VVHGFAYDNY LDQLADPSDI FSEGTSLDLA MADCYIKTRN WMMDVYSLTE EETIALMTTS VDFGITQVVD GNWGVHADID KWVFDQTDAP YDYPCTTSKS ARRRRILKID ERRLILDYHN VMLSPSEYAD ELFRRVTGIE ASSEKVDTFA RNRLAELLME SKLQFAKARM SKGMV
|
| |