Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_267 |
Symbol | |
ID | 7202966 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 157419 |
End bp | 160313 |
Gene Length | 2895 bp |
Protein Length | 952 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182336 |
Protein GI | 219124072 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0986316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGACCTTAG TCTTGGCCGA CGACGACACC TTTGTCAAAC CCGCACGCGA TCCTCGGCAG TACCGCGTCC TAAAGCTGGC CAACAACCTG CAAGTCTTGA TCGTCAGTGA TCAGCTCGCA TCCGGTGCCG TGGGTGTTGA AGCGGCATCC GTCCACGTCC AAGCCGGACA CTTTGACGAT ACCATTCCAG GCCTCGCACA CTTTCACGAA CACATGCTTT TCCTGGGGAC GGAAAAGTAT CCCGACGAGG ACGAGTACGA AACATTTCTG TCGCAATTCG GTGGGTTTTC CAACGCCTAT ACGGATATGG AAGATACCAA CTACTTCTTT TGCCTGACTA CGCCCAACAC CAATCCCAAC GTCACGTCCG ATGCTCTCTC CGGAGCGTTG GATCGCTTGG CCCAATTCTT TGTCGCCCCC CTATTCGATC CCGACGCAAC CGAGCGGGAA TGTAAGGCGA TCGATTCGGA ATACCGCAAC GGCAAGGCCA GTGACAACTG GCGTAACTAT CAACTCATCA AGAGTACTTG TAACGATACC CACCCGTTCG CCAAATTTGG TTGTGGTAAT TACGATACCC TCAAAACACA AGCTGGGTTG GAGCATCTGT TGGGAGAGCT GCAACGGTTT TGGGATCGCT ACTATCAAAC GTACAATTTA CGGTTAGCGG TAGTGGGACA CGCATCGCTG GATGCCTTAC AAGCGACCGT GGAAGAAACC TTTGGGACCC TGGCCTACAG TGAAGGAGCT CCCCGTCGTG TCAAACGTAG AGTGGGGAAT AAGGAGGATG GTCAGGATTT GTTTGTCCGT GAGAATGCCG TATATGGAGT TCCCGCCTAT GGTCCCGATC AACTTGGGGT ACTCCGTCGG ATCATTCCCT TTACCGAGTC TCGCACGATC AAGCTCCTAT TTGGTGCCCC GCCTTTGGAC GATCCGGCCG TTACAACTTC CAAACCGTAC CGCGTTTTGT CACACATTCT GGGACACGAA GCACCTGGTT CGTTGCACGC TGTCCTGAAC GATGCTGGCT ATCTGACCGG ACTTAGTTCC GGTATTGGTA TTGATACGTC CGATTTTGCA CTTTTTTCCC TGTCCATGTC GCTGACCCCA CTCGGCATGC GGAATTATCC CGAGGTCTTG GACTTGACTT TCCAATGGAT CGTACTGGTA CGATCGCGAT ACGAGAGTGA CCCACAATGG TTCGAAGCTC ATCACGAAGA GCTGCGTCAA ATCTCGGAAG TGAACTTTCG ATTCCGGGAA AATGGCGATC CTACTGACTT TTGCTCGAGT GCATCAGAGC TCTTGTTTGA CGAACAAATG GAGTACTCGC GTATTCTAAA GGGCGGTTCC GAAACCTCTC TACTCGATCC CGTCGTAACC AAAGCCTTTT TGGATCGATT TCGTCCAGAA AACGCAATGG TGCACATCGT CTCGTCCGAC CTGAAAACAA CGTCATCCGA CGACTCTAAC GGCTCGATTT GGGAAACAGA GCCATGGTAC GGTGCGCAGT TTCAGGCAGA GCGCTTGTCG AACGAACAGA TAGAAACGTG GGGAAGCTAC TCTCCCGAGA CGATTGATGC GCGGTTGGCG CTACCGGGTC TAAACAACTA TATCCCGACC GACTTTTCTT TACGATGTGA CGAAGAAGTC GACGCTAAGA AAGAAACCCT CACCAGCGAC GAAATTATGG TACCTCCCGT TTTAGTGCTG GATCGTCCAA ATTTACGATT GTGGCACAAA ATGGATCGTT ACTGGCGTGT CCCAAAAGCC TTCATTCGTG TTGCCATTCT TTCTCCCAAT GTGTATCGAT CACCACGATC AATGACATAC AATCGCATCT TTCAACGAGT TTTGAGTGAT GATCTCAATT CCTTTGTTTA CGATGCCTCC ATCGCAGGAT GTAATTACCG CGTCAGCTGT GCACCTAGTG GATACCGCAT TTCGGTTCGT GGGTACTCAG AGAAATTGCC CTTTTTGCTA GAAACGCTCA TGTCCCGGAT ATTGAGCCTA ATTCAAGAAA TGAAGGGTGG CGACCCAGAT TTGCGCAAGC GCTTTGCCAA GGCACAGGAA AGTCTATTGC GAGAAACGAA GAATTACCGC TTGGACACGC CGTACGAAGT TGCCAGTTAC AATTCGCGAT TGTTAATTGA AGAAAATGTT TGGTATTTGG ACAACTACGT CGACGAGATG GAAGGAGATG CTGCTTTGCA CGATCCCTTG ACCATGGAGG AATGCGCCCA AGTTGCCGAG GATTGTGTCA TGGGACGTTT GAAATGCGAG GCCCTATGTA TGGGGAATAT TGATCAGAAG CACGCACTGG GCATTTCCGA GGTTTTGGAC CGCGTCTTCT TGGACAAGTC ACGCACCATT TCAGAAGTCG AGACACCGCG TTTCCGATCG CTCAAGTTGC CGACGCGGGA TGAAGCCTCA CTAATTTTTG GTGACGCCGT GGTGAATCGG ACGTTGCCCA TGATCTATGC CGATCTCGCT CATAGCGCTT CGGAGGAAAA TAACGCGGTA GAAGTCATTC TACAAGCCGG TAGCGAGCTT GAACTGGGCT ACGAAGGTCT TGCCACTCTT GATTTGATCA CCCACATGGC TTACAATTCT GCCTTCAATC AATTGCGTAC CAAGGAACAG TTGGGTTATA CAGTGAGCGC GTTTCCGCGT AAGACTGCCG GTACCGCTTG GGGCCTGTCG GTTGTTGTCA TGGGCAGTGC CGCCCTCCCC GAGTACATGG AAGAACGATG TGAAGCTTGG TTGGTGCAGT TTCGTCGAGA GCTGGAAGCC ATGACGCCGG ATGCCATGGC GGTGGAAGCG TCGGCTATCG TCGCGCAACT GCTGGAGGAA GAAACCAAGC TGTCCCAAGA GGTTTCGCGA GTATGGGGAG AAATT
|
Protein sequence | ETLVLADDDT FVKPARDPRQ YRVLKLANNL QVLIVSDQLA SGAVGVEAAS VHVQAGHFDD TIPGLAHFHE HMLFLGTEKY PDEDEYETFL SQFGGFSNAY TDMEDTNYFF CLTTPNTNPN VTSDALSGAL DRLAQFFVAP LFDPDATERE CKAIDSEYRN GKASDNWRNY QLIKSTCNDT HPFAKFGCGN YDTLKTQAGL EHLLGELQRF WDRYYQTYNL RLAVVGHASL DALQATVEET FGTLAYSEGA PRRVKRRVGN KEDVPAYGPD QLGVLRRIIP FTESRTIKLL FGAPPLDDPA VTTSKPYRVL SHILGHEAPG SLHAVLNDAG YLTGLSSGIG IDTSDFALFS LSMSLTPLGM RNYPEVLDLT FQWIVLVRSR YESDPQWFEA HHEELRQISE VNFRFRENGD PTDFCSSASE LLFDEQMEYS RILKGGSETS LLDPVVTKAF LDRFRPENAM VHIVSSDLKT TSSDDSNGSI WETEPWYGAQ FQAERLSNEQ IETWGSYSPE TIDARLALPG LNNYIPTDFS LRCDEEVDAK KETLTSDEIM VPPVLVLDRP NLRLWHKMDR YWRVPKAFIR VAILSPNVYR SPRSMTYNRI FQRVLSDDLN SFVYDASIAG CNYRVSCAPS GYRISVRGYS EKLPFLLETL MSRILSLIQE MKGGDPDLRK RFAKAQESLL RETKNYRLDT PYEVASYNSR LLIEENVWYL DNYVDEMEGD AALHDPLTME ECAQVAEDCV MGRLKCEALC MGNIDQKHAL GISEVLDRVF LDKSRTISEV ETPRFRSLKL PTRDEASLIF GDAVVNRTLP MIYADLAHSA SEENNAVEVI LQAGSELELG YEGLATLDLI THMAYNSAFN QLRTKEQLGY TVSAFPRKTA GTAWGLSVVV MGSAALPEYM EERCEAWLVQ FRRELEAMTP DAMAVEASAI VAQLLEEETK LSQEVSRVWG EI
|
| |