Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49588 |
Symbol | |
ID | 7198246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 129515 |
End bp | 131851 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184401 |
Protein GI | 219128398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACT TCCGGTTGCA ACGAACAATC CTCAAGTCAA GGTACACGAA TGCTCGCGAG GTCGCCGTTA TTCTCTCCCT GCTCGTAGCT TTGCCGTACT GCATATTCCC GTTGACGGTC GTATCCGCAC AGCAAACGTA TACACCAGCA GGGATCCAAT CCGGTCCTGA TGTAGTGGGG GGTTCGTTCG CGAGTGCCGC AATCTTCAAC CCGGCCGACA ACACTGTGGT GGTGACGGGC ACCACGTACG GACGCTTTTG GCAGGATCGT GTCGACGAGG CTGATAGTTC TCCAGGATGT TTTCTTGTAA CGGCGCGCTT GCCCGAAGTG CACCACGGAG ATATTGCGTG GGTCGATTCG GGCAAAATAT CCACGATCGC TCACAAGGAA GGCTGCAGTA GTATTGCACT CCGTGGATCG AAGCTCTTTT TGTCGGGGCA CGCCGACACG GGTGGGGTTT TGGAAGAACT CCGCAACAGT GCCTCCATCG TTCCATCCAC GCAGTACGGG ATGCTGCTGG ATATGGACTA CGACGGTGGC AGTAGTCGGG TTGATATTGT GGGAGGTCGC GTTCTGCAGG ACAGTCCCGT CGTCTTTCCA GTGGCCGTGG TGGCATTGCC GGGAGAGTCG GAAGTGTACG TGGCGAGTAT GGCAACGGAT AGCACTGTCG ATCACGACTA CAGCAATGAA CAGCTCGACC CACAACGCTT CTTTCCCTAC GGCTCAAAGT TTCATATGAC GGTGTCCAGG CTCAGTCTTA ACGGTGCTCG CTTTCAAGAA GGCACGGTGC TCAATACTCT CGATGAGCAG TGGACTAAAC CATACTCGAC TAAAGAAGCC GGAGACGTAT ACGTTGCTGG TATGATCAAA CTTTCGAATG AGATTCTCGT TGTCGTCGGA TCGACAACCG GCTACGGCGG AGGGGTGGGC GGCCTTGTTG ATACGGGTCC CGATATGGAC GGCTTTGTTA CCAAATTGCA TCCCGTAACG GGAGGTGTAC CTGTTGAACC CGGTAATCCT AAACACCAAG GTACCGTGCG GATACAATCG GTCGATGGCA AGGATGACTT TGTGGCGGGA GTCTGTCACG ACGAATATCG TCACGATCCC GGGCACATTT ATATTGTTGG TTCCACTTCG GGTGATTTGG ACCGCTCTGG ATCCGCAGCG GCACCGCGCG GTTATCTAAT GAAACTGGAC TTCGAGTCCT TGCAGCCTGT ATGGACCAAA GTGCTGGCAG CAAATGCGGC CACGACAACC ATTGCAAACG CCACTAGCGG TGGACGGCCC TCCGTTCGGG GTGTCTCGTG CGTCGTGACT CCGGACGGTG AAGCCGTCTA CGCCGCTGGC GTTGTCGAAA ACAGTGGCGT CCTACCCCTT TCCGGTACCA TGACCTCGTT TGGGGGGAAA GACATCTATG TGGTCAAATA CGATACTGTG GACGGGAAAG AGGGATTTGT CCGCCAAATC GGATCGTCCG AAGACGATGA AATGGCCATG CGCGGGGGTT TACTGACGGA TTCTCAAGGA AACGCCGTTT TGGTTGGTCA TACGGCTGGT TCCTTGTACC GGACGCGTGA GGAAAGCGAA AAGGCAGGCG TATCGGATAT TTTCTTGCTT ACTATTTCTC GTTATGATGG AAAATACCAA TTTCCGATCG AACATCCGGA ATATGGACAG CAGGTTGAGG AGGACGTTGT GTCGGGCCCG GGGTCTGCGC CACAGACACC GGAAATGTCT GGTACCACGA TGAACGCCAA AGCACCCTCA ACGACAGGAA GCGACAGCAG CGGCGTATTG GGCTTGGAGA AATGGACTGG TTTGACTATC TTGTTGGTCG GGATTATTAC CTCGCTACTG TTGGTTGGCT TATTATTGGC GCGTCGCTCG AATCGCGAAG TCTATACGAG CCGCAATCAA GTGCTGGACT ACCTCCACCA GTTCGATGTC GAAGACGTTG ATCTCAAGCA TTCCGCCACG GGTGGATGGC ATTGTTCGTA CACCAACGGG TTGGCACACG GAATCAATAT TCGGGATGGA TCCTCCTCCA GCAACGACGA TATTGCGGCT GGTCCTTCTC AGGGTGTTGG ACTGCTGCCA GTACGAACAG GCGGATTCGA TCCGTTGACT ACTCCTTTGA ATTCGTCGAC ACTGCGCGAC TCTCTCTTTG TGAACGACGA TAGTGATGGT AGATTAGGTC GGCCAGACGG CCACCACTCG TTTGGAGATG ACAGTTCAGC GGATAGACGC CACACCGCAA TCCATGAATC CCGTCGTTCC AGCACGAGAA AAACACGTCG AAAGAAGTTG GATGCGTGGG GACGTGAAAT CGTTTGA
|
Protein sequence | MKNFRLQRTI LKSRYTNARE VAVILSLLVA LPYCIFPLTV VSAQQTYTPA GIQSGPDVVG GSFASAAIFN PADNTVVVTG TTYGRFWQDR VDEADSSPGC FLVTARLPEV HHGDIAWVDS GKISTIAHKE GCSSIALRGS KLFLSGHADT GGVLEELRNS ASIVPSTQYG MLLDMDYDGG SSRVDIVGGR VLQDSPVVFP VAVVALPGES EVYVASMATD STVDHDYSNE QLDPQRFFPY GSKFHMTVSR LSLNGARFQE GTVLNTLDEQ WTKPYSTKEA GDVYVAGMIK LSNEILVVVG STTGYGGGVG GLVDTGPDMD GFVTKLHPVT GGVPVEPGNP KHQGTVRIQS VDGKDDFVAG VCHDEYRHDP GHIYIVGSTS GDLDRSGSAA APRGYLMKLD FESLQPVWTK VLAANAATTT IANATSGGRP SVRGVSCVVT PDGEAVYAAG VVENSGVLPL SGTMTSFGGK DIYVVKYDTV DGKEGFVRQI GSSEDDEMAM RGGLLTDSQG NAVLVGHTAG SLYRTREESE KAGVSDIFLL TISRYDGKYQ FPIEHPEYGQ QVEEDVVSGP GSAPQTPEMS GTTMNAKAPS TTGSDSSGVL GLEKWTGLTI LLVGIITSLL LVGLLLARRS NREVYTSRNQ VLDYLHQFDV EDVDLKHSAT GGWHCSYTNG LAHGINIRDG SSSSNDDIAA GPSQGVGLLP VRTGGFDPLT TPLNSSTLRD SLFVNDDSDG RLGRPDGHHS FGDDSSADRR HTAIHESRRS STRKTRRKKL DAWGREIV
|
| |