Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50480 |
Symbol | |
ID | 7199320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 175795 |
End bp | 177441 |
Gene Length | 1647 bp |
Protein Length | 514 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185440 |
Protein GI | 219130580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAATAGTAA AGCCCAAGTA TGTTCGAGGA CACTAACTTG AAAGTATATG CGGTAGATAA ACGAAACGCA TTCTAGGAGA CCTTAGGGTA AAAGCAAGGC CAATGCTGCG CTTTGTCAGC CGGAGGACTG TGCTAACATC CTCTTGCAGG CTAGTAGAAA CACACGCATC GAGTTGTCCT GAGGTCTCTT CAAATCTACT GAATGTCCAC AACAGCAACG ATCGAGCGCA AAGTGGCATC ATCCGATCAT TCCACGCGTC GGCGAAGCAT GAGATTTTGC CCCTACTTGC GGTTGGTACT ATAGCGCTGG TTGGAAGATA CAGCTGGAAG GCACTCCGTC GAATGGACGA AGACTGGGAA GAATATGAGT ATCTCATGGA ACAGCACGAA AAGCAGAAAG CACGGGAAAA CCCCCATGAA CACGCATCTC GAACGATGGC TGTTGATGTT GGTACCGTCT ACACGAAGTT AGCCGTCAGC TATCCGAAAC CGGAAGTGAT AGTGACGCGC GAAGGAGATC GTTTCTTCTT TAACGGCATT TTCTATACTG ACGAATCCAT GGATAGAAGC TCATCTCTAA GCGGAAGGGC GGCTTTAGAA CGCTACTTCT ATAGTGAGAA CACTGTTGAG GATCGAGCGA ATTCAAAAAC AATTTGGAGC GTACTAAATC CCCCGAACGA CGGAGTCGTG GACGAAGAGG AAGCATCGAA GCTTATTAGC GATCTTTTGT CGCCCGCGAT TGCTGAAGCC ATGGATAGAC TTGATTATCA AAGCAAGGAA GACCTGCTGC GTACAGTTGT TACCCTACCA AACCAGCTTC TTGTTTCAGT TCCAGCTCTG ACGTCGTTTG TCAAGTCTCT GGACACGAAG ACTCGCACAA CCTTTTTGCC CAACCCCGTG GCGGCCGTTT GGGGCGCACA ATCTAAGAAT TTGCTGCCGG AAGATTACGA TGGAAAAACC AACAAGACCG TTGTCATTGA TGTTGGCGGT GAAACAACAC AGTTTTCTAT GGTGGAACGA AATTTGGTAA AGTACACGAT ATCAGTGCCT TGGGGTGGCG AATCCTTGGT CCGATTGGTT GTGGATCTGC TGAAGAAGGA GTCAAGCGTC CCTTTGCAGG ACGCCAGATC CTTATCGGCG CTGCAAGCCC AGGCGCGTGC CGCAGTAGCA GAGCTTACAT CACAAACGCG CGTTCCTGTC CATGTTCCGT ATCTGTTTCC AGATCCTGGT AAACATCATT TGGATGCGGC TTTGTCTCGA TATGTAGTGG AGCAGGCGGT CAACCACCAT ATTCGCAATA CACTGAGTGA AACGCTGCCG GGTGAGAACT ATCTTTCAAC CCATATGCCG CCGCCGACGA ATTTGGAAAG TTTGTTGATG TCGGTATTAA CGCAACTCTT GGAAGCCAGT AACGAAACTC CCATGAGCAT CGATTCTGTT CTGGTGGTGG GCGGGGCAAG CAAATTTCCA CTGGTGGACT CTTCCATCCG ATCTGCCTGT TTCGCTTTTA TGGGATCAGA CGCGAATACC AAGCTTGTCA TTCCAGAGGT CTCTTTACGG ACTGAATTAA CCGTATTAGG CTGTACGACG TTGTTGCCAT CGTACGACTA CGAGTTGGGG GAAGGTTTGA AACGCACGAA TAGCTGA
|
Protein sequence | MLRFVSRRTV LTSSCRLVET HASSCPEVSS NLLNVHNSND RAQSGIIRSF HASAKHEILP LLAVGTIALV GRYSWKALRR MDEDWEEYEY LMEQHEKQKA RENPHEHASR TMAVDVGTVY TKLAVSYPKP EVIVTREGDR FFFNGIFYTD ESMDRSSSLS GRAALERYFY SENTVEDRAN SKTIWSVLNP PNDGVVDEEE ASKLISDLLS PAIAEAMDRL DYQSKEDLLR TVVTLPNQLL VSVPALTSFV KSLDTKTRTT FLPNPVAAVW GAQSKNLLPE DYDGKTNKTV VIDVGGETTQ FSMVERNLVK YTISVPWGGE SLVRLVVDLL KKESSVPLQD ARSLSALQAQ ARAAVAELTS QTRVPVHVPY LFPDPGKHHL DAALSRYVVE QAVNHHIRNT LSETLPGENY LSTHMPPPTN LESLLMSVLT QLLEASNETP MSIDSVLVVG GASKFPLVDS SIRSACFAFM GSDANTKLVI PEVSLRTELT VLGCTTLLPS YDYELGEGLK RTNS
|
| |