Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_974 |
Symbol | |
ID | 7196285 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 542625 |
End bp | 545069 |
Gene Length | 2445 bp |
Protein Length | 603 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176605 |
Protein GI | 219109702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000875731 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCAGGGCTTG GAAACCTCGG CAATACGTGC TTTATGAATT CCACATTGCA ATGTCTGGCT CATACGCATC CCCTGAAGCG CTATTTCTTG TCAGGAGAAT ATGGAGATGA CTTGAATCGT AACAATCCGT TGGGAACGGG TGGGGACTTG GCGACTCAGT TTGCGCAATT GATGATGGAC ATCTGGATTA CAACGGCTCC TCCGCGCAAC TTTCTTTCGG AGGCCTCTAC CGGTTATGAT TCTTCCTCAT CACTAGCTAA CAACGTGGTA TATCCGCGAA ACTTCAAATA TACGCTTGGT AAGCATGCGG AGCGGTTCGT AGGGTATGAC CAACACGATT CGCAAGAGCT CGCAACCTAC CTGCTAGATG CTCTGCATGA AGACACAAAT CTTGTAACAA AGAAGCCATA TGTGGAAAAG CCGGAGCAGA AGGACACTGA AAGTGACCAG GAGGCCGCAG ATAATGCATG GGAACTGCAT TTGAAACGAG AAAATAGCAA GGTTCTTGAA AATTTTATGG GACAGATCAA GAGCCGAGTA CAATGCTGCA AGGCAAACTG CAATCGGGTA TCGACCACCT TTGACCCATT TATGTATTTA TCCGTACCGA TTCCGGGATC AATGGAACGC ATACTCAAGG TTACATATGT CCCTATGAAC CCTCAGCAGC GTATGCAGAA GCTTGAGATC ACAATCAGCA AAATGGCGAC GATGAAAGAG CTCGTCGAAA GAGTATCAGA GCAATTGCAG AAGTTTGGTC TTCTAAAGAC CATAGACGAC CTCGCATTGG AGGACACGTG TGCTGTGGAC ATCTGGCATC AAGAGGTTTA CAGCTGGTAC GAGCCTAAAA ATGAAGTGGA TCGTATCAGA GATAACGATT ATACGTTCTT GTACGAATTG GCGCCTCTGG CAATGGTCAA GGCGATGGGA CAGACTGTGC TAAATGGAAT TGAAACTGAC GACGATGTTG GTCTTGGAAC AATTACGAAG CATTACCAGC TCGATTTGGC CACAATGACT AAGTTGAATA GCGGCGATGA TTGGTCAGCA AAGCTGTCGA CATACCTTCA GAACCCCACC ATGTTGCATA ATTCGTTTGA CCCAAAAAGT GGATTATGTA GCGAACGTCG TCTGATTTTT CTGCGTTTAA TGAATTTCAT TGATTGTTGC TATCGAGAAG TTGACGATGA CATATCAGGT CAAAAGAGGT CGCGCGACAA CTGCTCTGAA TTGGTGAATA CTTCGACGAA CGAAGAGGAT GTGCTTGATA TCATTGAGCG ATGCGAATCC TCTGCTTTCC TTGAGAACGT CAGATCAAGG CATGATTTAG CAATTTTGGA GTTTATAGCG TCAAAAATGA GAAAAGAAAT TGTCAGTCTT GAGAATCAAC AAACAACTAT GTACCCTGAT GGAGTGATAA TTCAAATCCA TATTCGGAAA GCTACCTTGT TATCTGGTAA CCATCGCACG TCTGATACTG TCCCGCTAAT TATGCGGGTT TCTGGTAGCA TGACTGTATA TGAGCTTAGA GAAGAGCTTG CAAGACGCCT TAAGAGATGC TTAAATACAG GACGTGGGAA TGTGTCAGGG GAATCAACGT CTGAGCAGGA CGAGGCCGAG ATGAGCAAAC ACGCTGGCAA CGGTAGCTTT GCATCGCCAG AGCTTTTGAT AATAAGGCAG ATTCCGCTTT CATACCAACG AAAGAGTCTG AGTAATTACC GATCGACGGC AACTTTAGCA GCCAGTAGAC AACTGGGGTC ACTGGAACTT GTCAGCAACA AAAATAGGCT GTTAAGGCCA CGGTCTCTGG CATCGAAGTC GAATGAAGAT GAGCGCTCAA TTTTAGCAAA TCTAGTTGGG CACCATGGGA CGGTTTTTAT GGAATGGCCA GAGACTCTTT ACAACAATGC TTTTGATCCA AAAGAATATG AGCATTTTGA TTGCCCAAGT GATCTGTACA GCGATGAATG TGTAGCAAGG AAGTCTTCGA ATGCAACGAC AGTATTGGAT TGTATCGAAA AATATTGCCA GATGGAGCAG CTGGAGGATA CTGAGATGTG GTACTGCAAC AAATGCAAAG ATCATGTCCG AGCGTGGAAA CAGACTCACC TCTACCGCTG TCCTCCAATA TTGATAGTGC ACTTAAAACG CTTTCAGTAC TCGTCATCAA CGCACCGCCG AGACAAAATT GGGCTATTTA TTGATTTCCC TCTCGAAGGA CTTGATCTTA CGAGCGTTGC ACTTCATTGG ACCGGAGCAG AGAAACCTCT TTACGATTGC TACGCCGTAA GCAATCACTA CGGGGGGCTT GGTGGTGGAC ACTATACTGC TTATGCATTA AATGACGACA AAGTTTGGTG TCATTACGAT GATAGCCGAG TGACCGCCAA TGTGGAACCC GAAGAAGTTG TGTCAGAAGC AGCCTACGTA CTGTACTATC GGAGA
|
Protein sequence | SGLGNLGNTC FMNSTLQCLA HTHPLKRYFL SGEYGDDLNR NNPLGTGGDL ATQFAQLMMD IWITTAPPRN FLSEASTGYD SSSSLANNVV YPRNFKYTLG KHAERFVGYD QHDSQELATY LLDALHEDTN LVTKKPYVEK PEQKDTESDQ EAADNAWELH LKRENSKVLE NFMGQIKSRV QCCKANCNRV STTFDPFMYL SVPIPGSMER ILKVTYVPMN PQQRMQKLEI TISKMATMKE LFGLLKTIDD LALEDTCAVD IWHQEVYSWY EPKNEVDRIR DNDYTFLYEL APLAMVKAMG QTVLNGIETD DDVGLGTITK HYQLDLATMT KLNSGDDWSA KLSTYLQNPT MLHNSSRDNC SELVNTSTNE EDVLDIIERC ESSAFLENVR SRHDLAILEF IASKMRKEIV SLENQQTTMY PDGVIIQIHI RKATLLSGNH RTDECVARKS SNATTVLDCI EKYCQMEQLE DTEMWYCNKC KDHVRAWKQT HLYRCPPILI VHLKRFQYSS STHRRDKIGL FIDFPLEGLD LTSVALHWTG AEKPLYDCYA VSNHYGGLGG GHYTAYALND DKVWCHYDDS RVTANVEPEE VVSEAAYVLY YRR
|
| |