Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48470 |
Symbol | |
ID | 7203758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 589580 |
End bp | 590874 |
Gene Length | 1295 bp |
Protein Length | 319 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182989 |
Protein GI | 219125439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00321209 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATTTTCAT CATTTATGAA TCTCTGCCTT CTTTAGTATC ATGTCTGAGC GTAAAGCGCA GGGAGTCGCG ACGAAACTTT CACGTTTATC TTCATAGCAC GGCAGAATCG GTCAACCAGA ATCTTCATGC CACACCTTCT TGTCCATTTT CAAAGGTCTT TCCCCGATAT AGGATTGATC TCACAAGTAT CCAAAGCGAG AGAAAAAGAA ATTTTCAGGT ACCATTTTTA TCAGATTGGA TTAAGGCAAC AACCCGAAGC AAATTTCTGC AGTCTTCCTC TCCAGGAACC GACGTACTAT GGTTGGAGAA TCTGTCTGGG GTACAAGCAA CGGCAATGCT CTGGAGAAGC ACTACAAATG TTTCACAGGG CCATGTGAAT GCGACGCTCT TGGCTTTTGC CAATACCAAA GCGCAAACAA TTCGGCAGTG GGTTGAAATT TTTTACTGGC TTCAAGATAA ACGCCCTGGC GATAGTTGTA GAGGAGTCAC CGTTGAGTAC TTGGCTGAGC ACAAGGTACC GGTTGTCCTC ATCGAACGAC GAATGGAAGA GGTTGTATCC TCTGAGATTC CAGACTCGAA CACTTCGAAA GTGATCAAGC AGCATACACA AGCTTGGGTA AAGCGAATTT TAGTTGAGCT TGGGATTTGT CCTTTCACGA AAACTACGAA AATGAGCGGG CAAGGTCTTA TGGACTTGGG TATTTCACCG GGGAGCATTG CCTACCACAG CTCTTTTGCG AAAGTGGATC AAATTTGTTT CTTGATGGCC GATACCTGGG AGGCGATTAG TGACATGATC GCTGCGGGAC CCTCTGGAAA GGAGGGAGTC AGCAGTATTC TCTTGGCCGC TCCGGATTTC GACGACGATT TCGATCTCTG GTCTGGTCCA ATATTTGCCA TGTTGGAGGC CGGTGTTTTG GCAGCTAGTG CCGAAAAGGA AGTTGGCATT GTCTGTTTTC ATCCCAAATA CGCAACACCT GACGGAAGTA GCTGGCCAGG TTTCGGCCAC ATGCATTCAG TCCCACGATT AAAAACATGG TTGCTCGAAG ACGAACCAAG TTGCCCATTA TCCGACGAAG AAATTGCTGC TGGAGGAGCC TGGCAGCGAC GAACACCCCA CGCAACCATC AATGTCCTCC GTGCTGATCA ATTATCAGCT GCCGAAGGAC GGCGAAAATC TATCCATCTC TACAGCGAGA ACATCCGTAA GCTCGTTGGA GGAAACGGGA TCGGGTCGGA GAAGCTACAA GAAGATTTGG ATCGAGAACG TTCTATCCAT CTTTCAGAGT CGTGA
|
Protein sequence | MLWRSTTNVS QGHVNATLLA FANTKAQTIR QWVEIFYWLQ DKRPGDSCRG VTVEYLAEHK VPVVLIERRM EEVVSSEIPD SNTSKVIKQH TQAWVKRILV ELGICPFTKT TKMSGQGLMD LGISPGSIAY HSSFAKVDQI CFLMADTWEA ISDMIAAGPS GKEGVSSILL AAPDFDDDFD LWSGPIFAML EAGVLAASAE KEVGIVCFHP KYATPDGSSW PGFGHMHSVP RLKTWLLEDE PSCPLSDEEI AAGGAWQRRT PHATINVLRA DQLSAAEGRR KSIHLYSENI RKLVGGNGIG SEKLQEDLDR ERSIHLSES
|
| |