Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43354 |
Symbol | |
ID | 7197398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 206134 |
End bp | 209172 |
Gene Length | 3039 bp |
Protein Length | 909 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177885 |
Protein GI | 219112267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0648253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACAACAAGC GCACACCCGA TACAATATAT TTTTTCATCG ACTGTGAGCA GCATTGAACA GAACATAAGA CGGACTTTGA ACACAAAGAA CAAACAAAAG GAGATAGAAA GCAGAAATGA ATCATGCAGT TTCGAACGAT TATTCCTCCT CGTTGTCTCT GTCTACCCCT TTGCGAACAT CTAGGAGCGA ACACAAACAA ACTCAATCTC GCCATAACCG CACGAATTCG GAACCTCTAG AATTGACGGC ATCAATTGCC GGTGGAGTAA CAGCTGGATC GAATACCTTC GATACGAGTA GTACTGCTCC TTCGGCTGCT GTTGCTTCCC CTGCTGCAGC TACTTCTGCC ATCACCGACA GCAGTGTCGA GGAAGTGGAT CACGAACACA ATGTGCTCGT CAGTAACGAT CGTCGTGATT TGCTGGATAG AGGTTGGTCG ACGTCGAATC TTACCACAAC CACGACACAA CAGCGACAAG GGCTCGTTCT GGGACACGCC AGTAGGAGTA GCGGGCGCGC CGTCCGAGGG ACCATTTCTA TCGACGTCGC ACACTTTCAC GAATCTTGGA CCAATAGCGA CGGTGCCGAT TTCTTAGGCG CCTCCTTTGC TATTCCCGCG ACCACTGCCT CGCCTTCACC GCCCTTTGGT TCCTCGGCGC ACGCTGCCGC ATTGCGCAGT CGCAGCTCGG GATGGGGCCT GGCGCTGCAG CCAAGCGTGT CCGTGGGGCT CGATTTGGGG GGAGGCGAAT CCTCGGGATC CTGGGCGCTA CCAGCCGCGC CCGAAATGAC GCGGAGGTAT TCTTCGCTTT CACCAAAAGC GAGCACTTCC CTGGCCGATT TCAACCGCAA TGGTAGTCCT TCCGCGGTAC CATCAACACC GATTGCCGGA CGCTCTGTTT CGGGGTTCTC GTCCGTCATT CACGATGCTG CACGTATTAC CGATTGGAAT CGGGTTTTGA CGGCCTGTCA ACGAGATCCG CAGGATGCCG CCTATACGGG AAGGGACGGC TGGACAGCCT TGCATCACGC TTGCAATCGT CGGTGTCCCT ACCCCGATGT CGTGGAAGCC TTGATTCGGG CCTATCCGGA GGCCCTATTG AAAGAAGTCG ACAACGGGTG GTTGCCCTTG CATTATGCGT GTCGCTTCAA GGCGCCACGC GATGTTATCC GTCTACTGCT GAGTTGCTGT GACGAAAAAT CGCGTGTTTC TGTATCCAAG CGCGATCGGC AAGGGCGAAC ACCCTTGTAC TATGCAGTAC GCTATGATGC CCCACCAGGA GTGGTGGGAT TGTTGCTACA GGTGGACCCA TCCGCTGTTT TGGAAGAAGA CCAAAACGAA GACTCGCCGC TCGCTCTCGT TTGGGATTCG TGGGCGGAAA AGTTGGAAGG CAAGAAAACG CTTCTACCAT TCTTGACGCC AACCGCAATC CAGGGTGATA CGGAAGAAGA GATGGCGGCA TCGCTGCGCA GCAAACTGAA ACAGCAAACA AAGCTTCATA AACGCTGGAA GAAGGTGAAC ATGCTGCTGA AAGCGGCGTT CGGATTTGTC GTAGACGAAG AAGACGAAAT GAATCTCGAC GTAACAAATG CCACTAGTCC TAAAAGTCGC CAGTGGCGTG TCGTTCACGC GACGGCCGCC GTCAAATGCC ACATCTCACT ATTTCTGTTG GCATGTGCCT TGTACCCGGA ACAAGCACGA GAACTTGATG AGAGTGATCT AAGGCGTCCC GGGGATAGTA CTGTCCGGAA TACGAAGCAG ACGGCTTTGC ACTTGGCTGT CTCCTCCAAC GCTTGTGGCG AAACAAGCAA ACGCGTGATT CATACGTTGC TCAGTCTGAA CCGACAAGCT GCACACATCC CTGATGGAAT CGACGGTAGT TTACCGCTCC ACAGGATGGT AGAAAACGAA CGCAAACAAG AATGGGCTGA TCAAATCTTG ATTTTGTATC ATGCGAATCC ACGAGCGGTC CGCGTTGAAG ACGCTAACGG CAAACTACCA CTACATCGCG CTGCGTCCAA ACTTCCACAC CTCGTCGAAG AAGACAACGC TGTGGCTACG CAGTCAGTCA TTTTGAACCT AGTGGACAAG TATCCACAGG CTGCAGCACA TTTGGACCGA TCCGGCTGTT TGCCGTTGCA CATGATCGCA ATGTATGGGG AATTCTGGGA CGATCAAGTC GAAGCTGTGT ACACCGCCCA CCCCCAGGCT GTGCAAGTGC GTGCTGGTCC TAGTTGGGAC CGACGTTTGC CTATTCACAT GGCGGCCGCC AATTTGGACT CGGGAGAGTC TCTCTTGACA CGATTGGTAG AATTGCATCC CCGGGGACCG TCTATGGCGG ATCGACAAGG AAAACTTCCG TTGCACCTTG CATGCGACTT GGGAAAGGAA TGGGATGCTA TCAAAGCCAT ATATGATGCC TTTCCTGATG CGTTAGAAAA GGCCGAGGAA AATGCTCGTG GCTGGTTGCC GCTGCACGTT GTGGCCGCTT GTGCCAATGC GAGCTCAGAC TTACTGCATA AACTTATCGA GCTGTATTCA GAAGCCGCCT GTGTTTGCGA TCGGAATGGT CGCTTTCCGT TGCACCTAGC TTGCCTTTCC GCCAAGTCAT GGAAGGGCGG GCTGGAATGT TTGTTTGATG CAAACCCCGC AGCTGTTGAT ACGAGAGACA AATGTGGGTT GTTACCCTTG CACGTGGCAG CCTTGCGGAT GTGCACATTG TCTTCAGATA ATGCGACGGA GAGTACTCTG GCTGCAACAA CGTCAAAGCA CGCTAAGAAT GAAGTGGCTG ATTTGGATAT TCTTTTCGAA TTGTTGCGAG CCGATCCAAC CACCTTGACG AATTGAAAGT CATGTTTAGT GTTTAAGATC AACAGGACGG TGTGCATTTC TATTGTGCGT GTTGACTGAA TATGCTTTTC GCTACCGAAG TTAACGGATA CGATGTATTG ATCATAAATG TGATAGAGAA AGATTTTCCG AGTACTCTAC TAGTAATGTC AGGTTGCGAT GAGAAATAGG GATTCAATTT CCGTGGCTC
|
Protein sequence | MNHAVSNDYS SSLSLSTPLR TSRSEHKQTQ SRHNRTNSEP LELTASIAGG VTAGSNTFDT SSTAPSAAVA SPAAATSAIT DSSVEEVDHE HNVLVSNDRR DLLDRGWSTS NLTTTTTQQR QGLVLGHASR SSGRAVRGTI SIDVAHFHES WTNSDGADFL GASFAIPATT ASPSPPFGSS AHAAALRSRS SGWGLALQPS VSVGLDLGGG ESSGSWALPA APEMTRRYSS LSPKASTSLA DFNRNGSPSA VPSTPIAGRS VSGFSSVIHD AARITDWNRV LTACQRDPQD AAYTGRDGWT ALHHACNRRC PYPDVVEALI RAYPEALLKE VDNGWLPLHY ACRFKAPRDV IRLLLSCCDE KSRVSVSKRD RQGRTPLYYA VRYDAPPGVV GLLLQVDPSA VLEEDQNEDS PLALVWDSWA EKLEGKKTLL PFLTPTAIQG DTEEEMAASL RSKLKQQTKL HKRWKKVNML LKAAFGFVVD EEDEMNLDVT NATSPKSRQW RVVHATAAVK CHISLFLLAC ALYPEQAREL DESDLRRPGD STVRNTKQTA LHLAVSSNAC GETSKRVIHT LLSLNRQAAH IPDGIDGSLP LHRMVENERK QEWADQILIL YHANPRAVRV EDANGKLPLH RAASKLPHLV EEDNAVATQS VILNLVDKYP QAAAHLDRSG CLPLHMIAMY GEFWDDQVEA VYTAHPQAVQ VRAGPSWDRR LPIHMAAANL DSGESLLTRL VELHPRGPSM ADRQGKLPLH LACDLGKEWD AIKAIYDAFP DALEKAEENA RGWLPLHVVA ACANASSDLL HKLIELYSEA ACVCDRNGRF PLHLACLSAK SWKGGLECLF DANPAAVDTR DKCGLLPLHV AALRMCTLSS DNATESTLAA TTSKHAKNEV ADLDILFELL RADPTTLTN
|
| |