Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47778 |
Symbol | |
ID | 7202752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 809316 |
End bp | 811691 |
Gene Length | 2376 bp |
Protein Length | 777 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181981 |
Protein GI | 219123333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.950731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACTT CGGTCGTTGA CGTCGATGCG GCAAATCCGT ACCGGTACAC TTGCCGCACC AGCCTTTCCA AAGAGCAATA TCAATCTTTG ACCGCTGTGC ACAAGGAACT CGCGAATAGC AAGAATGCTA CTAACTCAAA GGATCGGCCT TGGCAGAACA CTATTTTCCC AGTAGTATCC ACACAGGACC CCCTGAAGCA GCCTAGCCGA GACTCTTCAA CAAGCGGCGC GCTCCGGAAA AGGACCCACC AGCATCGCAG CGAGAAAAGG GATAATACGG ATCCTTCCGA CACTATCGTA TCCACTACTG CGGTCAATAA ATCTTCCAAA CGTCGTCGTC GCCAAACGAA GGATCCGTCG GCCCCGAAAC GACCCATGGC ACCATTCTTG CGTTTCCTGC ATCAGCATAT TGGACAAGTC AAAACTCTCA GGTCGATTTC ATACCGAGAA GCCATATCAC TGTTGGGAGA GATCTGGTCG AGCGAGACTC CTGAAGCTCG ACGGCCCTAC TTGGAAGCGC ACGAGGCGGA CCTGAACGCC TACAAAATCG CCTTGACTTC TTGGAACGCA TTGCAAGCGG CGACCACTAC TCCCACGCAT ACATCGGAGC GCCGATTTGA CAAAACCCGA CGGATGGACA CAGCCCAAAC TAGTAGAGCG GCCCTTGATA ATGGAACTAC CACGCTAACT GAGAAGTCTT TGGCCGAGTA CGAGTCGAAA AACGAGGCGG CGACAACGAA AGCCATTGAA ATACTGCCGC CTGTGAATCC ACGCGACTTT GTGCCTCCCA AGACGCGAGT CCAACTGGCG GACGGCTACT ATAAGCGACC CCGAGGAGCT CCGCCAAAGG GATGTTCCTG GGACAAACGC CAGGGAGTTT GGATCCAAAA AATTCCATCG ACAGAAGTCG ACGCCTATCT AGATACTGCC TCATCCAATT GTCGGACGTC ATTGAAGAAG GTCAAAGCTC GTGGCAATCA CTCCAAAACT TTGCAGCCCA ATCAACTTGT GCGCTTAGCT CCACAAGCAC GACCTGAACG TTTAAGCGAC GGAAGTTTTC GGCGACCTCT AGGAAGACCC CCCAAAGGAT ATTCATGGCA TTCGAAGAGC GGTGTGTGGC TATTGAATCT AGCTGTATCA AGGGGGGGCA AAGGTGCACA CTGTTTGTCA GCTCCAATTC AGAGCTTCAG GAGTGCCAAA ATCTCTCCGG GTGCGCAGCA GATCAGTAGA AAAGCATCGC AACCGGATCC GCATGTGCGC GAAACAGAAT TGAGCTCTCC ATTATTTTTT GGGACGGATG GCTGTTACGG CGGACCTTCT CTGGCTTCTC CTTTGGAACC ACCTGCTCCA TACATGAAAA AGCCTCTCAC AACACTCCCT AGTTGCACAA GACAGAGTTG GCAGGAATAC GAACAGCAGG ATATGATGGG CGCGACCAAA TCTCCTTTGC TCTGTAGTTC AAAAATCACC CAGGAAGAGA TTTATCTAAA AAGATGTATC GCCCCGAAAA GCAAACCCTT CGCCTGTGCA GATGGAAGTT TCCGGCGACC AAGGGGGAAA GCACCAGCAG GGTACAGCTG GGACCGTCAC CAGGGGGCTT GGCAAAGATC TGATTCTGTT GCCACCACCA TCGAAGCCAA GGAAGACGAC GCAGTGTCAG ACATATCAGA AATTCCAATC AAGGAGGTTT TCATTAAAAG CGGGTGGGAC ATCCCTCGAC GAGTAAGTAC TCTCTCGCTT TCTTGCGAAA ATGCAGATGT GCCGGAATTC AATTCACCGT CAGTTCAGTA TGAAAGACCC CAAAACCCAT CCAAACCCTT AGAGACTCTT CGAAGCAGCG AACTGGCAAA AAAGCCAGGA TTACATCGCA AGCCTGTGTA CATGCATAGC AATAGTGCGT TATTGCAAAC AAGATACTCT GCATGCGGAA GTTGCCGGGC CTGCTGCGAA CCCGTCGCCT GTGGGACTTG TCTGAACTGT ATGCAGCAGC TTGACGATCA TGTTCCGACA TTGTTTGTCC CAACGTGCGC GCGTTGCATT TGTATTGCTC CCATCCTGCG TGTTCCATCG GCACAATTAG AGACAATGTC CGTGTGTAGC TTGTCAAATC CCAACTCCTT GGTTGAAGAT GAACTTAGTG ACCTTGATTC GAAGGTCAAC GATCATGGTA CTTCGGCATT CTCATTCTCA CAATCAGCGT CTCAGTTCTT GAAGCCGAAA AGTGAGTCGA ATTTGAGTAT TTCTGGGGCT GATTTAGCAA GAAAGTTGGA TGGAATCAAT CGTGCCCATG ACGACGAATG CGCAGGAAGC ACCGACGATT CTGTTGTCGT ATAGCTCAGC TCAGGGAAAT TTCTATTAAC ATTATAGACC CAATAT
|
Protein sequence | MSTSVVDVDA ANPYRYTCRT SLSKEQYQSL TAVHKELANS KNATNSKDRP WQNTIFPVVS TQDPLKQPSR DSSTSGALRK RTHQHRSEKR DNTDPSDTIV STTAVNKSSK RRRRQTKDPS APKRPMAPFL RFLHQHIGQV KTLRSISYRE AISLLGEIWS SETPEARRPY LEAHEADLNA YKIALTSWNA LQAATTTPTH TSERRFDKTR RMDTAQTSRA ALDNGTTTLT EKSLAEYESK NEAATTKAIE ILPPVNPRDF VPPKTRVQLA DGYYKRPRGA PPKGCSWDKR QGVWIQKIPS TEVDAYLDTA SSNCRTSLKK VKARGNHSKT LQPNQLVRLA PQARPERLSD GSFRRPLGRP PKGYSWHSKS GVWLLNLAVS RGGKGAHCLS APIQSFRSAK ISPGAQQISR KASQPDPHVR ETELSSPLFF GTDGCYGGPS LASPLEPPAP YMKKPLTTLP SCTRQSWQEY EQQDMMGATK SPLLCSSKIT QEEIYLKRCI APKSKPFACA DGSFRRPRGK APAGYSWDRH QGAWQRSDSV ATTIEAKEDD AVSDISEIPI KEVFIKSGWD IPRRVSTLSL SCENADVPEF NSPSVQYERP QNPSKPLETL RSSELAKKPG LHRKPVYMHS NSALLQTRYS ACGSCRACCE PVACGTCLNC MQQLDDHVPT LFVPTCARCI CIAPILRVPS AQLETMSVCS LSNPNSLVED ELSDLDSKVN DHGTSAFSFS QSASQFLKPK SESNLSISGA DLARKLDGIN RAHDDECAGS TDDSVVV
|
| |