Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39471 |
Symbol | |
ID | 7194967 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 598903 |
End bp | 600484 |
Gene Length | 1582 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183515 |
Protein GI | 219126544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGAAT CAATATCGTC GTCGTTAGCA ACGACGCGCA CGTCAACTTC CTTGGCACAG ATTTTCTGCC TACTGACGGT ATCTTCATTC ACCTCTTTCG AGCTTTCGGT GGCCTTTCAA CCAACGCTAC CTTCATCTTC TTTCTCTTTA TTGAGAAAAG ACGCGCATGT CGCAACAACC ATAGTATTTG AGACCCCTCG CATTCGCAAA GTATTAAGTC GAAGGCCCTG GAGTGCTTTA TCTGACAGGA ATCAGGAGGA GGAAGACGAG GAAGACGATG ATGAGGATGA GATTGATCCT GATTCTCTCG GAGACTGGAG AGCCTTTCGC CGGAATCTTT CCTTTTCTTC CGATGGCGAT TCCACGGGTT CGGATTCCAG CGAAGCTGCC GTATCCAAAA AGAATAAAGC CAGGGCGGTT AGTAAGGAAA ATGAGGAGCT CCTAAAATCT CAGAGTAGCC AGCTGGGACA GGAATACGTG TCTGGTGTTT GGGCCCACGA GACATCTACG GTAAGAACGA TATTGCAGCT AAAAGAAATT TTGTTTTATA CGTCCTTTTA AGCTCACTCC TTCTTCTACT CTATATTGGC TCATTGTTAT GTTTCAATCG TGAAAGCCGG AAGTAGGCGG CTTGGTGCTA CGAATGCCAC TAGAAGTTGA ACTCTTTCGT AACTACAAAC ATTCCGTCAT GGGGACTTTA TTACGGAAGA AATTAGACAA TGAAGTAGTC GAGCCCTCCA CCTGGTACGC GAAAGCTCAG ACGCTCGTCG AAGAGCATAT GCTCACCATT GCTGCAAAAG CCGGGGACGA TGGACAGATC GATCCAACCT CACTGGACGA CGACGCCTCC GAAATGCTCA CGCTTTATTT GGACAATCAA GAAACATGGC AAGAGGTGTG TTTGGTGATG GAACGCAACG AAAGCAACGG AGCCGCGACA ACTCTAGTAT TGAATCGTCC CATGGCATTG AAACTTACTG ACAGTCTCGG ACAATTGGTA CTTAACGGTG CATACAGAGG CGAAAAGACG AAACCAAAAA AGGATGTTAC ACGATTCATG CGCGCATTCG GTGGAGAATG TGCCGTTTAC ATTGGTGGAC CGGATGATCA GGACCAACCG GCCGTACTAG TACACGGACT TGCCGACTTG GCCGGTGCGA ATGAAATTTC GCCCGGAAGC GGTATTTATC AAGGTGGGAT CGAAGCGGCG GTGGAAGGAG TAATCTCAGG CAAGTATCAA CCACTGGATT TCCGGTTTTT CGTAGGACGA CACGTTTACG TGGAATCCAC CTTGGATCTA TCGGTCGTTT TGGGAAAGTA CCAGCCGGTT GCCTGTGCGC GGTCCGTAGC TCTGAAACAG TGTTTGAGTT TGCCCAAACC GCTATGGCAC GAGGTTTTGG AATTGTGCGC AGGAGAACTG GCGGATATAT CTGAGTTGGA AATGCTCAAA CGCGACGATC TAAAATTTGA AATTATCGAC GAAGATGATG AAGACGAGGA CGATGATGAT GACTCACCAG ATGAGCTCGA TGAGTTAGAT AGATTCGATG ATGAAGACGA TGAATACTAT TCCAGTACAT AG
|
Protein sequence | MGESISSSLA TTRTSTSLAQ IFCLLTVSSF TSFELSVAFQ PTLPSSSFSL LRKDAHVATT IVFETPRIRK VLSRRPWSAL SDRNQEEEDE EDDDEDEIDP DSLGDWRAFR RNLSFSSDGD STGSDSSEAA VSKKNKARAV SKENEELLKS QSSQLGQEYV SGVWAHETST PEVGGLVLRM PLEVELFRNY KHSVMGTLLR KKLDNEVVEP STWYAKAQTL VEEHMLTIAA KAGDDGQIDP TSLDDDASEM LTLYLDNQET WQEVCLVMER NESNGAATTL VLNRPMALKL TDSLGQLVLN GAYRGEKTKP KKDVTRFMRA FGGECAVYIG GPDDQDQPAV LVHGLADLAG ANEISPGSGI YQGGIEAAVE GVISGKYQPL DFRFFVGRHV YVESTLDLSV VLGKYQPVAC ARSVALKQCL SLPKPLWHEV LELCAGELAD ISELEMLKRD DLKFEIIDED DEDEDDDDDS PDELDELDRF DDEDDEYYSS T
|
| |