Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47561 |
Symbol | |
ID | 7202627 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 116465 |
End bp | 118179 |
Gene Length | 1715 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181848 |
Protein GI | 219123056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGCTCATAC ACAAGGCTTC CGGTTATACA TAAGGCTATA TCGTACCAAG ACAGCAGAAC CAATGAAATT CTCTATTTCT TCTCTCGTTG TCACAATTGG TTCCTTAGGG AACCTGCCTC TTGCCATTGC ACAAGGCGTC AGGTATGTCG AATAGGAAAG GTGCCATCGT CACCGTCACC AAAGAGGACG CAGAAACCTT ACAGATTCAT AATTGCCTTC ACCATTCAGT GCAACTACTC CTGTGGTCCG ATATGTGGAT CCAATGAGCG ACATGTCTTT ACTCCTTCCG TCGCTAAGCG CGCGACACGG TGATGAGCCC AGCCTCAACA TGGATTTTTG GCTTCCAGCT TCCCGGGCTC TGTCTTCCTC TCTCGCCACC GATATTCCCG ACCAAACATA TGCCACCCTT CCGTCGAAAG CTTCTAGCAC TGCGCAAGTT GCTGTATCAA CGATGCTCCA AAGCAATGTA CCAAGCGACA TCCCCAGTAG CCAGCCTTCC TATATACCAA GCTCTTTGGA AGATGGCCCG CCTAGTGATT CCCCAAGTCT CACGCCATCT CTACAATTGG CTTCATCATT TTCGGACGTA CCGAGCAATG TTCCTAGTAA CCAGCCGTCA GTCACCTCGA GTTTGCCAGG GGCCAGCGAG GTGTCAGTGA CTCAATCTGT GACGCTTGCA TTGGGGTCCA ATACAATTTT GGATGACGCA TCCATTGACA TTTTCGAAAG AGTATGTGCT TTTTCGTTTC TACCCATGTA TCTTTCCACA ATCTACGAAG CTGAGTATAA AAGCATTCGC TGTAGTGTAT TGGATCAAAA CTTGGTAGAT GAGTCTTCTA AACGACGTTT GCTAGACGAA GATTATACGT TGGGAGAAAA ACATTCAACC TTATCCTTGC TCCTTCGCGT CTCGAGTTTG GTATATCTAC GTTCCGGCGT CGAATTCGGA GATATAGTGC AGCAAACCTT CACTACCCAC GTGGACACCT TTCTGAGTCT CTTGTTTGAT ACTTTACCTT TTTTTGCACC GAAATCCAGC TCTGGTAGTG GTAACTCACA GGCAATCACC GGAGGACAAA CAGAGGTCCA AAACCAGGAA GCTAACCCAT CCCCAATCAT CATATCCGTC GCTGCAGTTA TGGGAGGTGC GATTCTAGCA GCTATTGCCG CCTTCTTTGT GCTAAATAGT CGCAGAAACG CAATTTTGAA TAGAGAAATG CCAGACGGAA CTGATGTATC CATTCCCATC GACTATTTTG AATCGAGTGA CGAAGATCTG GAAAGTGCCC CATATGATGT CACAGACATC TCATATTCAA CAATGGGAAT GAATATGATG CCTCCATCCC CTCTTGGAAT AGATTCGATC CCACGAGCCC TGAACTCTGT CTCGATGATA CATTTTTCCG AGGATCTCTC GACCACCTCA ATCGATCCTG AAAGTGGAAT CACACCTTCT TCGACGACAT TAAGCCCAGG TCCCTTGATT CCAGCCTATT GGGAAAGCTA CGAATCCAAA ATGATATGGA AAATTCGAAA TTCTTCATCC CACAGTCTTT CCAGTCAAAT GTCGACAGAT ATACAGTCCG TAAACGGCGC TTTCGAGAGT TCTCCTAGCC TTCTCACTGT CAATTCACGC CAAGAAGATG TAGGAACTAC GTACAGCGAC GGAGTCAAAG ATCAATCGTC GACGACAGAC ACTTACGAAA AGTGA
|
Protein sequence | MKFSISSLVV TIGSLGNLPL AIAQGVSATT PVVRYVDPMS DMSLLLPSLS ARHGDEPSLN MDFWLPASRA LSSSLATDIP DQTYATLPSK ASSTAQVAVS TMLQSNVPSD IPSSQPSYIP SSLEDGPPSD SPSLTPSLQL ASSFSDVPSN VPSNQPSVTS SLPGASEVSV TQSVTLALGS NTILDDASID IFERVCAFSF LPMYLSTIYE AEYKSIRCSV LDQNLVDESS KRRLLDEDYT LGEKHSTLSL LLRVSSLVYL RSGVEFGDIV QQTFTTHVDT FLSLLFDTLP FFAPKSSSGS GNSQAITGGQ TEVQNQEANP SPIIISVAAV MGGAILAAIA AFFVLNSRRN AILNREMPDG TDVSIPIDYF ESSDEDLESA PYDVTDISYS TMGMNMMPPS PLGIDSIPRA LNSVSMIHFS EDLSTTSIDP ESGITPSSTT LSPGPLIPAY WESYESKMIW KIRNSSSHSL SSQMSTDIQS VNGAFESSPS LLTVNSRQED VGTTYSDGVK DQSSTTDTYE K
|
| |