Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44398 |
Symbol | |
ID | 7198058 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 428798 |
End bp | 431776 |
Gene Length | 2979 bp |
Protein Length | 760 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178507 |
Protein GI | 219115423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.160536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCGCCAC CGTCTACTTT ACTCGCTTAT CACTCACAGT AAAATACCAT TGGCGGCCTA CAAAAGACCT TCGATCACCA TGACACTTAC TGTTTGCGTC GGAAGCTCCG GTTCCGGAAA AACAACCTTC TTGGAAGACG TTTATAAGAG TCACAAATGT ATCTATATTC GTCAGTACCA TATCATGCGT CCGTACATAA CGGTTTCCAA GATCCCCAAC TTCGATGCTA CCCGACTTCC CTATTGGGAC ATTTACGTCA AGGAAGAGAA GGCTGAAAAA ATCCAAGTCG GCGGTACTAT GGCCGGAGAA TTCACGGCTG GACTTTCGGG CGGGCAGCGC AAGTTGCTTC TCTTTGAATT AATTTGCCAG CGTACGGCCT CACAGTCTGA GCTTTTGGTT GTCCTTGACG AACCCTTTGC GGGAGTCACA GACGATTTTG TTCCGTTTAT TGTTGAGCGC CTGAATGAGC TTCGTCAAAA GCACAACGTG CTGCTGGTAA CCAATGATCA CGTGACGACT CTCACCACTA TGGCCGACAA CAAGATCACA GTCTCTGCGA TTGATCGTTC CACCGTTCGC ATCAACGATC GCGAAAAGGT TGACCGCGAA AAAGCCATCA TGGCACTTTC CGTCGGAGAC GCGTACTCAT ACCAGGCTAC CAACGCTGAT CTGAAGTTCT TTTACGATGT AGAAATACAT TCCAGCAGCG CCTTGATTGG TATCGCCTGC TTTACCATTT TTTGCTACAG TCTCTTTATA GCTACATTCT GGGATTCCGA AGAAAGCAGT CAAGCGCTAG TGCTGGTTGC CGGAGGTATC ATTTCGTACT TCTGCGTCAA TCCATATTTG CTAAGTCTCG TTGACTGGCG GAATGCCCAA AATGAAGAAG CGTATGTATA CTTTGGATGC CGGCAATATC AACAAGTGTT CCTCTGCAGA TCAGTCCCTC GCTCACTTCT CATTCTTATC ACAGTGAGGC TCTAGTCCAT GCTTCGAAAA CAATGAACAA GACTCTCAAA ACACTCTTAA CGTTTTCGCT TATACTTATC ATTTCCTTGA TTGAGTTCGG AGTCGTCAAT GCTACTATCG ATGGGCTCTC TGAGATTAAA TTCTGGGTCG CAATGCTTTT CGATAGTGCT TCCTTAACGT TTACTTTGAT TTGCTTGGGG CTCTACACCA ATATGCCATT TCAAGCGGTT CAAGTTGTCG GAAGCTTGCC ATTTTTGCTG ATGATCTTTC TTTCCACAAC GTGAGTGTGC TATGAACATA TACATATTCC TTTGTTCTTT TGCTGACACA CACCGTGATC GGTTTGAATT GCGCAGGTTT TCTCCAGGGT CAGGTGTTCC TGTCCTGAAG GAACTTCGCT ACTTGTATGC GCGATTCTAT TTCTGGTGTA TGGTTCCTGC TGTACAAGAC ACAATGGAAA ATTGTCCGTC CGACAATGTG ATTCTTGTGT ACGTGATCTT GAGCGGATGC TTGGGTGTTT TTATTTTCCT TGTAGTTATG GCAATCCTTA AAATCAAGAG GGGAATCCAG AAGGATAAGG CTGAGACGAA GCGTGCAGGA CTCCGTGATG ATGAGTTTAC GGAGCTTCAA GTCGAATTAT ACGGGACAAA GGCTTTACAT CGTCTGATGC ACATGAACAG CAGCCTTTCG CTCAAGAAAC CTGCTTCGAA TGGGACCATC AAAGAAGCGG TATAGCTGAA GTGAGGTGCT AGCTGGACGG GAGTGACAGT GAGATGCTGT TTTCGCACAT TCGTTTATAT TAGAATTTCA TAAGGGTAAC ATCGACATAG CTAAGAATAA GTCAAGACTT TTATTGATAA CAGCTGTTTA TTTCTTCAAG TAGATCGGAA AGTCAGTAGG TAGGGCTTTC ATTTTGATCC TTTTATGGTG CAAACGGCTG ACTGTGAGTA GTGCGTTCAA GCACGCAGCT CCGTCTCGTG TGACAGGTGA CGCTCAAGGA GCTGATAAAG TGTTTTCTCT TGATGTCACC CTTGTATGTA ACTAGAAACA CACGAATCGG GCTTCGGTGA GCCCGCAAAA TCCATAATCC AGACGAAACA GAGATCCTCG TTTCAGGCAA TAATATTCCC AAGAAATCAA ACCGAGTTTC TGTGTTCTTC GTGAAATAAC AGCGCTCTGT GCTCTTAAGC AAACGATCGG CGAAGATGCG ATGCTTGATG GTTGGCAGCA GCCTCATCGC CTTGCTGGTG GATAATGCTG CTGCCCTCAA CATCGTTCTA CCCGGAGGAA CAGGGTCTAT CGGTAGTAGG CTCTCGGCAA AGCTGATGGA TCACACGGTT ACGATTCTAA CACGGAATGC ATTTTTGGCA GCCGCTCCCA ATCGAGTGAC AGAACAGTTT GGGTGGGTCG GATCCAGCTT CTTACGGAAA AATCCGCATG TTAATCTGCG CGACTGGGAC GGTGGCGACC TGCTCGATAT TGTCGGCCAA GACTGGATTG GATGGCAGGA AGAAGCATTA TTAGATGCAG ATGTGGTCGT ACACTTTGTG GGAGGGTTCA CGGAACAACG TACAATGGCT TGTGAAAGAC TGGTACGAGA ATCGATGAGA GTGAACAAAG ACGCTTTGCA GATTACAGTC AATCCTCTAG ACGAAGAGAT TGGCGTTATT TCCGTCGGCG CTGTAACGCA GAAAAAAGAA CGCATCCGTG CCTGCGAAGA AATGGTCAAA ATGAATTGTG TTCACTCAAT GTGCTTACGC ATCGAGTGCT ATCGCGAAGA TGAAGGATGT GAAAAGATTA AATCTACTAT TGTCGACTGG GCGAAACGTC AGGGGAACAA GTAATTCTTG TATTCGTGAT CTTACACTTA GATTGACCAA TTTATTTTCC AATTACTGGA AAATTTCGGG CTAATACTTG TGACCGTGCG TTGAGGGCTG CCGAACACTG CATTGACTGT GGATCAGAAT AGTAAACGAG GTCAAATCCA TCAAACATCA ATTCTTTGC
|
Protein sequence | MTLTVCVGSS GSGKTTFLED VYKSHKCIYI RQYHIMRPYI TVSKIPNFDA TRLPYWDIYV KEEKAEKIQV GGTMAGEFTA GLSGGQRKLL LFELICQRTA SQSELLVVLD EPFAGVTDDF VPFIVERLNE LRQKHNVLLV TNDHVTTLTT MADNKITVSA IDRSTVRIND REKVDREKAI MALSVGDAYS YQATNADLKF FYDVEIHSSS ALIGIACFTI FCYSLFIATF WDSEESSQAL VLVAGGIISY FCVNPYLLSL VDWRNAQNEE AEALVHASKT MNKTLKTLLT FSLILIISLI EFGVVNATID GLSEIKFWVA MLFDSASLTF TLICLGLYTN MPFQAVQVVG SLPFLLMIFL STTFSPGSGV PVLKELRYLY ARFYFWCMVP AVQDTMENCP SDNVILVYVI LSGCLGVFIF LVVMAILKIK RGIQKDKAET KRAGLRDDEF TELQVELYGT KALHRLMHMN SSLSLKKPAS NGTIKEAIGK SVGRAFILIL LWCKRLTVSS AFKHAAPSRV TGDAQGADKV FSLDVTLRSV LLSKRSAKMR CLMVGSSLIA LLVDNAAALN IVLPGGTGSI GSRLSAKLMD HTVTILTRNA FLAAAPNRVT EQFGWVGSSF LRKNPHVNLR DWDGGDLLDI VGQDWIGWQE EALLDADVVV HFVGGFTEQR TMACERLVRE SMRVNKDALQ ITVNPLDEEI GVISVGAVTQ KKERIRACEE MVKMNCVHSM CLRIECYRED EGCEKIKSTI VDWAKRQGNK
|
| |