Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25549 |
Symbol | |
ID | 7197289 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1467954 |
End bp | 1469598 |
Gene Length | 1645 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178149 |
Protein GI | 219112795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACGCTGTG TGCGAATGCA AATCGACCAT GACGATTCTG GATCCTTCCG AGACATTTGC GTCGCTGGCC GAATCCGTCG GTCTGGATGT CCGTCTCCGC AAAGCCGTCT CCCGTTTGGG CCACGTACGA CCAACTCTGG TCCAGTCCAA ATGCCTCCCC TTGGCGCTGT CGTCCGGTAG GGATTTGCTG GTCCGGGCCC GCACAGGCAG TGGCAAAACG CTTGCGTATT CACTCCCTCT CTTGCAGAAA ATATTGCAAA GATCCAAATC TGGTGTTGGT GCAGTTGTCT TGATACCGAC TCGTGAGTTG TGTACACAAG TCCACCAGGT CTTGCAGGGA TTGTCGTACT ATTGCAACGA CATTATCTCG ATTGCTATTT TATCGGCGGG ACGAGGACGC GGGGAAAAAG CCCAAGAAGA GTTGACCAGG CAAGAAGCCA TGTTGCGCGA TCAGCCTAAT GTGTTGGTGG CGACACCTGC CGGGCTCCTG ACGCAGATAC GCAGTGGGTT GTTGGATTTG AAATCGTCGG TAGAAACCTT GGTCGTGGAC GAAGCCGATC TTGTTCTTTC CTTCGGGTAC GCCAAAGACA TTGCAGAAAT CGTCAAATCT TTGCCTCGTA TTTGCCAAGG CTTTCTCATG TCAGCGACAC TGTCGCCGGA ACTGGATTCC CTTAAAAAAA TTGTCTTGAA TTCACCTGTC GTGCTAAAGC TGGAACAGGA TGAGAAGACG AGCACTGGCG TTGGCCACTT GAAGCAATTT TACGTTGCGC TGCCCAAGCG AGACAAAAAT TTGGTTGTCT ACGTCTTTTT GAAGCTCGGA CTATTGAAGG GTAAAGGCCT ATTCTTCGTA AACTCGACTG ATGCTGGATA CCGACTCAAG TTGTTTCTGG AACAGTTCTC CATTCGTTCG GCAGTTTTAA ACGCCGAACT ACCATTTCGC AGTCGCCTGA ATATTATAGA ACAGTTCAAT GTGGGTAACT TTGATTATTT GATTGCCACC GACGCGAGTA CAGATGCTGA GCAAAAGGAA GACTCGGATG ATGAGCACGA GGCAAACGTA AAGAAGCTGA AGACTCGTAA GGCGGATTCG CAGTACGGCG TCTCGCGAGG TCTAGACTTT CGAAATGTTT CATTTGTTGT GAATGTAGAC TTTCCATTAA ACTCTCGTTC ATACTCTCAC CGTGTTGGTC GCACAGCGCG CGGTGGAGCC AAAGGGGTGG CCTTAAGCTT CGTGGAACTT GAATCAAAGC AACAACACGA TACACTTTTG GCTGTACAAG ATGACCAGCC ATCGACACCA CTGGTGGGCG CATCTGGCGA CAAGTTACAA GCTGTTGCAA AGGATATGGT TGACGAATCG GGTGCCGCAA CACAACAGCC TCAGCCCGTT CCGCTCGATT TTGATCTTCA CGAAATCGAA GGCTTCCGGT ATCGGTGTGA AGATGTACAA CGAGCTGTAA CACGAATGGC AGTACGAGAA ACACGAGCGG CCGAGCTCAA AGCCGAAATA CTGAATTCGG AACGGCTTCA AGCCCATTTT GAGGACAATC CTGCCGATTT GCAACTACTC CGTCATGACA GAGTCGCAAC GCACATATCC CGCGTCCAAG ATCACTTAAA GCATGTACCT AAATATCTGT TGCCA
|
Protein sequence | MTILDPSETF ASLAESVGLD VRLRKAVSRL GHVRPTLVQS KCLPLALSSG RDLLVRARTG SGKTLAYSLP LLQKILQRSK SGVGAVVLIP TRELCTQVHQ VLQGLSYYCN DIISIAILSA GRGRGEKAQE ELTRQEAMLR DQPNVLVATP AGLLTQIRSG LLDLKSSVET LVVDEADLVL SFGYAKDIAE IVKSLPRICQ GFLMSATLSP ELDSLKKIVL NSPVVLKLEQ DEKTSTGVGH LKQFYVALPK RDKNLVVYVF LKLGLLKGKG LFFVNSTDAG YRLKLFLEQF SIRSAVLNAE LPFRSRLNII EQFNVGNFDY LIATDASTDA EQKEDSDDEH EANVKKLKTR KADSQYGVSR GLDFRNVSFV VNVDFPLNSR SYSHRVGRTA RGGAKGVALS FVELESKQQH DTLLAVQDDQ PSTPLPVPLD FDLHEIEGFR YRCEDVQRAV TRMAVRETRA AELKAEILNS ERLQAHFEDN PADLQLLRHD RVATHISRVQ DHLKHVPKYL LP
|
| |