Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42794 |
Symbol | |
ID | 7196158 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1132664 |
End bp | 1134667 |
Gene Length | 2004 bp |
Protein Length | 490 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176728 |
Protein GI | 219109951 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAATGGAAC ACGCAGCGGG GTCCATTCTA GAATCTACCA ACCACAAGTG GTGGGTGTTT CTTGTTGGTC GTCGTCGTCC CTCCTCTTTC TCCATTGGAA CGACTCTAGT CGTCGTTGGG TATCACACAC ACATCCACAC AGACACATAT CGTTCGATTG AAGCGAGTCA TCGGATTGCT TTGTTCGACA GGTATGAGTG GCACCAACCA TTCCGCTCCG CACGGGTCGC CGAACCACAC CAGTACTACT GCTACCACTA CCACCACGAC GACGACAGCT CCACCCACGT ACTACACTGC GTACGGAGGA GGTGCCGGGA ATCCACCCCC ATCCTTTGCT GACGGATTCC CCCACAGCCA CACGGGACCG CCACCTTCCC GAGTCTACCA CGGCACGGGA TCGCCCGAAA CAGGATCACC CGCGCCGCAC CAGCCACCAC CATCACAACC ACCATCGTAT CACCACCACC ACCGTTCGCC TTGGAAAGGA TATCCCTGGG GAGGATCATC TGCGGAATCC GTACCGGTCA ACGCATCTTC CGCTACTACT ACCAATAGTA CTACCAATAC TACCACGGGG GCTCCACCGC CGTACGCCAT TGGAAGTGGT GGAAGTGGAA ACCACACCCA CCACAACGCG ACTACGGCGG GATCGACCGG ATACCGTCAC TCGTACGTGG GACCACCACT ACCACCCGCC CACGCAAGTT ACTGGGGACC ACCACGGTCC ACCAGTGATC TCCTAGCCGG AGGAGCATCC GTCGCCAACG ATGGACGTGA AGGCGCCACG TCGCCCTCGC AAATTGAAAC GGATCATCTC GAATTCGTTC AAGCCGTGGG TTGTACCTGC AAGAAAACAC GCTGTTTGAA ACTATACTGT CAATGTTTCG GAGTCAAGAT CTATTGTGGT CCCAACTGCC GTTGTTTGGA CTGTCACAAT GTTCCCGCAC AAGAAGATGC CCGGCAGAAT GCCATGAAGG TTATACTCTC ACGCAATCCC CACGCCTTTG ACACCAAGTT CCAAAAAACA CCCGTCGACG GCGCTACGGT GGAAACGCCT TCCAAGCTAT TGACGCACAA GTTGGGATGC AAGTGCCGCA AATCGGCTTG CATGAAAAAG GTACCTAAAC CCAACAACAC TGCGGGTCGA TGACCTTGCG TCTCCTCCGT CACACTCTAG TCACTCACCC CTTGCTCGCT TTTGTCGTGG TTGTTTGTTT TTGTGTATGT GTGTATTTGG TTTGTTTGTA TTGTTTCTTA CAGTATTGCG AGTGTTACGC CGGTCACGTG TACTGCAACA CGCACTGCCG TTGCACCGGT TGCAAGAATC GGGATGGCTT ACTTCCGGGA CCGGGTGGTC CCGGAGGTCC GTACGGGGCG ACGGTCCACC ACCACGATCC CCGCTTCGCC TCGCCGGCAC GGGCCACCGC CCCGGTCTTT GCGCCGCCAC TGCCGCACCC GACGCACGTT ATGCAACCCG TCGGTAGTCG CTCCAACGCA ACCAACGCGG GGGGTAAACG GGGTGAGCCC TTTGTCGCGG CCGCACAAAA TTTGGCCTTT CTGAAACGGG GATCGCCCGA AGATGCCACC ACCACCGGGC CGGTTAAAAA GGCCCGTGGT CCGCCCTCTT CGGAAGGAAT GAACAGTCTC ATGATTGCGG CGCAAGCCAT GACGGAATTT GGACAAGGAT CGTCGCCCTC GAAAGCCCGC TTGTCGGTCA AGGAGGCGCA GGAACTGGCC AAAAGAGCCG TGGAAACACC GACGCCCCGC AAACATTCGG TCTACAAGCA AGAAACCGAT ACGGTATAAA GGTAGTTCGT CGCAGAAGAA GAACAAAAAG ACAAACAAAT GAAAAGGCAC GCCAATGACC CAACCAAGGG TTTAAGCTAG GATTAATCCG ATTGTGAAAG TGAAACGTGT GATTGTCTGT GGTGGATGCA CTTGTCCCCA AACAACCAGT CTCACCATAC CAATCGATGG TTCTAGGTCT TACTCGTTTA AAGT
|
Protein sequence | MSGTNHSAPH GSPNHTSTTA TTTTTTTTAP PTYYTAYGGG AGNPPPSFAD GFPHSHTGPP PSRVYHGTGS PETGSPAPHQ PPPSQPPSYH HHHRSPWKGY PWGGSSAESV PVNASSATTT NSTTNTTTGA PPPYAIGSGG SGNHTHHNAT TAGSTGYRHS YVGPPLPPAH ASYWGPPRST SDLLAGGASV ANDGREGATS PSQIETDHLE FVQAVGCTCK KTRCLKLYCQ CFGVKIYCGP NCRCLDCHNV PAQEDARQNA MKVILSRNPH AFDTKFQKTP VDGATVETPS KLLTHKLGCK CRKSACMKKY CECYAGHVYC NTHCRCTGCK NRDGLLPGPG GPGGPYGATV HHHDPRFASP ARATAPVFAP PLPHPTHVMQ PVGSRSNATN AGGKRGEPFV AAAQNLAFLK RGSPEDATTT GPVKKARGPP SSEGMNSLMI AAQAMTEFGQ GSSPSKARLS VKEAQELAKR AVETPTPRKH SVYKQETDTV
|
| |