Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35165 |
Symbol | |
ID | 7200559 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 202299 |
End bp | 203490 |
Gene Length | 1192 bp |
Protein Length | 359 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179813 |
Protein GI | 219118061 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.258874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGACA TCCGGGTTAT ACCATTGCCC TTTCCAAACT ACGATCGTGT TCCAGAGTAC GATCCAAAAA GTCCATCCGT CGTAGATTTA TGCGTGACTT TGATACCGAA TCAAACGTTT CCGCGACCTA TCGGAACTCG CGCCAGTCGC GACAACATCG TTTCCCACAC TGGTGACGAT ATTCACTTGG ATGGACCTCT TTCGGTCGCT GTGGCTGACC AAAACCCCGT CCCCGAATCG TCAATACCGT CATGACGGAG TCGTCGACGC CATCCAGTAG GCGAGACGAA AGTCCAGGTC CTGGATGGTT CACCGCGTAC TCCGTTCCAG CGACGCGGCC TGAGGAATCC GAGCTGCCCC TACGAGTTCC GCGGCTCCGT CGCGTGCGTC CGCGTTCCGC AATGAACCAC GGCGGGTCGA GCGATACCCT CGCCGATCCG TCGATCCAAC TCGGAGCGCC ACCGCGGTAC TACATGTTCC AGGACTTTTT CCGAGACGGG CGTGACGCGG TCGAAGATGC GCCCGATGAT GGCGAGGATG GGTACGGCCA AGAACCTCTA CCCAAGCGAC TCCGCTGGAT ACACACTCCC ACACAATGTA CCGTGGAAAA GGACCAAGAC AGTGCGAGCG ACCTTACCGC GTCTTCTTCT TTTAATGAAG CCTCCGAGGA TGACTTCAGC GAAGACGGAA TATCCAACGT CCATGCAACG AAGACGAGTT CCGGAACTAG CAATCAAGAG TATACAGTGT CATTCTTTTC GTTTGGGGAC ACACCGCTCG CTCACACCCA GGCGCTTCGT AACGGCAACA AGTCTCCTCC ACCGCTGCTA CTACCGTCAC CAACCAGTAC AGCACAATCG AGCGTTCCCA ATCAAAAAGG GGGCATTGCT CCCAAGAAGC GATTCATGGA TCGGTACAGG CAAGCTTCGT GGTCTACCCC CTACGCCGAA CAGCACGGTA CGGACGTCAG GAGGTATTTA TAGATTCACC AGATTTCTCG GCTTTTGTGT TTGATGACAG GAATTGATTG TTCCTGCTAG AAATTCGGAG CAACCGGAGG GCCTTCGAGG CGTGTCCCGC GAAGTCGGCA CCGTGTTGGA TCGTCGGGCG GCGACCACGT TGGTGGAGGT GTCACTACAA GTGGGAAATG GAAAATGGGG AGGATACCGG TTTGGTGGAT GCTCCGGGTT AG
|
Protein sequence | MEDIRVIPLP FPNYDRVPEY DPKSPSVVDL CVTLIPNQTF PRPIGTRASR DNIVSHTGDD IHLDGPLSVA VADQNPVPES SIPRRDESPG PGWFTAYSVP ATRPEESELP LRVPRLRRVR PRSAMNHGGS SDTLADPSIQ LGAPPRYYMF QDFFRDGRDA VEDAPDDGED GYGQEPLPKR LRWIHTPTQC TVEKDQDSAS DLTASSSFNE ASEDDFSEDG ISNVHATKTS SGTSNQEYTV SFFSFGDTPL AHTQALRNGN KSPPPLLLPS PTSTAQSSVP NQKGGIAPKK RFMDRYRQAS WSTPYAEQHE IRSNRRAFEA CPAKSAPCWI VGRRPRWWRC HYKWEMENGE DTGLVDAPG
|
| |