Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38891 |
Symbol | |
ID | 7203616 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 467557 |
End bp | 468760 |
Gene Length | 1204 bp |
Protein Length | 373 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182969 |
Protein GI | 219125397 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.319117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCACCA CCGACAACTC CAAACAAGCC GCTAGTGGCG CTGCCAGTCG TCTGGCTCGC AAAAATATCC TGGAACTCGC GCCCTACCGA TGCGCCCGGG ACGACTATAG TGAAGGCGTC CTGTTGGACG CCAACGAAAA CGCGTTTGGT CCTCCTACCA GACCGGAACA CATCAAGGAT CCGTTGGAAC GGTACCCGGA CCCGTACCAA GTGCCGTTGA AGCAAAAGCT CGCCGCGCAC CGAGGCAACG AGCTGGAATC GTCGAATATT TTCGTGGGAG TCGGATCGGA CGAGGCAATT GATCTACTCA TGCGCATCTT TTGTGTTCCG GGAAAGGATA AAATCATGCA AACCCCACCA ACCTACGGAA TGTACAAAGT TTGCGCCAAA ATTAACGATG TGGAAGTGGT GAATGTCCCA TTGACTGCCG ATTTTGACTT GATTATTCCC AATGTACGTC TGACGAACCG ATACCGTGTT CATTGCTATC CTCGGCCTTT CTTAAACGAT TTCAACGCTT TGCTCTTGCA TGTAGATTTT GGAAGCCATA ACGCCGGAGG CCAAACTGCT CTTTCTCTGT TCTCCCGGAA ATCCTACGGC CAAGGCGTTG CCGTTGGCCG ACATTGAAGC CGTCTTACAG AGTCCCCAGA CGTGTGACAC AATCGTGGTC GTAGACGAAG CGTACGTGGA CTTTTCGACA CAGGGATCGG CCGTGGGTTT GGTGCACCGG TACCCCAACG TGGTGGTGTT GCAAACGCTG TCCAAAGCCT TTGGATTGGC GGCGATTCGG TGCGGATTCT GCATCGGACC ACCGGATATC ATCCAACTCA TGAACAATTG CAAAGCACCG TACAACGTCA ACGCGTTGAC TTCGGAATTG GCAATACAAG CGTTCGATCA CGTGGATGTA CTGGACACGA ATATTGCGAG TTTGCTGTCG GAACGTGCCC GGGTGGCGGC CTCGTTGGCA GAGTTGGACT TTGTGGAAAA GGTGTATCCG TCGGACGCCA ACTTTTTGCT CTTTCGGGTG GCGTCGCACG CACAAGCCGT GTACAAGGAC ATGGCGGATC AGGGTGGTGT GGTGACTCGC TTCCGGGGCA CCGAAATGCA TTGCGACGAA TGCATTCGGG TCACGGTCGG CACTCCGGAC GAAAACGAAG CCTTTTTGAA GGCTTTGCAA ACGTCGTACC GGGCGTTGGC GTAA
|
Protein sequence | MCTTDNSKQA ASGAASRLAR KNILELAPYR CARDDYSEGV LLDANENAFG PPTRPEHIKD PLERYPDPYQ VPLKQKLAAH RGNELESSNI FVGVGSDEAI DLLMRIFCVP GKDKIMQTPP TYGMYKVCAK INDVEVVNVP LTADFDLIIP NILEAITPEA KLLFLCSPGN PTAKALPLAD IEAVLQSPQT CDTIVVVDEA YVDFSTQGSA VGLVHRYPNV VVLQTLSKAF GLAAIRCGFC IGPPDIIQLM NNCKAPYNVN ALTSELAIQA FDHVDVLDTN IASLLSERAR VAASLAELDF VEKVYPSDAN FLLFRVASHA QAVYKDMADQ GGVVTRFRGT EMHCDECIRV TVGTPDENEA FLKALQTSYR ALA
|
| |