Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40447 |
Symbol | |
ID | 7198167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 413402 |
End bp | 414727 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184366 |
Protein GI | 219128325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.12237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCGT CCCTTCAAGA CGAATCGGCC CTGGAAGCAG AAATTCATAA ACTGGAACAA GAGATTGAGA AAAAACGCCT CGAAGAACAG ATATCTCTGT TGGAAATGCA ACTCCATCGG GCCAAGCCTG AGGCGCAGCG CCCGAACGCA CCATCCAACC CATCGGTGGT AGGACAATCT ACCTCCACTT CTCGAACTTC CAGTCCCCGC TCGCTGGAAC ATATGCTTCC CACCCAAAAC GTCAGTTGGT CTGGGGCCAA CGGACAAAGT CCAGAAGACG ACAGTATTAT CGCGTTGGCG ACTCAAGATT CCGAAGGCGT GGAATACGAC GAGGAAGACT ATGACGAAGA AGAATGCGAC GAAAATGAGT ACGAGGAAGT GTTCGAAGAA TTCGTCGAGT ACGTTACCGA CGACGAAGAA GAAGAAGTTG GAGAGCAACC TCTCGAAAGC GTTGAAGAAT ACGAAGAGGA AGAGGAAATC CAACAAGCCC CGGTTTCACC CTCCCGAGCT CCTCCTCGTC AGTGGCCTCC TCCGGCACGT CCAGTGAACG ACGAGCATGT AGTTCAATAC GTGGCACCAA AAAAGAACAC AGAAGCTCCC GAAGAACCGA AGCAAGTTGC CAAGCCTCGA AAGAAGTGGG TGCCCTTGAG TCAACGGGAT CCCAAAAAGT ACCAAGCAGC CAAAGAAGCA ACCCCTACAC CTCCCAAGTC ACCAGGACTG CCCGCTTTGC CGTTCACGCG ACGTAAGCTC CCCGACGTTA CGACGTCGCC TCCCGGTGAG GAAACAGTTT GGGAACAACT GTTGGGTCCC AAACTCATTG TCAATGAAAA GTTGGTCAAA TGCACAACCA ACTGTGCTGC TCAGGGACAA GAACTTATTC TGCTCCTGTT TGGCGCCAAG TGGCGTGCAG AATGCAAGAT CTTCTACCCA CTCATGATCG ACTTCTTCAA ACTAATGGCT CACCAGCACA AAATGGAATG CGTGTACATC TCGAATGATC GTACCTTGAT GGAGTTTAAG GATATTTTTG TCAAAATGCC CTTTTTAAGT TTGCCAACAG GTACGGTGGA AATCAAGAAT ATCTTGGCGC AACGACTGAA AGTGAACGAC TTGCCTGTAT TGGTCGTCAT GACCGCCGAC GGTCGTGTCA TCACAACGGA AGGATACCGC ATGGTGGCAG CCCTGGAGCG TCGGAACGAG GACCAGGCTA ACAAACTGGT TGATGTCTGG AAAAAGGCGC AGACGTACAA CATCGATCAA GTACCAGCCG ATACCAGTCT CAAACATGGC AATTTGGCGC GGGGAACAGT CTACTGGCAA GCATAA
|
Protein sequence | MASSLQDESA LEAEIHKLEQ EIEKKRLEEQ ISLLEMQLHR AKPEAQRPNA PSNPSVVGQS TSTSRTSSPR SLEHMLPTQN VSWSGANGQS PEDDSIIALA TQDSEGVEYD EEDYDEEECD ENEYEEVFEE FVEYVTDDEE EEVGEQPLES VEEYEEEEEI QQAPVSPSRA PPRQWPPPAR PVNDEHVVQY VAPKKNTEAP EEPKQVAKPR KKWVPLSQRD PKKYQAAKEA TPTPPKSPGL PALPFTRRKL PDVTTSPPGE ETVWEQLLGP KLIVNEKLVK CTTNCAAQGQ ELILLLFGAK WRAECKIFYP LMIDFFKLMA HQHKMECVYI SNDRTLMEFK DIFVKMPFLS LPTGTVEIKN ILAQRLKVND LPVLVVMTAD GRVITTEGYR MVAALERRNE DQANKLVDVW KKAQTYNIDQ VPADTSLKHG NLARGTVYWQ A
|
| |