Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_12331 |
Symbol | |
ID | 7200846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 822311 |
End bp | 823351 |
Gene Length | 1041 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180324 |
Protein GI | 219119115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAGAGACCC CACCGTACGT CTTTTGGACT CCTCCGCTAC CAAACACGCC GACGTCCTTG GCTTTGGTCG GCGATTTGGG CCAAACCGAA AATAGTACCC GAACCATGGG ACATATTTGG CGTTCGACGC ATCAGAACTC TAGATACTTG AGTGGAAAAC TCCCACCCGT ATCACAGCTT TTGATCGCCG GGGATATGTC GTATGCCGAC TCGGATCCGT ACCGATGGAC GTCTTGGATG GAACTGATGG AACCCTTGAC ACGTAGTCTT CCCTTACACG TCGCGGCTGG CAACCACGAA ATCGAATGTA ACACGGACTC AAACGATATC TTTGTTCCCT ACGAGCATTA TTTTCGAAAC CCGAATCGAG TACGCGACGC CGAGATGCAA CCAATTTCCG AGCAGTATCG AAAATCTCTT TGGCGCGAGA GCTGCTCAAC TCCCAGTGCT TTCCAAGGCC AATACAACTA CGGGAACTCT TTCTATTCGT ACGATCACGG GTCGGCCAAG ATTGTTGTCC TGAATTCATA TACCAATGCA ACCGAAGGTT CAGCGCAGTA CGAATGGACG CAAGCGGAGC TGAGGAGCAC AAATCGTACA CGTACACCCT GGCTGATTGT ATCGTTTCAC TCTCCTCTGT ACACGACTTT TTTGGGCCAT GTGAACGAAA TAGAAGCAGT CAACATGAAA CAGGCTATGG AGCCCTTATT CTGTCTTTAC GGCGTCAATC TAGTCATTTC TGGTCATGAC CATGCCTACA TGCGAACGCA CTCGCTGTAC GAAGACTCCG TCGACACCGA AGGCCGGTCT CCAATATATC TTACCTTAGG AGCTGGCGGA AACCGGGAGC AGCATTCCGC TGGCTATCGA CAAGATGAGC CCGAAACTTG GGTGGCTCAT CGGACGTTGG AAGATTTCGG GTATGGGCAT CTGTTCCTAG CCAACGCGAC TCACGCCCAA TTTCGTTGGA TTCGGGACGG CACGTCCTCT TTCGGTGTGA ACGACCAGGT TTGGATTAAG AATGCCCACG TCCTTTCTTA G
|
Protein sequence | GETPPYVFWT PPLPNTPTSL ALVGDLGQTE NSTRTMGHIW RSTHQNSRYL SGKLPPVSQL LIAGDMSYAD SDPYRWTSWM ELMEPLTRSL PLHVAAGNHE IECNTDSNDI FSCSTPSAFQ GQYNYGNSFY SYDHGSAKIV VLNSYTNATE GSAQYEWTQA ELRSTNRTRT PWLIVSFHSP LYTTFLGHVN EIEAVNMKQA MEPLFCLYGV NLVISGHDHA YMRTHSLYED SVDTEGRSPI YLTLGAGGNR EQHSAGYRQD EPETWVAHRT LEDFGYGHLF LANATHAQFR WIRDGTSSFG VNDQVWIKNA HVLS
|
| |