Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51897 |
Symbol | |
ID | 7200375 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 641841 |
End bp | 643541 |
Gene Length | 1701 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179888 |
Protein GI | 219118217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTTC AAAAGCGAAA CGATGTCTCG ATTTACTGTC TGTCTTCAGG GCCAACTCTT CCCGAATGGC TGGGTGATCG GGCCAAGAGA AATTTAGCGA AAAAAGACGA GAATGTGAGA AGAAGAATCG AGCTAATTCA GGAATTTCAG ATGCCGGCCT CGTCGTCTCG CTTGGCACAA TCGGCAGATG GCCGCTATAT CGTCGCCGCT GGAACGTATC CACCAAGGAT ACGATGTTAT GATGTTCATG AGCTGAGCAT GAAATTTGAG CGATATGTTA ATGCCGGTGT TCTGGATATT GCTATGTTGG GGGATGATTA CGGGAAGATG GCTCTACTGT TGGATGATCG AACAGTTGCA TTCCACGCTC CGTACGGGGC ACACGAGGCA ATTCGCATAC CATCTTTTGG TCGGACCATG GCATATGAAC CTACAACTTG CGAGCTTCTC GTGGCTACAA AAGGAAACCA GGTATTTCGT ATCAATCTGG AAGAAGGTCG TTTCAGTGAA CCATGGAGTT TTGAACCCTC AGAAGCGTCG GGCACTTGCA TCGCGGTCAA TCAAGCTCAC CCTCTGACTA GTGTCGGCTG CGATGATGGG ATTGTTAGGT TTTGGGACAA TCGAAGTCCA GATTCGCTTC TCAAACCCTT TATGAAACTT GACGTTCAGA GTGCGACGAA GGGATATGGA TTTGCGGAAG ACGCTTACAT AAACGGAAAC CCAAGCGAGA TAACTTCGAT CGCGAATGAT CCCAGTGGTA TGTTTATGGC TGCTGGTACC GCGAATGGAA TTGTTGCGCT GTACGATATC CGTTCCAGCA GACCACTGCA CATCAAAGAA CACAAACATG GGCTGCCAAT TCACACTGTC AAGTTTCATG CGGGTTCAGG TATGGTTCTC AGCTCGGACG AGAAGCTCGT CAAAGTATGG AGATACAAAT CGTCCATGGA TTCACACATG GGGCTTTCTG CAAAAGACAC ATATCCTGCG GTAAATGAGG ATTCGTCGCT TGGATCAGTA AAGGTAAATG TAGAAGGGAC TGGAAAACTC CAACACTTCA TTGTGGCCGG TGACGAACAC GATCCGTACG GCGACAAGAG CGGAATCATT CTTTGTGCTA CGGACCAACC TAAACTCGAA ACCTATTACA TACCAGCTAT TGGCATAGCT CCCAAGTGGT GCTCCTTTTT GGAAAGCATT ACTGAGGAGT TAGAAGAGCG AGATCTCAAT CGAGAAACCA CTGGAATCAC TTCAAACTTG GTTCGTGACG GTCAAGAAAC CATTTACGAA AATTACAAGT TCGTAAGTCG GGATGATCTG GAAAAGCTCG GTATATCCAA CTTAGTTGGG ACGCCGCTTC TTCGTGGCTA CATGCATGGT TTCTTCATGG ACATTAATTT GTACAACAGA GTAAAATCAG TGGCGAATCC GTTCGAGTAC GAAGACTATC AAAAGAAGAA GCTGAAAGAG CGTTTGGAGG CTAAACGTTC GAGTCGTATA ACGCCTCGGC CTTCTGACAA GAAGCCCAAG GCGGCTGTCA ACGCAGACCT TGCCGAGCGA TTGCAATACA AGGCCTCCGA TTCTACCAAA GCGGGAAAGC TCGCCAATCA AGTCTTGTCC GATGACCGTT TCGGTAACTT GTTCACCAAC CCTGACTTTC ACATCAACGA GGAAGACGAT GACTTTAAAC TCCGCAACCC C
|
Protein sequence | MNVQKRNDVS IYCLSSGPTL PEWLGDRAKR NLAKKDENVR RRIELIQEFQ MPASSSRLAQ SADGRYIVAA GTYPPRIRCY DVHELSMKFE RYVNAGVLDI AMLGDDYGKM ALLLDDRTVA FHAPYGAHEA IRIPSFGRTM AYEPTTCELL VATKGNQVFR INLEEGRFSE PWSFEPSEAS GTCIAVNQAH PLTSVGCDDG IVRFWDNRSP DSLLKPFMKL DVQSATKGYG FAEDAYINGN PSEITSIAND PSGMFMAAGT ANGIVALYDI RSSRPLHIKE HKHGLPIHTV KFHAGSGMVL SSDEKLVKVW RYKSSMDSHM GLSAKDTYPA SGIILCATDQ PKLETYYIPA IGIAPKWCSF LESITEELEE RDLNRETTGI TSNLVRDGQE TIYENYKFVS RDDLEKLGIS NLVGTPLLRG YMHGFFMDIN LYNRVKSVAN PFEYEDYQKK KLKERLEAKR SSRITPRPSD KKPKAAVNAD LAERLQYKAS DSTKAGKLAN QVLSDDRFGN LFTNPDFHIN EEDDDFKLRN P
|
| |