Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47099 |
Symbol | |
ID | 7202173 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 396116 |
End bp | 397366 |
Gene Length | 1251 bp |
Protein Length | 324 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181372 |
Protein GI | 219122060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAAGTTGC TGTCGGTCTA TACACCCCTA AAGGGATCCG CTTCTAAAAG TACAACGCAT CAAAAGAGCT CCATCACAAT GGCGTCCGAC GCAACACCGT CATTTTACGA AGATATTGAC AATCTCAAAA AGCTATGCGA CTTTTTAAGG GGTAAACACG GACCTCCGGT TCGTGAAGCT TTGTTAATAG AGAAGCGCGT TCACTACATG AAAGGTGAGT TTTTCGAGGA GAGCCCCAAT ATCTCAGACA AGTTCGTGAC AACAAGGAGA TGGGAAGAAA CTTCATTTTC TGATTTTGCG TATGCAATCT TGCCTTATTT CGGCGCAGGT GAAAAACTCG TTAATTTTTT GGTGGAACCA AAGAAGGGTA CGAAATGGCC GACCAATCTA CCCAAATTTG CCAGCCGATC AGACGCCATT CTGGTTTGCA AAGAGCTGTG CAAACAGCAA TTTTTGCTAA GGTCCGAAAA GCGCGGCAAA GGAGAACTAG ACGTACGTAT CTTGCGCTTG GCACAACCGC TATGTACTAG TGTCGATCGT TGTTCTAACC GCGGCCACCC ACATGTGTTC TCATAGGTTG CCCGTGTTCG TGATTTCGAC GAGGCTGGCT ATTTTACGTG GGTGTACGAA GGTGATAAAA CCATGAGTCA CCTTATGTCG GCCGGTCTGA TTGTGGGGTT TCTCTTCTGC GTGTGTTTTC CGATTTGGCC ACAATTTCTC CGCGTTTTTG TTTGGTACCT GTCCGTCACG TTGCTGCTTT TCATCTTTAT CCTCGTGACT TTCCGGGCAC TGGCATTCTT ATTCATTTGG ATCATCGGCT TTGAATTCTG GTTTTTGCCG AACTTGTTTG ACGAGACTTT GAGCTTTGTG GACAGTTTCA AGCCAGTATA TTCGTTCGAC CCCGCAAAGC CTGGACAGCT ACCCTACCGG ATTGGTGTAG CGGTGGCGTT TGGATCGTTT TGTTACTGGG CCGTTACGCA GCCGTCGGAA TTTGATGGTT TCCGGGCAGC TCAAGGGGAT TTCTTGAAGG ATCTGTACGC TGGCACCCTG CTATCGGACA TGTCGCAAGA GGATAAGGAG AATATCGACA AGCCAAAAAT ACAATCATTA GACGATCTTC TTAAAAGTTT GGACCAAGAT ATCAAAGAGA ATGCAGACTT CCTTTCGGAA GAAGACGAGG ATGAGAAGCT GGACTCTCTG CTCGATAATC TTGTTGATAT TGAGGAAGAC ATTGCGGAAG AAGAAGAGTA A
|
Protein sequence | MASDATPSFY EDIDNLKKLC DFLRGKHGPP VREALLIEKR VHYMKGEKLV NFLVEPKKGT KWPTNLPKFA SRSDAILVCK ELCKQQFLLR SEKRGKGELD VARVRDFDEA GYFTWVYEGD KTMSHLMSAG LIVGFLFCVC FPIWPQFLRV FVWYLSVTLL LFIFILVTFR ALAFLFIWII GFEFWFLPNL FDETLSFVDS FKPVYSFDPA KPGQLPYRIG VAVAFGSFCY WAVTQPSEFD GFRAAQGDFL KDLYAGTLLS DMSQEDKENI DKPKIQSLDD LLKSLDQDIK ENADFLSEED EDEKLDSLLD NLVDIEEDIA EEEE
|
| |