Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18927 |
Symbol | |
ID | 7197849 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 380662 |
End bp | 381890 |
Gene Length | 1229 bp |
Protein Length | 312 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178495 |
Protein GI | 219115399 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAAAAAAG CCTTTCATTG CCCCTGTCAC CTATCGACCT CTTCCTGCTG TCATCAGTAT TGTCGCGATG ACAACAAAAG AAACCTATGC TATGCCGAAC GATGAACAAT ACGAAATGCC TACAGAACAG GCTGTTGCTA AGAAACGTAA GCCTCGAAGT CTACTATTTG TGCGGTTTAT GTTTGTTTTC AGTTCGGCTT CTAACACGTT TGGTCACTCG GCAGCGGATT CGGCATTAAA GAGTTTCCTC TCGGGAGGCG TCGGCGGAAT CTGCGTTGTG GTCGTGGGTC ACCCTTTGGA TCTCATTAAG GTAAGAGTGT CGACACTGGC TTTGTTGCGT GGATATCAGC GAATGGATAG GTCTCAATGT TGAGCCTTGC ATGAATATTA GGTTCGCATG CAGACAGGTG GCATTGCTGG AGCCAGTGGT TCAGTTCTTG GTATTTTTGC CAACACATTC CGCTCGGAAG GAATGCGTGG CCTGTATCGA GGTGTTTCGG CTCCTTTGCT GGCCGTGAGC CCCATTTTCG CCATTTCATT CTGGGGCTAC GATATAGGGC AACGCCTCGT CCAGTACGTC CAGCCGAGTC CCGGAGATCT CTCGCTCACG CAAAAATGTG TTGCTGGGGG TCTAAGCGCA ATTCCAACAA CGGCGATAAT GGCTCCGTCG GAACGTATCA AATGTCTGTT GCAGACCAAT GGCGATAAAT ACAAAGGCAT GAAGGATTGC GCCACTGCGA TTTATCGAGA AGGAGGATTC GCTAGCTTGT TCCGGGGAAC GGGAGCTACA CTATTGCGAG ATGTCCCGGG ATCCATGGCG TGGTTTGGCA CGTACGAGGC CGTCAAGATG GGAATGATGA AAGCTCAGGG AATCGAGGAC ACGTCCCAAC TTTCTCCATC GGCAGTACTT ACGGCGGGAG GGCTAGCTGG TATGGCCTGC TGGGTCATCT CTATTCCTGC GGATGTTTTG AAATCACGTT ACCAGACAGC TCCCGAGGGC ATGTACCGTG GCTTAGGCGA CGTGTACAAG AAACTAATGG CAGAAGAAGG GGCGGGTGCA TTGTTCACCG GAATTCGCCC CGCTTTGATC CGCGCTTTCC CAGCCAACGC TGCGTGTTTC TTCGGAATGG AAGTAGCCAG AAAGGTGTTT TCATTCATGG ACTAGGATTT GCGGAAAAGT GCTGTAAGCG AAAAGCGGGT TTGTAATAGA TGAGGTTTAC CGTGTATTC
|
Protein sequence | MTTKETYAMP NDEQYEMPTE QAVAKKPDSA LKSFLSGGVG GICVVVVGHP LDLIKVRMQT GGIAGASGSV LGIFANTFRS EGMRGLYRGV SAPLLAVSPI FAISFWGYDI GQRLVQYVQP SPGDLSLTQK CVAGGLSAIP TTAIMAPSER IKCLLQTNGD KYKGMKDCAT AIYREGGFAS LFRGTGATLL RDVPGSMAWF GTYEAVKMGM MKAQGIEDTS QLSPSAVLTA GGLAGMACWV ISIPADVLKS RYQTAPEGMY RGLGDVYKKL MAEEGAGALF TGIRPALIRA FPANAACFFG MEVARKVFSF MD
|
| |