Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17967 |
Symbol | |
ID | 7196968 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2452771 |
End bp | 2453854 |
Gene Length | 1084 bp |
Protein Length | 315 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176983 |
Protein GI | 219110463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.827795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAGCCCAA CCCAATTAGA ATACGATGGG ACGTCTATCT CGAGACAAAC GCGACGTCTT CTATCGACTC GCCAAAGAAA AGGGCTACAG AGCTCGATCA GCATTCAAGC TGCTCCAGGT TGACGCCGAG TTCGACATTT TCGGTGCCCG TGGGGCTCCT GCCTCACTCG GTAACACAAT TGAACCACTG CGCGTTCAAC GAGCCGTCGA TCTGTGCGCC GCACCCGGTA GTTGGTCGCA AGTGCTCAGC GATAAGCTTT ACGAATTAAA TCATGCGACA GGGGATGCCG GCGCAAATTC CGATCAAGCG TCCGATGCTC TAGCCAACGA CTTGAGCAGG ACTAGTTTAG ACATAGACGA ACAACCCGAA GAACCTAGTA TTGTTGCAGT TGATTTGCAG CCAATGGCAC CGATTGATGG CGTTTTGTGT CTACAGGGTG ATATAACAGC TCAGTCCACC GCACAAGATA TTATTAAGCA CTTTCAGGGT AATCGCGCGG AACTCGTGGT CTGTGACGGC GCTCCCGACG TCACGGGACT GCACGACGTT GACGAATACT TACAAGGACA GCTCTTGTTA AGCGCCATGA TGATCACCAC CCACGTACTG TGCGAAAGGG GAACTTTTGT GGCCAAAATA TTTCGTGGCC GTAACGTTGG ATTCTTGTAT GCACAATTGC GACTGTTGTT TGAACGGGTC AGTATTGCGA AACCCACCAG CTCGCGCAAT TCGTCCATGG AGAGTTTCGT TGTGTGCCAG CGATTTAAAG GAGCTCCGTA CTTGAATCTC CCGTTGGATC TCGGGGGCTA TCTAAATTTG CGGAAACTGC GACAGGGCCA AGCCGGCGAT GGAGATGGTG CGGATAGTCA TGACGAGCTT TCTGATCCTT TGGATTCCAT CGATATTCCC TTTCTTGCGT GTGGCGACTT GTCCGACTGG AGTCCATCCG GCGAGATCTT GGATGCAGAC AAGAGCTACC CAATCGATGA GAGCCAGTAT ATTGCTCCTA TTGCGCCTCC AATTCAACCC CCGTATCAAA CCAGCATGGA AAAACAAGCG GAAGACCGGC GTAAAAAGGT GTGA
|
Protein sequence | MGRLSRDKRD VFYRLAKEKG YRARSAFKLL QVDAEFDIFG ARGAPASLGN TIEPLRVQRA VDLCAAPGSW SQVLSDKLYE LNHATGDAGA NSDQALDIDE QPEEPSIVAV DLQPMAPIDG VLCLQGDITA QSTAQDIIKH FQGNRAELVV CDGAPDVTGL HDVDEYLQGQ LLLSAMMITT HVLCERGTFV AKIFRGRNVG FLYAQLRLLF ERVSIAKPTS SRNSSMESFV VCQRFKGAPY LNLPHDELSD PLDSIDIPFL ACGDLSDWSP SGEILDADKS YPIDESQYIA PIAPPIQPPY QTSMEKQAED RRKKV
|
| |