Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47140 |
Symbol | |
ID | 7201933 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 588528 |
End bp | 589962 |
Gene Length | 1435 bp |
Protein Length | 375 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181405 |
Protein GI | 219122129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.136457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTGA GAAACAGCCC ATCCAGGGGG AGGATCCATA GTCTAGCCGT GATAGCGACG GCCTTCATAT TCGTTTTGCA CATGTACGTT GGTGTATACG AAAGTGCGTC TTTTTACCAG CAAATACCCA TGATTGGCAC GTGGTCTACT GAGTCAAAGT CCCGTGTGGA GACAGACGAC ACCAAGCGCG CTACCCGCAA GCATTCCATG AGAGAGATTG TGCCGGAACC AGTTCCTCGA CCTTTGATTG AAACTCTCGT TACTGGACAA AACGTTACAG GGGACGTTGC GTGGCTCCTG AATATGGCGG TGCTGGGATT CTCAAAGTGC GGTACATCCT TCATGATGCG CTATCTCGGT CGACACGAAG AAATAGCCAT GCTGACTGAC GGTGAACACT GTGAGCTGAC AAGGCGCAAT GAAGATTCGG CCCTTATCAA GTCCTTGATG GATGGGCTTC CCAGCGGAAA GATAGCGCGC GGCTTGAAAT GTCCTATCCA TTTGGAAAGC CCCAGAGCCA TGCAGAGCTT CTCCCGATAC TTCCCGAACA CAAAGATAAT TGTTGGAGTC CGACATCCCG TCCTTTGGTA AGTAGTATGC TTGAACACAC ATGAGCAACC AGAAAGGGCT CTATCCTGAT TTCTTCTTCT AATAACATTG GAACAGGTTT GAATCCTTCT ACAACTGTGA GTGGATATAT GCTGAATTCT TCTTTCGGGA CTGTCTTTGT ACCATAACTC TCATCCAACA TATTTGTATT CAGATCGGCA TCGAGACGGT AAGACCCAGC TGCTGCCAGC CCAGGAACTG ATTGGAAAAT GTGCGGATTT GGGGCCCTTT GAAAAGGTGG CTTCGGTCTG TACCGAAGGA GCCAAGTTCC ACGAGCCTTT GGCTCGCTTG GGAAAGACGA ACATGCAGAG CACAGATGAG CGACAATACT TTTCGGCCGA CGCGCAGAAT GTTTCAGACA CCGATGCTTT CTCCGGTATG AAAGATCTTC GTGTACGACG TAGCCCAGCT CCAGGACAAG GACCACGACC GTTCTCAAAT CCTACTACAA GACTTGCAGA ACTTCCTGCA AGTCACAAAG CCGTTCCAGC CGATGGTGGT AGAGCCCAAA AGACTGCATG ACGGAACCCG TATTGATATC TGTGACCCCG AGTACAATCA TTTACGCGAG GTACTTGTGG ATACTGGAGT GAAGGCGTCG AGATGGATTC GGCGATTTTT TGTCCATGCC GAAGGCGTGA CGGTGTCGTC TCCCAAATTT TTGGACCAGG TGTTGGCCAA GTGGGAAGAA GATCCGTGCG AAGAACGCCG GGCCGAGAAG AGCTCCGCCC CATCCCCTTG AATCAGATTG CCATGTACAA GTTGTAGGAG TCGTCCAGTT GATAAATTAT GCAGGTATTG TTCGACCGGT ACGCATCATG TTACT
|
Protein sequence | MSLRNSPSRG RIHSLAVIAT AFIFVLHMYV GVYESASFYQ QIPMIGTWST ESKSRVETDD TKRATRKHSM REIVPEPVPR PLIETLVTGQ NVTGDVAWLL NMAVLGFSKC GTSFMMRYLG RHEEIAMLTD GEHCELTRRN EDSALIKSLM DGLPSGKIAR GLKCPIHLES PRAMQSFSRY FPNTKIIVGV RHPVLWFESF YNYRHRDGKT QLLPAQELIG KCADLGPFEK VASVCTEGAK FHEPLARLGK TNMQSTDERQ YFSADAQNVS DTDAFSGMKD LRNFLQVTKP FQPMVVEPKR LHDGTRIDIC DPEYNHLREV LVDTGVKASR WIRRFFVHAE GVTVSSPKFL DQVLAKWEED PCEERRAEKS SAPSP
|
| |