Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50054 |
Symbol | |
ID | 7198803 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 261659 |
End bp | 262884 |
Gene Length | 1226 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184931 |
Protein GI | 219129512 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.702034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACGGACATA GAAAGATCTA TCCGTGGAAA GTCCAACAAA ACAACAACAG CAACACAAAA CAGCACACAC GAGCGGCACA TTAACGAAGG AACACCGAGG GAGTGAGAAG GCCTTTTCGT GGATGAACGG TTGGCCATGA AAAAGCCCGA AGTACTGCGT GTTGTCAAGT GTTGCTTGCC CGCAATCCTC TTCGATTATA AGAGAAAAAA GGCTAGTGAG GATGCTAGCG AAATAAATAT CGACTTGCAA GTTGACGATC TACAGGTGAC AACATTGTGT CGGTTGTGGG CAGGAATGGG GCACATTCAT CGCGTCCGCA TTTCCTTACC GGTAGGCTCA ACAACCAGTG GCAATAGAAC TACCGGCAAC GACGATATTG CTATTAAGCA CATAATTCCG CCTCCTTCAT CACAGCGGTC GTTCGGCGAC CATCGCAAAG CCTCCAGCTA CCGCGTCGAA GCCAACTTTT ACGAAAATCT CGCACAAGAA CTCATCGCCA AAGGTGTCAG TGTACCCACT CCGTACCATG TGGAGCGGGG TAAGGGTGAT TCTGTCATCA TCGCCATGTC GTACCACGAA AGCAAGGTCA ATCCGACGGA AAACCAGCAA CGCGTGCGGC TCGTCTTGTC GTGGTTGGCA CAATTTCACG CTATGTATTG GGATGCCGAT GCGGCGGATC GCGTGGTCCA ACAAGCGGGG CTGCAGGCTG TGGGAAGCTA CTGGTATCTG GCAACGCGTC CAGATGAACA CGAGGAGATG CCGGACCACG GCTGGCAAGG AAGACTAAAG CGCGCCGCCC GTGCCATTGA TGCGCGGCTA CAGCGCGATC CTTTGCAATG CGTTATACAC GGGGACGCCA AGGATGCAAA CATCTTGATG GACGAGCATG GAAAGGTCAC CTTTTGTGAT TTCCAGTACG CGAGATCTAG CTTACTTTTT CTGCAGCTCC GTATCAATCG ACGACGAAAA GGAAGCTCTG GAATATTATT GGAACGAGCT GAAAGCTCGA TTGCCACTAA ACGTGTCGCC CGCGCCTACT TGGGAGCAGC TGCAAGATTC CATGGAATTT GCATATGCGG ACTTTTATCG ATTCATGAGT GGGTGGGGAT TTTGGGGGTC GGGAGCTGAA CGTCGCGTAA TTGCGTTGTT GGACCGGCTC GATCACGGTA GCAAGTTGGC GACCGAGGAA GACTATGACG AAGCCGTACA ACGAGAGTTT GGCTAA
|
Protein sequence | MKKPEVLRVV KCCLPAILFD YKRKKASEDA SEINIDLQVD DLQVTTLCRL WAGMGHIHRV RISLPVGSTT SGNRTTGNDD IAIKHIIPPP SSQRSFGDHR KASSYRVEAN FYENLAQELI AKGVSVPTPY HVERGKGDSV IIAMSYHESK VNPTENQQRV RLVLSWLAQF HAMYWDADAA DRVVQQAGLQ AVGSYWYLAT RPDEHEEMPD HGWQGRLKRA ARAIDARLQR DPLQCVIHGD AKDANILMDE HGKVTFCDFH SVSIDDEKEA LEYYWNELKA RLPLNVSPAP TWEQLQDSME FAYADFYRFM SGWGFWGSGA ERRVIALLDR LDHGSKLATE EDYDEAVQRE FG
|
| |