Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50115 |
Symbol | |
ID | 7198917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 85290 |
End bp | 86696 |
Gene Length | 1407 bp |
Protein Length | 404 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184965 |
Protein GI | 219129585 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGGTCAACT TTCAACATCC ATCGCCTCGC CAATCGCAAC CCACAGCAAG GTAGGTAGCC AGCCAGCCAG CCTTTTGCAA CGAGTGACGG TGAAATAAAC TGTTCCCAAA GCCCAGGGAC CTGCAGTACG GTAGACCTGG GATTGATCGC CGGAATCCGA GTCTGCGATA CGCAACGCCT CGATTCTCGA TCATGTCGGG CATTGGGAGA GGTGATGGCA AGAGTCACCC CCGTGGCTTC CCGAGCCGCG GCACGGTTCG CTTCGCGGAA GGAACCCAGG GACCGAGTCA CTCAGCATTG ACGATGGTCA CGAGAGGAGG CGATGATGAC GATACGAAGC TCGAAGAGAC GACGCCCCGA CGGAAACGCC CCCGTCACGA GCGGCCCAAC GAAGATGAGA TCGACGATGT GGATGAATGG AATGATACAG AGGAGCAAGA GGATGATGCA CCCAGCGTAC CGACAGAGCA CGAACTCTTG CAAGCCAAAC AAGAGCGACG GAGAAAACGA GAGAAGGGTG GTCCGGAATT AGAAGGCGAC GAGGTTTCCA AAGAGGGCAG AACTAGTATA GATAACAGCA CATCCCTGGC TTCAGAAGGC ATCGAAATTG AACCGTTCCA TATGCATCAA GAGTCTTCGG ACGGAACTGG CTATTTTGAT GGCGATACGT ACGTTTTCCG CAAGGTTCAA GAGGAAGACG AAGAACCGGA TGCTTGGTTA GAATCGCTCA GCAGTCGTAA CGACGAAAGT GAAAGAGAAC AGCGAGGACA CCAGCCACTC ACCAAACCGC AACCTGACTT TGAGAAGAGC AACATGGACG ATTGGCCCAA GGAATCCTTG TACGCAAAAA TTATTCCTCT CGTGAGTGAT ACAGAAACTG TTATGCAAGC TATCGCGCGC TACGGGCATT TACTCAAGCG CAAGCATAAA ACTGGTGCGA ATATCGGAAG CAACAATGCC AACGACGAAT CTCGCAAATT CGCGCAATCC GCATTGAACG ACCTGACCGG AGCTGCGAAC GCACTATTGC TTAAAGGAAA CGTTGATATC TACCAAAAAA CAAGGAAAGA GCTCGTTTGC CAGCTACCTC CTCAACTAGG ACGAATGAAG ACAGCAACCA AACAAAAAGC GCTTTGGGAA TACATGGGCA ATCAAGATGG AGCAATTCAC GGTCCATTTA CCACCGAGCA GATGCATGGA TGGATTGCCC AAGGATACTT TGTTGGTCCC ACAGCGGTTC AAGTTCGATC AGTGATCGAA CAGCCCAAAG AGACAAGCCT AAAAGACGAC CTATTGTCCG ATCTGATGGA TGACGACGAT GACACCGCGG CCACACCATC TGCGCTCGAG AGCGTTCGCG GAGATTGGAT GCAATCGGAC CAGGTGGTAT TTACTAGCTA TACTTAG
|
Protein sequence | MSGIGRGDGK SHPRGFPSRG TVRFAEGTQG PSHSALTMVT RGGDDDDTKL EETTPRRKRP RHERPNEDEI DDVDEWNDTE EQEDDAPSVP TEHELLQAKQ ERRRKREKGG PELEGDEVSK EGRTSIDNST SLASEGIEIE PFHMHQESSD GTGYFDGDTY VFRKVQEEDE EPDAWLESLS SRNDESEREQ RGHQPLTKPQ PDFEKSNMDD WPKESLYAKI IPLVSDTETV MQAIARYGHL LKRKHKTGAN IGSNNANDES RKFAQSALND LTGAANALLL KGNVDIYQKT RKELVCQLPP QLGRMKTATK QKALWEYMGN QDGAIHGPFT TEQMHGWIAQ GYFVGPTAVQ VRSVIEQPKE TSLKDDLLSD LMDDDDDTAA TPSALESVRG DWMQSDQVVF TSYT
|
| |