Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46220 |
Symbol | |
ID | 7201187 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 645451 |
End bp | 646638 |
Gene Length | 1188 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180477 |
Protein GI | 219119433 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0715961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTCCCTT GGAAGCCAGT ATAGTGTACA CAGCTTGGTT TTTTGCCCAA GCGAGGAACG ATGGGAAGCG TATCCGGCAA GAACGATAAG AGTCACGCCG TATGTGCCAG TGACCAATAC ATTCGGCGTG CCCATCACGC TGGTTCGTGG TACGAAAACG ATCCCGTTAC CCTCCGTACA ACGTTGCAGC AGTACCTTGA TCACGCTGCC TCGGACAGCA GTACCAAGAA CGGCCGTGTT AATACAAGCG GTGCCGATGG GCGAATATTT CTGCGAGGTT TGATCGTCCC ACACGCGGGC TATTCGTATT CCGGACCGAC TGCGGCCTAC GCATATCAAC CCCTTTTCCA GGAACTATCG CGAGTCGACT GTCCCATTCA AATTTTGCTC GTGTTGCACC CATCCCATCA TGTTTACCTA GACGGATGTG CTATATCCAA CTCCCACACA ATCAACACAC CGGTAGGGAA CTTAGCGACC GACGATGGGA TTAGGGAAGA ATTACTCCTC CTCAACCACA ACAATAAATC TATTTTTACC GTCATGTCAC AAAAGGAAGA CGAGGAGGAA CATTCTGGTG AAATGCAGTA TCCGTATATA GCCCACATTC TTCAAGCATG TGGAAAACTG CACAACAATG GCAGTAACAA ACCAATTCGT GTGCTGCCAA TCATGTGCGG AGCTCTATCG AACCAACAAG AAGCAAGCTA CGGGCACTTG TTGCAACGTG TTATTGCTCG AGAGGATGTT TTGACAATCG TTTCGAGCGA TTTTTGTCAT TGGGGTCCAC GGTTCCGGTA CCAACCAATT CCTACCAAAG AAAAAAGTTA CAAAGATTCG ATGCCTCTTC ACGAATTTAT CAAATCCCTG GATCGCCAGG GCATGGATGC CATCGAGGCG CAGCAACCGG GGGCGTTTGC AAATTACTTG GCACGCACAC GCAACACCAT TTGCGGCCGA CACGCCATTG CCGTATGGAT GCAAGCCATT GCTGCATCCG AAACTACTAT TGGCAACAAG GACGACACCG ATCCAACCGG TGAATTGCTG CGAGTGCGAT TTGTGCGCTA TGCACAGAGC AGCCCTGCTG AAAGCCTACG GGATAACAGC GTGAGTTATG CCGCAGCTTT AGCCACAAGG ACAATTGCAG CAAAACCAAA TGATGAGTCT GCTCTTTATG CTCTTTAA
|
Protein sequence | MGSVSGKNDK SHAVCASDQY IRRAHHAGSW YENDPVTLRT TLQQYLDHAA SDSSLIVPHA GYSYSGPTAA YAYQPLFQEL SRVDCPIQIL LVLHPSHHVY LDGCAISNSH TINTPVGNLA TDDGIREELL LLNHNNKSIF TVMSQKEDEE EHSGEMQYPY IAHILQACGK LHNNGSNKPI RVLPIMCGAL SNQQEASYGH LLQRVIARED VLTIVSSDFC HWGPRFRYQP IPTKEKSYKD SMPLHEFIKS LDRQGMDAIE AQQPGAFANY LARTRNTICG RHAIAVWMQA IAASETTIGN KDDTDPTGEL LRVRFVRYAQ SSPAESLRDN SVSYAAALAT RTIAAKPNDE SALYAL
|
| |