Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47097 |
Symbol | |
ID | 7202172 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 391708 |
End bp | 392892 |
Gene Length | 1185 bp |
Protein Length | 287 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181200 |
Protein GI | 219121702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCCA TGAAAGCATC ATTGGCTTTT CTGTGGCTTC TTCTTGCCTC TAATAGATTT TGTGTCCAAT CGTTTCGCCC GGATTGGTCG AATCACCAAA CGAGACTTTC TGGAAGCTGC GTCTTCAGTG CGAAAAATAT TGACCGTCCA AATACGACCG ACAACAAGGC CATGGCGTTT CTTCGAAGAA TGGGTCGCGT AGGGGGCAAT CGTGATTTTA CGCACGCAAT CGGAATCGAC GAAGGTCCTT CAACCAAGTC CACCGGAAGT GGGAAAAAAG TGAGTGCAAT GGATAAGCAA CGCCGCCGTG TGCCTGATCT GGTTGATTCA GATCCTATCT CAATACTTCG CTGGGTTGCC ACCCTCTCTC TTCCAGCTGC AGAAAAAGAA AGCCGCCTTT CAGTCTTGTG CACTTACAGG TGTCATTGAT GATCTCTCAG AACCGTTTCC AACCACCAGT TCCGGGTAAG CCGAAATCTA GTAATGAGTG TGCCTAACGT TGGTCGTGTG CACGCCGCAT CTACTGACAT ACTAAAGAGA CACCTACTAA CTTCAAATTG ATTCTACAAA CAGAACACAG TGGGCTGGAT ACACAGATCA AGTAATGGGC GGTGTATCAA CTGGGCATCT TTGTCGGGAA GATTTTGACG GACGAACGTC TAACGTTTTG CGTGGCAAAG TCAGTTTGCG CAATAACGGC GGCTTTATCC AAATGGCGAC AAATCTGGCG CACGACGCAA AAGACTCTAG GCTCGTTGAT GCGTCCTCCT TTGACGGCAT AGAAGTAGAT GTACAGTATC AAGGAGAACA AGAGGAAGAA ACATTCAACA TACAGTACGT ACGAAGCGAT CGGTCATTTT TTCGTAAACA TTCCGATTTC CTCACGCCTA TCCATGGCTG CTCTTATTTC TCCAGCTTGA AAAATGTTTG CTGCCCGCTC CCGTATAGCT CATACCGTGC ACGGTTTTCC GTTCCGAAAG GATCCTGGAT GACAGCTCGG GTTCCATGGA CAGACTTTCG CGGACACGGG CCGGGTGCTT CCGATATTCC GTTCTCTTCT AATTCGCTGA CAAGAGCCGG GATTGTGGCT ATTGGTAAAG AAATGGAAGT GCTGCTAGCT GTTTCTGGCC TCCGTTTCTA CAGAGAAAAA AATTAAACCC ATTGTAGAAA AAGTTTGTTT ATTTC
|
Protein sequence | MVSMKASLAF LWLLLASNRF CVQSFRPDWS NHQTRLSGSC VFSAKNIDRP NTTDNKAMAF LRRMGRVGGN RDFTHAIGID EGPSTKSTGS GKKKKKAAFQ SCALTGVIDD LSEPFPTTSS GTQWAGYTDQ VMGGVSTGHL CREDFDGRTS NVLRGKVSLR NNGGFIQMAT NLAHDAKDSR LVDASSFDGI EVDVQYQGEQ EEETFNIHLK NVCCPLPYSS YRARFSVPKG SWMTARVPWT DFRGHGPGAS DIPFSSNSLT RAGIVAIGKE MEVLLAVSGL RFYREKN
|
| |