Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48036 |
Symbol | |
ID | 7203252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 789215 |
End bp | 790336 |
Gene Length | 1122 bp |
Protein Length | 296 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182299 |
Protein GI | 219123995 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.43007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGATTAAAAC TTTATATATC TTTGTTGCAG CAATTCTGCT ATAAATTCCA AATAGCTTCT CGCTGACGCC AACCTTCAAA CATATATTGA TAAAGCTAGA TCGATTAGCC CCGACGACAA CGTAACCAGA AGAATGGGAA CGGGTACTTT GTTTGCTTCC CCTCGAGTAA CAGAATGAAA ATTATTTGCC GGATGATGCA GCTTCTTCCT TTCGGAAGTC CTTCTTCTCA AACAACACTT CTGGTTTCTC GCGATAGTCA CACAACCGAA TTCGAAGTTA CATCAAATTC TACGAAGCTG GTTCAGCAGG AAATGGCTTT CCAAATCTTC TCACTGAACG ACAATGGTGT TGTCGCTATG CAACAAGGAA ATTATGCTGA AGGGGGAAGA TTTTTCCTTC AAGGTTTGGA TAGTATACGC CAAATCCTCG AAACGAAGGC TCCAGCTGTT AGTGACTCAG AAAGAGCTAC CGCTTTCTGT AGCTTCGCAG CGAGCTTGAC ATCTAAAGCT ATCGGCTCCG AAGCATGTTG TGCAGTGGGG CCCAAGCCAC ACCACACAGC ATTAGAAAAT AGGACATCAT GGCAACTTGA ACAGCAGCAC GAAGATGCCT TTGATTTTTT TGCAAGGGTC TTTTTGGTAT CTGCTTCGGA CTTAGGTCAA AGTTCGTTCG ATGCTATTTT GGGAGCAGTG TGTGTTTTGC TCTTGTTTAA CGCGGGAGTT GCCTTCCATG CAGATGGAGT TCAGACCGGT AGGTCAAAGA TGCTTCTCAA AGCATACAAA TTTTATGAAA AGGCGGTAAG CATTGTGCAA CAAGAACATC CCTCACCCTG GGAGGCAGGA AAACTCCCGC TCGTCTACCT TTCAGTACTG GCAAACATGG CCCATATACA GTCAGAGCTT TTCGAAAGTG ATGCTCTTCA TCAGACACAA CTACTTCTTG AGAAAGTTCT AAACTGTGTT GTTCATCGCT TCTCGGCGGC CTTGGAAGCG CACGAACTCC GCTTTTTTCT GATCAAGTTA ATGCTGCTCA ACTGCCAGAC ATCAATTCAA GCGGCAGCTG CGTGAAGGAA GCCTTGAATG GATTTCTCCT TTCCCAAGAA ATAAAAGCAA TGTAGAGATA AA
|
Protein sequence | MKIICRMMQL LPFGSPSSQT TLLVSRDSHT TEFEVTSNST KLVQQEMAFQ IFSLNDNGVV AMQQGNYAEG GRFFLQGLDS IRQILETKAP AVSDSERATA FCSFAASLTS KAIGSEACCA VGPKPHHTAL ENRTSWQLEQ QHEDAFDFFA RVFLVSASDL GQSSFDAILG AVCVLLLFNA GVAFHADGVQ TGRSKMLLKA YKFYEKAVSI VQQEHPSPWE AGKLPLVYLS VLANMAHIQS ELFESDALHQ TQLLLEKVLN CVVHRFSAAL EAHELRFFLI KLMLLNCQTS IQAAAA
|
| |