Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43127 |
Symbol | |
ID | 7196741 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2105155 |
End bp | 2106510 |
Gene Length | 1356 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177447 |
Protein GI | 219111391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAACTGT GATCGCACAT AAAAACACGG AACGTCCACC CATTGTAATC GTTAAGAACC ATGTCCACGA TCAACTCTGA GACCGGCGAA GTCGTCCAAG GACGCTCGCC ACGCTTCACC ATGTGGGTCG TCTTTCTCAT CTTCGCAACC ATCTCGTTGG GATCTTCCGT CGAAGTGGTG AGTGACGCAA TCCTTTCGTG TACAGGATCT GTTCTTACGC CTCCTTGATT CATAAACACT GCCTATGCCT CTTCTGGAAA GATCCGGCAA AACGCTGCAC AACGGAAGAC TTGGCATTGG TTCTCTCAAC GACAAGTATC CCGCGCTATG TATTGACCAG TCGTGTTGCG CGACCCGTAC GCAACGCACT GTGTGCCAAG GTTCTGTGCC GCAGTAGATG ATCGCACACT CACACTTTCC TTGCTTCAAT TCTTTGGGAA TAGAAAAATG GATCTAGTGA TCCCAATAAG GCTTCGAAAT GGGCCATTGC CTGTTCGTCC GTTACGTTTG CCGTCACGTT GCTTGTAACG ATTGCTCATG TGTTGCCTGT CGCCAGTACG CTTATCGTCG GCACCAAACT AGAAGGAATT ATCTCGCTTA TTATGGCAGC TTTCTGGGCG GCAACTGTTG CAATCGTCAC CAACGCGAGC AACGGACTCG GCGTCGAATC GCAGACCAAC ACGGTTTTGA ACGGCAATCT CTACTACGCG AGCTGGGGAG GATTCATTAC ATCAATTGTG TTGTTGGTGA ACTATCTCAA GGAAATCTTC GGCGTGGATG TGGTCGGTCA AGTCCGCAAT CGCTCCGCGC GTCTCAGCTT GTGGGCCGGC CTCTTGGCGT CCGCTTTCAT TGTCTTGGGA TCCAGTGTAC GTGTGTTCAA CGGAGATTGC GATGGCAGCT CGGCAAGTTC GCAAGAGTAC TGCAAGCGAA CCAAGTTCGC CATTGCCAAC GGGACTATCC TCTGCTTTTT TTCGATTGCG ATTGTCGGTA TGAAGCTCAT GACGTCTTCG GCACCCTTTG TGGTAGAATT TGTGACTTCT ATTTTCCTCG CTGTACTCAG TGCCTTTGGC GTCGCTTACG TCACGTCGGC AAAAGGTCCT GGAAGTCAGA TTGGCAATCT TTACTACGCC TCCTGGATTT GCTTCATGAG CTCAGCCTTC CTGGCCGCCG AAACTTTCAA TGAATACTCG TCTGCTGGAG CCGCCGGGGG TTCCAGTAAC CACAACATGT CCAACGGCGA CGACAAGCAC GACGGGGACA TTCAGGTAGA AGAGCTGGGC GACGAGCGCA TCTAGAGGCG CAGTCCATCT ACAAACATAG GTAGAATATA TGTTCCGTTT AAAATCTCGT TGCAAG
|
Protein sequence | MSTINSETGE VVQGRSPRFT MWVVFLIFAT ISLGSSVEVK NGSSDPNKAS KWAIACSSVT FAVTLLVTIA HVLPVASTLI VGTKLEGIIS LIMAAFWAAT VAIVTNASNG LGVESQTNTV LNGNLYYASW GGFITSIVLL VNYLKEIFGV DVVGQVRNRS ARLSLWAGLL ASAFIVLGSS VRVFNGDCDG SSASSQEYCK RTKFAIANGT ILCFFSIAIV GMKLMTSSAP FVVEFVTSIF LAVLSAFGVA YVTSAKGPGS QIGNLYYASW ICFMSSAFLA AETFNEYSSA GAAGGSSNHN MSNGDDKHDG DIQVEELGDE RI
|
| |