Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48316 |
Symbol | |
ID | 7203739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 101966 |
End bp | 103046 |
Gene Length | 1081 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182896 |
Protein GI | 219125246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00626379 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGAATGAG ACTGGCAACT TACAACCAAG TTACTTTCGT TGAGTGACAG TGAGTCATAA TCAGAATACA CTGTATCTGG CTATTCCGTT ATAATAGTTC TTGAGCCTTT TGGGGTTTCT TGTGGTTACT TTTCGCATGT CACTCGTCTT TCGATCGCTC TCCTACGTCG CGCGGCGGAA CCTGAGGGCA GCCTGCCCTT TGGTGGAATC ACCGTCAATG TGCGATACCA TTTTCTGGAA CTTCGGCGGA TGTGTTGACG ATTCCGTCAA AAGCAAACCT CGACATGGAC AGTTACGTTG GAAGCACGGT GCAAAAATGG GTCACCACCT GGACACCCTC GACGAGATGG CGCATTCCAA TTCGCGTAAA GAAGCCCAGG AACGTCGCAA AAAGAAAAAA GGCAAGAAAG AGTCCAAGGA AGTCAATAGT GCACCAAAGA ACCACAAACC AGGCGAGTCC GATTCTTTCA TCGGCTGGGA CGAGGAAGCG GAAGAAGACG ATGAAGACGA GGACGAAACC GTGCTGCCGG ATCCCGCGCA AGTCAAGATC CGTATGACTA AGGTAGTCGA CTCCTTTGTT ACCAAACTGA AAACAATTCG GGGTTCTGAA CCCACCGCCG ATATTTTTGA CGACATTCAG GTCGACGCCT ACGGTTCACA CGCCCCTCTG AACTCGGTCG CACAAGTCGT GATTTCATCA TCAACTTTGG CACAGGCAAC CTGCTATGAT CCGGAGTTGG CTAAGAATGT TGGTGTCGCG ATTCGGGATA AGTTGGAATT GAATCCATCC GTTGAGGAAG GCGGGGTCGT GCGCATCCCG CTACCGCGCG TGTCGCTGGA AGTTCGCCAA CAAACGGCCA AGGCCCTGAC TAAACACACG GAAAAGTACC GACAACGTGT ACGTACTATC CGCCGGAATG TCCTCAAGGT CGTGAAGCAA GGCGAGGCTG GAAAGTTAGA AGGTATTTCC AAAGATGACG CGTTCCGAGC TCAAAAAGAA ATCGAGGATG TTACGGAAAA AATCATGGTC AAGCTGAACG AGGCTGCCGA ACAAAAACAC AAGGCAATAA TGGCTCTTTA A
|
Protein sequence | MSLVFRSLSY VARRNLRAAC PLVESPSMCD TIFWNFGGCV DDSVKSKPRH GQLRWKHGAK MGHHLDTLDE MAHSNSRKEA QERRKKKKGK KESKEVNSAP KNHKPGESDS FIGWDEEAEE DDEDEDETVL PDPAQVKIRM TKVVDSFVTK LKTIRGSEPT ADIFDDIQVD AYGSHAPLNS VAQVVISSST LAQATCYDPE LAKNVGVAIR DKLELNPSVE EGGVVRIPLP RVSLEVRQQT AKALTKHTEK YRQRVRTIRR NVLKVVKQGE AGKLEGISKD DAFRAQKEIE DVTEKIMVKL NEAAEQKHKA IMAL
|
| |