Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40938 |
Symbol | |
ID | 7198815 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 314678 |
End bp | 316233 |
Gene Length | 1556 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184942 |
Protein GI | 219129535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.445023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGTC GCCGTCTGCC TCGCGCATTG ACGACGTTGA CGATTTTTTT GGGAGGCCTC TATTTAGGAA ATTTGCAGCA TCAACTGACA TCACGTCAGG AAGCTCCCGT GACGGCTCTG GGCGGCCCGG CATTGCCTTT GTCTTTCTCG GGAGCAAGCG TTCCACACTA CGAATCCGCA CAGTCGAGAG AAATAGTTAA GTCGCCCGCT CTGTGGGAGA CACAGCTGGC TGACGCTCAG CAAGAGACGA AGCGACTACG CTCTCAGATT AAGGAGATGG AGAAGCAAGT TCGGCGTAGC CGATACCAAT CTTCGACATT CGAGCAACCG CTCGTTGATT CGAATCCACG GGCTGTGCCC AATCCTCACT GGGTCTGTCG GCAGCTATCT ACAGCTGTCA ATGCTACTAA CGCAATCCCC AGTGCATCCT TTCTTTGGAA TACTCGTTTG AAATCGATTC ACGCGGCTTC CCAGCTCAAA GTGAACGACC CCCGCTACTA TTTTTCCGAT TTTACGGCCC AACTCTTGGC CATTGTTGCA CCGCGATTGT CCCGGTCCTC CGGTCACGAT GCCGATGGTG TGACTGTCCA GTATTTGCTC GATCGGATAC AGGCTCGGTA CGAATACTTA CACGCTCAAG GGCCGATTGC GGAGCCCGTC AAAATCGTCG TCCTTGGAGG CAGCGTTTTG GTGGGACGCA ACTGCCGCAA GCTCTGCAAG GATCTGGGGT TGCAACTGCG CATGCCTCAA CGCGAATGCA CCTGGGCTCA TCGACTAGGA GTCTTCCTGA ACGTCCTGGT ACCTGACATC TTTCGCGTCA CCAAAATTGC CATGGGCGGG ACCAATACGG CGGTCGGCAC GACCATTTGG AAGTACGATT TGTTGCCACC CGAAGCGCGT CAACCCGACG TTGTCATCAA CGCCTACAGT ACCAACGATA TGCACATCCT CACGGCGTTG GAAGCATCCT CCGGTAACCA AACGCTACGG GATCGCGTTT TTGTCATGCT CCAAGACTTT GCGCGGGAAG TCCTGGTACC GCCGCCATTG GCGTGCACCA ACGCGCCTCC ACCGCCACTT TTTCTCCACG TGGACGACTA CCTCGGCAAC GAGCAACGCG CAATTCTGGC CACGACCGAG CTGCGACAGA GCGTGGATGT ACTAGCGGCG TACTACATTT TTCCTACCGT CTCCTACGCA GACGTAATCC GGGATTTGGT ATACGGTGAT ACGGCGGAAT CGTGGTTTTC TCCCGAAGGC TGGTACGTCA AGGGTATGTC AGGGATGCAG CGGGAGATTC ACCCCGGAAT GGGTATGCAC ATTGTTATGG TCTGGGTGAT TGCCTTTAAC CTGTTGCACG TGGCGACAAC ACACTGTAGT CGAGAGATAA GTTCCAGGCA GAACCTACAG CTTGATTACG ACAGGTCACT GTTGGCTCGG GACGTGCCAT TGCAGAATGG GCCGTACACG AATGTTCGCG GCAAGCCAAA CCGTCTTCCC GAAAGCTTGC CACCCCCGTT GACGTCCAAC ACAACTCTAG AGACAATATC GATTGA
|
Protein sequence | MASRRLPRAL TTLTIFLGGL YLGNLQHQLT SRQEAPVTAL GGPALPLSFS GASVPHYESA HASFLWNTRL KSIHAASQLK VNDPRYYFSD FTAQLLAIVA PRLSRSSGHD ADGVTVQYLL DRIQARYEYL HAQGPIAEPV KIVVLGGSVL VGRNCRKLCK DLGLQLRMPQ RECTWAHRLG VFLNVLVPDI FRVTKIAMGG TNTAVGTTIW KYDLLPPEAR QPDVVINAYS TNDMHILTAL EASSGNQTLR DRVFVMLQDF AREVLVPPPL ACTNAPPPPL FLHVDDYLGN EQRAILATTE LRQSVDVLAA YYIFPTVSYA DVIRDLVYGD TAESWFSPEG WYVKGMSGMQ REIHPGMGMH IVMVWVIAFN LLHVTVGSGR AIAEWAVHEC SRQAKPSSRK LATPVDVQHN SRDNID
|
| |