Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43495 |
Symbol | |
ID | 7197190 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 629302 |
End bp | 630830 |
Gene Length | 1529 bp |
Protein Length | 452 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177972 |
Protein GI | 219112441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000588522 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACTTTTGCT CATCTACTCT CACCATGAAA TCCGTTTCTG CTTTCTTGTT AGCTTTCCTA GTGTCAGTCA CCGCGGCCAA TCTCATCACC GATGGTAAGT TGCACTAATG TCCGAAAAAT GCTAGGAAAT TTGGGAACTG TTGATCACTA TCCACCTGAT TTTCGATGGT GGAAGAAAGT GAATGCACGT GGTTTCTTAC ACTCAGTAAC TCCTGTCATT GGCCCCTGAC CTCTCTGTAG CCGGACATCG TAAACTTGAA GAAAGCAGCA GGTGTACTAG CAAGACCCTG AATTTCGACC AGTTCAGTGC CGGAGATCTT ATCACGAATG AAAAGTTGAA GCAAGAGTTC AAGTCACTAG AGATCTCTGT GAAACCCGAT GGAAGGTGTG CGTCTGGTAG TTTGGCCCGA ATTTTCGACA CCGAAAACCC GACCTGCGAA GATGATGACT TGAAGACTAG TGGGGAAGGC ATGGCGGCCA TTATCCAGGA CAAAGGAGAT TGCGCTCCGA ATGACTGTCG ATTTGGAGGC GAAATGATAT TCTCCTGGAG CAGCAAGGTA ACCCTCAACT CCATTCGTCT CTTGGACCTC GACCAATCAG TGGCTATTAA ATTATTCAGT GGGGATACGA ATGTGGCTAC CATCAATACC CCAAAACTTG GAGACGGCCA GCACCAGACT TTCACCATTG GCAAATCTGA TATCACTAAG ATGGTGTTGG TTCACTCAGG ATCTGGTGCA GTTGCAGAAG TTGATTACAC AACTTGTGGC CCTGGCGCCA ACGGAGATCC TCACTTTCAC ACATGGACTG GGCACAAATT CGATTACCAC GGGCAGTGTG ACCTTGTCTT TATGAAGGCT CCCTCTTTTG AGGGTAGGGG TCTCGAGATT CACGTGCGCA CCGAGCAGCG CTACTTCTTC TCTTTCATTA AAACGGTAGC AATGAAGATT GGAGATGATT TACTGGAGTT TGGATTCGAT CAAGTGCTCC TGAACGGTAG CCCAGCTCAC GAAGGTTTGG CATCGGGCAG CAGCTTCGAC TTTGCGGGAT ACCCTGTGTC GTTCCACGAC GAGCCCATGC CCAACGGCCG TCCACGTAAG CTCTACCGTG TCCACACTCC CCACGATACC GTCGACATCA AGGTTTTCAA GCAGCTGATG GCCATAGAGA TCGAGGATGC GTCTCACGTC AATTTCATTG ATTCGGTCGG TATCACGGGC GACTACAACT CTGGCTTGAT GCTTGGCCGT GATGGTGTCA CTGTTTTATC GGATCCTAGC GACTTTGGTC CGGAGTGGCA AGTCACCAGC GACGACCCCA GCTTGTTCAG TTCTGTTCAG GCTCCCCAGT TCCCTGAGAA GTGCTGGGAA GCCGCGGCTA TTGATAAGGT CCGCCATTTG CGTAATGGAG TGTCGCAGGT GCAAGCGGAA GAGGCCTGTG CTATCTTGGG AGAAAATGCT GACATTGAAG ACTGTGTGTT TGACATTATG GCCACCGGAG ATATCGAGAT GGTCGGTGCG CACCTTTAA
|
Protein sequence | MKSVSAFLLA FLVSVTAANL ITDAGHRKLE ESSRCTSKTL NFDQFSAGDL ITNEKLKQEF KSLEISVKPD GRCASGSLAR IFDTENPTCE DDDLKTSGEG MAAIIQDKGD CAPNDCRFGG EMIFSWSSKV TLNSIRLLDL DQSVAIKLFS GDTNVATINT PKLGDGQHQT FTIGKSDITK MVLVHSGSGA VAEVDYTTCG PGANGDPHFH TWTGHKFDYH GQCDLVFMKA PSFEGRGLEI HVRTEQRYFF SFIKTVAMKI GDDLLEFGFD QVLLNGSPAH EGLASGSSFD FAGYPVSFHD EPMPNGRPRK LYRVHTPHDT VDIKVFKQLM AIEIEDASHV NFIDSVGITG DYNSGLMLGR DGVTVLSDPS DFGPEWQVTS DDPSLFSSVQ APQFPEKCWE AAAIDKVRHL RNGVSQVQAE EACAILGENA DIEDCVFDIM ATGDIEMVGA HL
|
| |