Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47389 |
Symbol | |
ID | 7202436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 499321 |
End bp | 501096 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181738 |
Protein GI | 219122824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00749667 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGACA GAGCCTTACA AATCTTTGCT ATCGGACAAA CGATCGTTGC TTCCATCCGT GCCGGATCGC GAGCATGGCA TGTTCTAAGA AAAGGTCCCA TTGGAGTCAT GAACGGACCT CTGATTCTGC CTCCTCCAGT CAATGAGACA TCAAATTTAT TCGATTTGAC TCTGGCTTTT CGTCATGCAG TTCAGGTGTG TGATGCGCCG CAAACCGACC GCGAGGCAAG TGCTATGCTC GGAAGAGAAA AAAAGTGCGC ACGCATCGAC AGAGAAAACC AGTCTGGTTT TTCGCAACCA AGCGATCCTT TGGCCCAAAA TATAAATAGA GACCATCGAA CGGAAAACGA GGAAATTGAT CGACTTCTGG ACGAATTGGA GAGTAGTGAG AAAGAGCCAA AAGTCTCGTG GAAAAGCTTT TTCGAAGCAT GTAGACAAAC TGCTGTTGTC CCAGTTCAAC GGGCATCCTT GTTCGGTGCT TCGCTACCAG TCTCTGCGTG GTTCGCTACC AAACGCTTTT GGAACCTTGA TGCGGCATCG GCATGGAGCA CAATTGAGAT CCTGGATGGG CCCTCAGGTT TCACTGACTA TGTTTCTGCA GAGCTAGTCT TTGAGAAAGG TGATAAAAGC GCATACCCTC GAGTAGCTCT GATAAAGGCG TATGCACCGG AGGAGTTCAC CAACTTGCGC TCGAACTTTG GAATTTCCGA ATCAGACTAT GCAAGGTCGA TTCTGCATTC TGGTCCGTTT GTATCTTTCC AAAGCAATTC CAAAGGAGCA GCGCGGGTTG GTGGGGTTTT CTTCTTCACC CGTGATGGCA ACTACATGAT CAAAACAATA AAGGCAGCGG AAGTACATGC TTTGCTACAA ATGATGCCAA AGTACGACAA CTTCATGAAG CGAAATGGGC GTAGATCATT GCTGACCAGA ATTTGTGGTC TTTACGATAT TGATATTCAG GACGCCTCAA GCGGTGTCAA TGAGAAATAC ACTATAGTTG TTACCAACTC GGTCTTTCCA GCGGAAAGTT CTAGTATAAT TTCCGAACGA TTTGATCTGA AGGGATCGAC CCTAGGCCGA GAGTGCTCGC CAGAAGAACG GCGTACCAAA GGGTCAAATG CTATCCTTAA AGATCTTGAC CTTTCGCGAG AGGTACAGCT AGTCAAGTCT TTTCAAGACG AAGGAACACC GCACTTTGAG GGCTACGGCC TGCATATTGG ACCTGCTGCC AAGGCAGCTG TCCTCACGCA ACTCCGAAAA GATGTGCACC TGTTGGTGCT GTGCAACGTA ATCGACTACA GTCTGCTGGT TGGTGTCTCT CGCTTGGATT CTCGTCACTT TACAGTCGAC GAATTGCACC TTATAGATTC GAGCACAGAA GCTGAGCTTC GTCTAAGTTT AGCTCGTCGA GGACAAGCAG CGGATGCCAT CTTGTCCGCA CTAATAATGC CTGTTCGATT GCTTACTGCT CCGCCAATCT ACCTGTATCG GAGAGCGTGG TCCCTTTTTC GAAGGACAGT ATCCTGGCCT CTTCCATATT ACGGTTCCGG AGAATGTGGA ATAGATGCGG GTGGGCTAGC CAGGGTACAG GGAGATCGGC TTGGCCACCC TTCCGTCTTT TATTTAGGGG TAATTGACTT TCTCCAGCCT TTTAATATCC CAAAGAGAGC TGAATGGAAG TACAAGAGCT GGAAGTACGG GGAAGGATTT AGTTGCGTCC CTCCTGAGCA GTACGCAGAA AGATTTTTGG CGTTTCTCGA AAGCCATATC AGTTAA
|
Protein sequence | MLDRALQIFA IGQTIVASIR AGSRAWHVLR KGPIGVMNGP LILPPPVNET SNLFDLTLAF RHAVQVCDAP QTDREASAML GREKKCARID RENQSGFSQP SDPLAQNINR DHRTENEEID RLLDELESSE KEPKVSWKSF FEACRQTAVV PVQRASLFGA SLPVSAWFAT KRFWNLDAAS AWSTIEILDG PSGFTDYVSA ELVFEKGDKS AYPRVALIKA YAPEEFTNLR SNFGISESDY ARSILHSGPF VSFQSNSKGA ARVGGVFFFT RDGNYMIKTI KAAEVHALLQ MMPKYDNFMK RNGRRSLLTR ICGLYDIDIQ DASSGVNEKY TIVVTNSVFP AESSSIISER FDLKGSTLGR ECSPEERRTK GSNAILKDLD LSREVQLVKS FQDEGTPHFE GYGLHIGPAA KAAVLTQLRK DVHLLVLCNV IDYSLLVGVS RLDSRHFTVD ELHLIDSSTE AELRLSLARR GQAADAILSA LIMPVRLLTA PPIYLYRRAW SLFRRTVSWP LPYYGSGECG IDAGGLARVQ GDRLGHPSVF YLGVIDFLQP FNIPKRAEWK YKSWKYGEGF SCVPPEQYAE RFLAFLESHI S
|
| |