Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45780 |
Symbol | |
ID | 7200918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 252418 |
End bp | 255805 |
Gene Length | 3388 bp |
Protein Length | 1060 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180000 |
Protein GI | 219118456 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATC TTGAAAGATA CAATAGCGGC TCGAGCGATG GGAAGATCGA GGACTGCAAG GTTACACTTC CAAAAATAAG CGAATCAGCA ACTGAAGAGT CAACATTCTT GAGTACCGAA GCCTCTTCAT TTCATTTAAC TAGAATAGCC GAGGGTGGAT CGATCAAGGA GTCACACGAA AATAAGAAAG GCGGCAAGTT ACGAGTTCCT CCAAGGAAAT TCACGAGCAA TCTCTACCAC CGATCCGAGG AAGCCGAGCT GAAACTCCTG TACGAGCATT CTGTAGACGC AGAGAACTGC TCCTTAATTC TGCTAACGGG TAGGACTGGT ACCGGAAAGA CGCATTTGGC TCAAACGCTC GAGTCCGCTG TGCAGAAGAG ATCCGGATTC TTTTTGAAGG TCAAGTTTGA TCAGCGTATG CACCCTATCC CTTACGCAGC ACTTATTTCA GCCGTTACTC AGTTTGTGCA CATTGTCGTG GAAGAAGGGG AAACAGCCAC ATTGCAAGTT CGCGAGGCTG TCGGCAATCA CGTCGGTGAT GGAATCGAGG CACTGACCAG CATTATCCCG GAATTGGAAG TGTTGCTTGG AAAGTCCCGG ACAGCACACC TCCCTATGAA AGCCGCAGCT GTTGGAACTC AGCGTTTTGT TTTTACATTC CACCTGTTCC TTCAGGCTAT CTGTACGCTG AAGCGACCAA TTGTGGTATT GCTTGACAAT TTTCATTACG CGGACCCTTG TACTCTCGAT ATACTGAGCT TTGTAGTTGC CGACGGCGGC CAGCAAGGCC TTGTTCTTCT CGTGACGTGT GATGTGACCG AAGTCGGCAC TGATAGCTAC TTGGCTGTTA AGCTCCGAGA TATTGAAAGA CAGACGCAAG CTCAAATTCA CACTTTAAAC ATTGACCATC TTGACCTTGA TTCTACTGAA GAATTTCTTG CTGGTTCGCT GGAAGCTGAG AAAGGCTATA TTCACTCTTT GGGAACAATT GTTTTTCAGG AAACTGGAGG AAACTTTCTA TTCATGATGG AATTTCTCCG GCGGCTTTCT GCTTCGGACC TGCTATACTT CGACCATGAC ATTGAACAGT GGAAATGGGA CGTATCGGAT ATTCGGACCA TGAGTGAGTC TAAGACCGTG CATGAATTTC TAGCGGAAGT ACTCGAGCAA GTCTCCCGTG ATCACAAAGA CGTCCTGAAG GTTGCGTCCT GTCTGGGGTC ACAAATCGAC GAATCTCTTA TAGAAATGGT ACTGGGCTTT CCTGTTCTAC ACTATTTGGA GGAGTCCCTG AGGTGTGGAT TATTGCATTT CGACAAAATC TGTGCTTGCC TCACTTTCGC AAACGACGTG ACTGAACAGG CGGTATATAG CCTCATTCCT CAAACAGAAA GGCCGCTTTT TCACTTGGAA ATTGGAAGGA GATTATGGAG AAGGCTTAGT GATGACTGTT TCGATAAGAA ACTGTTCATT GTACTATCGC AGATGAGGAT GGGACAAAAC CTTATCACAA GAAACAGCGA ACGAACAAAA GTTGCTTGGC TCTGCCTCCA TGCAGGAAGA AGAGCGGCTA TGGCATCGGC CTTTCAAACT TCTCGCGTGT ATTTGGTATT CGGAATCGAG CTTTTAGAAA ATACAAGCTG GCGGGAAAAC TATGACCTTA CACTGGCGCT GCACAATTCT ATTGCTGAAG TTGATATGTG TCTAGCTCTG TTTGAGTCAA TGGATACGTG CTTGGCAGCA ATCTTGAGCA ATGCTCGGAC CTTTTCTGAT AAACTTAGCG CTTATAGGAC AAAAATCCAT GCCTTCGGAA TCCGAGAGCA TCAAATTCGT GCCATTGGTC TAGGCATGGA AGTACTCCAG GGACTAGGAG AGAAATTTCC GAAGCGTCAT CTTGCCTTTC ATCTTCGGAA CGAATTGAAA GGGCTCAGAG CGGCGCTTAA GGGCAAGAGC GACGAGCAGC TCCTGCGGCT ACCAATAATT GGGAATCCGG AGAAATTGGC AGCAATTCAG ATTCTCCATA CTTTGATGAT GTCTTGTATG CTTGCGAAGC CTGAGTATCT TCCGTTTGTC ATTCTCCGGA TGCTCAAACT TACTTTGATT TACGGGTTGA GCCCATTGGC TGCCACAGCA TTTGCGGCGT ATGGTATGCT TTGCATCCCT TCGGAATGTC AAGGCGATGT GGATGTGGCT TTTCGTTTTG GCGACCTTGG TCTATTGCTA TTGAAACGCT TCAAGGCCAT TGAATTTTGT CCACGTGTAT ATGCACTGTA TTTTGGTTGT ATATATTGTT GGAAGAAGCC ACTTAAAGAT GCCATGGATC CTTTACTTCA TGGTCATCGT ATTGGAATGC ACACTGGCGA TGATGAATTT GCCAGTGTAT GCGCAAGTAT ATACTGTATT CATGCATTTG AGGTTGGCGT ATCATTGCAT AGCATGCGGC GAGTATGGGA AGGTATCTAC GAGTGGATGC TGTCGAAGAG ACAAAGTTCA CTCCTTTCAA TTTCACTTCC TTGGCTGCAA GCCTTGCATC ACTTTATGGG ACTGTCGGAC GATCCTTTAT CCTCTAAGGG AAATTTGATT GATTACGATG ATGCAGTTTT TCGTGCCGAG GCGATCGGTG CAACAGTGTA TGTTGTAGGA ATTCGCTTCA CTCAGATGAT CTTAGCTTAC ATTTTTAACG ACTACGAAAA AGCCGCCACC ATTTCGCTGG GATTACAAGA CATTTCTCTG ATTCCGCCAA CCATCGAGCG AATGGTCGGA GTGTTTTATC TTGCTTTAAC TTCGCTCGCA ATGGTAAAAC GGAAAACGCT TGTCCGACAG AATCTAAAGG TTGCCAAACG GTCAATATCC ACCCTTAAGA AATGGGCAGA CCATTCGCCG TACAACTGTT TGGAGAAACT ATTGCTCGTT CAAGCGGAGT TAGCATCAGC TCAGAATCAA AACTCAAGGG CCAGTACTAA GTACATCCTT GCGATTGCCA TGTCTAAGGA GTCCAATCTT TTGATGAACC AGGCTTTATC CAACGAGAGA GCAGCGAGGC ACTTTCAGGA CATGGGAGAT ACTGATGCGG CAAAACCTTT TTTCGTTGAA GCATTGCGTT GTTATGAAGA GTGGGGCGGC AACGCCAAAG TTGTCCGCCT GTTCGCCGAA GTCTCACAAA TCTATGGAAG TGAGCAAGCT TAAATGAGGC GCCTTTCTCT CGGCCTTTTC AGATCACCGA ACATGGTTAT CTCAACTTCA AGAAAGAAGG CTTCCCGAAC CCTAACACCC AACAGTGCAA ATGATCAAAC AATGTCCACT GAACAGGGCT GAATAACCAA GGTTGCTTGG CATGCCGTGA TCTAGTCTCT TTCTACTCTC GCTTCGTTTC GAGGCCTGCC ATTGCGAC
|
Protein sequence | MNDLERYNSG SSDGKIEDCK VTLPKISESA TEESTFLSTE ASSFHLTRIA EGGSIKESHE NKKGGKLRVP PRKFTSNLYH RSEEAELKLL YEHSVDAENC SLILLTGRTG TGKTHLAQTL ESAVQKRSGF FLKVKFDQRM HPIPYAALIS AVTQFVHIVV EEGETATLQV REAVGNHVGD GIEALTSIIP ELEVLLGKSR TAHLPMKAAA VGTQRFVFTF HLFLQAICTL KRPIVVLLDN FHYADPCTLD ILSFVVADGG QQGLVLLVTC DVTEVGTDSY LAVKLRDIER QTQAQIHTLN IDHLDLDSTE EFLAGSLEAE KGYIHSLGTI VFQETGGNFL FMMEFLRRLS ASDLLYFDHD IEQWKWDVSD IRTMSESKTV HEFLAEVLEQ VSRDHKDVLK VASCLGSQID ESLIEMVLGF PVLHYLEESL RCGLLHFDKI CACLTFANDV TEQAVYSLIP QTERPLFHLE IGRRLWRRLS DDCFDKKLFI VLSQMRMGQN LITRNSERTK VAWLCLHAGR RAAMASAFQT SRVYLVFGIE LLENTSWREN YDLTLALHNS IAEVDMCLAL FESMDTCLAA ILSNARTFSD KLSAYRTKIH AFGIREHQIR AIGLGMEVLQ GLGEKFPKRH LAFHLRNELK GLRAALKGKS DEQLLRLPII GNPEKLAAIQ ILHTLMMSCM LAKPEYLPFV ILRMLKLTLI YGLSPLAATA FAAYGMLCIP SECQGDVDVA FRFGDLGLLL LKRFKAIEFC PRVYALYFGC IYCWKKPLKD AMDPLLHGHR IGMHTGDDEF ASVCASIYCI HAFEVGVSLH SMRRVWEGIY EWMLSKRQSS LLSISLPWLQ ALHHFMGLSD DPLSSKGNLI DYDDAVFRAE AIGATVYVVG IRFTQMILAY IFNDYEKAAT ISLGLQDISL IPPTIERMVG VFYLALTSLA MVKRKTLVRQ NLKVAKRSIS TLKKWADHSP YNCLEKLLLV QAELASAQNQ NSRASTKYIL AIAMSKESNL LMNQALSNER AARHFQDMGD TDAAKPFFVE ALRCYEEWGG NAKVVRLFAE VSQIYGSEQA
|
| |