Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18027 |
Symbol | |
ID | 7197076 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 76883 |
End bp | 79296 |
Gene Length | 2414 bp |
Protein Length | 706 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177552 |
Protein GI | 219111601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGAGGTACA TCGAGGAGAA CCAACTACAT CCCCCGACAT TTCCCCCCTT TTTCCCAAAC ACACCAACAC TGACAAATCG CCGCATATGA ATCCCCCGGT GGCTTTGGAA TCCTGCGAAC CCGCGTCTCG CGCCGCGCTC CGTCGGGCGC GTCGGGTCTG TGTCAAGGCC GGTACATCCG TCGTAGCGAA TGAAGACGGA CGGCCTTCGT TGACGCGTCT CGGCGCCATG ACGGAACAAA TCGCCGACCT CGTCCAATCG GGCATTCAAG TCATTCTCGT ATCCAGTGGA TCTGTGGGAA TGGGGAAGCG ACTCTTGCGC AAACAACGGA ACCTGCAAAT GAGCTTTCGG GACATTCACA ACAACGATCA CGTCAATATT ATCGGTACCA ACAACGGAAT GATGCCGGAC GATGTCTCCC TTTTGGCCAA GTCCGGCGTG CCCCGGACGG CGTCCAGCTC GTTCGTGTCG CTGCTTGACG TCAACGAACG CCCCCACACG TTGGCCGAAA AGAAAAAGTA CTACGACTCG GCCTGTGCGG CCGCGGGACA GTTCGAAATG ATGAATCTCT ACTGCGGGTT GTTCGCATCG TACGATATTA CCGCCTCGCA AATTCTCGTT ACCCAGACAG ACTTTGTCGA CGAATCGCGT CAACGGAATT TGCAGTATTC CATTGAGCGA CTGCTGGGGT TGGGAATCGT CCCCATTATC AACGAGAATG ATGCCGTCTC GGCCAATATG GGATACACTG CGGACGACGT GTTCTCCGAC AATGATTCAC TCGCGGCCCT CTGCGCCCGC CACTTTGGCG CCGAAGTATT GCTGCTGTTG ACGGATGTAC CCGGGGTCTT TGATCGACCA CCAACGGAAC CAGACGCCAC TCTGCTGCGA TTGTACCAAT CGCAACCCGT CGCTATTGGG GAAAAATCCA GTCAAGGTCG CGGCGGCATG GCCTCCAAAA TCGACGCCGC CTTGTCCGCC GTCCAACCCG GATCAACCTG TTGTGCCTGT GTCGTGGCGG CAGGGAACGA TTTGAACGTC ATTCGTTCGG TTCTCGCCAA AACACCACCG ACCACAGCCG TGTCAATGAA AGACATCAAA GGTACCATGT TTTGTACACC GGGAAGTGCG TTGGAAGCAC AGGCCGTGGC CGATTTCGTC TCGGACACCC ACCAAGACGC GAGTGTTGCC GAACAAACAC GAATCTTGGC GACGGCAGCC CGAACACAAG CGCGTAAACT CCAAGCCTTG CCGTATGGGG CACGACAAAC GATTCTCAAC GCCGTCGCGG ATGCCTTGCT CACCCATCAG GAGGCCTTGA TGGAAGCCAA CTTGCTCGAT TTGCAAGCAG CCGAGCGGGA TGGCGTTAGT GAGGTGTTGA AGAAGCGTCT GGGTCTGACG CTCCAAAAGT TTGACACGCT GGCGGCTGGT ATACGCCAAA TTTCGGCCAA CAAGGACCCA CTCGGTGTCA TACACAGCAG GCGTGAATTG GCGGACAATC TTGTCTTGTC GCAAGTGACG GTTCCGATTG GCGTATTGCT TATTATTTTT GAATCGCGTC CAGACAGCAT GCCGCAAATT TCTGCGCTGG CTCTGGCCTC AGGCAATGGA CTGTTACTGA AGGGAGGAAA GGAAGCAACG CATTCGAACG CAGCCATACA CAAGGTTATC GGAGACGCGA TTGAAGAAAG CAGTGGAGGA GAAATTACCA GAGATATCAT TGCATTGGTA ACAAGTCGAG GACAGGTGGC TGATTTACTG AGTCTAGACG ACGTGATCGA CTTGGTCATC CCTCGAGGGA GCAATGACTT GGTATCGTAC ATCAAGTCCC ACACGAAGAT TCCGGTCCTC GGACACGCCG ACGGCGTATG TCACGTCTAT GTGGATGAGT CCGCCGCTGC GGATGCCGCA AGCAAACTGT GCGTGGATGC CAAAACGGAC TATCCATCGG CTTGCAACGC CATGGAGACA CTTTTGTTGC ACGCAGCAAC GCTTTCCAAC GGGGTAGCCG CTGCGACGCT CATGGCCCTG CGAGCATCCG GAGTGCAGTG TCTAGGAGGA CCGGCCGCAA TGAAATCTGG GTTGTGCGAT CGGGCTGCCC CAGAACTCAA ACATGAATAC GGGGACTTGA CCTGCTTGGT CGAGGTCGTT CCGAACCTAG AGGCCGCAAT TGATTGGATC CACAAGTACG GTAGTGGTCA TACCGAAGCG ATTGTCTGCG GTGAGGAGAG TGACGTTGGT GAGGAATTCT TACGAAAGGT TGATGCAGCT TGTGTCTTTC GCAACGCATC GACACGATTC GCCGATGGCT TTCGATTTGG CTTGGGCGCT GAAGTGGGTA TATCGACCGG TCGTATTCAT GCACGTGGCC CCGTAGGCGT GGAAGGTTTG CTGACGACGA AATGGCAACT GCGA
|
Protein sequence | MNPPVALESC EPASRAALRR ARRVCVKAGT SVVANEDGRP SLTRLGAMTE QIADLVQSGI QVILVSSGSV GMGKRLLRKQ RNLQMSFRDI HNNDHYYDSA CAAAGQFEMM NLYCGLFASY DITASQILVT QTDFVDESRQ RNLQYSIERL LGLGIVPIIN ENDAVSANMG YTADDVFSDN DSLAALCARH FGAEVLLLLT DVPGVFDRPP TEPDATLLRL YQSQPVAIGE KSSQGRGGMA SKIDAALSAV QPGSTCCACV VAAGNDLNVI RSVLAKTPPT TAAVADFVSD THQDASVAEQ TRILATAART QARKLQALPY GARQTILNAV ADALLTHQEA LMEANLLDLQ AAERDGVSEV LKKRLGLTLQ KFDTLAAGIR QISANKDPLG VIHSRRELAD NLVLSQVTVP IGVLLIIFES RPDSMPQISA LALASGNGLL LKGGKEATHS NAAIHKVIGD AIEESSGGEI TRDIIALVTS RGQVADLLSL DDVIDLVIPR GSNDLVSYIK SHTKIPVLGH ADGVCHVYVD ESAAADAASK LCVDAKTDYP SACNAMETLL LHAATLSNGV AAATLMALRA SGVQCLGGPA AMKSGLCDRA APELKHEYGD LTCLVEVVPN LEAAIDWIHK YGSGHTEAIV CGEESDVGEE FLRKVDAACV FRNASTRFAD GFRFGLGAEV GISTGRIHAR GPVGVEGLLT TKWQLR
|
| |