Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49837 |
Symbol | |
ID | 7198663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 40012 |
End bp | 41667 |
Gene Length | 1656 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184635 |
Protein GI | 219128891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCCACTTCA CAAAGCAAAG ATCGGTTTTT AATCTACAGC CAGAGAGTGC CGCGCTTATC CAATTTAAAG GAGTCAGAAC AAACATGGCT CCGTTTCTTC GTTTCTTGTG CAGCTCAAGG CAACAGCAAC GATTTTTGGT CAAGACCCTT TTGGTATCCA AGTCGTACTG CAGTCGCAGC ACTAACAATA ACTGGTGCAA GTCAGCCGCA ACCTCGTCTT GGATCTCATC TAAATCCTCA GACTCCGCAT CGTTAGCTAT TTCTACCGCT AGTGCCACGA CGCCTCCAAC AGCCGAGCCT TCCCTTGCTT CTGATCCTGT TTTGAGTGAA TCAACCTCGT CCTCGTCGTT TGTGCCATCT TTGTCGTCGT GGGGCTGGAC TGTGTCATCG ACATCTCCAA CCTCCTACGT CATTGCAGTA TGTCTCGCCG GCGTGGCGTC GTTGGCTCTG GCAACCCGCA TTGCCCTCAT TGATGGCACC GGCACCACAA CGTTTTGCGA ACGAGCAACC ACTGCGCGGT CCGTTCCACA ACAGCCAAAA GATCACGAGA GAAAAGATAG CTACATACCG ACCAGCGAGT CGAAGCAAGA TGTGGATACT GCCATGCTAA GTAGTTACCC TAATGATTCC AGAGAGCATT TCATTCTTGA GAACATTGCT CCCCAAGCCG AAGGTACAGA CGAAGAAATT TCCGACCAGA CAAGCCAAAC CAGAAGTGCC ACTCCGCTTC CACCAAATCC ACTGGCTGAC GGATTTTCGT CCCAAACGGT GGACGAACTT GTCAATGAAT GGCTTCAAGA TCCTTCCATC AACATTCGTG CGCTCCCGGA TGCCTTGGAA CGTCAAGTGT ACCGTTCCAC CGTTCAGCTG ACCCTCAACG CTGTCTACCG TTCTTTGGCA TCGTTGCACG GTAAGGCTTT GCTGGGGCAC GAGTTACGCT TGCGCAAAAC GTCCGACCGT TCGCCCTTCG AATTTCTATC ACGTTCGCAT CACTCGACGC ATACAATAGA GGAATCGGTC CTGGGGCAAG TCGCTGATCG GCTCTTGCAG AACAAGGCCA TCAATCGACC ACTGATTCCG GATATGGTGG AGCGCCAATT GTATGTTAAC TGTCTCAAAT TGATCTTTCG AGTATTGGAC TTGCTGGCCG CCTCTTTGAA CCTGACCATT TGCGGACATG ATGTCTTTGT TGGTTTGGAA CCTGCCAAGC ATCAGGGGGG TGCAATCTTG CAGGATTCGT TGAATCGAAC CACATCTTCC TTGACCAAAC TCGATCCTGA AGTTCTGCGA GAATTTGCCC GCCAAGCCGG GGTCCGGGAA GCCTTGTCAA ATCAGTCCTG GTGGCAACGA TTCTGGTACG CTTCACAGAA AGAATTACTG GCGCAGCTGC ACGGATCATT GTATAGTCTA GTACTCGGAA TTGTAGACGA CATTCTCGCC AATACCAGTT TGCGGTTACT TTCTGAAAAC GTACAAGTCG ATATATGTCA GACATCTGAA CCCGCGAGTC CGTACATCCC ACCTGGCGGT GCCGTTGTGG AAGAGAATCT CAACAGAGCA AATACTACAG GACGGTTCCT AGCTGCCTTT ACAACGGGCG TTGGCGTAGG ATTGGCTACA ATGGCAGCCT TGGGCGGCGG CGGTGGCCGT CGATAA
|
Protein sequence | MAPFLRFLCS SRQQQRFLVK TLLVSKSYCS RSTNNNWCKS AATSSWISSK SSDSASLAIS TASATTPPTA EPSLASDPVL SESTSSSSFV PSLSSWGWTV SSTSPTSYVI AVCLAGVASL ALATRIALID GTGTTTFCER ATTARSVPQQ PKDHERKDSY IPTSESKQDV DTAMLSSYPN DSREHFILEN IAPQAEGTDE EISDQTSQTR SATPLPPNPL ADGFSSQTVD ELVNEWLQDP SINIRALPDA LERQVYRSTV QLTLNAVYRS LASLHGKALL GHELRLRKTS DRSPFEFLSR SHHSTHTIEE SVLGQVADRL LQNKAINRPL IPDMVERQLY VNCLKLIFRV LDLLAASLNL TICGHDVFVG LEPAKHQGGA ILQDSLNRTT SSLTKLDPEV LREFARQAGV REALSNQSWW QRFWYASQKE LLAQLHGSLY SLVLGIVDDI LANTSLRLLS ENVQVDICQT SEPASPYIPP GGAVVEENLN RANTTGRFLA AFTTGVGVGL ATMAALGGGG GRR
|
| |