Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46983 |
Symbol | |
ID | 7202090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 99804 |
End bp | 101756 |
Gene Length | 1953 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181299 |
Protein GI | 219121908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.817353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAAGAAT CTGAATTCCA CCACGATCTT CTGGCATAGT CGTATTCTCT GTATTCGTGT CACTCCAGAA GTACTTTGTT GAAGGGACAG TCAAGGATGA AACTTTCGGG TGGCCTTGCT CAGGGACTCA GCCTTGCGGT GTTGGGCGTG GGCCTTGCTC GCGCCGAGAT CCGGGACATG GAGTTTCTGC TGGAACCGAC CGAAGACTCT ATCCACTTTA CCGACGGATA CTTGCGGGGT CCCGGCTATA TAGATCTATC CGCGCTGGTC TTTACGGCGG TTTCCGAAAA TTTCACCCCC GTCGACGTCG GCACCAACGA CGATTCGCCT CTCAATGACG ACGACAACGA CGACAATTCG AACGCACCCG ACGATACTGG TGCAGACGGT GGAAACCGAC GAGACTTGAG CGCACGTGGA TTGGACGGAG GTGTCGAGGA AGGTCTAACG ACGGTAGATA TGGCCGTCAT TGGGCTACCC CAAAAATGTG CGAATAGTCG ATCGGGCTGC GATTGGACCG AGCAAGGAAT AGGAGGGCGT CTACCGGATG GAAACTTGCG ATGGTGCTGC TCTACCGAAG CTGTTGCCGC TGGGCTTTGT GAAAGTCACA ATAGCGGCAA ACTCATTATC AACTCGACTT CGTTTACTGG GGCATACAAA TTTGTAAACG TCCCCCCTCA CGGACCGATG AGTAAACATC TTCGGTTTGG TCAGATTAAC GAGGCAAATT CCGGTCAGTT TGTAGTGGTT TTCGCCAACT GCAACGAAGG CGGACGACAG ATTGTTGCGC GTGGCAAAAC TGTGTGGAAG TCTGTGCACG GCTATCTACC GGGAGAACTC TACGGATTTA TGAATTTCTA CGCCATCTTG ACGGGTATGT ACCTTATTCT GTTACTTTGG TACGGCTATC TCATGCACGT GAACGAAGAA TCAAGAATTG CCATTGAGAA ATGGATCCTC ATGACAACTG TACTCGGTTT ACTGGAAATG TTCTTCCGCA CTGGCGACTA CTTTGTCTGG AACGTTGACG GATATCGGAT GAACTTGGCC ATGTACATTG GTATAATTCT GGGAGTCGTC AAACGCGGAC TCAGTCGAGC ACTGATCATC ATGGTAGCGC TTGGCTGGGG AGTTGTGCGT GATTCGTTAG GCTCGGCTTT GAGGACAATC ATTGTTCTAA CCGCTGCGTA TGTTGGTGTT TCTGTCTCTC GCGATCTCAT GCTGGTCTTC GCCATTGAAG ACATGGAAAC GCTATCCTAT GATGCGGAAG TTGAGCTCTT TGATGTTGTG ACGATATTGA CATTTGTTGT TGCGGCTGTC GATGTCATTT TCATTCTCTG GATTCTTGAT GCCCTCAATA ACACCATGGA ATACTTACAA AGTATGAATC AAAGTCGCAA ATTGATGCGG TACTTGCGTT TGCGGTGCAT TTTTTTGTTT TCAATACTGT TTGCTACCAT CTGGGTTGTT TTCTCCTTGG TTGATACGTA CGATGAAAAC GGAATCATTC GGGAAGAGCA TGAGTGGTCA GTAGATGCCG CAACTGAGAT CAATTACTTT TACGTGCTGG CGGGGGTAGC TTTTTTGTGG CGGCCCAACC CCAGTGCGAA GGAGTACGCC TACGTCATGG AGCTCTCAGC TACGGGCGAA GGGAATGACG GAGAAGATAC ACACGAACTA GAATTAACAG GGGTTGTCCC ATCAGCGTTA GACGATGACG ATGATGAGGA ACCGTCAGGG AAAAGCAACG GTTACCATGA TAATGATCAC GATGACCGGT TTCGAATCGA CGACTCGGAA GCGGCATAGC ACACAAGCAA GTCATTCGTA TTTGTAACCT TTTCCCCCCA AGGGTCAATT CATCTGCAGG CATATATACC GAAGCAGTTT CCTACAGCCT GTTCACGCCC TAGACATTAA CGTAATACAG CTCCTTCTTT TCCACTCGGG TTA
|
Protein sequence | MKLSGGLAQG LSLAVLGVGL ARAEIRDMEF LLEPTEDSIH FTDGYLRGPG YIDLSALVFT AVSENFTPVD VGTNDDSPLN DDDNDDNSNA PDDTGADGGN RRDLSARGLD GGVEEGLTTV DMAVIGLPQK CANSRSGCDW TEQGIGGRLP DGNLRWCCST EAVAAGLCES HNSGKLIINS TSFTGAYKFV NVPPHGPMSK HLRFGQINEA NSGQFVVVFA NCNEGGRQIV ARGKTVWKSV HGYLPGELYG FMNFYAILTG MYLILLLWYG YLMHVNEESR IAIEKWILMT TVLGLLEMFF RTGDYFVWNV DGYRMNLAMY IGIILGVVKR GLSRALIIMV ALGWGVVRDS LGSALRTIIV LTAAYVGVSV SRDLMLVFAI EDMETLSYDA EVELFDVVTI LTFVVAAVDV IFILWILDAL NNTMEYLQSM NQSRKLMRYL RLRCIFLFSI LFATIWVVFS LVDTYDENGI IREEHEWSVD AATEINYFYV LAGVAFLWRP NPSAKEYAYV MELSATGEGN DGEDTHELEL TGVVPSALDD DDDEEPSGKS NGYHDNDHDD RFRIDDSEAA
|
| |