Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49212 |
Symbol | |
ID | 7195519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 242101 |
End bp | 244243 |
Gene Length | 2143 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183838 |
Protein GI | 219127221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAACACTG GCGTGTTATT TCCTTGTGGA CTACGTACGA GACTTTTTCT CTCTTCCGTA TCGAGGTCTC GACGATTGGA TCGACGAGAT TATCGCTACC GAACGTGAAA AGTGGAGAGA AAGGGGTAAT TTGAGTACAC GGAATCGCAG TTTCAACCAC AGGTTGCTGC TAGTCTCTTT CCGTTGTTTC ACCGGTACTC TGCAAGGTAT GCGGCAGTGC TGAAGACGAG GTGATTATTT CTTTACAGTG AGTCATCCGC CGGAATATTT GGACAGTAAT AGAATCATAA GTATGAGCAG CAACAACAGC ACCGATTGGT CGCGCAAGCG CAGTCGGGGC GAGGAAAAAC TAGACGAGAA GCAGGGAGAC AAAGATCTCT CTTCCACGGC TCCTCGCGAT CCCCATTCCC CTCCGCTTGG AGGCATTGGA CCTCCTCCTC CTCCACAGGG AGGTCCTCCA CCAGGAAATT CTAGAGGAGG CGGCCCCCCG CCCCCGTCTC ACCAAGGATA CGGAGGATAC CCCCCGCCGC ATATGGGTTA TCCTCCTCCT TCCGGTCCGG AAGGACAGGC CCCCTATCCT CCCAGAGGCT ACGAAAACGC GCCTCCTTAC CCAGGCCCTG GGTATGGACA GCATCCTCCG GGATATCCTC CCTACGGAAT GCCACCTTAC GGCATGTACC CGCAATACCA GCATTCCATG TACGGACCCC CTCCTCCGCA CCACCCCTAC CCACCGCACG GACAACCCCC ACAGCAACCC TACTATGGGG ATCCCAACAT GCCCGGCGGG GGCAACAGCA ACGGACCCCA TCCTCCCTCA TCACGCTACG ATGCACCACT GGAGGAACGG AGCGGACCGT ACGGTGGCAA TGCCGCTTCG GGTGGTCCCA TTGCACCCCG CAGAGGAAGT GGCGTCGATC GGAAACTATC CCGGGCCTCG GCCGAGTCCT CCCGTGCCGA TGACGACGGA GACACGGAAG GCGATGACGA CAACATTGTC CCTGGCGCGG TGACGGCGCG TCTCAAAACC TACATCAAGC CGCGGACTCC GTCCACTCGC GAAGTTCTGG ATCGCCGAGC ACGCAAGAAT GCACAGTCTC GAGCACGCGC GGCGAAATTG CGTTTGCGCA TTGCGGAAAT TGAAATGAAG CCAGAAGACG AACGCACGGA AGAAGAAATT CATTTATGGC AACAGTACGA ATCAAGGAGG CAGCGGAAGA ATGATCGGTC CCGCGAACGC GCACTGGAAA AGAAAGAGGA AATTGATCGC ATTTTGGCCA AGCCCGATAA GAAGCGAACT AAAATCGAAC GCCAATTTTT GGAAACCGCT TTGTCAGCGA AAAAACGTAA AAACGAAGGC GATCGATTGC GGCGACAGAG ACTTAAGGAG CTGGGTTTGG CAACGAAAGG AACTGGAATA AAACCTGGCA TTAGTGCTCG TGGCCCACTT CCGCCTCAGT ACCAAGGCAT GGTGAATCCG CCGCCGCACC ATCATCCCAT GTACGGACAG ATGGGCGGTA TGGGCGGACA CCCGCATCAC CCAATGGGAG ATATTCCCAT GTCACCGCTA CCCAGCATGC CCCACCATCA ACACCATCAG TATGCTCCCA TGCAGTCGCC ACACTTTGGA TCGCCTACTA TGATGACCCA CCCGCATCAC CAACCATACG GCTACCCGAG TCCGCAGCGC AGAGGCCCTC ACGGAACCAC CAGTCGCCAT GAGGGTGGTA TGCCCTACAT GCCACCAGCG CAAGGCTACG ACGCAAGTCC GCCTTCACGT CATCCTCCTC CGGTGCAACA GCGTCGCAAT CCGGACGGCT CGATGAGTAT TTCGATTGGA GGTAGAAGTG GACAGCAACA AGGAGGGCCT CCTTTTGGCG GGGAGGTCCG AGGAGATGAC ATATCAAATA TGATGATGGA CAACGACAAC CGTGGTGGAT ATGATCGGAG TGGACCAAGA GATATCAAGG ACGAGTAGGT AAAGACGCTG CGTGCGTTCT GAATGTTTCA AAAACAAGTT TGTTTTGTGA AAGAGTGACG AAGCCCTCAG GCATTGTTGA ACAAAGCTTA AGGGTAGAAT TTAATGTTCG AGACCATTGT GGCCTCGCAA CTAGTATTTA TGTAAATCTG TTTCTGCAGA GTT
|
Protein sequence | MSSNNSTDWS RKRSRGEEKL DEKQGDKDLS STAPRDPHSP PLGGIGPPPP PQGGPPPGNS RGGGPPPPSH QGYGGYPPPH MGYPPPSGPE GQAPYPPRGY ENAPPYPGPG YGQHPPGYPP YGMPPYGMYP QYQHSMYGPP PPHHPYPPHG QPPQQPYYGD PNMPGGGNSN GPHPPSSRYD APLEERSGPY GGNAASGGPI APRRGSGVDR KLSRASAESS RADDDGDTEG DDDNIVPGAV TARLKTYIKP RTPSTREVLD RRARKNAQSR ARAAKLRLRI AEIEMKPEDE RTEEEIHLWQ QYESRRQRKN DRSRERALEK KEEIDRILAK PDKKRTKIER QFLETALSAK KRKNEGDRLR RQRLKELGLA TKGTGIKPGI SARGPLPPQY QGMVNPPPHH HPMYGQMGGM GGHPHHPMGD IPMSPLPSMP HHQHHQYAPM QSPHFGSPTM MTHPHHQPYG YPSPQRRGPH GTTSRHEGGM PYMPPAQGYD ASPPSRHPPP VQQRRNPDGS MSISIGGRSG QQQGGPPFGG EVRGDDISNM MMDNDNRGGY DRSGPRDIKD E
|
| |