Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27039 |
Symbol | |
ID | 7200690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 203734 |
End bp | 205874 |
Gene Length | 2141 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179814 |
Protein GI | 219118063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.384955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGAATAAGT GTCCTACTTG AGCGAGACCT GCTACTGTGG CAAGATGTTA CAGTACGACG ATAACGGATT TTATTTCTTC GCGCTGAGTA CGCTCAGCTT TTACTTGGTA CCTTGTACGT TCGAATGGGT ACATCGGTTG GTCGCGGTCG ATGTCCGCCA TTCGTCACTC ACGTAGGTCA CCCGTTCCTT GTCTTTTCTC TCTCTCTCTT TCTCGCTCTC CCACAGCCTG GTATTCCATT CTACAAAAAG TGTTCAACGC CTTTTGGGTC AACGATGAAA AGATTGGTGC CGTCGCGCGG ACTTCCGCCG AACAGAAAAA GGCCGATCAG CTCAAAAAGT CGCAAAAGGG CATGAGCGTC CTGCATTCCC AAGGCTTCCT CATCAACGTC GGTATTACCC TCGCGCTCAG TATGCTCTTC GTGTGGCTCT TGTTTATGGT ATCGCAGGAC GGCGAAGTCA ACTCGTTCGA TCCCTTCTCC ATTCTCGAAA TTGATCACGG CTCGGACTCG AAATCGATCA AAAAGGCGTA CCGCAACCTC TCGCTCAAAT ACCATCCCGA TAAGAATCCC GGTAACCGCG CGGCGGAAGC CAAATTCATG ATGGTCAGTA AGGCCTACGA AACATTGACG GACGAAACGG CCAAGGAAAA TTACGAAAAG TACGGCAACC CGGACGGCAA ACAGAGTTTG GAAGTGTCCA TTGGATTGCC GTCGTTCTTG CTCGACACCA ACAACCGTAA TCTAGTCCTT ATGGTGTACC TTGTCATCAT GGTGGGGGTC ATTCCCTTTT GCGTTTGGAC CTACTACAGT GATTCCTCCA AGTACGGAGA AAAGGATGTC ATGTACGATA CCTATTCGTG GTTCCATCAC ACTCTCAACG AACACACGGT CGTCCGAGCC CTCCCGGAAG TCCTCGCGGG TTCCGCCGAA TTCCGCAAAC GCAACATTCC CCGTGACGCG GACGATAAAA AGGCCGTTTC CGCCGCCGTG ACCAACGTCA AATCGCTCAT GCCTAAACCC AAGTACAATC ATCCCGTCTG CGTCAAGGGC AACGTACTCA TGCATTCCCA TCTTTTGCGC CAAGACGTCG CCAAAGTGCA CGAAGAAGAT TTAAAGTACA TGCTGCGCTA CTCCACTGCA CTGATTGATG CCATGATTTC CGTCTGTAAG CATCAAGACT CAATTCAGAC GGCGGCTAAT TGTATTGAAT TCGGACAGTA CGTGACCCAG GCCATGTGGA CCAAGGATTC GCCGTTGTTG CAGCTACCGC ACTTTACGCC GGCAGAAGTA GCACACGTGG ATAAAGGCAA GGTCAAGATT GGAACGGTCC AAGAATATCG CGCGCAGGCG GAAGACCAGC GCAAAGGCAT GGCCACATTT TCCGACTTGC AGAAGAAGGA TATCGCCAAC TATCTCCACA TTTTCCCGGA TATCACGGTT GAATCCAAAG TTTTTGTGGA CGACGACGAA GATGACAACG TGTACGAAGG GGATTTGGTA ACCATTATGG TTACAATAAC TCGGAACAAT CTGGCAGACG GTGAAAAGGC GGGTCTCGTG CACGCACCCC GATTTCCCTT TCCCAAGAAG GAAGCTTGGT GGATTATTTT GGGGCAACTT AAGGAGGGCA AGATCATTTC GATTGATAAG GTTGGTAATT CCAACAAGAA GGTGCAACAC GCCATCAAGT TCTTGGCACC GCCGCAGGGT ACGTACGAAT TCGATCTACT TATCAAATCG AACGGATACG TGGGTGTCGA CCAAAAATTG AAAGTAGACA TGACCACATT GGACAACTCG GCCTTACCGG AATACAAGGT GCATCCGGAT GATGCCGAGC TGGACGATGA GCCGACACTG TTCGAGGAGA TGCTGAACGC CCACATTGAG CAGGATTCGG ATGATGATGA TTCGGACGAG GAAGATTCCG ATGACGAAGA TCAGCCACAA ACAGAAGCCG CCAAGAAAAA GGAGCAATTG CGAAAGGCAC GGCAAGCTGA CAAAGATGAC GACGACGATG ATTCGGATGA CGAAGCGGAA GAGGTGTACG CCGATAAGTA GACTCCTCCG GCGTTACACT TCCTATTTCG GCCCGATACA TTTCTAACTA TTAAGAACTT ATAGGTTGGA TTTACATATA C
|
Protein sequence | MLQYDDNGFY FFALSTLSFY LVPSWYSILQ KVFNAFWVND EKIGAVARTS AEQKKADQLK KSQKGMSVLH SQGFLINVGI TLALSMLFVW LLFMVSQDGE VNSFDPFSIL EIDHGSDSKS IKKAYRNLSL KYHPDKNPGN RAAEAKFMMV SKAYETLTDE TAKENYEKYG NPDGKQSLEV SIGLPSFLLD TNNRNLVLMV YLVIMVGVIP FCVWTYYSDS SKYGEKDVMY DTYSWFHHTL NEHTVVRALP EVLAGSAEFR KRNIPRDADD KKAVSAAVTN VKSLMPKPKY NHPVCVKGNV LMHSHLLRQD VAKVHEEDLK YMLRYSTALI DAMISVCKHQ DSIQTAANCI EFGQYVTQAM WTKDSPLLQL PHFTPAEVAH VDKGKVKIGT VQEYRAQAED QRKGMATFSD LQKKDIANYL HIFPDITVES KVFVDDDEDD NVYEGDLVTI MVTITRNNLA DGEKAGLVHA PRFPFPKKEA WWIILGQLKE GKIISIDKVG NSNKKVQHAI KFLAPPQGTY EFDLLIKSNG YVGVDQKLKV DMTTLDNSAL PEYKVHPDDA ELDDEPTLFE EMLNAHIEQD SDDDDSDEED SDDEDQPQTE AAKKKEQLRK ARQADKDDDD DDSDDEAEEV YADK
|
| |