Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37787 |
Symbol | |
ID | 7202764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 74650 |
End bp | 76098 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181994 |
Protein GI | 219123360 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCA CCGTGACACC GTTGCTACGG TGTGAAGTGG GTGGGCGACT GCGGTGGTTA CTGCCTTTGC TGGTCGTGCT GTATCACCGA GCCGTCGGAG CATCCGCCGC AGTCACCGAC GGGGTTTCCT TCGGCATGGA CTTGCAGTTG CGGAGTTCGG CCGATTCGGA TCCATCCAGC ATGATTGTGC AATTACTCAA CCCGACGATT CTCTTCCTCG ACGACATCTT TACCGACGCA TTCCCTCAAA CCTTTACGCA GGTCCGCTTC ACCGTCAACG AGTACGACAT TGCCGCTGCT ACCGGTGACA ACGTCGACCG ATTCTATCTC GGCGCCATTC GACTCAAGGG AACGGCGTAC TTTCGAGTGG ACGACAATGA GGATGCGGTG GACGGTTTGC CTTCGACCGC GCGGGTCGGT GCGGTAGCCG CCCAAGCTTT TCAGGAATAC GATGTGGAAT ACCTCGAGAC CTTGATCGGA ACCACGACGG ATCCGTTCCT CAACGGACTC ACCTACACCA TTGTCAAGAT CAACGGCGGC GGAGCCGTCA ATCCGGAACA GCCCAGCTCG CTTCCCAGCA GCGACGAGCC GGCCCTGGAG ACCTGGATGA TTGTCCTCAT TGCCGGAACC GCCGTCTTGT GCGCCGTCCT CTGTAGTTGC ATCCTTTGGT TATGTTGCTG TGTCTCGGAC CTGGACGACA CCCACGCGGA GCAGCGTTCT ACCTACCCCG CCGTCACCAA GTCCGACAGC CGCGGGACCA CCAAGACGAC CGAGTCGCCG GAGGCCGACT CGTCCGCTCC CGGACGACCG CGATCCCTCT CCCCCGTGCA TTCCATTGCC AGTCAAGATT CCTCCATTTT CACCTACAAC CCGCGTTCCA CCACCCGCGC ACGCTCCGTG GACAAGCAAG CCTTTGCGTC CTTCGCGTCG CAAACGACGG ATATGGACAT GGAACAGTGG CAGCAAAATT CCGTATTGGG CGGTAGTTCT GCCGACACCA GTATGGATCG CAGCAACCAC GATTCCAGTT CCAGTCTGCC CTTTGGCAAC GATATTTCTG CCATTGAAAA CAAGAAGGAT CTCTCCTTGA TTGCCGAAGG CACCGACGAG GATTCCGCCA CGCCGCTCAA GCCGGAAGAA TTGATGCAGT TGCAAGCCCA GCAACTCGCG CTCGCCGCGC ACAATAACAA CAACGCGCGG GAACAGTCTC GGTCCTATCT CTCCCAAGCT GCGCTCGAAG ACTTGGAACA GGGCCGCCGA CAGCGACAAC AACAGCAGCA ACGCGAATCA GACTCGAACC TTTCCATACT TGCGCGACAG CCGCCACAGG ATGCCACTAA TCACGTCATT CAGGATCTCC ACGATTTGAG TTACGAGATT GCCCAGATAC GGTCCGTCAA GTCCGCGGAC AGTAGTCGAG GAAGAGGCCG CAGTCGCCGT CGTTCCTAA
|
Protein sequence | MVPTVTPLLR CEVGGRLRWL LPLLVVLYHR AVGASAAVTD GVSFGMDLQL RSSADSDPSS MIVQLLNPTI LFLDDIFTDA FPQTFTQVRF TVNEYDIAAA TGDNVDRFYL GAIRLKGTAY FRVDDNEDAV DGLPSTARVG AVAAQAFQEY DVEYLETLIG TTTDPFLNGL TYTIVKINGG GAVNPEQPSS LPSSDEPALE TWMIVLIAGT AVLCAVLCSC ILWLCCCVSD LDDTHAEQRS TYPAVTKSDS RGTTKTTESP EADSSAPGRP RSLSPVHSIA SQDSSIFTYN PRSTTRARSV DKQAFASFAS QTTDMDMEQW QQNSVLGGSS ADTSMDRSNH DSSSSLPFGN DISAIENKKD LSLIAEGTDE DSATPLKPEE LMQLQAQQLA LAAHNNNNAR EQSRSYLSQA ALEDLEQGRR QRQQQQQRES DSNLSILARQ PPQDATNHVI QDLHDLSYEI AQIRSVKSAD SSRGRGRSRR RS
|
| |