Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47924 |
Symbol | |
ID | 7203119 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 438578 |
End bp | 441422 |
Gene Length | 2845 bp |
Protein Length | 884 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182227 |
Protein GI | 219123845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCCTG GACAAACTTC AGAGGACAAA ACAGGCGAAG AGGATAAAAA ACCTGCGGCA TCTCGGACCA CCAACCTAAC AAGGGCGCGG TCACACAACT TAAGTCCTCC GTTGCCTGAT GCGTCGAGCA ATTCTTCGTT TCCGGCGTAT GCGAACTCGG ATCTCGACCG GAAAATAGCC GCACGATCTT CAGATCGCAC ATCGCCTGGT GCTTATCGCG AAGGGGGGAC GGATGGGGGC AGTGACGAAG ATAATCGTTC GGTAGCCAAT AGTGTTGGTT GCGATTCGGT CACAGAGGCA GGGACCCTGG ACTCAACAGC GGATGAGGAG ACCGGACTTG TTGTCTCCGA AGAAGTATCG GAAACTGATA GCCGGGCGGA TGATTTACGG CCGAGTGCAA GCTGCAATAG GCCCATGACC GATTCGACAC AAAATAACGA ACTCGAGTTG GTGCCACTAC CAATACAACA GCCCCTCGCA GAAGAGTTGA GGCAAGGAGT TCCACAGTCT GCCAAATCTA GTTTTGGTTT GGGGGACGCA GAAACTTCTT TGGACGCACC CTTGGTTTAC ACAATGGTCG ATCCCGTTCC TGCCGCCGAT CTTTGCTCAC AGCCGAGTTG GTGAATGAAG CCCAGGAGCA ACAGCGTATA CACGCCGCCG AGGAGAGAGG CCGTGAGTCG GCGCTCGCAC ACTTTGTCGA GGCCAACGTT GAACCGATGC CGACTAGTTC TGGACGTGAC GTCTACTGGG GTTGCAAGCG TTGCTGGCGC GATCGTGGTC CGTGTTTCTA CTTTTGGCTC GGAGTTATCG TTGTGATTCT AGCTTTGGCG CTGGCTTCAG CATTGACTGC AGCACTCGTC GGTGAAGAGG ATTCTCCAAG CAGTCCGCGG CCTCTACCTG TTGATTTGGA TCCAACAGGA GCGCCGTCGG TACAGAACTA TATTGACCCT GCTACCTGTG ATTTGGCGTT TCCTTTGGCC CTTGATACCG CCCCGACTCG TAGTGTACTG ACCATTCGAA ACAGCACACT TTTTTTTAAT TCGTGTACCG AGAACATTCT TGTCCAGAAT GCAAACTGGT ATACCCTTAC AGCAGAAGAC ACCCCTGCAG TACAGGCCAC TATCTGTGAG GAAGGTGAAG GTAGCTCCGC AATTTCCACA GACACTATTC TTTTTTTGGA CGTCTCGGTA GGAGATTGTG AGGCCAAAAT GTGTGTTCCG ACCGTAATAC GGCTGAAAGA AGCGTCTTGT CTCGTGTTCG CCTGGTTAGT CGAGTCTGGG CAGGCTTACA ACCTTGCCGT ATTCGAGGCG GAGGGTTCGC CAAGGGTGCC ATTTTCGTTG ATGGTTGAGA CAACCCAATC ACTGGACAAT TTGTCTTGTA CGGAAGCGAG GGCAATCACT ATCGGTGAAA GCGTTTCTCT GGATACGTTA GAGTCCGATT CTTATGAGGC GAATGAGCTT GAGTTATACA CAAACTGCGA TGGTTTACAA ACGGACACGC CAACTCGTTG GTACAAAGTC TTCGGACGAG GGTTTGGCGT GACGGCCCGG ATTTGTTCAC GAGCCGCTTT GGCTATCGAG GTACATACAA GCTCGTCTTG TCGAAACAAT ACAGCACACC AATGTGCTAA AGTGTCGGCT AGAAAAGACG ACACTTTTTG CACAGCGTAT TCCTGGGCAT CGGGACTTGA AAGCGAGCAC AGAGTTGCGA TCGCTGCTGC GCAGCCTGAG CAAAATGGCG GATACTCCGT CGAATTTCTT ATTAATGACT CTTGTGAAAA CGCGCTCGAG CTCCCAGAGT TACCTTTCTC TGACTCTGGT AATACCCGGT CAATTGTTCC CAGTTTCAAT ACGAAAGCAT TTGTTTGCTT GCTACTCGGT GATTCGTTTT ATAAAGGCGT TTGGTATCGC TGGACTGCAT CCACGGAAGG TTGCGTTACT GTGTCTGCAA AAGGCGCAAA TGCTGTTCCG TTCGAGGAGA GATTAGATCC CGCTATTGCC ATTTATCAAG GTGACTGTTT GACAACCTTG GAATGCGTTG CAATGAACGA GGACGCGTCC TTTTTCCAAG CAGACTCGGA GGCTACCTTT GATCCCCAAG TAGACACAAC ATATTTCGTT CTTGTCTTTG AAACCGGTGG GGGTTCTGGC GCCTTTGAAG TGGAAATAAA GGTTTGTGCA TTACAAAATG ATTCGATTTT GCCGACAAGT ATGTGGATCT CACCCCTTCA ACATGCTCGT TGCCACTTCA GCAATCGCAG GAAACCTGCT CGAGATCATC TAGTAAGGAA ATGTGTTCGC TTTGTCCGAT GGGCGAAGAA GTTCCCGAAA AATCTTGCCT GTATATCTCT GAAGCTGTAT CTGCCACCTT CGTGTCGGGC GATTCACATT GCTTGTCCGT GCAGTACCGT GGCGTACAAC AATGCGGGTG TGCTGCGTCA ATGTCCGTAT CCTGCAACAC GTGCCCTGAC GGCTCGCTCG CACCTGATAC AAACAAGATT CTACCCCTTT CGAATATAAC TTGCGGCGAC ACTCAAACTC TCCCTCTTGT CGGGAACGGC TCGACTTGCG ACGATACGGC TCTTTCACTA GCTATGTTTT GCGAGTGTCC AGGAAGTATT CCCTGCAGTA TGTGCGGTGA GAATCGACAG CTGACAAATC CGGAACTCAT CATCACCGAA TCAGTTTCGT GCTCGAGTGC CGAGGGCTTG ATTCAGCGAG GCTTGCTCTC AGAGCAAATT TGTGACGTGA CCACTTTCGA TAATTATCTT ACACGAATAT ATCAAAAGCA GAATGTTGTG GGATTCTGTT GCCGCGGAGA ATCGTTACAA GATTTCCAGC TATAG
|
Protein sequence | MAPGQTSEDK TGEEDKKPAA SRTTNLTRAR SHNLSPPLPD ASSNSSFPAY ANSDLDRKIA ARSSDRTSPG AYREGGTDGG SDEDNRSVAN SVGCDSVTEA GTLDSTADEE TGLVVSEEVS ETDSRADDLR PSASCNRPMT DSTQNNELEL VPLPIQQPLA EELRQGVPQS AKSTQEQQRI HAAEERGRES ALAHFVEANV EPMPTSSGRD VYWGCKRCWR DRGPCFYFWL GVIVVILALA LASALTAALV GEEDSPSSPR PLPVDLDPTG APSVQNYIDP ATCDLAFPLA LDTAPTRSVL TIRNSTLFFN SCTENILVQN ANWYTLTAED TPAVQATICE EGEGSSAIST DTILFLDVSV GDCEAKMCVP TVIRLKEASC LVFAWLVESG QAYNLAVFEA EGSPRVPFSL MVETTQSLDN LSCTEARAIT IGESVSLDTL ESDSYEANEL ELYTNCDGLQ TDTPTRWYKV FGRGFGVTAR ICSRAALAIE VHTSSSCRNN TAHQCAKVSA RKDDTFCTAY SWASGLESEH RVAIAAAQPE QNGGYSVEFL INDSCENALE LPELPFSDSG NTRSIVPSFN TKAFVCLLLG DSFYKGVWYR WTASTEGCVT VSAKGANAVP FEERLDPAIA IYQGDCLTTL ECVAMNEDAS FFQADSEATF DPQVDTTYFV LVFETGGGSG AFEVEIKQSQ ETCSRSSSKE MCSLCPMGEE VPEKSCLYIS EAVSATFVSG DSHCLSVQYR GVQQCGCAAS MSVSCNTCPD GSLAPDTNKI LPLSNITCGD TQTLPLVGNG STCDDTALSL AMFCECPGSI PCSMCGENRQ LTNPELIITE SVSCSSAEGL IQRGLLSEQI CDVTTFDNYL TRIYQKQNVV GFCCRGESLQ DFQL
|
| |