Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34964 |
Symbol | |
ID | 7199949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 767088 |
End bp | 769052 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179287 |
Protein GI | 219116985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCA TGGCCCTCGT TGTGGGCGTT CTCGCTCTAT CTCTTCGTAT GGGTGATTCC TTTTCGTCAC AATGTACAAA ATTACCACGG AGAACGGCAC TACCGTCTCG GCGTACGAAT CGTTACGTTT TATGGAGCAG CAGTAGCCGT AGGAGTACAG TTGCACCACC GGATCTCGAT ACGGGTGCGA ATCCGATTCC GGACGCCGAC GTAATTACAT CCACCGACGC TGTGGTGGTA GGCGGAGGTC CCACTGGACT CTTGACAGCT ATCATGTTGG CACAAAAGTA CCCCGACCGA ACCCTCCAAC TCTACGATCG TTTGGCGGAA CCGCCGTCTC CAGACGACGA CACCGTCTGG AACGATGTGG CCAAGTTTTA TTTGATCGGC TTAGGAGCAC GTGGACAATA CGCCTTGCAA ACCTTTGGAG TATGGGACGA AGTTGAGCAA CGCTGCGTAG CGGTCGTCGG ACGCAAGGAT TGGCCGCCAG ACTCGGAAGA AGGTGTGGAA CGGATCTTTA CCAAAGAAGA CAAGAAGGTC GCAACGCAAG TCTTGCCTCG TGACAAACTA GTCGGTGTCT TGCATCAGCA TATTCGGGAA AATTATGATG GAAAAATATT CTTAAATTAT GGCTACGAAG TGCATCCGGT AGATTTTGAA TTTCGTGGCG GTAGTCAAGT CCTACTGCAG ATTGTCCAAT GTTCCGAGAC CGTCGTACGG CTGAATCCTT CGGCTGTACG AACAGCTATA GATAAACAGG ATGAGATGCT ATGTGATACC CAAGGTGGCA AGTTTGTGGC GTCAGATTTG GTGATTGCTG CGGATGGTAC GGTTCGTACC ATTGCCAATG CCATGGAACG TCAAGATCAG CAGCGATTCC GTGCAATGAA TCCACTCCAA AGGCTAAGGG CTGGCCGACC TTTTCGCGTG AAGCGATACC TTGATGACAA TCAGCGTATA TATAAAACCA TTCCGATGAA AATCCCCAAG GATTGGCGCC CTGACCTAAA CTACTCGGCT CGTACGAAAG AGGGACGAAT CAATTACGAT GCTCTACCGG CCAACGAAAA TGGGGAATAC TGCGGGGTGT TGCTGCTCAA AAAAGGAGAC CCAATGGCGC AGGCGGACAC CAGTCCAACC GAGCTCCGCC AATTAATGGA CGACGTTCTT CCACAATTTA GTGCTCTTCT GGATGACGAG GTTGTCGCTG CGGTTGCCCA AAAGCCTGTT TCGTATCTAC CGGGCTTTCG GTACGCCGGT CCGCGTCTTA ACCAAGGCGA CCGTTGTGTA CTTCTCGGCG ATTGCGCCCA TACTGTCAAG CCGTACTTTG GACTCGGTGC CAATTCGGCC TTGGAGGATG TTAAGATCAT GAGCGAGATT CTCGATGCCA CGGAACATGA TATATCGGCG GCGGTCCGTG AGTTTTCACG ACGCCGGGCT CCCGAATCGG AAAGCCTAGT GCGCATCTCC CGTGACCTCG ATCGTCCCGG AAAGATTGGA TTCGTCACGT TTATTCTGCC TCTGATCCTG GACTCTATCT TTAGCAAAGC CATGCCGAAA TTGTTCCAAC CTAATATCAT CACCATGCTG CAAAAAGAAA ACTGGACTTT TCGACAGGTA GCATCACGAA AACGGCTGGA TCGGCTAGGA CAGCTTTCCA TCATTGCGGC AGGCTTAACA GGGATGGGTT TCGTTGCGCG AGTGTTAGTT CGTTCGGTGG CAAGAATGAT GGGCAAGAGT ACGACAAAGG TTGCAATGGG ACTTATCGGA GCCGCCTTTG GAATTGGGCT GCTCCGACGG TTCGCTGGGC TAGTGGCACC AGGATCAGCA CCAGCCGACG TTGTCACCAA AATGGCTACA AACAAAAAAT CCAAGGAGCA AAGCGATAGC CCACTCAGCA GTCGACAATC ATTTCTGACA CCTCGTCTTG GCTTTAGCAA CAAAGGAGAA AGGAAGTCCA AGTAG
|
Protein sequence | MTTMALVVGV LALSLRMGDS FSSQCTKLPR RTALPSRRTN RYVLWSSSSR RSTVAPPDLD TGANPIPDAD VITSTDAVVV GGGPTGLLTA IMLAQKYPDR TLQLYDRLAE PPSPDDDTVW NDVAKFYLIG LGARGQYALQ TFGVWDEVEQ RCVAVVGRKD WPPDSEEGVE RIFTKEDKKV ATQVLPRDKL VGVLHQHIRE NYDGKIFLNY GYEVHPVDFE FRGGSQVLLQ IVQCSETVVR LNPSAVRTAI DKQDEMLCDT QGGKFVASDL VIAADGTVRT IANAMERQDQ QRFRAMNPLQ RLRAGRPFRV KRYLDDNQRI YKTIPMKIPK DWRPDLNYSA RTKEGRINYD ALPANENGEY CGVLLLKKGD PMAQADTSPT ELRQLMDDVL PQFSALLDDE VVAAVAQKPV SYLPGFRYAG PRLNQGDRCV LLGDCAHTVK PYFGLGANSA LEDVKIMSEI LDATEHDISA AVREFSRRRA PESESLVRIS RDLDRPGKIG FVTFILPLIL DSIFSKAMPK LFQPNIITML QKENWTFRQV ASRKRLDRLG QLSIIAAGLT GMGFVARVLV RSVARMMGKS TTKVAMGLIG AAFGIGLLRR FAGLVAPGSA PADVVTKMAT NKKSKEQSDS PLSSRQSFLT PRLGFSNKGE RKSK
|
| |