Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_36668 |
Symbol | |
ID | 7204484 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 91871 |
End bp | 93749 |
Gene Length | 1879 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185624 |
Protein GI | 219120785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGACG CCAAACGGCA ACGCAATCAA CAAGTCTTGC TGGAATCGGG ACTCTTGTCC CTCGCCGCCA CCGTGCGCGC ATCGGCTCGT CCCGTACCGA CCCGGACCCA CACCGGCCTC ACGGCCCGGA AACGGAACGC GACCCACGAC GGGAAATTCG AATCGGCCAG ACTCCAACGC AAATCCAACC GGATTGCCGG AATTGCCTCG GACGGACTCT ACGTGCAAGA AGAACGGGCC GGACGATTCA CCCTCGCCGG TACCCACGAC ACCAACAACC CTAGCCGTAT TGACAGCATG GACACCGCCA ACACCACCAA CCACAACAAT AACCACAACC AAAACGACGT CAAGGAACAC TACCGGGGAC GCGTCAACGA TGGGGCGGAT CTTTCCTTGG CGCAAGTCGT GGCTCTCGCC GGACCCAAGT GGACCGACGA CGATTCGCTC TCTCACGCGC AAGCCTTGTT TCGACGCCTC CAAAGTGCGG ATTTTGCTCC ACCCGACGAT TGTGCCCTAC CCGACACTCC CCACGCGGCA CACAAGGAAA TGGACAATAC CACCACAGCC ATTCCTTGTA AAGTGGTAAC TCCCGGCAGT ACCCCCACCA ACACTCTCCA CAACACTACA GACACCACCA TCCCTTCCCA CGACCACTTT GTCCGTCAAG TCCAACACAT GTCGGTCGAT TCCGATCACC AGGTTGCCAA GGTCGTCCCC GATCGGATAT ACGGTATTGC TACCCATCCA TCTCCCCACC AGCTCATTGT TTGTGCCGGG GACAAGTCGG GATACGTCGG CATCTGGAAC GTGGATGCCT ATCACCCCGA AAAAGACACG GACAAGGCCG TGCACGTCTT CAAGTACCAC TCCGGAGCCG CCGCTTGCTT ACAATGGAAC TCCAACGGTA CATCCCTCTT GTCGGCTTCG TACGACGGTA CCGTTCGCGT ATTGGACGTA GCTACGGCAT CGGCACAACA AGTTTTTGCC ACTTTCGACG ACGATCCCGT CCACGCCCAC AGGCCTGGCG CCAACACGGA TACCGGCTAC CGTTTCTGGA CGCAGTACGC TTGTTGGGAT GCCTCTGAAC AAGGCCTCTT CGTAGCCACT TCCATCGGCA CCGCGCTGCA CGTGGACTTG CGGACCGCTC CGGCCTCCAA AGTTACCTTT CACGAACAGC TTGCCGAAAA GAAAATCAAC ACGCTCAGGT ACGTGCGTCC AAGCGTCTGT GCGAGCGTGC GATTTCCCCC ACCAATCCGC CAGACTCTAT ACTTTCTACA AAACTCACCC CGATTGCCAT CATTGCCAAT TCTCCGGGTT GGACCTGTTT CTGTGTCAAC GGCTCAGTCT CCACCGCAAC GGCCACACGT TACTTTCCGC CGGCTTGGAT TGCCAGTTGC AAACCTGGGA TTGGCGCAAA CTGGGTGACA ACCGTACGTC CCGACACTCC AAGGCTCCCT CACCGGTGGC ATCTTACCAC TGTGGCAAAT CGGTCAATTC CGCCTATTTT TCTCCTACCG GAACCTACGC CGTGGCGACC ACCATGGCGC ACAAACTGGA CATTTTTACC AATCTGGAAC GGGCCAGCGG CTCGAACAGC AAACCGACTA AAAGTCTTCG CCACGATAAC TTGACGGGTC GTTGGTTGAC CACCTTTATG GCTGTTTGGC ATCCGACGCT GGACGTCTTT GGAGTGGGCA GCATGCAGAA ACCCCGTGCC GTGGAGATCT TCGACCCCAG TCGTACCGTT CCCTTGGTAC GGGCTATCCA AGGCGACGCC TTGACGGCCG TGGCGAGTCG ATGTGCCTTT CACGCTTCGA CGGGTCGACC CGTCCTGGTC GGCGGCAACT CCAGTGGACG AGTCACGATT GTGCGGTAG
|
Protein sequence | MVDAKRQRNQ QVLLESGLLS LAATVRASAR PVPTRTHTGL TARKRNATHD GKFESARLQR KSNRIAGIAS DGLYVQEERA GRFTLAGTHD TNNPSRIDSM DTANTTNHNN NHNQNDVKEH YRGRVNDGAD LSLAQVVALA GPKWTDDDSL SHAQALFRRL QSADFAPPDD CALPDTPHAA HKEMDNTTTA IPCKVVTPGS TPTNTLHNTT DTTIPSHDHF VRQVQHMSVD SDHQVAKVVP DRIYGIATHP SPHQLIVCAG DKSGYVGIWN VDAYHPEKDT DKAVHVFKYH SGAAACLQWN SNGTSLLSAS YDGTVRVLDV ATASAQQVFA TFDDDPVHAH RPGANTDTGY RFWTQYACWD ASEQGLFVAT SIGTALHVDL RTAPASKVTF HEQLAEKKIN TLSLHRNGHT LLSAGLDCQL QTWDWRKLGD NRTSRHSKAP SPVASYHCGK SVNSAYFSPT GTYAVATTMA HKLDIFTNLE RASGSNSKPT KSLRHDNLTG RWLTTFMAVW HPTLDVFGVG SMQKPRAVEI FDPSRTVPLV RAIQGDALTA VASRCAFHAS TGRPVLVGGN SSGRVTIVR
|
| |