Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18551 |
Symbol | |
ID | 7204375 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 514714 |
End bp | 516538 |
Gene Length | 1825 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186362 |
Protein GI | 219113557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.507401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCGCTTCA GAGAAACAGT TCAACCTAGA CTAACTGTAA GCGCTCCAGT TCCCTTGCGC GAGTATACAT AGCAGAACAC CCAGCCGACT AGGCTGTATG TAACCGGGAT TGTGAGGTGA TTTGACTGCA AATACGAAAA GCGAGCACAT TGCATCGACG AACATGAGCA CTACAGTGAT GGGGAAAAGC ATCTTTGCGA TCGGCGCCTT TTTGACCGCT TCTTTGTACA ATTTTGCAGC TTACAAGAAA GAAACTCCAT CGGATGTACG GGTGCATCTT GCGGTCCAAG AACGTACTGC CAGTGCTGAG GACGATCGCA TAACCATGTT GCCCGGCTTA GACTACGATC CAGGGTTTGA ACAATTTTCG GGGTATTTGG ACGTATCGGC AACGCGACAC ATATTTTACT GGTACATGGA GAGCCAGTCT GATCCTGCCA ACGATCCTGT CGTGCTCTGG ACCAAGTAAG TATTCGACCG ACGCACAGAG AAGCAACACA AACTTTGAGC GATTTATTTT GCTAATTCAA ACTTGTCGTG TTCCGGATGT CTTTTCTCCA GTGGCGGACC GGGATGTTCA GGTTTACTGG GCATGGGTGC TGAGCACGGG CCATTCTACA TTTCAAAAAG TGGGAGGCTT CACGACAATC CATACAGTTG GAATAAGGTT GCCAACATGA TCTATTTTGA ACAACCTGCT GGAGTAGGGT TTTCATATTG TGATGCAGCG GAGGACTACA TCACTGGCGA CGAGCAAGCA GCAGCAGACA ACTACAACTT TATTGTGGAG TTCTTGCAGC GCTACCCAGA ACGCCAGACT AACGACTTCT ACGTATCGTC CGAGTCCTAC GGGGGTCACT ATATTCCCCA AATGACTTTG GAAATTCTTC GTCGTGATAT TGACCATTTC GTCAATTTCA AAGGATTCTT GCTTGGAAAT CCATATGTGG ACCCTTTATC GAACATGGTA ACTCAGTTTG AAGCTTACTA TAGCCACGGT CTCATAGCCA AACCGCTCTT TGACGATTGG AGTAAAAAAT GCAAGGATTC TAACTACTGG ATGTCCAGAG AATGTGACCA AATCACCACA AACATGTTCA AACAGTTTGG ACACGGTATC AATCCGTATG CCTTGGATTA TCCGGTTTGT AAGAAAGATG CTGCTGAGTA TTCCCATCTG GAGCGACCAG TCAGCAATCG TAACCACAGA GTCTTGAAAA CGACGAAAGA CGGCCACGAC CCTATGGCGA CCGCCACGCT TGACATTTCT AACCCGTCGA CCTCGTCCAT CGAAGTCGAC ACCTTGCTGA ATCGTACGAA CGGCCAAAAC ACGCAGAGTG ATCCACAAGC TGCTTTTAAG CCTTGTTCAC AAGAGTTTCT TGAAAACTAT TTGGATCGAG AAGAGGTGCG GGATGCCTTG CATGTTGCAC CGAGTGCTAA GCCCTGGGAT GTTTGTGGAG GCGTACGATA CTCGAAATCT GACGTAGATA TTCCGACGAT TGGGCTTTAC CAAGAGCTTA TCGATCAAGC CAAAGCCGGC AAGCACGATC TCAATATGCT CATTTATTCC GGCGATGATG ATAGTATCTG CTCGACCGCC GGGACTCAAT ATTGGCTCTG GGATCTAGCC GAAGCATCAT CAATCTGGAA GGCATGGCAA GCTCAAGAGC AAACCTCCGG ATTCGTCACA ACTTTCGATC TGGGAGACAA GACCAACGCT ACCTTCACAT TTGTGACAGT GCACGGAGCT GGCCACGAAG TACCCTCCTA CCGCCCTGTG GAAGCTCTGG AGATGTTTCG ACGGTTTCTA GCACATGGGT TCTAG
|
Protein sequence | MSTTVMGKSI FAIGAFLTAS LYNFAAYKKE TPSDVRVHLA VQERTASAED DRITMLPGLD YDPGFEQFSG YLDVSATRHI FYWYMESQSD PANDPVVLWT NGGPGCSGLL GMGAEHGPFY ISKSGRLHDN PYSWNKVANM IYFEQPAGVG FSYCDAAEDY ITGDEQAAAD NYNFIVEFLQ RYPERQTNDF YVSSESYGGH YIPQMTLEIL RRDIDHFVNF KGFLLGNPYV DPLSNMVTQF EAYYSHGLIA KPLFDDWSKK CKDSNYWMSR ECDQITTNMF KQFGHGINPY ALDYPVCKKD AAEYSHLERP VSNPAFKPCS QEFLENYLDR EEVRDALHVA PSAKPWDVCG GVRYSKSDVD IPTIGLYQEL IDQAKAGKHD LNMLIYSGDD DSICSTAGTQ YWLWDLAEAS SIWKAWQAQE QTSGFVTTFD LGDKTNATFT FVTVHGAGHE VPSYRPVEAL EMFRRFLAHG F
|
| |