Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50439 |
Symbol | |
ID | 7199252 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 65197 |
End bp | 66947 |
Gene Length | 1751 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185371 |
Protein GI | 219130436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0618415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAACTGTAA ACGCACAAGT TAGGAGAAAT GCGACGTTTT GTCCAGTTTA ACGTCCTCGA TTTAGCGGCG ATACGTGTAA GGCCGTCGAC CTTCGGCCAA AAGAGTAACT GCGACTATAT GAAGGTGGCT ACTAGCTTTG AAGGTAGATC CATGGTCTTG TGGTTCATCG ACAGCAAAAC TCTGTTTCAT GGTGGCTGTC ATCAAAAGTC AGTAAATCCA AAGCAGACAT ATATTGGCTA GTGTGCTTTC TGGCCATCCC CTATTCACTC ACACTTGTTT TTGATTGCCT GTGACAATGG CAGAGAGTAA CATATGTTTG CACAATTCAG GGAAAGGCCT CAAAAAATTG AGTGCGGCAC TTGCATGCAC CATCGAGCGC AGGCATGTCC GAGGTGACCG AACTTGAAAT CGGGGACGAG GTGTTGCAGA ATAACGGCGG GTTTTCCTTG ACTTGTGACG TGCCAAGTTC ACCTTTCGGA GACTCCGACG TGTCTTCCGT AGAAGGGAAC GAAGAAACGG CAGTGATGGA AGTTGAATAC ATTCGTACCA CAAAAGTAGA GCAATCATTC GGCAGTAAGG CTATTATTTC TCTTGGAAAA CAAGCCACGG TCGGTGCTAG CGATGTATTT GTCGGCTCAA GCCACCCTTT TCATCACACA TCCGAATCGG CTTCGCCGCA AGCAATCAGC GTTAATACAT TCAAACACCT TGAAGAGAAT GCAACGAGCA CGACGAATTC CGTAGCGGGA AAACATATAG GAAAGCGGTC GGATCACCAA CCGCAGAAAC AGCCGAGATC GTCTTTTGCT TTACTTGAAG AACGTTCAGC GGGTACGGCC TTGAGGCCGA AGACCGCCTC CTTAGATCAG GATGGGCGTT CAACGCCGCG GGCTGCGCTC ATGCACAGTC TACTACTGAG TACTTCAGTT CCGCCAATAT CTTCGGTGGA TCCGGTTTGT AGATCGTCAA AGGAGTCCCG ACAAGGGCAG CGTTACATCC CAGTCGAAAC GACCTTCTAC GTGGACACCG TTTTGAATCG AAATGAAATG GAGTTACTTT GGCTAGGAGA AGAGAGCATG TGTTGCCTGT ATAGAACCGA GTTCGACTTT GACTTCGTAT GGGAGTATGC TTCCGAATCG CTGAGGATGG AACTTAGAAA GTTCAGCCCG GACTCATCGG AAAACCAAAA AATTATGCAA ATTGTGCGCA CGGAAGACAT CATCATTGGC AAACGATTTA CGGTACGTGA AGCTTCTATA AATTGGCGTT ATTTAAATAT GCACTAGCTC CTGATGATTA TTTCTCCAAT ATTGCTAGAT TTTGAGAGAA CAGGTCCGCC TCGCGCTCGG AAAGGGTTTT CGGCGTCCTA TGATGAAATC GGTTATTGCC GATCTCCGGT TGCAGCAGAG AGATTCGAAA CTCAAGGTCT TTCCGCCTTT GAAGCCGGGG GTGTGAGACT GGAAGGCAGA TTGGAATGAA TTGCTGCTAT GTAATTTCTT TAAATTTTTA CAAGCGGCAG TTGTTTTTAT CCTGCGACGT CAAAAGGTCT TCTTTGTTTG CGAAACGCAA AGGCTGCTAG GACCGCCAAA ACTTTTTACA CGACAATCCT TGATGTGATT TCAATTGAAA GAGCACACGA GATTGTGGCT GCTGACCAAG GTCGCCCATA CATTCGACTG CTTCGACAAT TGTCGCCATT CAACTTTGCA GAGAATTGCT GTACTAATGT AAATGAAACG TTTAAGATTG T
|
Protein sequence | MSEVTELEIG DEVLQNNGGF SLTCDVPSSP FGDSDVSSVE GNEETAVMEV EYIRTTKVEQ SFGSKAIISL GKQATVGASD VFVGSSHPFH HTSESASPQA ISVNTFKHLE ENATSTTNSV AGKHIGKRSD HQPQKQPRSS FALLEERSAG TALRPKTASL DQDGRSTPRA ALMHSLLLST SVPPISSVDP VCRSSKESRQ GQRYIPVETT FYVDTVLNRN EMELLWLGEE SMCCLYRTEF DFDFVWEYAS ESLRMELRKF SPDSSENQKI MQIVRTEDII IGKRFTILRE QVRLALGKGF RRPMMKSVIA DLRLQQRDSK LKVFPPLKPG V
|
| |