Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39650 |
Symbol | |
ID | 7195283 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 348252 |
End bp | 349792 |
Gene Length | 1541 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183730 |
Protein GI | 219126994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCGA ATCTTTCGTC GCCTCTTTCT CTCGCGAGTA TATCCTCCTC CGACGTCTTC CGCCCAACGA GGCGCCTTCA GGATCTGTTC GGTTTCCAAC AATACCACCA CGATGACAAC AGTGAAGATC AAGTCTTGAC CGAAATCGAA GCAGATTGGC ATGCTATTGT AGACGCGGGA TACTCGTGGT GTGATGTTTC GCGAGTTCTT GAAGGACGGG TGGTCTGGTT GAATCATGAA AATAGGGATG GTGACGTATT CTTGGCATCC CGTACCACCG AAGTGTCCTG GATAGGATTT TTGACGAGAA ATACTATTGA GTCCTCCACA ACTTGCCAGT TTTCCATTGA TGTCGCTTAC AGTAACGGAG GAAAATGTAC ATTAAATGAC GCTCCAACCG CACGAAACGT TGGGATTTAC TCCGATTCTG TCACGCGGGC GGCACAAGTA CTCCAAGATA TTCTCAGCCA GCTGGTAGGA TCTCAGAACT GTCCTTCGAC ACCGTCATTC GAGGAAGTCA CACTAGCCTG TTGTCATGGG AATACAGGAG ACAGGACTCT GCCCCTAACT GCCTCTCTGT TGAAGAAATG GCTATGGGAT GGCAGTGAAA GTATATCATC ACCTCTCCGA CGCTTAGTTT TTTGCAATGT TGTACTTTCC GCCGAGCAAT GTCGAGCCAT CACGCAGGCA ACATCGTTTA GCACTAGGAA CATGGAACTA AGGCTGGAGC GATGCACATT TGAAGACGAC GGAGAAACCT TGGTGCAAAT ACTACGCCAA AGCGATTCTG CTATTCGTCG ATGGAGCTTT TGGAATGGAC TCGGAATGAG TACGAACGCT AGCTTGAATT TTTTTGCCGC GTTGGGCGAG TCGGACCAGC TCGACAGCCT TGATCTTTTT TGCATCCCCT TAGAGCCAAC ACAGTGGGAG AAGATGGTCG CGTGCATTGC CCAAAGTCAG AGTATCCAAA ACCTGCGATT GTTTTGCCGA CGAGGGATAT CGAACGGTCA ATGGAATATG CTTTGTGAAG CTCTGCGAGA TCACCATGCG ATCCGGAAAC TGTCGGTTAG GTATGCGTTT CCCTTTTCTG TACCTCATTT GTCAGAAGAG GCCAGAAATC GACGAACACA GATCCTGCTG CAAGCTTTGC AAGATAACCA CACCTTGGTA AACGTACAAC TTTCGAGTCA AGAACACGAC TTGGAGTTGT ACCATCCAGC GCTACGTGCT ACTCTCCTCC TTAATCGGTA CCGACCTTTG TGGTCGACGT TGTGGCATCC TCCAGAGACG GCATCATTCG ATCCCGCACA TTGTCAGGCG GCACGCCTGG TACACGTCTT GACCAGACCT TGCCTTCAAA ATAATCCGGT CGAAATCCAT TTAATACTTC GCTCCACTGT AAGTCTTTGG AGCAGTTGGC ATTCCTCAGC AATTGCAAGG TGAGGTCCGG TTACTGACTG TGACAGTGAA ACCAACGGCA ATCCCCGTAT GCAATAGTTC GTATTTCTAG AAAGATTAAC TGTAAACTAA TGAAACAGTG A
|
Protein sequence | MNSNLSSPLS LASISSSDVF RPTRRLQDLF GFQQYHHDDN SEDQVLTEIE ADWHAIVDAG YSWCDVSRVL EGRVVWLNHE NRDGDVFLAS RTTEVSWIGF LTRNTIESST TCQFSIDVAY SNGGKCTLND APTARNVGIY SDSVTRAAQV LQDILSQLVG SQNCPSTPSF EEVTLACCHG NTGDRTLPLT ASLLKKWLWD GSESISSPLR RLVFCNVVLS AEQCRAITQA TSFSTRNMEL RLERCTFEDD GETLVQILRQ SDSAIRRWSF WNGLGMSTNA SLNFFAALGE SDQLDSLDLF CIPLEPTQWE KMVACIAQSQ SIQNLRLFCR RGISNGQWNM LCEALRDHHA IRKLSVRYAF PFSVPHLSEE ARNRRTQILL QALQDNHTLV NVQLSSQEHD LELYHPALRA TLLLNRYRPL WSTLWHPPET ASFDPAHCQA ARLVHVLTRP CLQNNPVEIH LILRST
|
| |