Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14850 |
Symbol | |
ID | 7203369 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 679638 |
End bp | 681737 |
Gene Length | 2100 bp |
Protein Length | 653 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182583 |
Protein GI | 219124590 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGCTTCCT ATTCTGGACA AGTCGAAGAA GAGCTACTAG ATTTGGAAGC CTCGTGCATT CAGGTCTATC GAGACAAGGC GGGCGAGATC GTTTCGTTAC GCTCCGACTT GCAAGAGTGT CAATCGGTTC TATCGTCGCT GCAAGAAATG CTGCTGGGAT TCCAGGCTGA TTTGGGAGGT TTGAGCGGGG AAATCCGACA ACTGCAGGAA AAATCCCGAA CATTGGATGT TCAACTCAAG AATCGGCGAG AAGCCGAAGA AGGCTTCCGT CAATTCTTAG AGCATATAAT CGTTGCGCCA AATCTAGTGC ACGCCATTAC AACTGGATCC GTCAACACTG CGTTCCTACA AAGTGTGCAA GAAATCGACC AAATTTACAA AAATACCCAT TCCCCCACTC CGCAGCCTTG GTCCGGCGGC AAGCCGCCTT CGGATACTGT TGCTGGAAAG CAAGTCCAGG AACAAGTTCG GAATCTACGA TTGCTTGCTG TCTCGCGAGT TCGTGACTAC TTCTTGTCAC AATTGGTTAG TTTGCGACAA CCACAGACAA ATATTCGTAT GATTCAAGTC AATGGACTTC TAAAGTACGC GGAGCTTCAA GACTTTTTGG AAGAGGCCAG TCCGGAAATT GCCACTGAAA TTCTGAATGT TTACACCGAG TCCATGGGCA AGACTCTACA GCAGCTCTTT CGTACTTACC AATCGCAATT ATTGCAACTC GACTTATCCA GATCGTCATC ATCTCGGCAC GAGGTCCTGG CCATGGATGA CGCGCTACTG AGAGATACTC TTACGACTCG TGCGAAGAAA CGCGTCGATG TCTTCTGCCT TGGTACCCGA GCTACTGAAT GCCTGGACGA AGACGCTTCG AACCATGCGC AGCCAATACT GGCTCATGTA GCGTTGGCCG AACGTCAACG GTACCATTAT GAACGATTAT TTCGTTCTAT TCTTGGGCAC TTGGTGGACG CCGTAACAAA CGAACATGTT TTCGGTCGTC AATTTTTCAA GCGAGATATC TTCACGCCCC TCTTCCAAAA TACACTCAGT CTGCTATTGG AACAATCTGA AAACTACTTG TTTGGTTGCT ACGACGCGCT TTGTCTCCTT CTAATGATCA AGGTGACGCA TTACTACCGA CGCTTGATGC GATCGCGGAA AGTACACTCG CTCGATGGTT TTTTCGATCA GCTCACTAAT TTGCTTTGGC CACGCTTAAA AACTGTTATG GAGGGGCACT TACGGAGTTT GAAACAAGCG ACGGCTGTCA AGTTGGGTGG TGTTGACTTG CACCCCCATG TTGTCAGCCG ACGATTTGCT GAGTTCTGCT GCAGTATTCT ATTGATTTTA CAAAACAAGG CTTTTCACAA ACAGCACGGA CTTGGCAAAA TATCGGGAGG AAAGTCAATG CAATCGCCTC CTGCGAAGGG ATGGCCCGTT GAAAATTCGA CGCCAAGCCA TGCTGGTTTA GACGACTCGA TTCGAAGCAC GGTGGCGAAC AGGAGTGCAG GGGACATGCT GTTGGAGGAC CTAACAGAAA TGGTGGACGC TTACGTAGCT CTGATGGAAC GATTGTCCGA CGAACACACG TCGCAAAAGA GTCGTGTCGT TTTTTGGATC AATAATTTAG ATGCAGTCGT CTGTATTTTT CAAGAACGTC GCGTTGTTGG CAAGGAATTC AATCGAGTTG TTGAGTTGCT GATGCAACAA CGAGAAGTGT TTGTCGAAGA GGAGCTCTTG ACAGGATTTT CAAAAATGAT TGCCTTCGTT CAACAGACTG AGGCGCACAT GGCGACCACG CCAAGGGGTG AAACCTACGA TGCCAACGCA GCCGTCGTCG AAGCACTGGT ACTAGACTTT GCCTCCAAAT GGAAAGGAAA TATTGACGTT ATCAATCGCA ACGTACTATC CTACTTTTCC AATTTCCGGA ACGGAATGGA AATTCTCAAA CAAGTATTGA CGCAGCTGCT CCTTTACTAC ACGCGCTTTC AAGATATTAT TCGAAAAGTT TGGAAAAATC GATTGCCACC CTTCTGTGAA AATCTGATCA GTACCAACAT TATTCTGACC GAGATTAAAC GTTATGCACT GGCGATATGA
|
Protein sequence | LASYSGQVEE ELLDLEASCI QVYRDKAGEI VSLRSDLQEC QSVLSSLQEM LLGFQADLGG LSGEIRQLQE KSRTLDVQLK NRREAEEGFR QFLEHIIVAP NLVHAITTGS VNTAFLQSVQ EIDQIYKNTH SPTPQPWSGG KPPSDTVAGK QVQEQVRNLR LLAVSRVRDY FLSQLVSLRQ PQTNIRMIQV NGLLKYAELQ DFLEEASPEI ATEILNVYTE SMGKTLQQLF RTYQSQLLQL DLSRSSSSRH EVLAMDDALL RDTLTTRAKK RVDVFCLGTR ATECLDEDAS NHAQPILAHV ALAERQRYHY ERLFRSILGH LVDAVTNEHV FGRQFFKRDI FTPLFQNTLS LLLEQSENYL FGCYDALCLL LMIKVTHYYR RLMRSRKVHS LDGFFDQLTN LLWPRLKTVM EGHLRSLKQA TAVKLGGVDL HPHVVSRRFA EFCCNDSIRS TVANRSAGDM LLEDLTEMVD AYVALMERLS DEHTSQKSRV VFWINNLDAV VCIFQERRVV GKEFNRVVEL LMQQREVFVE EELLTGFSKM IAFVQQTEAH MATTPRGETY DANAAVVEAL VLDFASKWKG NIDVINRNVL SYFSNFRNGM EILKQVLTQL LLYYTRFQDI IRKVWKNRLP PFCENLISTN IILTEIKRYA LAI
|
| |