Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40489 |
Symbol | |
ID | 7198415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 51638 |
End bp | 52825 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184560 |
Protein GI | 219128732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0229238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACA CCCCTAGGTC GGACTTCAGT AAGGCAGCAT TGCTGTCGGC AGTCCGTCAG GCCGCTGGTT CGACACCTGT ACGCCGAAAT AAAATGAGAT CTGCACTGGA TGCGAAGACG GCACCAGATA AGGCTCGAAA ACTAATTGCG CCCACAGTCA AATCGTTTGC GCTTATGAGG CAGGTTTCAC GCTTGGGAAT GGATGACCCC GTATATACCC TAGCTGATCG TGGGGTACCC AACCCTGCGA ATAGAATATA CGACGACATG AATGTAGTTG ATGTCCCTGA TGATGTATTG GAGCAAATGT CGATCGCTTC CGATCCAACA GCAGCTTTTG GAAACGGCAT CGATGTCACA GTTTTTGAAA AAAGTCTAGG CGACCTGCAG CCTCATTTCA GCATGTCCCA ATTTTCTCAG ACGGAGTCGT CAGTACCTGT CGTTCCGCAA TCTCCTTCGC CTAGGTCGAT GGCTACAATC GGATCCAACT CCTATTTGTC GCATATTCAG CAGTCCCAAA TGCCCTCCTC CTTTCGGCGG TCGAGACCCC AAGAAATTAC AATAAGTACG GAAGATACGA GGATGGAGGA TTTGGGTGCT TCCATGCCGT CCCTCGACGT CATATCGCTG GATCTTGAGG AAGCAGTCTT TGGTGGCATG GTCCGCAAGC TAGCACGTAC CGATGATTCC AATACATCAC GAATGAATAA AAGTGCGGAT GATAGCATGT TGGGCGTACG AAATTCGCGA CGGGGCTGTC GTCGAGGAAA AAGCAGCGTA TCCAACAGCA TGATGCGGGA GGCCGCCCTG GCTATGGAAA AAGACGGAAA CACAAATCAC CAATTAATCG CCGATGCTCT CAACAATTCG ATTCAAGACT TGCGCTCCGA AGGCTTGCAA GTCCGCCATG TGCCTCGGAG AACCAAGAGC AATCAAGAAA AGTACGAGGC CCCAGACCCC GTTTTTCCTT CCAATAAATC CGTCAACTCT GCTTCCAGCA AATCTCGTAC ACTCCCCTCA ATTGCTAGTC TTTTCCCGGA AGAAAATGAA ATATTTCCCT CAAAAATTCC ACTCCGTCGC CGGGTTGGTG TGCGAGTGAT ACAGCGAAAG CACAGCGGGG ACACAGATGA GGCTTTGGTA GGCAACCCAG ATCTCTTTGT GCGATCTCTC GTGAAGGCAA TACAGTGA
|
Protein sequence | MDDTPRSDFS KAALLSAVRQ AAGSTPVRRN KMRSALDAKT APDKARKLIA PTVKSFALMR QVSRLGMDDP VYTLADRGVP NPANRIYDDM NVVDVPDDVL EQMSIASDPT AAFGNGIDVT VFEKSLGDLQ PHFSMSQFSQ TESSVPVVPQ SPSPRSMATI GSNSYLSHIQ QSQMPSSFRR SRPQEITIST EDTRMEDLGA SMPSLDVISL DLEEAVFGGM VRKLARTDDS NTSRMNKSAD DSMLGVRNSR RGCRRGKSSV SNSMMREAAL AMEKDGNTNH QLIADALNNS IQDLRSEGLQ VRHVPRRTKS NQEKYEAPDP VFPSNKSVNS ASSKSRTLPS IASLFPEENE IFPSKIPLRR RVGVRVIQRK HSGDTDEALV GNPDLFVRSL VKAIQ
|
| |