Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50474 |
Symbol | |
ID | 7199272 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 159756 |
End bp | 162213 |
Gene Length | 2458 bp |
Protein Length | 747 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185391 |
Protein GI | 219130478 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0241823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATC AGTCGGTCGG ATGGGACCAT GAAAAGGAAA ATGTCGGCTA CTTTCATAAG CATGTAGAGA AGGGTGGGCT AATTATGGTC AGTCGCACGC CGCTATGCAG ATTCGGGTAT CGGAACCGAA ACTCTTTCTC GGCTCTTATT GTATTCATCT ATTTAGTGGC GTACACTTTG GCCTGGTGCA GACCTTCGAT CAGTTGGAGT CGCCGGATCT CCTGCTGTTG CTTTTCAAGT CCTTCCTATG CAATCAATCA GGCTTCTAGA GACAACGACG AGGCAAACAT TGATCTCGCG GTCTTGTCCG TACCACTGGC TGAAGCGACG TCCACACTGC TAGAACTCTG CTCCTCCAAC AACACAAGAA TCGTACAGAT GGAAGGATAC GTAACGGCGA AACGAGGCTT TGGGTCCTCC TTTTGCTTCC TGGATCTGTC CGAACGGGGG TGGGAACGTA GACCTGTACA AGTCATGCTG AAGAGACAAA ATTATAGCCC ACCAATGGGA GCAGATAGCG ATGGGAATGC ACCTACATTC GATGGGATTT TTAGGTCAAT GCTCCCGGGA ACATACGCTT CCATCACGGG CGTTGCGTCG CCGACCCGCA ACCCTGGGGA AGCTGTTTTA TTGTGTCGCG AAGTAGATTT GTTGGGCCTC CCGCGCAACC CGCAGCATAT TCGTGTGATT CTGGAATGCA CCGCCAAAGG TCTATTGCCC ATTGCGAGTG TCGCTCGTCT CTATAACCAA TCGGCCGAGC AACTTCAGCG TGATCTAGAA TGCAGCGGCG AAGGATGCTA TGGAAAGCCT TTCGACAGCC TAGCCAAGAT GGTTTTGCGG TCGCTGCCAG CCGATGAACG GTACCCTGAT TTGGTGAAGT ACAAGCAAAG CTTCCGGCTA CCCGTTGCAC CCGCCGAGAC TTACTCTTTG CCAGAAAGCG TTAAAATGGC GACAAAGGTC TCGGTTTCTG GAATTAAAGA GAGCGCTCCA CCGCTTTCTG TCGAGGCCGT CTTGTCACAA TTTTATGAAT CAGAGGACAT AGATTGTTCT GGCTCCACTT TGCAGCTCCC AGCCTGTGTA CATGGATGGA TCCAGAATCG TCGACGATTC GATCGCAATG TGACGGTCCT GGAATTGGTG GATTCCTTGC GGGAAAGCGA CAATAGCACT ACACTGGAGA GCGCGCCTCA CTTTTGCCAG CGGCTGAAAT GTGTTTCGCA TCCCCAAATA CTTGCCGTGT CGGACACGAT GGCGCATTTG CTGGCTCCTT CGGCCCAAGT GCAAATACAA GGCGTGGTGA TCGGGGATAA GAATGATGGT GCACCGACGC TTTGGATAAC AAACATCCGG CTGCTCCAAG CAAGCTGGTG GCCTTCGGTG ACGCGCTTTC TTTTGGAATT GGTTTTGGAA AAGCGATTTA GTGTCCCCGA TGCAGCTTTG GCCTTGCAAG TGTCGGAATC GGAAGTTGTC GAGGCCACCA ACGCGAACGC ATTGGACTTG ACAGCCCGTC AGTGGAAGGC TGCCGAGTTT TCGCAAGCGC TAAAGCTCAA GGCGCAAGAA GGATCCTCCG TGACTTGTTC CACAGAAGAG ATTTTCATTC TGGAAAAGTA TGAAGCAAAG TGTGCCGATC AGTTTCCCAT CCAAGATGTA ACGAATCACG TGTCTGAAGT CCATTTTTCG TCGTCCACGT CGTCTACGAA TGAAACAGTG GTTACACACT CCCGTGAGGG GAGTCGTTGG AGAAGGCAAA AAGAGCCGCA GTTGGTTTGG ATGGGCAAAC AAGTATTGGA GGTTCTACAA ACACACCCGA ATTGGAATAG CACACCAGGT CAGTGAGTTT CGTATCCTCG ATGTTGGAGG CGGCCAAGGT TGGTTGGCGA ACCATTTGGC CCAAACCGTC CCGGAGGCTC GAATTCAAGT TATCGATATT GCTTCGGGTG CCGTCCAGAA TGGTGCCATG CGTTCCCGGC GCCTCGGACT ATCCAACAAT AACCGGGTAT CGTATACTGT TGGCGATGCA TCTTCCCCAA ATTTGACGCT CTGGGAGGAT GACTTCGACT TGGTCGTGGC GTTGCACGCC TGCGGAGGCT TGACCGACGT GGCTTTGTCG CATGCGCGGT CCCGCCAGAT CCCATTTGTA ATTTGCCCGT GCTGCTACCG TTCGAATGCA CATTTACAGG TACAAGCGTC GTCATCGTCA TCGTCGTCCA CAAGTTCCGT CAATATTTCG AGATGGCTCG ACGTGGCGCC AACGACCGGG GCGGATGAGT ACACCATCTT GACGCGTTTG GCCGAAACGC AGCACGATCT GGTGCTGTCC CGCCGGGCGA CGCACGTCGT TGGTCGCTTA CGCGCCGCCG CGACGGAACG AGATATACCC AACATCCAGG TATCGCTGTG GACCTTTCCC GTGGCTTTTT CCACCCGCAA CCTCTGTTTA GTGGGTAAGT ATAGATAG
|
Protein sequence | MMDQSVGWDH EKENVGYFHK HVEKGGLIMA SRDNDEANID LAVLSVPLAE ATSTLLELCS SNNTRIVQME GYVTAKRGFG SSFCFLDLSE RGWERRPVQV MLKRQNYSPP MGADSDGNAP TFDGIFRSML PGTYASITGV ASPTRNPGEA VLLCREVDLL GLPRNPQHIR VILECTAKGL LPIASVARLY NQSAEQLQRD LECSGEGCYG KPFDSLAKMV LRSLPADERY PDLVKYKQSF RLPVAPAETY SLPESVKMAT KVSVSGIKES APPLSVEAVL SQFYESEDID CSGSTLQLPA CVHGWIQNRR RFDRNVTVLE LVDSLRESDN STTLESAPHF CQRLKCVSHP QILAVSDTMA HLLAPSAQVQ IQGVVIGDKN DGAPTLWITN IRLLQASWWP SVTRFLLELV LEKRFSVPDA ALALQVSESE VVEATNANAL DLTARQWKAA EFSQALKLKA QEGSSVTCST EEIFILEKYE AKCADQFPIQ DVTNHVSEWL HTPVRGVVGE GKKSRSWFGW ANKYWRFYKH TRIGIAHQVS EFRILDVGGG QGWLANHLAQ TVPEARIQVI DIASGAVQNG AMRSRRLGLS NNNRVSYTVG DASSPNLTLW EDDFDLVVAL HACGGLTDVA LSHARSRQIP FVICPCCYRS NAHLQVQASS SSSSSTSSVN ISRWLDVAPT TGADEYTILT RLAETQHDLV LSRRATHVVG RLRAAATERD IPNIQVSLWT FPVAFSTRNL CLVGKYR
|
| |