Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40831 |
Symbol | |
ID | 7198773 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 48114 |
End bp | 49495 |
Gene Length | 1382 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184882 |
Protein GI | 219129409 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.353784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACCGA ATCAGCACGG TCAATTTCTA CGCTCAGTGT CCGAACTCTT CGACAATTTG CGAGCTGCGA TACCTGGTAT AACGCAAACA GCAAAGAAAG CAAGTAAGTA GCTACCCGAT TGGCTGTCTT TACTCTAAAA GTGAATGGAG CCGAGCATTC AAGGGGGCGT GGGAGGGGCG GGGTAGCGAT TGGGGATGCG GAGACTGTGC CGGAATCGGT AAGGGCAACA ATAACGACTT CGCAGTGGGG CGTGTCCGGA TAAAGAACGT TGAGATCTTC TTATTGGGCA CGGCACACGT TTCCAGCGAT TCTAGCGAGG AAGTGAAACT TCTGCTCCGT CATGTGCATC CCGACGCCAT TTTCGTTGAG CTTTGTGAAG CTCGCATACC TCTCCTTGAA GGAACGGCGA AGGACGAACA CGAAGAAGAA GCATTGGCAC ACCAGAATCG CACGATGTGT GAAAAAATAC GGCAGGTACA GTCCACACAG GGAGGCTCCC GTCTTCAAGC TCTTTCCACA GTTTTGTTGA CTTCTGTCCA AGAAGACTAT GCATCCGAGT TGGGAGTAGA GCTGGGAGGC GAATTTCGGG CCGCATACCA ATACTGGCAA GCGCAACAAT CCATACCGAC TGGAACAAGT TCTCAATCTT GTGCTTTGAT TTTGGGCGAT CGTCCTCTAC AATTGACACT TGTACGTGCC TGGGAGTCTC TCGGGTTTTG GCCCAAGGTA AAGGTTTTGC TAGGTCTGCT TTGGAGCTCA TGGCAAAAGC CGAAAAAGGA GGAAATCCAG GAGTGGCTAC AGTCTGTGCT TCGGGACGAA ACAGATGTTC TCACGGAAAG TCTGAAAGAA CTGCGCCGTC ATTTCCCTAC CCTTTTCACA GTAATTATTG CAGAACGTGA TGCATGGCTA GCTGCCAAGC TTGTACAAAG CTGTCGAGTA TTATCAGCCT CAGCAACAGC AGCTTCTCCT GTATGCACGG TCGTGGCCAT CGTTGGTGCT GGACATATCC CAGGAATTGT AGCCTGGCCC CACATTGTTG CGCACTGTCA AAGACAGTGT CGTTACGAGG AACATCCTTG TCGAAGAGTC GTTGATTTCC GTTCGGAGGC GTTGTCAGTT GAGGTAGTCC AACCGACATG GTGCTTGTGC AAGTTCCAAA GAATTGCACC CCTCACTCAC TTTCATATCA ACCTTCTGTT CTACAGATCC ATACAGCGCA GCGTAAGTGA GAACAAAGCA ATAGGTTTTT GATGCGATAA CAGTTTCTCG GGATCACGAG GATGACACTT TTTTGGCTTT GTACAAGAAC AAAGCAGGCG CAAAGATAAG CGGTCGGAAG TCAAAGTCAA AGTGTACAGG TACGACGGCC TTTACCGAAT AG
|
Protein sequence | MGPNQHGQFL RSVSELFDNL RAAIPGITQT AKKAMGRVRI KNVEIFLLGT AHVSSDSSEE VKLLLRHVHP DAIFVELCEA RIPLLEGTAK DEHEEEALAH QNRTMCEKIR QVQSTQGGSR LQALSTVLLT SVQEDYASEL GVELGGEFRA AYQYWQAQQS IPTGTSSQSC ALILGDRPLQ LTLVRAWESL GFWPKVKVLL GLLWSSWQKP KKEEIQEWLQ SVLRDETDVL TESLKELRRH FPTLFTVIIA ERDAWLAAKL VQSCRVLSAS ATAASPVCTV VAIVGAGHIP GIVAWPHIVA HCQRQCRYEE HPCRRVVDFR SEALSIQRSV FDAITVSRDH EDDTFLALYK NKAGAKISGR KSKSKCTGTT AFTE
|
| |