Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49722 |
Symbol | |
ID | 7198417 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 63994 |
End bp | 65869 |
Gene Length | 1876 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184562 |
Protein GI | 219128736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.703059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGAAGCGGC CTGGAGCGCG AACACAAAAT CACACAAGTC TTCCGTGTCA GCACGGTTCG TCCTTGGTTT TTCTAGAGGC TTCCAATTGG GACCATGATC GCTGTATCTC CAGAGCCTAC GGGATCTGGA GAACCTGCAG TCTCTGCGGA ACATGCCTTA AGCGTAGTAG GACAGCTTTT GGGTGCTTCC TTCCTTCGCG AAGCCGAAAA GAACATCTGC TCGTGCTCGG GTTCCGTTGA CAGGCTCCGA GAGTGCGGCA TTCTGCCCGA CACGCCCCAA ATGTCTCACT TTAGCCTCCA ATGTCCTTCC ATATCGCTCG TTCACGAAAA GCATACGAGC GTCGATCACC GCACTCGTCA AACTGCGCAT GACTTGGCCA TTGCCCTCCT CTCGCGTCCC GTTGTCTTGC GGCGGGCCAA CTACCATTGC TCTTCCAGCA TGAAACCGCA AACCAGCCAG GGTGCTTCCA ATCCCGTCTT CCAAACGGAC TCATTGCCGC AGCTGTCGCA GCAGATTCTT GAAAACGCGT ATCAATCCTT CACCGTGCTC ATCGACAGCC GCCTGCGTGC CTACGCGAGC TTTCTGGCAC GACACGCAAT GGCCGTCGCC GATGAAAAGA CCAACGAGAT GGGCATGTTC AGCGTGGAGC AAAAGCTGGA GACACTCTTG GATGTTGGCG GTAAGATCAC CGTTTCGCGA GTCTCCACGC GCTTTGATGT GGCGGAAGTC GAGGGCGTTC AGGAAGGCGA TCACTACTCG TTTCCTCTTT CGTTTTACGT CGAAATGTCG CTGATGATCC CGCGTCCTCT GGCCACCGAT GAGATGGTCT CCGTTGCCTT TTCCGCCCCG GGAACGATTG CGGGTAAGTC GCTTTTGCCT TTTGCCCTCG AGACTTTGCG ATGGCTACCT CAACACGCAG CGTACGTACT CACCAATAGA TTTGTGCCTC GATTGTGTCA ACAGCAATGG TTGGCGAAAA GCAGATTCTT TCTCAAGTCT CGGTCTCGTT GAATGTGGAT GCACTTCTAT CGGAGATGAT GGACCGCGCG TCTTGTATTG TGGCCGCGGT AGTGGAGATC GCCAACAACG CCTTTTGCAT TCCGGAAGAG CCCAAGAGTA TCCAACGCGG TGACAGCTAT CTCGCTATGC CGCCACCACC ACCCCCTCCG CAACCCATCC CCAGACAAAA TTCGACGAAT CTGAGGGCCA AGCTCGTTAA CCCGCTTGAA CTTCTCAGCA ATGCCGCTGC CGAACTCCCG GTTGTTTCGC CAGATCTGTC CGGCCTGATG TCCCCAAGAC ACGGTGTGCC ACCTTTGACC TTGGAGATCC CTACAGTGGA TCCCTACCTG GAGGACACTG ACGATTCCGA GAAGGGTGTT TCCGAATTCT CTGCCGATCA GTGCGCAGAT ATTGTCGACG GCGTCTTTGG CGCTCTCGAC GATGCCTTTC TCAAGGAACC GCGCTACAAA AAGGCCAAGG GGCAACCCTG ACCAACGCCC TTTTACCCAC TATATACAAA GTGACACGAT GTCGCAACTT CTACAATCTC CAACCAAAAC CTCGAAAATT CCTCTGTTGC ACAGAACCGA TCATTCCACC GCTTCAAGTC TTCGCATTCT GCTGTTTCCG AGCTATACAT TTACCCGCGC ATCAATTCTC AAGTTTTGTA TCTCTGCTCG GCGCAACCTC TGTAGACATC CCAGACAGCC CTCTCATCTG ACGTACCCAT CCACTTATCT CAATATTGAA TCAATTATCA GCTATAGGGC AAAAGGCTGC CCACGTTTCC TTCTCCATGT ACTGTTTTGA CGCCTTTCGT CTGATTCCGT AGCTAAATAT CACGAAATAG AACACTTTAA CACGGACGAT GGACCT
|
Protein sequence | MIAVSPEPTG SGEPAVSAEH ALSVVGQLLG ASFLREAEKN ICSCSGSVDR LRECGILPDT PQMSHFSLQC PSISLVHEKH TSVDHRTRQT AHDLAIALLS RPVVLRRANY HCSSSMKPQT SQGASNPVFQ TDSLPQLSQQ ILENAYQSFT VLIDSRLRAY ASFLARHAMA VADEKTNEMG MFSVEQKLET LLDVGGKITV SRVSTRFDVA EVEGVQEGDH YSFPLSFYVE MSLMIPRPLA TDEMVSVAFS APGTIAAMVG EKQILSQVSV SLNVDALLSE MMDRASCIVA AVVEIANNAF CIPEEPKSIQ RGDSYLAMPP PPPPPQPIPR QNSTNLRAKL VNPLELLSNA AAELPVVSPD LSGLMSPRHG VPPLTLEIPT VDPYLEDTDD SEKGVSEFSA DQCADIVDGV FGALDDAFLK EPRYKKAKGQ P
|
| |