Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50961 |
Symbol | |
ID | 7201619 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 70682 |
End bp | 72709 |
Gene Length | 2028 bp |
Protein Length | 357 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180748 |
Protein GI | 219119999 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0280626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTCG GAAATGTGAA ACCATGGAAA GCTCTGTCCG TCTGTCCCGA TGACAACATA CAAGACTAAA ATTCTATACC GCGAGGTGCA AGGGACGACA TGACGCGCGT CTCAACGATG CCTGGGTTGA CATGATGTCT GTCATCCGCC ACAGATGACA GACATATTTG ACTAACAGTA ATGTAAATAC ACGAAGGACC GTGACACAAG ATTTTCATTT GGTTCGCCGC TGATTCCATC ACGTGACCGT AGCTGTTTAT AGCAACGTCT ACTTCTTTGA TCGGGATTTC CTCTCGCTAG CATCGTTTCC ATTGCGAACC ACAATGTTGC AAACTGGGTC TCGGTCGGCT CTACGATGCC CGAAGGCGAC CCTACGTTAC TTTTCTGGCG GCGCAGTGAA ACTGCATTTG CCTGCTCTCG GCGCTACCGT ATCGATGGCG GTCGGTCACG GCAATAGTGT GAACCCGAAC ACGATTAACT ATAGAGAACG ACACCAGTTT CCGGTGAAAT CTTGGAGGTA CGCTTCTACT ACAGCTGCTC CATTATCCAT TACGGACACC GAGCGAGAAA TGATAACCCA CTATGCGCAC GAACCGCAAA CGTCCGTTTC GCTACAAGCT TTGATGCGTA CCGGTCGCGG CGAGTACTTG CACAAGACCT TTGCGGAAGA GAAGCTTGAG CGCCACGCAG CGACGGAACT TGTGTTGATC CAAGTAGCCG GATTCTTACG TCGCGAGCTG CCCATCCGAC TCGCTCATCG TATCCGCGAT CTGGAAGGCA TACCACTACT CAAGGACATG GCCAGTATCC AGTTGGTAAG GGACTTGTAC GTGAAGAGTT TTCTTGAATT GCTCAGCTTC GACAAATTGA TACAATCAGC CGAACAGGAA GAAGGGTTCG CTGCCTTGAT TGAAAATATC TACGATCGGC ATTCCAAAGT TCTGGTTCAA ATGGCCCAAG GTGCGTACGA GTTCCGCAGC GCTGTTCGCC AAGAGAAGGG GGCGGACGGG TTTGAATTAC AGGAAGAAAC GCACCGCTTC TTGGATCGTT TTTATTTGGA TCGCATCGGT ATTCGGGTGT TGATTGGCCA GTATCTGGCC TTGCGGCAAC CGCCGGTGGA AAACTACGTC GGTATCATAT GCTCCCATAC TTCGCCTTAC GAAATTGTCA AGCGTGCCAT TGATGATGCG GCTTTTATGT GCACACGCAA ATATGGTGAC GCTCCGGAAG TCATTATGAG TGGAAGACTA GATTTGACCT TTCCTTATGT ACCAACTCAT TTGCGTAAGT AGAAGGAGAG TGGTAGGGAT GAGGATGGGG ACAGGACTTT TGGTTATCCG CGAAATGTGA CAGGATACGG TTCATTTTTT CATCCGCCTA ACGTAAATTC GCTGTGTTGT AGACTATATC ATGTTGGAAT TGATTAAAAA CAGTATGCGT GCCACTGTGG AATGGCATGG TATTGATTCG CCGGAATTTC CGCCTATCAA GGTAATTATT GCCGATGGTG CCGACAACGA GGACGTAGTG ATCAAGGTTA GTGACGAAGG TGGCGGAATT CCCCGATCCA ATATGGGGAA GATTTGGTCC TATCTTTTCA CGACCGCGGA TCCAGCCATT CAAGCGGGTA TGGTCGGAAC CGCTGGTGCC AAAGGACAGG GCCAAGATCA CGGAATTGAC TCCCCTTTGG CTGGGCTGGG CTACGGGTTG CCGATTTCGA GATCGTACTG TCGATACTTT GGTGGCGATT TAAGCATCAT GTCAATGGAA GGATTCGGGA CCGACGCTTT CGTGTACTTG ACGCGATTGG GCAATACAAG TGAACCCGTG CCAATTTAGA CAAGAAATAT ACAGTCAGCA TTGCAATTTC ATCGGGAGAG AAAGTGGCTG AAGTAACAGC TGTCATAGAG TCAGCTCGTT TTGCTTTCGC CTATGTTTCT AGCAAAAAGA ACGCACAAAA ACAATTTACG TTCCAGCGAG AATTCTTTTA CTTTGCTGTA AAAAGTAAAA AGAACTTTTG TGCTTGGT
|
Protein sequence | MEFGNVKPWK ALSVSTELVL IQVAGFLRRE LPIRLAHRIR DLEGIPLLKD MASIQLVRDL YVKSFLELLS FDKLIQSAEQ EEGFAALIEN IYDRHSKVLV QMAQGAYEFR SAVRQEKGAD GFELQEETHR FLDRFYLDRI GIRVLIGQYL ALRQPPVENY VGIICSHTSP YEIVKRAIDD AAFMCTRKYG DAPEVIMSGR LDLTFPYVPT HLHYIMLELI KNSMRATVEW HGIDSPEFPP IKVIIADGAD NEDVVIKVSD EGGGIPRSNM GKIWSYLFTT ADPAIQAGMV GTAGAKGQGQ DHGIDSPLAG LGYGLPISRS YCRYFGGDLS IMSMEGFGTD AFVYLTRLGN TSEPVPI
|
| |