Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_21122 |
Symbol | |
ID | 7204682 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 566226 |
End bp | 568207 |
Gene Length | 1982 bp |
Protein Length | 447 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185730 |
Protein GI | 219120997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACCTCGTC GCCCCTTGTT GTTCAAGATC TGCATTCCAT TGACAGCCTT TTCAACGAAA CCGTTCGCTC GTTTGATTCC ATACGTCTTT GAATACCAAC AGAAAATATG AGGGAAGTCG TCCACATCCA AGCCGGACAA TGCGGTAACC AAATCGGTGC CAAGTTTTGG GAGGTAAGAC CCTTTGGTGT GAAGTGCATT GGAAGGCTCC TCCGTTCCGT TACGGGTTGG CTGGCGCTAT TTGATTTATG GACCAGACAG TCATTTCCTA TGTATATATG TCGTATGTGA AAAGCGAAAT TGAAAAATTG AGTTCCGGAC CATGGAAATG GTTCCACCAG TTGAGCGCTT GCGGTGTGTT TCCACGGCGA GAATACCGCA CCAGCTTCAT TTTCATACCA AATCGATGGC ACTGATACAC AGACGACGCG AAGTTGGAAA TTTGGCCGGA GAAATGTCGC GAAACACCTG TTGAAGTGCT CTTTACCCCG TCGTTTCTTA CCTTGATTGT CTACTCTCTT TGTGTTTTAG GTCATGTCTG CCGAGCACGG TATCGCAGGA GATGGTACCT ATCACGGAGA CAATCCTCTT CAACTCCAGC GCATCGATGT CTACTTCCAC GAAGGTATGG AAGGGCGTTA CGTGCCGCGC GCAGTCTTGA CGGATTTGGA ACCAGGAACC ATGGATTCGA TTCGTGGCGG ACCTTTTGGT GACCTTTTCC GCCCGGACAA CTTCGTCTTT GGACAGAGTG GAGCTGGAAA CAACTGGGCA AAGGGACATT ACACTGAGGG CGCAGAACTT GTCGATTCCG TCATGGAGGT CATCCGCAAG GAGAGTGAAT CCTGTGACGT GCTGCAAGGT TTCCAGCTTA CTCACTCGAT GGGAGGAGGT ACCGGTTCTG GAATGGGAAC TCTACTCGTC TCCAAGATCA AGGAGGAATA TCCCGATCGG ATTTTGAGCA CGTACAGTGT CGTTCCCTCG CCGAAGGTTT CCGACACGGT GGTCGAGCCG TATAACGCGA CTCTTTCCAT CCATCAGCTT GTGGAGAACG CCGACCAGTG TTTCGCTTTG GACAATGAAG CACTCTACGA CATCTGCTTC CGTACGTTGA AGCTGTCGAA CCCCAGCTAC GGAGACCTGA ACCAATTGAT TGCCAACGCT ATTACCGGTA CCACCTGCTC GCTCCGTTTC CCCGGACAGC TCAACTGTGA CCTTCGCAAA CTTGCCGTCA ACATGGTCCC CTTTCCACGT CTCCATTTCT TCCTTGTCGG ATTCGCGCCT TTGACAGCCG CGGGATCGCA AAGCTTCCGA GTTTTGACCG TCCCTGAGCT CACTCAACAA GCCTTTGATG CCAAGAACAT GATGTGTGCA GCCGACCCCC GTCACGGACG TTACCTCACC TGTGCCATGA TGTTCCGTGG TACCATGTCC AGCAAGGAAG TCGACGATCA AATGCTTCAG ATGGTCAGCA AGAATTCTTC CTACTTTGTC GAGTGGATCC CGAACAACTT GAAGGCTTCG ATTTGTGATA TTCCTCCCAA AGGACTGGCC ATGTCTAGTG TCTTCATCGG AAATTCCACT GCGATTCAGG AAGCCTGGAA GCGAGTTGCC GATCAGTTCA CAGTAATGTT CCGTCGTAAG GCTTACTTGC ACTGGTACAC TGGCGAAGGT ATGGACGAAA TGGAGTTCAC GGAGGCTGAA TCCAACTTGA ACGATCTCGT GTCCGAGTAC CAGCAATACC AGGACGCCAC TGCTGACGAA GAGGATATTT CCGGGGATTT CGAGGATGAG GGTGCCTACG GCGAATAAGC GCGTGCTCAA AGAGCTTCGC TTACGTAAAG TAGCAGTGGC TCCTTGTGAA AAAAAGTCTC GTCTTTCATG ACGCATGCGT GTGGGAAACC GGAATGGAAA ATGATGGTGT GTTTCCGTGT GCAGCAATAT CAAAATAACG GACCTACTCT ATTTTAGCGG CGGTTTATTA AC
|
Protein sequence | MREVVHIQAG QCGNQIGAKF WEVMSAEHGI AGDGTYHGDN PLQLQRIDVY FHEGMEGRYV PRAVLTDLEP GTMDSIRGGP FGDLFRPDNF VFGQSGAGNN WAKGHYTEGA ELVDSVMEVI RKESESCDVL QGFQLTHSMG GGTGSGMGTL LVSKIKEEYP DRILSTYSVV PSPKVSDTVV EPYNATLSIH QLVENADQCF ALDNEALYDI CFRTLKLSNP SYGDLNQLIA NAITGTTCSL RFPGQLNCDL RKLAVNMVPF PRLHFFLVGF APLTAAGSQS FRVLTVPELT QQAFDAKNMM CAADPRHGRY LTCAMMFRGT MSSKEVDDQM LQMVSKNSSY FVEWIPNNLK ASICDIPPKG LAMSSVFIGN STAIQEAWKR VADQFTVMFR RKAYLHWYTG EGMDEMEFTE AESNLNDLVS EYQQYQDATA DEEDISGDFE DEGAYGE
|
| |