Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47846 |
Symbol | |
ID | 7202980 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 223564 |
End bp | 226157 |
Gene Length | 2594 bp |
Protein Length | 770 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182349 |
Protein GI | 219124099 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCCCG AACAACCCCG TGCCGATCCG AATCCGTTCG GAGGAATATG GAGCTGGTCC TGCGCACCCT TCGCATCACG TGCCAAGGAA TTGGTACCGA GAAGGAAAAG GACGCCTCGG CCTTCGCCAC GAAATCTGTA CCCATCGCAC CCCCAACATT GTGTGCTCAA CCGGAATTGG CGTACGTGAC CTTTCCCGTG GACATTCTTT CCACCCAAAA GCTGTTGACG ATATTGTCAT GAAAGATCGC GAATCGGACA GCGAAGGAGA AACCGCTGCC CCCGCAAGCA ACCACGCTCC CGAGGCGATA ACGGCGACGC CACCAGCGCC CTACGGCAGC GATGACGTAC CTAGTTCGTA TCAACACAGT AGTCGTCTAG CACATAGAGG CGTATCCACG CCTTCGACAT CAAGCTCCAG AGATTCGTCC AAATTTGCGA AAGAAATATC GGTGTCCCGA ACCCCGACGA ATTTCCACGA ACAAACGATT CCTATACATT CTACATCTCT CAACACTCTT GAAGACGACG ACGACAACGA TGAGACTTCG TCCACGAGAA GTTCGTCTTC TCGTCGTTAC GATTGGCGGG TCTTCCGTCC CCGAGACGGC GGATGGGTGC ACTTTCTGCG ATGGTTTGTG GTGGGCGGCG CCTCGTACGC CAAGAAAACA GACGACAAGT TACAAGAAGA TTTAGGCAAA CTCGCGCGTT GTTTAATCCT TCTCCGCGAG TATCTCTCCA CCTTTGGTAT GCCCGCCAAG GGGGGACCAA TGGATCAAGA GGTTGTCCTC CGGCAGGTAC TGCGAGATTT GTACGCTGGC GGGGCGCCGT TGTGGGCTGT CGAACCCGTC ATGCAAAAGG CAGCGGAAGG CTTAACCGGC ACTCCCGGGA TTAACTGGTT CCTTTTACCC CGCAAGGCCT TTTATAGCTC CACATCCACT ATCACATCTA CCATGTTCAC CATCGAACGG GGGTTCAACG TGCAAAAGCT GGGGGCCATG GAAAAGGTAG CCATTCGACT GGCCTCGTTC GCGAGCAACA CTAAGGGTGT AAGTAACATT CCGGAACGCT TTCCGGAACC GAGTGAACTA CAGAAGGCGG CTCGGATGGA ATCAGTGCGC TTTGACGAGT CCTTTCGAAC GGGTGGGACC GTCCTGAAGA GTCAGGATCC CAGCGTGGTG GCTAAAAAGA TACTCAGACT AGCCTCACAT GCGGAAGGAC TGTTTTACTT TGTCAATGCT CGGGAATATT CCGAAATGCC GGGGCAACGG ATGAATGATT TTTGGGTAGT AACGGAACAG GAACGAGAGC TCTTCAGCCG ATTGGCAACC ATCGAAGCGA TGCAGATGAT TAAAGAGATC GACGCCAATT GCGGAAAAGA AACGTATTCA TACTGGACAA TTTCACTTTT TCGAGTAATG GCTTCCGCTG GAGCTTGTGC CTTTTGGTTT GGTGGCTCGT GGGTAGACAT GGCCGTATCT GGAGTGTTGG CGCTGGTAGT TGCCTTCATT GGACAGTCAA AATTTCTATC ACAGCAAGAA CGACTGGTAT TTGAAATTGT TGCAAGTTTC GTAGTGGGCA TTACAGCGGG AACTATTGCT TTGAAATGGC CGGACGATAC TTGTTTTGGA GCCATGGCAA TTGCCGGTGT TCTGGACCTT CTGCAAGGTT TCCGCGTAGT CTACGCGGTC ATTGAAATCA TGAGCAAGCA AACCGTTTCT GGGGGCGCCG ATTTGATGGA GGGTATTCTC TTTACTGGAT TAATTTCGTG TAAGCAGAAA GTCGACGTGC CTGTCAGTCA ATCAACAACA TTGTCTCATT CTTTTTATCT GTTCTATGCA ATAGCATCCC TTCGATTCGG GCAGTATACG GCGGCATCTG TCTTCTCTGA CACAGCGGAT AATATTGGCT TTGCAGCTTG CGAGCATGGT ATCGATCAGC GCTGGTTCAT TTTAATTGTC CCAATAGCTG CAGTCTCTTG GTCGGGACTC TTCAATCCGA GGTATCACGA TCTTCCAATG GTAAGTAGTG GTAGTACATA GGCTTGGGAT TGCGCAGATT GGATGCACAG CAGCCTCTCA ATTTAAGTGA TTTTCTTTGT CGGTTTTCGT AGATGGCTTT CCATGGTTCG TTGGCATACT TGGTGAACTT TGGACTAGCC CAGTTCAACG CAGCCGATAA CTTGAATAAT TTCGTATCGT CCTTTGCGGT TTCCTTCTCC GCCGGAATTT TTTCTCGTTT TACTGGTCAT CAAGCCGTTG GAAACACAGT CGCCGGAATG TACGCTTTAG TGCCTGGAGC ATATCTCGTA ACTTCGTTGT TTTCTACCGA CACTTTGGAT ACTAGTTTCT TTGTTGAAAT CATTCAGCGT TCACTCATTA TTGGTATTGG TGCCTGGTCT GGAACAATTC TCTGTTCGGT AAGTTTTAAT ATGATTTTGT GGCGTCCCGA AGCCATAGAC GCTGACGTAT CTCTCCTGAA TACTTTATGT AGCCTGCACT CCTTGGAACG ACGATGGGTC TCATTTCGCA GCAGCACCGG GATCATAATA GGCGAGGCTC CTCGACGACA GGAAATGCCA TGCTTTTCTT CTAG
|
Protein sequence | MSPEQPRADP NPFGGIWSWS CAPFASRAKE LVPRRKRTPR PSPRNLYPSH PQHCVLNRNW PVDDIVMKDR ESDSEGETAA PASNHAPEAI TATPPAPYGS DDVPSSYQHS SRLAHRGVST PSTSSSRDSS KFAKEISVSR TPTNFHEQTI PIHSTSLNTL EDDDDNDETS STRSSSSRRY DWRVFRPRDG GWVHFLRWFV VGGASYAKKT DDKLQEDLGK LARCLILLRE YLSTFGMPAK GGPMDQEVVL RQVLRDLYAG GAPLWAVEPV MQKAAEGLTG TPGINWFLLP RKAFYSSTST ITSTMFTIER GFNVQKLGAM EKVAIRLASF ASNTKGVSNI PERFPEPSEL QKAARMESVR FDESFRTGGT VLKSQDPSVV AKKILRLASH AEGLFYFVNA REYSEMPGQR MNDFWVVTEQ ERELFSRLAT IEAMQMIKEI DANCGKETYS YWTISLFRVM ASAGACAFWF GGSWVDMAVS GVLALVVAFI GQSKFLSQQE RLVFEIVASF VVGITAGTIA LKWPDDTCFG AMAIAGVLDL LQGFRVVYAV IEIMSKQTVS GGADLMEGIL FTGLISSSLR FGQYTAASVF SDTADNIGFA ACEHGIDQRW FILIVPIAAV SWSGLFNPRY HDLPMMAFHG SLAYLVNFGL AQFNAADNLN NFVSSFAVSF SAGIFSRFTG HQAVGNTVAG MYALVPGAYL VTSLFSTDTL DTSFFVEIIQ RSLIIGIGAW SGTILCSPAL LGTTMGLISQ QHRDHNRRGS STTGNAMLFF
|
| |