Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48414 |
Symbol | |
ID | 7203666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 387102 |
End bp | 389695 |
Gene Length | 2594 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182825 |
Protein GI | 219125098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0500715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGTG ATGGAGACGA CGAGAGCTTT GCCACGGCGG TGGCTGCCCT CTATAGTCCG CTGCATCAAT CGGTATCGCC GGAAAGCATT GCGAGGGCTG CGGCCCGGCG GGTGGGCACC GTCACCGACA TGCGGGTGTA CCTACAACGC GCTGAACTGC ACACGATTGC TCGTAACAGC GACTCAGTAA CGAGTTCGAA CGTGCCCAGG ACTCATCCTC GGATCATTCA CGTAACGGGC ACGAAAGGCA AAGGATCCGT CTGCTGTTTA TGTGAATGCA TCTTGCGCGA GCGTTTCGGA CTCCGGACGG GTTTGTTCAC GAGTCCTCAT TTGGTTGACG TTCGTGAACG AATTCGGGTG CAAGGAAAAC CAGTCTCCCC TGATGTATTT GCACACGCGT ACTGGAGGAT CCGTTCTAGG CTCGAAGCGT TTCGAAACAC GAACATAGGA GGCGACGGCG TCGGCGATGA TGAGGATCTA CCGACACTAC CGGGCTACTT TCGCATGCTT ACTCTCATGG CCTTGTACGT CTTTTACCAT TACGAACCAC GCATCGATGT TATAGTACTC GAGGTAGGTA TGGGCGGGCG ATATGATGCA ACAAACATCT TGGACTTGGA TCTATACGAT GTACGTTGTG GAATCACCCA GCTGGATTAC GATCATGTAC GCGTTTTAGG CAACACCTTG GAAGAAATTG CCTGGGAAAA GGGCGGGATA TTCAAAGTTC ACAAAGCTGA CAATGCAAAC GTGACGCTCA AACCGCACTC GGCGGAGGCA GCGACGAAGC AAGAGATTAT CGATTTTTCT CTTGCAACTA CTGATCATGC TTCCGAGGAT GACTCAGGCA ACACTCGCGG GCACATCTTC TTCGCGCTTG ATACAAATTC GCCCTCAGCT TTGCAAGTTC TACATGATTG TGCCCGCATT GAAGGTCGTG GTGTTACCTT GCAACTCGTG GGCGTAGCTG ACGCAGAGTC GTCGCACCAC CAAAAGCGAA GTGGGTACAC AATGCTACCC GACTCTTGCT TGATTGGACT ACCGGGTTCC CATCAGCGAC TCAACGCCCA ATTGGCCATT ACATTGTGTG AATCCCTGTC CTTGTCCGGG AAGGACAAAG CAAAATCAAT CCAGGATCGG GGTGTTGATG TCATGTTTGA TGCCTTATCA CGGGCTTCCT GGCCGGGTCG TTGCCAAACT GTCGAGGTTC CGGAACAAAG TATGACGCTA CGTTTGGACG GTGCGCACAC GGTGCAGTCG ATAAAAGTTG GGTACAATTG GTTCAAGTTG CAACAGAAGG ATTGCAACAA CAGTAGAAAT ATTCTAATTT TCAATTGCAG TCACGAGCGC GACCCCGTGG AATTGCTGGA TCTACTGGAT TGCGAGATGT TCACAGCCGC TTTTTTTTGC CGGGCCGACT CGGAACGTCC CAGTGCCGTA GTCAAAGCAT CAGCCCAGGA CTTGTTGCAG CGATCGGGGC GGGTCGTTCG TCAAGATCTG CTGCCAAATA CACAGGCAAC TTGGCAAGAG ACGCTGGAAT CGCTTTGGAA CCATCGATGC GAACAAACAC CCCACCGCAG CGAAATCTCC ATGACCACTG CTAGCAATCT AAGTGTCAGT CAAGCTTTGA AGCAAATCCG GAAATCGCAG TATGCCGGGG AACATAGCGA AGTGCTTGTT ACAGGGTCGC TGTACCTCGT TGGTTCTGTA CTCAATGCCG TCAAATGGAG CGAACGTGAA GCGGATGGAA AGTTGCAGGA CCTGATAACA TGTTGCTTAT AACGACACGG GTCCATAGTG GAGTGTACAC AGGTGGTTTT GAGTTTGAGA GGCTAAAGTT TGTCGGTGTT TTATTCCGCC GCTTTTGTAG GATCGTCTTC TCGAAGGCAC AAAAAATGTA CATGGTCTAT GTCGGGCAAT GATTTCAGGT GGGGAGGATT GATCCAATGT AAAAAATTCT CCACGTTGCC CAATTGCTGC TGGATCTGAT TGCGTGCATC GTCTATATCT TCCGTTGTAC AGAATCCGCC AATCTTCCAC AGAATCCAAT GTTCTATATT ACCGGCTAGA TAATACGGGA AGTCGTTCTG TACAATACAC TTTCGAGAGC CTTCGAGCTC CGAAATTGGA GGATAGGCGT AATATTTACC GTCATCGCCT TTTCGCTGTT CCAACTCAAA CTTGGAACAC AAAATGTGGT CATAGACGGA TTTCCACTGA TGCTTTAAGT CTCTGATATA AACTTGGTAG TCTCTTTCCT GCTCTACGCT TCTAGAGAGA CGGGCCAAGT TCTGGTCCAC CTTAACAATA TACGTAAGCT CGTTCCATTC GAACGGTACA GTTCGATATC CGAATTTTCG TACAGGCATA TCGATCCGTG TTTTTGTTGT GTCTTCGATC CATAAACCTG TTTCGTCACT GCGTTTCAAA TCTTTGGCGC AATCTGGGGA GGTTTTCAAT GAAAGTGATT CGGCGACAGT GAGCGACGAT CGGACGAAGC CAGTCACTCG TTTGGATAAG TAGAAACTTA TCATGAAAAG AGAAGCATAT GCCGGCCGCA AGTATCTTCA TGATTCGTAA AACACATTTG CTTT
|
Protein sequence | MTSDGDDESF ATAVAALYSP LHQSVSPESI ARAAARRVGT VTDMRVYLQR AELHTIARNS DSVTSSNVPR THPRIIHVTG TKGKGSVCCL CECILRERFG LRTGLFTSPH LVDVRERIRV QGKPVSPDVF AHAYWRIRSR LEAFRNTNIG GDGVGDDEDL PTLPGYFRML TLMALYVFYH YEPRIDVIVL EVGMGGRYDA TNILDLDLYD VRCGITQLDY DHVRVLGNTL EEIAWEKGGI FKVHKADNAN VTLKPHSAEA ATKQEIIDFS LATTDHASED DSGNTRGHIF FALDTNSPSA LQVLHDCARI EGRGVTLQLV GVADAESSHH QKRSGYTMLP DSCLIGLPGS HQRLNAQLAI TLCESLSLSG KDKAKSIQDR GVDVMFDALS RASWPGRCQT VEVPEQSMTL RLDGAHTVQS IKVGYNWFKL QQKDCNNSRN ILIFNCSHER DPVELLDLLD CEMFTAAFFC RADSERPSAV VKASAQDLLQ RSGRVVRQDL LPNTQATWQE TLESLWNHRC EQTPHRSEIS MTTASNLSVS QALKQIRKSQ YAGEHSEVLV TGSLYLVGSV LNAVKWSERE ADGKLQDLIT CCL
|
| |