Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20082 |
Symbol | |
ID | 7200403 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 792054 |
End bp | 794546 |
Gene Length | 2493 bp |
Protein Length | 716 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179716 |
Protein GI | 219117858 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGAACGCGT CCTACCTAAT CGGAACCGGT ATTTACGATG TGACTGCGGC GTGTGCCGAA ATTAATTTCA TGGGCTACGC TCGTGCAAAT CAGAACGGCC ACGGTATTCA TCAAAGGCTG CGTGCACGGG CCTTTGTGAT GTCCGAACCG TACGGTGCTC CCTCCGCGGT ACACGATGCC TCGAGGCCGC ACTCCCACGA AATCGTTGAA CGACTCGGGA GCCGGTCTCG GCCTGACCAC TATCCTCGCC ATATCTCGAG GCAGGGCCGA GTCGAAGACC AGGCGAACAA AGAGGCTCTC TCGGTCCTAG CGGATCCAGC CAGAACAGTA TGCTTTGTGA GCGTAGATAT TGGCATGGGC TCGGATTTGC TCACACAGCG AGTTATAGCA CGGCTCGAAG AGCTATTGCC AATACAGGAT GGCTTTGACA AGCGTCTTTG TCATCTCGAC AATCTGAGTA TAAGTGGTAC GCATACCCAC TCGGCACCAG GGGGTTTCCT TCAGTATGCA CTGTATCAGA TAACTAGTCT AGGATTTTTC GAAGAGGTAT TGGAAGTTTA TGTTGAAGGC GTTGCGCAGG CCATTCGACG TGCCTACGAC AACTTGCAAG TGGGCTCAAT AGCGGTTGCT CAGGAGCGTC TACAAGGAGC CAGTATCAAC CGGTCGCCAT CCAGCTATCT CCTGAATCCG GTTGAGGAGC GCGATCTGTA CGTCGAGGAT GGAGACACCG ACAAACGTAT GCTGCAACTC AACTTTCTCA ACGCAAATGA AAAACCTATC GGGGCACTTA ACTGGTTTGC GGTACATGGA ACTAGTATGA ATTCAAGCAA CCGACTAATT ACTGGGGATA ATAAAGGCTA TGCTTCGTAC TTGATGGAGA AGCACTTCAA CGAAAACGGA ACGTTACCGG GAAAGGGACA ATTTGTTGCC GCGTTTGCTT CGACAAACCT TGGCGATGTC TCTCCGAACA CGGCTGGCCC GAGATGCATT GACACTGGCC TTCCATGTGA CTATTATACT TCTACGTGTA ACGGCAGAAC CGAGCTTTGT ATTGCTTTTG GCCCTGGCAA GAACATGATT GAATCGATGG AAATAATCGG ACGCAAGCAA TACGTTCTTG CTTCTGCGTT ACTTGGTACC TCGAACGTTA AAAAGCTGAA GGGACGTGTT GCCTCTCGGC ATTCGTTTAT CAACATGGCC AATCTGACCG TCAGAATGAA CAACACAACC TTTGCTCGGA CGTGCCCCGC TGCCTTGGGC TATTCCTTCG CCGCGGGGAC GACTGACGGC CCGGGCGATT TCGACTTTAC CCAAGGAACC AACACGTCGA ACTGCATATG GGACATCATC GGCGGGTTCC TATCCACGCC ATCCACCGAG CAAATACAAT GTCACGCTCC AAAGCCAATC TTGTTGAACA CTGGGGAGGC GTCGCTTCCT TACGCCTGGG ACCCCAATAT TGTACCGATT TCTGTCTTTC GGATAGGAAG CCTTTTCATT CTCAATGTTC CAGGTGAACT TACTACCATG GCTGGTCGCC GGCTGCGTAA AGCCGTTTAC GAAGTAGTCA GGTCAAATGG CGTTGCGGAT CCGATAATCG CCATTGCGGG GCTGGCCAAT TCGTATACAC ATTATGTAAC GACCTTCGAA GAATACAGTG GCCAGAGATA CGAAGCAGCG AGTACTCTAT ACGGACCGCA TACTTTAAAT GGCTACATAC AAGAATTTCG ACGCATAACA TTGGATCTGC TAATAAATAG AGCGTCTGCT TCAACCAAGG CCCCAACCGA TCTCACTCGA AAGCAAATTA CCGTCATACC TCCAGTTGAA CTGGATACAA TCGGTCTGGG TCGGAAGTTT GGCTCAGTAG CTGTGGACAG CAAAGATCAG TACATTCGTG GAAATGACAC CGTGGTTGTG TCTTTTCGAT CTGCCAATCC TCGAAATAAC CCACGGATCG AAGGTACTTT TCTTTCCATC GATTACTTGG ACAATGATGG GAACTGGCAA ATGCAATATA ATGACGGAGA CTGGTGCACA AGGTTTATCT GGAAAGGCGG TATAGTGCGA CTTGGATCAT CGTTTGCAGA AATACATTGG AAAATACCAA GCGATACAAT GCGAGGCATT TATCGCGTTT GCCACTATGG CACGCGGAAA AGCTTGCTAG GCTCTGCTGA GAGCGCTATA TACTATGCAC CTGAATGGAT CATTTCAAAT CTCCTTGGCT CTATCACTGC CAACATGATT TTGCAGAGCG TGAAGCTAGC AATTGCAGTG TCCGACCAAA TTCAGCGATT CACTGCAGGT AGTTTGGGCC ATTCACGGTA CAAGGAGTTC TATGGTTGTA CCAGAGCTTT TCTTGTGCAC GATCACGCAA ACTAATAATG AACATAATTA GTGAGGTATT CCACTGATCC TGTGAAGCCA TCTTCTCTCT AGCTACAAAA ATAGTAGAGA ATACCAGAAA TAATTGAGAA TATATGTTAA CAT
|
Protein sequence | MGYARANQNG HGIHQRLRAR AFVMSEPYAR TVCFVSVDIG MGSDLLTQRV IARLEELLPI QDGFDKRLCH LDNLSISGTH THSAPGGFLQ YALYQITSLG FFEEVLEVYV EGVAQAIRRA YDNLQVGSIA VAQERLQGAS INRSPSSYLL NPVEERDLYV EDGDTDKRML QLNFLNANEK PIGALNWFAV HGTSMNSSNR LITGDNKGYA SYLMEKHFNE NGTLPGKGQF VAAFASTNLG DVSPNTAGPR CIDTGLPCDY YTSTCNGRTE LCIAFGPGKN MIESMEIIGR KQYVLASALL GTSNVKKLKG RVASRHSFIN MANLTVRMNN TTFARTCPAA LGYSFAAGTT DGPGDFDFTQ GTNTSNCIWD IIGGFLSTPS TEQIQCHAPK PILLNTGEAS LPYAWDPNIV PISVFRIGSL FILNVPGELT TMAGRRLRKA VYEVVRSNGV ADPIIAIAGL ANSYTHYVTT FEEYSGQRYE AASTLYGPHT LNGYIQEFRR ITLDLLINRA SASTKAPTDL TRKQITVIPP VELDTIGLGR KFGSVAVDSK DQYIRGNDTV VVSFRSANPR NNPRIEGTFL SIDYLDNDGN WQMQYNDGDW CTRFIWKGGI VRLGSSFAEI HWKIPSDTMR GIYRVCHYGT RKSLLGSAES AIYYAPEWII SNLLGSITAN MILQSVKLAI AVSDQIQRFT AGSLGHSRYK EFYGCTRAFL VHDHAN
|
| |