Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49568 |
Symbol | |
ID | 7198234 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 70399 |
End bp | 72469 |
Gene Length | 2071 bp |
Protein Length | 605 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184298 |
Protein GI | 219128183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.110101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTATTGTTG GAGAGCATAA AGAGGGTGTC GGTAGAGGAA CGGAACGGTA CAGTACAGAA CGGCAAGGTG CAGTAGTGTA AGATGCAGCC GTTACAGGAA GACGGCGAAC GTCCAGCACC GGATCCACTC GCGGCTCCGG ACGATACCAA CAACAACAAC AACAACAAGG ACTACGACGA CTACTCTCCC GCTCACACAC ATCTCAAAAC GTACACGGCG GGAGAAGACA AATCGCGACT CCAACAAGCG GCTCGGGAAC TCGGACTTCG TCAAGCTCGT CAGCCGCGAC GGCAAGTCCG CAAGGTCCAC GACTTTCACC AGAACCCCGT ATCCATACAG GCTTTCTGGA ATGCTCTCTG GAACTCGGCG ACGGAAGACG CGCGCATGCG AGCCGTCTAT GTCCCACCGC ACTACGATGC CCAAGGAATG CTGCTCGGGG ATTACTACCA CGACAACATC GACGACGACG ATTACTTCTT CCGACACGCC CCGTCGCAGC CCGATACGGA ACTCGGACCC TCCCGACCTC CCTTGTCGTC TTCGTCCGCC GTAGATTCAC ACGGTGAACC ACTCGGCGGG GGAAATTTGA CCGCTGCCGT CCTCGGAATC GTCAAAGGCA TGGTCGGTCC GGCGATCTTG TATCTACCCC ACGGATTCGC CACCGCCGGA TACCTAGTAG CTCTCCCTAT CGTCATGGTT TGTACCCTGC TCTTTTTGTA CTCCTCCCGC TGTCTACTGG ATGCCTGGAA GATCGAACAG GAAAAAGTAT CGCCACCGCA ATCTCGAGAA AGTGACACAT CGTCCTCCAA CGAACGTACC GCACTACTCC CACATCACCG TCCACGGCAG TTCTTGTCCT ACCCGGAACT CGCCTACCGA GCTCTCGGAC CTTCCGGAGA ACGGACCGTC CAACTCGGAA TCGCACTCAT GCAATCCGGC GTCTGTTTGA CCTACCTCAT ATTCGTACCA CAAAATCTAC ACTCCTCCTG GTTGCACCTC ACCAATCAGT CCGTCGCGCC CTCCTACTGG TTGATTGTCA TGTTGGGTGT CCAGGTACCT CTTTCCTGGA TACGGGATAT TCGCAAATTT ACACCCACCA ATTTGCTCGC CAACGGACTC ATCCTTTACG GACTCGTCAC CTGCATTGGC TTTGCTCTGG ACGAAGCCTC ACAACCCTGG CAACCGGTTG CGGCTGTTGT GGACCCGGCA GCCGTCCACG ACAGTCCCTG GGCCAATATT GGACAACATT TCGCAGCCCT GCAACCGTGG GCTGCCGACT GGTTCCTCTT TGTCGGTACC AGTGTACGTG CATATTCACG TACTCATACA TGTCGACAAC CCGGAAATTA GTCGCTGCCT TTTGTCTGTT GGTTCGCGGT GTATATCCCA ACGTACCCAC ACGTTCTTTG TTTTTAATTT CCTTTCCTCC TATACAGGTC CTGCTTTTTG AAGGATCCAT TACGCTCCTG GTGCCGCTGC AGGAAGCCGT GGATGACGAA CAGCAACGTG CCCAATTCCC GGCCGTCTAC CAACGCGTGA TTCTCAGTAT TGTTGGCTTT TACGTCGTCT TTGGCCTGAC CTGCTGGATG GCCTTTGGTC CCGACGTGCA AACCGTCTTG ACAACCTCGC TGCCCAACAC CAACCTCGCC ACGACCGTGC AACTGGCGTA CTCAGTCGCC GTCTTGCTCA CATTTCCGTT ACAAAATTTC CCTGCTCTAG AAATTGCCTG TCGCGGAATC CAATCACAAG TCCGCAAACG TACGCATCTG GCAGTTTCCC GGAACGTGAC GAGTTCTGTC CTGGTCTGTT TGCTCGGTGC CGTGGCTGTC TGGACCATGG ATGATCTCGA CAAGGTCGTA TCGCTTATGG GATCACTTTT GGGTTGTCCG ATTGCCTTTT GTTTTCCACC GCTGATTCAT TCGCGCCTCG ATCCCAATCT ATCGATCCAG CGATTGTGGG CCAACCGGAT TGTGGCCGGC TTGGGTGTCG TCGCCATGGT ACTGGCGTCC GCCGTGACAC TCATTACGTG GTAGACAGAG CATTGTCGCG CGACGCAGGC CAGTGGATTT A
|
Protein sequence | MQPLQEDGER PAPDPLAAPD DTNNNNNNKD YDDYSPAHTH LKTYTAGEDK SRLQQAAREL GLRQARQPRR QVRKVHDFHQ NPVSIQAFWN ALWNSATEDA RMRAVYVPPH YDAQGMLLGD YYHDNIDDDD YFFRHAPSQP DTELGPSRPP LSSSSAVDSH GEPLGGGNLT AAVLGIVKGM VGPAILYLPH GFATAGYLVA LPIVMVCTLL FLYSSRCLLD AWKIEQEKVS PPQSRESDTS SSNERTALLP HHRPRQFLSY PELAYRALGP SGERTVQLGI ALMQSGVCLT YLIFVPQNLH SSWLHLTNQS VAPSYWLIVM LGVQVPLSWI RDIRKFTPTN LLANGLILYG LVTCIGFALD EASQPWQPVA AVVDPAAVHD SPWANIGQHF AALQPWAADW FLFVGTSVLL FEGSITLLVP LQEAVDDEQQ RAQFPAVYQR VILSIVGFYV VFGLTCWMAF GPDVQTVLTT SLPNTNLATT VQLAYSVAVL LTFPLQNFPA LEIACRGIQS QVRKRTHLAV SRNVTSSVLV CLLGAVAVWT MDDLDKVVSL MGSLLGCPIA FCFPPLIHSR LDPNLSIQRL WANRIVAGLG VVAMVLASAV TLITW
|
| |