Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50045 |
Symbol | |
ID | 7198740 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 238023 |
End bp | 240772 |
Gene Length | 2750 bp |
Protein Length | 696 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184843 |
Protein GI | 219129328 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.361429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACTCTACTC GAATCGAGAG TAACAATAAA GCTTCCAATC CAATACTGCG GAAGGTTAAC AACTCGCAGT GTTTTTCTTT CTCTTTCAGC AATACACGGC TAATTCATTG CAGTAGACTG CGTTCCAGAG TATGAACGAA GACAGTACCG AAGACCCGGA TGATGGCAGT GTGGCGCTCT GTGCATCATG CGATCGTTGC CGCTCGCGAA AGACCAAGTG TGACGGCCAG CGCCCCTGTG GAAACTGCTT GGCCAAGTAC ATGAAGAAGA ATAAACTCAG TAGGTAAGTG GCAGCGAGGT TGCTAACGAC GGTGACCGTA GAAACATCTG TGGTGGTATC TGACGAGACC GAAGTGGTTT TGACCAAAAA GTGAAAAGGC GAAAAGCGCC GAAACAACGG GGAGGTTTTA TAGATATTTG AATCTCGTGA AACTATCAAG GGGAAAAATC GATGGTCGTA CGATATACGG TTGTAGTGCC GGGCTTGGGG TGGTACGATT GTGTAATACT CATAGTCCGT CGAATACATC GGAACATCAA TCGGGTTGGG CTCCTTTCAT TGATTGGCAG TGCGACGGAT TCTAGCTCTA CCCATGCATC GCTTTGCCAT CTTACACCTA TCCTTCTTTT GTACATCAAA CAGCGCGGAT GGAATCGATT TTACCGAGTG TGAGTGTGTC TATTCACCCG CTAAGCGTCG TGGCCCTATT CCGGGTCGTA CCGCTGGCCA AGCTCGGAAG GCCACCGAGC TGCAACATCA CCAACAGCAG CAGCCGAATG ATTGGCCTCA AAATTATCAC AACAACCCTT CGACTGGGGT GAACTTGAAC GGGACAGGAT TGGACGCCCA AATGACTGCT GCTTTATTTT CCGGCCAAAC CGAACAGGCG TCGTTGCAGC AAAAACTGAA CTTTTTGCAG TCACTGCAAA ATCAAGACGA AGATCATCTC ATGATGCAAC AGCAGCAGCA GCAGCATCAG ATGGACGAGC CTGCCAATCG ACGAGTGAAA CGTGAAGATG CTGGACAGAA TACCAGCACG AACGGGATTC CTCGCACTAT CACCACCCAC ACGCACCTTT TGGAACGCTC CAATCCAGAT GGAGCCCGTC TTCGTGCGTA CTACCAGCTA TCGATCGACG AACTCTATCG TTTGCCTCCG ATACCGACGG ACGAAGAATA CTGTGCCCGC CTTAACGTTC CGGGGATGAC GCCTCAAATG ATCCCAGGTC CACATCTGGC CGCCCTGAGT GCCGCACGCT TTGCTGAGAT CGCGCTCGGC GCACTTGTTC ACAACGAAGT GTCGTTAGCG ATGGAATTGT GTAATGCAGT TGTTCACTGC TTGCGGGAAT CCGTACAGGA ACCCGTGCAG ACACCAGTGA TGTTCGAAGT TGCCAAGGCG TACTTTTTGC TCGGCGTTTT CCGTGCCTGT CGCGGAGACA TGGAACGGTA TTTCAAATAT CGCCGGGTCT GTATGACGTA TTTGGCGAAG CTGGAGGTAA GTGACAGTAC ATTTCACTTC CAAAGATATT ATATAATGTG CTCACGGGGT ATATGTATGA TTTACTCCCT GCACAAAGAA CGATGATAAA ACGGCGGTGC TCCTTGCCGC AGTGGCCTAC TTGGACTCTT GGGCGTACAT GACTTATAAC GCCGACGACA AATTGGTGCC GGCCGTTGAT GATCAGCTTC CACGCGTGAT CGTCTATTAG TCGCAGTCCG TACGCTACTC AGACTGAGCT CAAGTATGAT GTCAAGTTGG ATGCTGGTGC CATTGCTAGC GATCCCAAGA ATCAAAACTG GATTCAAGGT GCTCCGCCGG TGTACCTGAA TAATGAGGCC CCGTTGCATG CACGGGCTTT GGATGCTTTG GCTTGTGCCG TTCGCACTTG TTGCGATCAA GCCAACAGCC GTTTCGCTCT TATTAGCAAG GAGGCTAATA TCGAAGGTCT GGACACGATT CCTTCCGAAT CCATTTCTTC TGCAACGTAC AATGCAGTTC TATCGCACGA GAATGAGCTC TGCAGTCGCA ATATTGTTCT TTCAGCGTAC ACTCTGATGC AACAGCACGA ATCTACTGAC AGTTCTCGAC ACAAAAACGA GGGACAGCAC ATGGTCATTT CTGCGATGGA CGCGTTTCTG GAAAATAGTG ACGAAGATGG CAATGGTGGA TTCACCGACA GTCAGATTCA GAGTTTGCTT TCTGTTTGTA ACACTGCGAT TGAGAATCCG TTCCTCTTGC ACCATGCTGG TCCAACATAT CACATGGTGT CCAACGCGGC CGTACTATTG TGTCATTTAT TAAACGGCCT TCATATGGCC AAGATGAACG GTCAAGATTT CGGTCGGATG GAACAGTCCA TGTTTGAAGA AGTCTTTGAC GCTTTTATAT CGATTCGCAA ACTCTTGACG ATTCATCGAC GTAAACTACC GGTCAAACTG CGTTGCCATG CTATTCCGAG ACCAAGCATG GACGGTTTAA AGGAAGGGCA GCCGTTAATT GATTTGGGGG AAACAATTCT TTGTGCGTGC CGTGGATGCC AGGGTTTTGT CCTTATGGCT TGCAGTCCCT GTGTAGCGGC GGAGCGTGCC CAGGCGGCGC AACATGATTT GTCAGTCGAA GCGGCGAAGG AAGCCGAAGC GATTGAAATG GGCGAGCTCG ACAACGAATT GGACAACTTG GGAGCGGAAT TTGATATGGA CGACGATATG TTGTTGGGAA TGATTAGCAA TCTCATTTCA AGTTGAAAGG
|
Protein sequence | MNEDSTEDPD DGSVALCASC DRCRSRKTKC DGQRPCGNCL AKYMKKNKLS SADGIDFTEC ECVYSPAKRR GPIPGRTAGQ ARKATELQHH QQQQPNDWPQ NYHNNPSTGV NLNGTGLDAQ MTAALFSGQT EQASLQQKLN FLQSLQNQDE DHLMMQQQQQ QHQMDEPANR RVKREDAGQN TSTNGIPRTI TTHTHLLERS NPDGARLRAY YQLSIDELYR LPPIPTDEEY CARLNVPGMT PQMIPGPHLA ALSAARFAEI ALGALVHNEV SLAMELCNAV VHCLRESVQE PVQTPVMFEV AKAYFLLGVF RACRGDMERY FKYRRVCMTY LAKLENDDKT AVLLAAVAYL DSWAPYATQT ELKYDVKLDA GAIASDPKNQ NWIQGAPPVY LNNEAPLHAR ALDALACAVR TCCDQANSRF ALISKEANIE GLDTIPSESI SSATYNAVLS HENELCSRNI VLSAYTLMQQ HESTDSSRHK NEGQHMVISA MDAFLENSDE DGNGGFTDSQ IQSLLSVCNT AIENPFLLHH AGPTYHMVSN AAVLLCHLLN GLHMAKMNGQ DFGRMEQSMF EEVFDAFISI RKLLTIHRRK LPVKLRCHAI PRPSMDGLKE GQPLIDLGET ILCACRGCQG FVLMACSPCV AAERAQAAQH DLSVEAAKEA EAIEMGELDN ELDNLGAEFD MDDDMLLGMI SNLISS
|
| |