Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47762 |
Symbol | |
ID | 7202743 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 759177 |
End bp | 762486 |
Gene Length | 3310 bp |
Protein Length | 1014 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182129 |
Protein GI | 219123637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTAA TTGACAACAA TAGCGAAGAC ACTGGGGGAA TGCGTTGTTG GTTGCCACGG GAAAATGGTT CTGACGACCA AGAGGCGGCG AGCCTACAGA ATTTGGATTG TCTGTGCTTG GGGACGGGAC GGTTTTTGCG TTCCGTTCTC GTGCCGGCTT TAAACTCCTT TAGTCATTCA GTTTTGGTGC AAACGCGGGG ACGCTCCTTT CTGGAATATA TGGCGACGCA GGATGGGGAT GACAACGGAA CGTTTCCGGT GGACACCGTC CTGCCATCGG GCGAAATAAA GACTGACCGA TACCGATGCT ACGGGGCATT TTCGTGGGGA CGAGTCGAAG ACAAGGCAGC TTTTTATGAC GTTTCCCGGA AGACTAGTGG ACCCTCGGTT ATCGGTGTAG GAGTAACGGA GGCGGGCTTG GCATCATCCG AGACCCAGGC TATGAAAGAC CTGTACGATT TTTTGGAGTA CTATCAAGAC ATGTGGGAGG AACGCAGCCT TTGGAAACCA GCTCTAACCC CACACAAAAA GCTTTGTGTC ATCGATATGG ACAACATTCC CCAGAATGGG GATGTCTTGG CGCGTCATAT GAATAGTTTG GCGCAGGACA ATGCAAGAAT GTTGCGTTTC TTGGCCGACA AGGTTGTATT TTTGAATACC ATGGTCGATC GGATCACATC CCATCGTGAA GGAGACCCAA TGGTTCCCAA GGCAGAACCG GTCCCGGCCA AGGCTTTGGT AATTCTTGAT TCTGAGGGGG ATCTTCCAGT AGCGTTTCAT AAAATGAAAG AATCCCACGG GGTAGTAGTA CGCTCAACGC GGGCCGAACT CGAAATTGAC TTGGCTCTAA AGTTACGGGT TGCCAATGGC ACGCACACAG CCTTAGCACA CATACTGGCT CTGACTAAAC GAACAATGAC AGATGCACTC ACTGTTGACG GAGTTGCTGG ACCGTTGCTC TTGGCATACT TAGATGCGCT TGTGGAAACA CAGATTCTAG CTGCTGGCGG GGCGTCGGGA CTGGAACCCC ACGCTACAGC CGCCTTAGAA GTATGGCAAG ACTGGCGATC AAGGCTGACG CATCCATATT TTGGTCTAAG TTCTTTTTTC ATTACTCAAA ACGGAGCAGC TAAGGGCGGA ATCCGCCTTG GACCAACTGT GCTAGATCTG GTAACAAGAA GTCAGACTAC ACAGCCGCTC AATGTCGCGA TGGCGTTTGC TTGGGCATGC TTGCTGCGCT GGTTGACGCC AGACCGCAGG AGAGATAGTG AGGATGAGAA AAGTAGTCGC TATTCATTGA CGGAAGAAAT GACGTTTACA ACCGCTAAAG GTGTCTATAC AGGTTGGTTA CAAGGGTCAG AACTCAATAA CACGGAAGAC GCAACTACGA CATACGCTGA TGGATTGCAC TACAATCTGA GTCAAGATTG GTATGAATTT CGGTGCTCCT GCAAAGTGCC AGTAGGCAGC AGAACTCAAC TGCAAAAACC ATTGTCAGAT GTTTTGGGTG CTTTAGTTTG TAGTGGTCCG CGGCAGCCGG TAGCATACCA TGGAATAGTC CGGTCGTACC TCTTGGCAAC CGACGGCGGA AATTTAAACG CGATTGCCGA CAAGCGGGCC ATGAATGACC TCGTGGCTGG AGTGTCCACT CTATACGCTC GCATGATTGT CGGGGACGAC ATTTTGAGTA TTCTGAAAGA AATCGGGGAC AACGACGGCG CCTTTATTGA TGGTTTCGCC ACAGCGTGTA CATCTATGGC AGATGTGTCT TGTTTGAGTC AGGGTTGTCC TTTAGCATTT CGACGTAGTC CTGTCCCGAA TCACAGCCGA CTACTGTTGT TGTCTATCCA CAAAGATACG ATCGATACAG TTGTAACTTC TGAAGTAGCC TCCGCTATCG CCATTGATTT GCATACTCAC TTGCTGCCAC CCTCACACGG CCCGCTCTGC TTGTGGGGTA TTGATGAGCT ATTGACTTAT GTATGTGTAG CAGTGGGCGA TCCGTTACTT ATAAGAAAGT ATCCAGAAAC ATTCTTTCAC ACTTTTTCTT TGTTCTTGCG CAGCATTATT TAGTGGCGGA GTTCTTTATA ACTGCTCCGG CATCGATGAC ACCAGACGGC TTCTATGCTT TGCCAAAGAA ACAGCAAGCG GATACAATTT GGCGGGCACT TTTTGTGGAG AGATCACCGC TCTCGGAGGC ATGTCGCGGA GTCATTACAG TTTTGGTGTC TCTCGGATTA GAGAACGCAC TAGCGGACCG CGACTTAAAC TTGATTCGTA AATTTTACAA AGGCTTCCGG GACGAAGGCC TAACCGGAGC AGAGAAGTTC AGTTCTTTAG TTTTCAGCAA ATCGGGTGTC CGGTACAACA TTATGACAAA CATTCCCTTT GATCCCAACG AAGAGAGGTA CTGGCGTCCT AAACCAAAAG ATTATTCGGA CAATTATCGC TCTGCTTTAC GTGTAGATCC CCTCCTGACT GGTGATTGCC GAACGATTGA ATTGGCTTTG AAGGGCTCGG GATACGACAA TACTATCGAG GGGGCGCGTC AGTACTTACG AGACTGGTGC GATACAATGA GTCCGGAATA CATGATGGCG TCGACGCCGC ATGACTTTCT GCTGGAAAAA GGCACCTTGG GTTCCTCGAC ATCTACCGGT ATTAATGAAG AGGCATTGAA ACTCCCGGGC GCTTTCGCTC AGCTCAAGAA CCAAGAAATT AGCTGCAATA GCACAGAAGA TGACAGTCCG AGTGTTATTA ATGAGAACAG TGATTTCTTG GGCAACGTGT TGATGAAAAT CTGCGAGGAG CGCGATTTAC CTGTGGCCCT AAAGATTGGA GCGCATCGGA GAGTTAACCC AGCCTTAAAG CAAGCAGGCG ATGGTATGGT TGCGTTTGCT GATGCTGGTA TGCTTGGGCG GCTGTGTTCC AGGTTTCCAA AAGTTCGCTT TCTCGCAACC TTTTTATCTC GTAACAACCA ACACGAGGCT TGTGTCTTGG CGTCCAAGTT TCGCAATTTG CACATTTACG GATGTTGGTG GTTCTGCAAC AACCCAAGCA TTATTCGAGA GATTACCCAA ATGCGAATTG AAATGTTAGG TACTGCCTTT ACGGCGCAAC ACAGCGATGC CAGGGTTCTT GATCAGCTGT TATACAAATG GCCCCACTCG CGAGCCGTAA TTGCGGCAGT TCTGAAGGAT GAAATGGCCA AGATGGTAGC TTCGGGGTGG ACTCCTACCC GTGCCGAAAT TAGACGAGAC GTCGCCCGTC TTTTCGGGGC TAGCTATGAG GAGTTCATGC GCAAGTCTCT
|
Protein sequence | MTVIDNNSED TGGMRCWLPR ENGSDDQEAA SLQNLDCLCL GTGRFLRSVL VPALNSFSHS VLVQTRGRSF LEYMATQDGD DNGTFPVDTV LPSGEIKTDR YRCYGAFSWG RVEDKAAFYD VSRKTSGPSV IGVGVTEAGL ASSETQAMKD LYDFLEYYQD MWEERSLWKP ALTPHKKLCV IDMDNIPQNG DVLARHMNSL AQDNARMLRF LADKVVFLNT MVDRITSHRE GDPMVPKAEP VPAKALVILD SEGDLPVAFH KMKESHGVVV RSTRAELEID LALKLRVANG THTALAHILA LTKRTMTDAL TVDGVAGPLL LAYLDALVET QILAAGGASG LEPHATAALE VWQDWRSRLT HPYFGLSSFF ITQNGAAKGG IRLGPTVLDL VTRSQTTQPL NVAMAFAWAC LLRWLTPDRR RDSEDEKSSR YSLTEEMTFT TAKGVYTGWL QGSELNNTED ATTTYADGLH YNLSQDWYEF RCSCKVPVGS RTQLQKPLSD VLGALVCSGP RQPVAYHGIV RSYLLATDGG NLNAIADKRA MNDLVAGVST LYARMIVGDD ILSILKEIGD NDGAFIDGFA TACTSMADVS CLSQGCPLAF RRSPVPNHSR LLLLSIHKDT IDTVVTSEVA SAIAIDLHTH LLPPSHGPLC LWGIDELLTY HYLVAEFFIT APASMTPDGF YALPKKQQAD TIWRALFVER SPLSEACRGV ITVLVSLGLE NALADRDLNL IRKFYKGFRD EGLTGAEKFS SLVFSKSGVR YNIMTNIPFD PNEERYWRPK PKDYSDNYRS ALRVDPLLTG DCRTIELALK GSGYDNTIEG ARQYLRDWCD TMSPEYMMAS TPHDFLLEKG TLGSSTSTGI NEEALKLPGA FAQLKNQEIS CNSTEDDSPS VINENSDFLG NVLMKICEER DLPVALKIGA HRRVNPALKQ AGDGMVAFAD AGMLGRLCSR FPKVRFLATF LSRNNQHEAC VLASKFRNLH IYGCWWFCNN PSIIREITQM RIEICYTNGP TREP
|
| |