Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49883 |
Symbol | |
ID | 7198596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 184820 |
End bp | 186941 |
Gene Length | 2122 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184667 |
Protein GI | 219128958 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0580309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTGACCCT CCCTTGTGCT TTTTTGTTCG TTCAATTCAC TGTTACAAAG TTCAAGTCTT TCGAAGGCTT TCCGCTCATT GCGTGTTCTT ATTGCCCTCA TTCATTATGC TTGCCAATGT CGATTCTGGC ACGGCCACGA AGGCGACATT CGGCAGGTTT CGTTCAAACC GGCACCCCAC TAACAGAAAG CAACTTCAGC GTGCGGCTAC GTCGCGATTC ATCGCTCTTT GTGTGATGAT GCTGGGTATG GGGTCCCTCT GGCAATCCCG ATCTATACTT TCGTTGGTAC TCTCAACATC TACCCAACAG GAGTCTGCAA TTGATGCCAT TCGCACCGAC GTGGTCTCCA ACAACATTGA CACCGGCACC AGATATGAAA AAGACCCGAT GCAATTTGCG GGGAGCTTGC AATCAGGAGG CAGCGCTGGA AAGACCAAAG ATAATACAGA AACGCCAGTG ACAGCCGCCG AGCGAGAGCG TTCAGTGCTG GCCTTAGAGC CACCCCCAAC GAAGGTTACG GTCGAGGCGG ATGATCCGTC AAATGCTGGC AACGAAAGTG AAACGACGTT TTCGTTGATG CAGCGTATCG AACATACGCC TCGAAACCTG ACCAACATCC CAAAAATGCG AGTAGTACTT TGGCCTGACA CGCAATCAGG CGTTCGAAAC TCTGAGAGTT TTCACCTGAC GGAAAACGGT ATCAACGAGT CTGCCTATTT AACGCTGAGC AACGAGACAT GGGACTTTCA TTCCAATGTT GTGTGGGTGG GGGATATGGG AATGGGTGGC CCTCGAAGAC AATGGTGTGG AGCTTTTGGG GCACTGGCGA AACAGGCCAA AGACAAACGG CGTGTATCAA GACTCCCATT GCAGTGGCCT ATTTGTATTG TTGACTACGC TGACGGCCCC TCATTGCCAC GGTGCGCGAA TATCGAGGCG CAAGTAGGGG TTGAAAATGT TCGGTATTCC GTTCGATCCG TAGTGACTGG ACGTAATTGG AATGAAACTA TCGGATGGGT ACAAGGCGGT GGTCGACTAA GTCTGAATAA AACATATGGT ATCACATATC GGCAGGCATC GTATATGGTT CGGACAGACA TGATCAAAGT CTTGGAAAAC TCTCTTCGAG CGAGAAATAT GAGCCTGGCC GATCCAATTG AGCGCATCGA TCGGCCGGTT GATGTTGCCC ACTTCTGGCC CCACTCAAAT TCATCGACAA ATCAACAAGA CCGACGCAGT GTGTTACATT TCCGATCCAA GCTACGTTCC AAGATTAGCG ATCTGGTCGT TGATCTTGGC AAGACCCAAT CAAGTCTCAA CGTCTTTGTT GGTCTCAAAG GGCACGCAAA GGAGAGTGGA CGCACTGGAG TGCACACCGA CTACATGAAT GCTCTACTTG CTTCAAAGAT TGTCGTGGTG ACACAACGAG ACGAGTGGGA GGAACACTAT CGACTTATGG AGGCAATCAT TGGCGGTGCC ATGGTGATGA CGGATCGCAT GCTGACGTTA CCAGCAGGCC TGCAGAACGG CACGTCCATA GTCGAGTTTG ACAGTGCCGA GAGTCTCGTG TCACTGATCA GCTACTATCT GAGACATTCT GATGAGAGGC TGGAGATAGC CCGGGCAGCG CGGGACGTCG CTTTGCGAAA GCATCGTTCT TGGCATCGGA TGGAAGAGAT CATCTTTGGT GAGAGTCTGT CAAATTGCAG CTTCCAGCAC CCAAATAGCC CGTGCCCGTA CGTTGCTCAC GGCATCGATT CAAAGCGTTA AAAAGACATG CGCGAAAAGT TTCCAAAAGG AACGGAGCTT GTAAAGGGTC TGTTTATGGT TGTTGGCAAA GAGCCTTAAG ATATGGCTTT GAGATACTGT GTGCACGAAA CTGATAGTGA CGAAAACAGA AGCAGCAATT TGTCTCCAAT GTACATTTTA CAGCGGAACG GACTTAGCTT GCTTGAGGTG TCTATAGGCG ACGATTGCAA TCTATTGAAG TGTTGGAAGG AGCCATTTAC TGTTAAGCAC ACTGATATAT GTCACACAAT GAAGATCACT TTTTTCAAGC AACGAGTTGT CAAAATAACG AAACAAAAAC TAATGTTGTG TCTAAAGCTT GATATTTATT TT
|
Protein sequence | MLANVDSGTA TKATFGRFRS NRHPTNRKQL QRAATSRFIA LCVMMLGMGS LWQSRSILSL VLSTSTQQES AIDAIRTDVV SNNIDTGTRY EKDPMQFAGS LQSGGSAGKT KDNTETPVTA AERERSVLAL EPPPTKVTVE ADDPSNAGNE SETTFSLMQR IEHTPRNLTN IPKMRVVLWP DTQSGVRNSE SFHLTENGIN ESAYLTLSNE TWDFHSNVVW VGDMGMGGPR RQWCGAFGAL AKQAKDKRRV SRLPLQWPIC IVDYADGPSL PRCANIEAQV GVENVRYSVR SVVTGRNWNE TIGWVQGGGR LSLNKTYGIT YRQASYMVRT DMIKVLENSL RARNMSLADP IERIDRPVDV AHFWPHSNSS TNQQDRRSVL HFRSKLRSKI SDLVVDLGKT QSSLNVFVGL KGHAKESGRT GVHTDYMNAL LASKIVVVTQ RDEWEEHYRL MEAIIGGAMV MTDRMLTLPA GLQNGTSIVE FDSAESLVSL ISYYLRHSDE RLEIARAARD VALRKHRSWH RMEEIIFGES LSNCSFQHPN SPCPYVAHGI DSKR
|
| |