Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17172 |
Symbol | |
ID | 7196029 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 113243 |
End bp | 115131 |
Gene Length | 1889 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176526 |
Protein GI | 219109543 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.496601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATACACGT CGACCATGAA GTCGGTTTTG TGTGGAGTTA CGCTGGCTTG CGGTGGAATT TGGCAAGCCA ATTCGTTCAC CTTTCGTCCG CTACCCGTCA CGAGACCCGT TATTGGCTTG TTGGCTACGT TGGATCGCGT CAATGAAAAG TCCACGGTGG AACCCTCCTC GGGGGATCCA CAACTCCAGC CGATTCGGCG GTTGCGGCAA AACAAGAAGG AGCCTTTGAT TGCGATTGTG GGACGTCCCA ATGTCGGGAA AAGCGCTTTG GTGAATCGTA TTGCGGGATC GCAGTCTGGT GGTGCCATTG TGGCGGACGA AAGTGGAATT ACCCGGGATC GCACCTACCG TCCCGCCGAG TTCCTCGGCG AGCGCTTCAT GATTGTGGAC ACGGGCGGTC TCGTATTTGA CGATGACGAA AGCACACTCT TTGCCAAAGA GATTAGGGAA CAGGCTATGG TGGCGATTGA AGAGAGCGCG GCAGTTATCA TGGTAGTCGA CGGTCAAACC GGACTGACCG GGATGGACTT GATGATTGCC GAGTTTCTAC GCAAAGAAGT GGATATTCCC GTGCACGTGG CAGTGAACAA GTGCGAAAGT GAGAAAACGG GTGCCATGTC CGCTGCCGAC TTTTGGGGAC TCGGATTGGG CGAGCCCTTT CCGGTTTCGG CCCTCCACGG AGTAGGCACG GCCGAAATCA TGGAAACCAT ATTCGACTCG ATTGCCGAGA AGAAGAGTGC CATTGAAGGC TTTGGAACCA AAGTCAAAAA GCTGAAAGAG GCCAAGGGTA TCATGAAACA CAAAGGGCCT CTCCCTGGGG AAGACGAAAC AGATTATAAA ATGCGAAAGT ACGGAATCGG TGATGCGGCT AAAAAGGTGG AAGAAAATTA TCAAGCAGCC ATGGAAGCAT TTGATGCCGA GGACCGTCCC GAGGAGATAA ATATTGCCAT TATTGGTAGA CCCAATGTCG GCAAGTCTTC CCTGTTGAAT TCCATCTTTG GGGACACACG TGCGATCGTG TCTGAGATGG CTGGAACGAC ACGCGATTCG ATCGATGCCG TCATGGAACG TCCGCCACCG CCCGGAAGTG ACGATCTGTC TACAATTTAC CGCTTCGTAG ACACGGCGGG TATTCGTCGG AAAGGAAAGG TTGATTTTGG TCCCGAATTC TTCATGGTCA ATCGAGCACT CCGGGCGATT CGACGTGCCG ATGTCGTCCT TCTTATTCTG GATGCCACTT CCGGTGTAGC CGAACAAGAT CGTGTCTTGG CGCAGAAAAT TGCCGATGAT GGACGCGCAT GCGTGATCGT TTGCAACAAA TGGGATGCTG TCGTTGATAA GGATTCAACA ACGTACGACA AGTCGGTCCA ATACTTTCGA GAAGAATTGC CGATGATTCG TTGGGCCCCT ATCTTATTTA TCTCGGCTGC CACTGGGCAA CGTGTTGGCA AGATATACAG CGCCATTGAC GGTGCCATCG AAGCTCATCG TAAACGGATA AGCACGGCTA TTCTCAACGA AGTTTTGAGA GATGCTATTT TGTGGCAGCC ACCTCCGACA CGCCGCAACG GGTCACAGGC GAAGATATAC TACTGCAACC AAGTGAGTAC GCGACCACCT ACCGTCGTTG TTTTCTGCAA TGATCCCAAA CTGGTCAACG ACAATTACAG GCGTTACTTG GATCGTAAGT TCCGTGAATC ACTGGATGGA TTTGAAGCGA CTCCCATTCG ATGGATTTTC CGAGGCCGTC GTGTACGTGA TGTCATGAGG AATCGCTCCA TGAATGGAGA TCCCGGTGAC GGTGGCACCG GCGTCAGTTT CCCCTTTCCC CATGCCGACT AACCATGTAA ATCAAACCAT CTTTTATAAA TATTGATATT TGCTCCTTA
|
Protein sequence | MKSVLCGVTL ACGGIWQANS FTFRPLPVTR PVIGLLATLD RVNEKSTVEP SSGDPQLQPI RRLRQNKKEP LIAIVGRPNV GKSALVNRIA GSQSGGAIVA DESGITRDRT YRPAEFLGER FMIVDTGGLV FDDDESTLFA KEIREQAMVA IEESAAVIMV VDGQTGLTGM DLMIAEFLRK EVDIPVHVAV NKCESEKTGA MSAADFWGLG LGEPFPVSAL HGVGTAEIME TIFDSIAEKK TFDAEDRPEE INIAIIGRPN VGKSSLLNSI FGDTRAIVSE MAGTTRDSID AVMERPPPPG SDDLSTIYRF VDTAGIRRKG KVDFGPEFFM VNRALRAIRR ADVVLLILDA TSGVAEQDRV LAQKIADDGR ACVIVCNKWD AVVDKDSTTY DKSVQYFREE LPMIRWAPIL FISAATGQRV GKIYSAIDGA IEAHRKRIST AILNEVLRDA ILWQPPPTRR NGSQAKIYYC NQVSTRPPTV VVFCNDPKLV NDNYRRYLDR KFRESLDGFE ATPIRWIFRG RRVRDVMRNR SMNGDPGDGG TGVSFPFPHA D
|
| |