Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37297 |
Symbol | |
ID | 7201944 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 656588 |
End bp | 658673 |
Gene Length | 2086 bp |
Protein Length | 527 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181417 |
Protein GI | 219122154 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0209176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTGA GGGATCCTAG ACTTAGAATT GGAGGGAAGG TGACGGCAAA GGCTTGTCAT GTTGTGCATC TGAGCGAGTG CGCACGGAGA TATGGCGTCA ACAAGCACTC CAAGCGGCTT GTTGGAACGG TTCTAGACGT CACGACCACC CCTGTATCCA TTACAACCGG GCGTACCTCT ACTTTGATAA CAGCAGTTTA TGATTTTGGA GAGAGTTTGT TCAAGGAAAA AACACTGAAC ATTCGGAGTG TAAAGGCATT TGTACCGCCA GAAGATGAAG GAATGTCCTT AATTGAGGAA TTAGCAGCAG AGGCTTTGCA GGCAGCAGAA GCAGACATGG AAGCCGGAAA CTTGATGGAA GAAAGTGTCG AAGCCCCGGT AGCCGAAATG GTTGAAACCC CGGCTGACAT AGAGCCCGAT ACCTTGGTCG ACACAGAGCC CAATACCCCG GTTGACACAG AGCCCGATAG CCCGGTAGCC GAAATTGTCG AGACCCCGGT TGACACTGAT ACCTCGGTTG ACACAGAGTC CGAAAACCCG GTAGCCACAG TGCACCAAAC AGAGTGGTAT GTGAATGAAA GAAAAACCCG GCTGGATGTG AATGGCCATG TCTATGTTAG GCACTTCCAT ATCCGTACTT CAGTTGGTGA CCTTATTGGT CAAGACTCTG ACAATGGGGT GAGATTTTCG CGCCTCGAAT ATTTTCTGCT CATGTTTCCG CCGACCCAGC TGACTACTAT GTGTCGGCTT ACAAATACTA TGCTTGCACA GCAAAACAAG AATCCAATCA CAACCGGAGA ACTTCTTCGG TTCTTTGGAA TGCTCATACT CACTACAAAG TTTGAGTTCA GTAGCCGGGC CCAACTATGG TCCACCACTG CACCCTCCAA GTACATTCCT GCCCCTTCAT TTGGACGCAC AGGAATGTCC CGGCAACGGT TTGACAATAT CTGGAAATAT ATCCGTTGGA GTGAACAATG TCCAGTCCGA CCCGATGGTA TGAGCACTCA TGTTCACCGA TGGCAACTTG TTGACAACTT TGTCACAAGG TTCAATGAGC ATCGTAGCGA AAACTTTGTA CCTTCCCATC TGATTTGTGT GGATGAATCT ATCTCAAGAT GGTATGGGCA GGGTGGGATT GGATAAACCA TGGTCTACCA AATTATATTG CAATTGATTG AAAGCCTGAG AATGGGTGCA AGATTCAAAA TGGTTTTGTG TGGACAATCC GGTATTATGC TTCGATTGAA ACTTGTAAAG GGAAAGACGA TAACTGACGA CGAAGAGGGT GACGAGGAGG ATGAGTATCT ACCGCATGGT GCAAAAATTA TCAAAGAACT TGTTTGTCCT TGGTGGGGGA GTGATCGGAT TGTGTGTGCT GATTCTTATT TTGCCTCCGT TGTGACAGCT GTCAAGCTTA AGAGGATTGG CTTGAGATTC ATTGGGGTTG TGAAGTCGGC AACGAGAAGA TATCCAATGG CCTACCTTTC ACAGTTGGAA ATGACAAGTA GAGGAGAATG GAAAGGATTG GTGACAGACA GAATCTCGGA TGGAAGTTGT GACCTGATGG CTTTTGTATG GGTGGACCGA GACCGTCGAT ATTTTATATC AACAGCATCC AATCTGAATA GAGGCTGGAG TCCAGTTTGC TACCGGTGGA GACAGGTGGA TACATCACCT GATGCAGACC CTGAGAGGGT GGAGATCAAT ATTGCGCAAC CAGTTGCAGC AGAAGTGTAT TATTCTTGCT GTGCAATGAT TGACAGACAC AACCGGAGTC GGCAGGATAC ACTGATGCTT GAAAGAAAAC TTGGCACATG GGATTGGTCG ACACGAGTCA ACTTATCAAT TTTTGGTATC ATTGTTGTGG GCACATGGTT AGCCTACAGC CAATGTACAG GAATAGGAAA GTCTGCTGGA CGAGAAGAAA AGCAGAAGGA TTTCTACAGT GCCTTAGCCG AGGAGCTGGT GGACAACCAG TACGATAGTG TTGGAAGTCG GAAAGTTGGG AGGGGTGAGT TGGACAAGGA TAGCCCAACC ATCTCCAGAA CTGGAGAGCC GCGATGTGGT CTCTCCGCAC ATCTAA
|
Protein sequence | MPVRDPRLRI GGKVTAKACH VVHLSECARR YGVNKHSKRL VGTVLDVTTT PVSITTGRTS TLITAVYDFG ESLFKEKTLN IRSVKAFVPP EDEGMSLIEE LAAEALQAAE ADMEAGNLME ESVEAPVAEM VETPADIEPD TLVDTEPNTP VDTEPDSPVA EIVETPVDTD TSVDTESENP VATVHQTECK TRIQSQPENF FGSLECSYSL QSLSSVAGPN YGPPLHPPST FLPLHLDAQE CPGNGLTISG NISVGVNNVQ SDPMGKTITD DEEGDEEDEY LPHGAKIIKE LVCPWWGSDR IVCADSYFAS VVTAVKLKRI GLRFIGVVKS ATRRYPMAYL SQLEMTSRGE WKGLVTDRIS DGSCDLMAFV WVDRDRRYFI STASNLNRGW SPVCYRWRQV DTSPDADPER VEINIAQPVA AEVYYSCCAM IDRHNRSRQD TLMLERKLGT WDWSTRVNLS IFGIIVVGTW LAYSQCTGIG KSAGREEKQK DFYSALAEEL VDNQYDSVGS RKVGRELESR DVVSPHI
|
| |