Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48647 |
Symbol | |
ID | 7194894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 474007 |
End bp | 475917 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | |
GC content | 64% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183104 |
Protein GI | 219125683 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.453623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGACG TTTCCCTTCG GTACTTGATC ACCGATCCGG ACACGTGGCG CGGTACGTTT TGGTGCCTCA TGGGTCTCAT CCTGGCGCGT GTCACGGCGA GTGCGGCGGT GGAAGCCTAC CGCCGCGTCC AACTCGCACG CCATCTCCAG GCCTATCCCT ACGCACGCGT CGTCCCGCTC CGGACTCGCG GCGCGGCACC GCACCACCAC ACAATCGTCC ATCGTATCCT CTGGTGGATG CTGGAATTCG TCGGGGCTAT TCTACGCGTG ATTGCGGCCG GATGTATGGT ACTGTGGCGC GCCGGGCAGC GTGCCTGGCG GCGGCAGGGG AGCAGGCGCC CCCACGGCCC CGTCGTTCCC ACGGCTGTCG TCCCTCCCTC CGTCTTGAAA TCGAAACAAT CACGAAGCCG TAGTGAACGA CCGTCGGTAC GGAGTCAGTC CGTTCTCTTT GCCCAACACA CCAACGGAGT CGTCAAGACG ACCGAGTACT ATTACAATCC CCGAGCCACG CCGAATCAGC GCGTCCCGGC CCAACCATCC ATCACGCCAC CGCGGCACGA ACTTCCAGTG GAACACCGGG AACCGACCCC GGGGAGTCCG GAAGCAACAA ATGCCCTCCG CTCCGTCTTT CCTTCGTCCA CTGCCAGCAG CACTACCACC ACGTCCATCA TGGAACACTC CCCGAGTCGA CACCATATCA AGGCGGCTCT ACCACTTTCC GAACAATGGC GCAAACGCCG TCTCGCTGGC GAAGATGCCC TTTTGCCGGC GTTGAAACGA GTTGTTGGTC ACGGGGGTGG GATCACCGTG GGGGTCAGTC CGACGCGGGG ACGCCTCGGG GCTTCGCGAT CGTCGCGAGC CGTGCCGACC CCCCTCTCGG ACAAGGTCCG TCACGCCCGC GAAGCGCGCA TTTGGGAAAG TTTGAATCGG AAACGGCCCG TTCCGGACGC GAACGATACC CAAGGTCGGG CTGCCAAAAA GGTGGCCTTC GGACCCACGA TGCCTACCGC TATTGCACCG TTGGCAGCGG AACCCACCCC GGCCCCCCGG TCCAGTATTT CGTTCGGTAC ATCACTGACT GACGTCACCC CAAAGCCAGA CGCGGCCCCC ACTACTGCGG CACCGCAACC GGCGTTCGTA TTTGGATCCA CATCGGATAC AACACCCGCA CCCGGTCCGA CAACGGCGTC ACAGCCGGCC TTTGCTTTTG GATCCACATC GGATACAACA CCCGCACCCG GTCCGACGAC GGCGTCACAG CCGGCCTTTG CTTTTGGATC CACATCAGAA ACAACACCCG CACACGGTCC GACGACGGCG TCGCAGCCGG TGTTTGCTTT TGGATCCACA CCGGGCACAA CGCCCGCAAT TGGTCCAACG ACGGCGTCGC AGCCCGCGTT TGTATTTGGA TCCACACCGG GCACAACGCC CGCACCTGGT CCGACTACGG CCCCGCAACC GGCGTTCGTA TTTGGATCCA CACCGGGCAC AACGCCCGCA CCCGGTCCGA CTACGGCCCC GCAACCGGCG TTCGTATTTG GATCCACACC GGGCACAACG CCCGCACCTG GTCCGATTAC GGCGTCACAA CCGGCGTTCG CCTTTGGATC CACGCCCGCG CCGACATCAT CGCCCGCACC TGGTCCGATT ACGGCGTCAC AACCGGCGTT CGCCTTTGGA TCCACGCCCG CGCCGACATC ATCGCCGTTA CCAACCACAA CGTCGGTACC GCCGACGTTT GCCTTTGGAT CGGCCGTAGC CGATACCACT GCTGCTCCTG TCACTGGCTT TGGAGCGACT GCTAGCTTTG GTGCACCGGC ACCAGTGAGT GGTGTATCGT TTGGGAGTAG TACTCCCACC GCGGGAGGAG CGAGTGCCCG GCGTCGTGCC GCACGCCGCG GACGCCGGTA A
|
Protein sequence | MPDVSLRYLI TDPDTWRGTF WCLMGLILAR VTASAAVEAY RRVQLARHLQ AYPYARVVPL RTRGAAPHHH TIVHRILWWM LEFVGAILRV IAAGCMVLWR AGQRAWRRQG SRRPHGPVVP TAVVPPSVLK SKQSRSRSER PSVRSQSVLF AQHTNGVVKT TEYYYNPRAT PNQRVPAQPS ITPPRHELPV EHREPTPGSP EATNALRSVF PSSTASSTTT TSIMEHSPSR HHIKAALPLS EQWRKRRLAG EDALLPALKR VVGHGGGITV GVSPTRGRLG ASRSSRAVPT PLSDKVRHAR EARIWESLNR KRPVPDANDT QGRAAKKVAF GPTMPTAIAP LAAEPTPAPR SSISFGTSLT DVTPKPDAAP TTAAPQPAFV FGSTSDTTPA PGPTTASQPA FAFGSTSDTT PAPGPTTASQ PAFAFGSTSE TTPAHGPTTA SQPVFAFGST PGTTPAIGPT TASQPAFVFG STPGTTPAPG PTTAPQPAFV FGSTPGTTPA PGPTTAPQPA FVFGSTPGTT PAPGPITASQ PAFAFGSTPA PTSSPAPGPI TASQPAFAFG STPAPTSSPL PTTTSVPPTF AFGSAVADTT AAPVTGFGAT ASFGAPAPVS GVSFGSSTPT AGGASARRRA ARRGRR
|
| |