Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50569 |
Symbol | |
ID | 7199394 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 150316 |
End bp | 151617 |
Gene Length | 1302 bp |
Protein Length | 354 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185530 |
Protein GI | 219130769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAAGAGCA TGAGTTTACT AGGCACAATG AATGTTGGCT TCTGGCTAGT TCCGTTCCGC GCGCTGTTCC AGCTCGATCA AGCAGAGACC GTCTCCAAGT ATCCTTCGAT TGTTTTACAG GCCATTGTTG CATGTCGACT GCGAGCAGTG GCAGAAATGC ACTGCCGAGC GAGGACCTCG GTGTTTTCGC AGCCGATTCG GTGGCAACGC CGTCAGGGGA CGAAGAACGC ACGATAACGA CCAAAGGTTA GTTGCTCGTT TCCGTCACAG CAGCGTGAAG CGTTTCCACC ACATAGGCTC GATCTGGACA GACGGCTGAA GAAGGAGCGA GTAAAAACGA AACGGCGCTT TTGCAAGCCG ATGCGAACCA CCAATAATCG TAGAGGAGGT TGTCACAAGC AAAAGTCGCA ACCGACGACG GCAGGAAAGG AGCGAAATCA GCGGATGGAA AAAACGGCGC GACTTCCATG CCAAAGCCGT TTTACAGCTC CAGTAGGTAT TCAAGTGAGT CGCAACTGCA GGCGTTTCCA AAACCGGTAG GGGAGACATT CGACGATCGC TTCCATGTGA CGAGCGTCAC ACCGATAAGA TTCACCAGGC GAGGGCTCCG GCATGTTGCC GTCTTCTAAC AACCACCCGC AGTAACGTGG GCGCTCTAGG CGACCTTGGC TCTGACAAAG AGGCGTGTCC TGTGGATTAC AAAGCTGACG TAGCGACTGA TCCGGAATAC ATCAACGAAA CCGACAAAGA AATTACAGCG CAAGGAGCCA CTGCCGTTCA AGTCGCGGAG GATTCCAAAG AAGCAAAAAT CAACATCGAA GGGCTTTTAC GGATTGTCTT TCAAACCTTG CTGGAGGAGT GCCATATACC AAATTTTACC ACATACACCC TAGTCGCCGA CAAACAGTTG ATTCTTGCCC AAGGCAAGCC CGTGGAGTTC ACCTTGTACA CGTGGAAGTT GTGGCGCGAC CGGCATCATT GGGCATCATT TTGGAAAGAT CGGTACGTAT TGGTGTTAGA TTCAATGTTG GAACCTTGGC TGGTTTTACG ACCGAACAAT GTAGGACAGC CTCCACGTAC GCGCAACGAC CAATGGGCGT GACAACCGAC AAAGTGAAAG AGGAAGATGA TGCCGCTGCA CCCGAGTCAG CAGACAAATC CCAATCAAAA GCTATCTCGG ATGGCGCAAC GGAACCTCCA ACAAAACAGG AGCGGTCACA GGAACTCCTT GGGAATATAA TGGGATTGCT CGCAGGGATC GCCATTCTAT TTCTTTTTGT GTCCATCTTG AAGTGGGAGT AG
|
Protein sequence | MSLLGTMNVG FWLVPFRALF QLDQAETVSK PLLHVDCEQW QKCTAERGPR CFRSRFGGNA VRGRRTHDND QRLDLDRRLK KERVKTKRRF CKPMRTTNNR RGGCHKQKSQ PTTAGKERNQ RMEKTARLPC QSRFTAPVGI QARAPACCRL LTTTRSNVGA LGDLGSDKEA CPVDYKADVA TDPEYINETD KEITAQGATA VQVAEDSKEA KINIEGLLRI VFQTLLEECH IPNFTTYTLV ADKQLILAQG KPVEFTLYTW KLWRDRHHWA SFWKDRTAST YAQRPMGVTT DKVKEEDDAA APESADKSQS KAISDGATEP PTKQERSQEL LGNIMGLLAG IAILFLFVSI LKWE
|
| |