Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50387 |
Symbol | |
ID | 7199155 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 196445 |
End bp | 197876 |
Gene Length | 1432 bp |
Protein Length | 393 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185339 |
Protein GI | 219130368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTCGCCCAC ATTTTTGGAA GCATCCATCT GTGAGGAAAC TCTTTTGGTG CATTGCGTTG TCACATCCGA CACTGGTGTA CCGTTGGCAT CGTCGGTGTT GTTACATCGT GTCGGAGTTG CGTTCCGTCC GTCCCAGGGT CGCTCGTTGT AACTGTTCCA CGACGCCATG GGCAACACGG CGTCGGAACC CGACGAGGAC GATCCGTCCG CCACGGATCC TTTTGACGGC GTGGATACCT TGGGCTACCG CGTGCTGGGC GTGCAGCCCG ACAGTCCCGC ATCGCAAGCC GGTTTAGTCT CCTTTCTCGA CTTTTTGGTC GGCGCCCAAG GCCGCATGCT GCTGGGATCG GGCGAAGACC TCGCCGACGG CGAGGAGTAC GACGATATTG ATCTACCGGC GTTACTCAAG GAATACCAAA ACAAGGAACT GGAATTGCGT ACGTACGAGT CCTGATAGAA GGAAGGGAAG CACCTCCCCA AAGGTACCGT CTCTTACTCA CTGGCGTACC CGTTTCCACA GTGGTATGGA ATATTAAATC CCAGCAAGAA CGGTTAATAT GCTTGACACC ACGAGACGAT TGGGGCGGCG CTGGTCTCCT GGGGGTCACG ATTCGACTCG ACAATTACGC GGGGGCCGAA GACCGATTGA TTCGCGTACT CACGGTCGAA CCCCAATCTC CCGCCGCCGT GGCTGGACTC GTCCCCTACC AAGACTTTCT CCTCGGCACC ACGCACCAAA CTTTGGAAAC TACCACACAA CTCGCGGATC TCCTACAAAC CAACGTGGAT CAAGTGGTGG AATTCTACGT CTACAACGTG GACTCCGACT TGGTACGTTT GGTCGCCCTC CTCCCGACTC GGGCCTGGGG TGGTGGTGGA CTCTTGGGCG CCCAAGTCGG CGTCGGCTAC CTGCATCGTC TTCCACACGC GGTCCGTACC ACGCCCGGCG CCAGTGTCGC CCGCAAAGTA CGGTACGTCG GCGTGGCGCC CGGAGGTCGC GCGGGAACGC CGCGATTACC CCGACCCGTC CTCGTCATGG AACCGCAACT CGAAATGGAA GCTCACGATG GGGAACCTGA AGACGACGAT GAAGACAGTG ACCACAGCGA TAATGTGGAA GAAGTATTTC CTCCACGACG AACGCCCGCA CCGAGCAACG AACAAACGGC CTTGGAACCA TTGCCGCTTT CGAATGCCAC CAAAGTGTCG GCCGACACTC CCGGTAACCA AGGTGGGACT CTAGCGTCCG CCGCTGCTAG TGTCTTTGCC GCCCCGCCCG TACATACCAG TATGCCTCCA GCACCGCAAG CCGAACCCTA TGAGAATCAT GGTCTCGTTG CACCGAAAGA GTCAACGGCT CGCTCCGCGT CTAACTCCGG CGTGTTCGAC GCTTTGCCTC CGCCACCGCA AAGGACCGCC TACCGGAACT GA
|
Protein sequence | MGNTASEPDE DDPSATDPFD GVDTLGYRVL GVQPDSPASQ AGLVSFLDFL VGAQGRMLLG SGEDLADGEE YDDIDLPALL KEYQNKELEL LVWNIKSQQE RLICLTPRDD WGGAGLLGVT IRLDNYAGAE DRLIRVLTVE PQSPAAVAGL VPYQDFLLGT THQTLETTTQ LADLLQTNVD QVVEFYVYNV DSDLVRLVAL LPTRAWGGGG LLGAQVGVGY LHRLPHAVRT TPGASVARKV RYVGVAPGGR AGTPRLPRPV LVMEPQLEME AHDGEPEDDD EDSDHSDNVE EVFPPRRTPA PSNEQTALEP LPLSNATKVS ADTPGNQGGT LASAAASVFA APPVHTSMPP APQAEPYENH GLVAPKESTA RSASNSGVFD ALPPPPQRTA YRN
|
| |