Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49802 |
Symbol | |
ID | 7198464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 366213 |
End bp | 368143 |
Gene Length | 1931 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184531 |
Protein GI | 219128672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACAAGTCT ACTCACACTC ACATACACAT GTAGTATACA TACACACACA CACACCCATT CCCTCACATA ACAATCATAT CCTCTTACCA ACCAACCAAC CTTGCTGCCG ATACCATCCA CGGAAGACAC ACACACACAC ACTCTCACTA CCCTCAAAGT CACTTTTAGG TCACAGCCAC TCCTCACACG TATCCAAGGA GTCGGTTTCG ACTTCCCCAT GGGACTTGGT TCCAACTTAT CGTCGTCTCC GCTCGGTAGG AGCGTTCACC ATTCTTCGAG TGATTCGACG TTGCGGAGTC CCCCACGGAC GACTCGGACT ACCGTCCGGC ATTCCACGGC GTGTGCGCGT TCCTCTCCCA CCAGCGTTGC TCTGGTTCCG ACGACGTATC CGTCTCCTAC TGCTCATCAT CAACAACAAC AACATCACTA CTCCGTCAGT GCGAGTCATT GCCACCGCAT GCCGCATCCC CAATCACCAC CACCGCGTTT CTGGCTCCAG TCATTCGTTC CATCCTCCTT CTTCTCTTCC TTCTCTTCTT TTTCTCTCTT TTCCGCACGT CCCATGAGCC GACGAAGACT GCGGAGATCG CGACCCAACG CTCTGCTATT GTACAGTCTC GTAGCCTGGG GGACGCTACT CGTCTCGTGT TGGACCATCT TTGTTATACT CTCACCGCCA CAATCCGCAT CCGTACCTCC CAATCCGCTA CCACACCATT CCGTCGCACT CCTGCGGCAG GCCTCCACCG AACTGCGGAC ATCGTCAACG GGACAACGGC GTCAAGTCAC CACAGTCCAA GTCCCCGACC TGTCCTCCTG GCGCAACACA CACGGCGTAG TACACGTCGT ACAAACACGC TTTATGCAGT ACCAACCCAA CTTGTTGGAT CTGGGTCATG CCCGTTTGGA AATATTCCAA GCCCTTACCC TGCCTTCCGT CCGCGCCCAG TCCTCGCAAG AGTTCCTGTG GCTCATTCGC ACCGACCCGG CCTTGCACCC AACCTTGCGT ACCGCACTCT GTCACGTCCT GCGGGACGTC CCCAACGTCA TTCTGGTCGC CTCCAACGAA AACCCCGAAG GCTTCCGCGC GGACGACGCC GTCGCGGACG TCAGCGACGA TTCCGTCTGG GTCGGCCACG CCGACACGGT CCGGGCCTAC CACGCCGCGG CACAAACCCA CGTTCTGTTG GAAACACGAC TCGACGCCGA CGACGGCCTC GAAACACACG TCCTCGAGAA TTTGCAACGC CAGGCCGCCA CCGCGCTCGT GCACGCACCG GCCGTGGGAT GGCGCGTCTG GTGCGCCTCG AGTCACCTGG AATACCAACA CTACAACGTG TGGGACGCTG GGGACGTCCG GGGAGCCATT GTGGGGATCA AAACATCCTA CTGCGTCACT CCCGGCTTGA CTTGGGGCTA CGCGGTTGGT GTCGTCCCGC ACAAGGTCGA ATCGAAACAC GATCGCATAC ACAAACGTGT CCCGGCCTGT ACGGAGGCAT CGGCCGAGAA CGCCCGTGTT ACGGGCTGTC TGACCCGCAT ACAGAACGGA CGGCATCCGG CCGCGGTGCG CGCGCGGTCG CCCACCAGTG CCGGCATGGC CAATCTAATA CTCGACGCCA CCATGACACA AAGCGGCAAC GCTGCCGCCA CGATGCACAA ACACAACCTA CAGAAATTGC AAAAATCGCG TTGGAAAACG TTGCAGGACG ATTTGTGGCT CACACTGCCC CTAGTGTTTG GGATTGTACC TGCACGGGTG TGGCAAGCCC GGGAATATCT AGAAGCTCAC ATGGTTAATA TTGTGCGCGA TAATCTGGCC GGACAGTGTA CCAAGGGACA CAGTTGCAAG GAATTGAGTA AACAGGCACT ACAAGTTCTG TTGGATATGT ACGAGGCCGA CCAGGCGGAA CCGGAACCCG AACAGCTGTA A
|
Protein sequence | MGLGSNLSSS PLGRSVHHSS SDSTLRSPPR TTRTTVRHST ACARSSPTSV ALVPTTYPSP TAHHQQQQHH YSVSASHCHR MPHPQSPPPR FWLQSFVPSS FFSSFSSFSL FSARPMSRRR LRRSRPNALL LYSLVAWGTL LVSCWTIFVI LSPPQSASVP PNPLPHHSVA LLRQASTELR TSSTGQRRQV TTVQVPDLSS WRNTHGVVHV VQTRFMQYQP NLLDLGHARL EIFQALTLPS VRAQSSQEFL WLIRTDPALH PTLRTALCHV LRDVPNVILV ASNENPEGFR ADDAVADVSD DSVWVGHADT VRAYHAAAQT HVLLETRLDA DDGLETHVLE NLQRQAATAL VHAPAVGWRV WCASSHLEYQ HYNVWDAGDV RGAIVGIKTS YCVTPGLTWG YAVGVVPHKV ESKHDRIHKR VPACTEASAE NARVTGCLTR IQNGRHPAAV RARSPTSAGM ANLILDATMT QSGNAAATMH KHNLQKLQKS RWKTLQDDLW LTLPLVFGIV PARVWQAREY LEAHMVNIVR DNLAGQCTKG HSCKELSKQA LQVLLDMYEA DQAEPEPEQL
|
| |