Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49964 |
Symbol | |
ID | 7198553 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 448646 |
End bp | 450346 |
Gene Length | 1701 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184805 |
Protein GI | 219129246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCATACGAC TTCCTCCTCC CCACCCAACT CAACTACTTA CCCATCCACC AACTGACGGT GACCTCCTTT TAGAGTGTAC TGTGAATACG AGCGTCTACA TCTCACGGCA AGGGAGAAAA CAGCTTTCTC TGAACAGTGT GTAACGAAGT CCTCCAATCA CCAAACAACC CTATCAAAAA TGAAGTTCTC CTTTGCTGCA GTCAGTGCGG CCTTGACAAT TGCTTCCGTC GATGCCTTTG TGCCACAGAC TCACAGTCAC ACCAATGCTC GGCCTGGGTC AAATCCCGAT TCGGTCCGGA CTGTGTCTTC CGCACTGCGA GCCGAGGAGT CACACGCATC CTCGTTTTGG TGGGGACCCG TGGCAACGGC TTTTGCGGGA GTTTCTTTGG TCGCACAGAT CGCCGTGGCG GCTCCGGAAC CTACTTGGAC GACTGGCGCC GCAGGTGTGT GTTTGTGTGT GTCTGTAGTT AGATGTTAAT TGATTGTGAG GAGTGATGCG TACGACTCTG CGGTTCAGAC GCAGAGATCT TGAGAAGACT GGAGCAAGTG CTTCGAAAAT CTCCCCGTCG GTATCCTTGC CTGCTGCTTC CAAGTGAGGG AGGTTGCGAG CCTCATCGTC GCTTTCGTAA TTCATTTCTC ACCCTCCTTT GCCGTTGTCC GAACTTTGCT TGACCTTGCC AACGCAGACG ATCTACTCCC CCAAACCTCA TCGGTACTGA TCGCGGATCG AATAGATTCC ATGGACTTTT CCATGCCTTC ATATTCGGAT GCCATCAAGT CCGCAACTCT GGATACTTCC GCCTCGTCCT CATCCAAGCC AGCCCCTCCG TCCTTTAATC CCTTTGAAGA TTCGTCCAAA GACGACGCTG CCTCGGCTGC AGCTAAAGCG GAAGAAAAGG CGGCTGCCGA GGCCAAAAAG GCCGAGGACA AAGCACGTGC GGAATTAGAG GCCGCCACCA AAAAGGCCGA AAAGGAGGCG AGGCGTCAAG CGGAGCTCGA GAAGCAAAAA GCTGCCGCCG AACGTACCAA GGCTGCTCAA GAAGAGAAAG CTGCGGCGCC AGCGAGCAGT AGTGTGGAGC TGCCGTCGGT CCCTTCGGTG GATCTCAAAC TCCCCGATAT CTCCATTCCA GACTTCAAGG CTCCCGACAT TACCATTCCC GATTTCAAAG CACCTGACAT CAAGATGCCC GACTTGCCCA AATTTTCCAT GCCCAAAATG GCAACGGATG GCTACGATTT CCCCGATATC AAAGCACCGA AGGTCGATAT TCCCAATGTG GATATGCCCA AGATTGCTAT GCCCGCCCTT CCATCTTTTG GTGGTGGTGG TGGTGCTTCT TCGTCAGACA ATCCTTCCTC TCCGCTGGAA TCCCAAGATG TCCGAGACGA ACGCGCGCGC TCTGCTAAAG CCGATTTTGC CGACGCCGAC AATACCGCCA GAGAGATTGA AGCCAAGGCG CTGGAATTGC GGGCCGTTGC CAACGACAAG AAGCAAGCCT TTAAAGACGC CAAGGATGAA GCCTGTGCAA CACGACCTGG CGGCAAAATC TTGTGTTTGC GTAACCCCAT GAAAGCTGGA TTCTAATACG AAAATTATTA CCTATGTGGT GCTGGAACAA ATTATCATCA CAGTCAGTAC TATTTTTATC ATTTTCCAAT GTACAAGACT ATTGCGTAAC ATTTGGGCCA CCAGGCGACA C
|
Protein sequence | MKFSFAAVSA ALTIASVDAF VPQTHSHTNA RPGSNPDSVR TVSSALRAEE SHASSFWWGP VATAFAGVSL VAQIAVAAPE PTWTTGAADD LLPQTSSVLI ADRIDSMDFS MPSYSDAIKS ATLDTSASSS SKPAPPSFNP FEDSSKDDAA SAAAKAEEKA AAEAKKAEDK ARAELEAATK KAEKEARRQA ELEKQKAAAE RTKAAQEEKA AAPASSSVEL PSVPSVDLKL PDISIPDFKA PDITIPDFKA PDIKMPDLPK FSMPKMATDG YDFPDIKAPK VDIPNVDMPK IAMPALPSFG GGGGASSSDN PSSPLESQDV RDERARSAKA DFADADNTAR EIEAKALELR AVANDKKQAF KDAKDEACAT RPGGKILCLR NPMKAGF
|
| |