Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47695 |
Symbol | |
ID | 7202703 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 554174 |
End bp | 555638 |
Gene Length | 1465 bp |
Protein Length | 418 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181932 |
Protein GI | 219123231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00130842 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCAGCTGCA GCTGGAGCTT ATCTTGGTGC AGAATAAACA AAAAAGGACA GGCAAGCGTT TGTGAATTCT GGCATTGTCG AGATCGCTGA CCAAGAAATT TCCATTCACG CTGAAAAATG AACGTTTCGC TCATAGGCAA AATAAAAAGA GGTATTAAGG AAATTTGCTC GGCAACGAGC AATAATTCAT TTCCAGCCGT TGTCGACAAT CCATGTTGGG GTCTGCCGAG CCAATGTTCC TTGTCTGACT TGCTGCGATG CTTTGAAAAG GATAAAGTGG TCGTGTACGA AACTGATCAT TTTCTAGCCT TGAACAAGCC ACCGGATCTT CGAATGGATG GACACCACCC TTCGACGGTA CTCAAGCTCC TAACCTATTG GTATCCTCCT CCTGCCTTCC AAAATCTTCA GAGTAAGGGA CTTCTGGAAA AGGTTAGCGA GTTTGAAAAC TACAGGAATA TTGAAGGCAA TGAGCTTAGA CCCTGTCACC AATTGGATTA CGCAACGTCA GGCATCCTTC TCGTCGCTCG GAATCGGCAA GCAGCGGATC AAGCTCGAGT TTCATTTGAA GAGCGGTCCA CTCGAAAAAC TTATTTGGCA CTCGTCCACG GCCACCTATC GGTACCAAGC AATATTCCGG TCATGAGGAG AACGGACATT GATGAGCGAA TGGGTCGGCT GGAGGAAATA TACCGACAGA GTCGTCGCAA ACACCGGAAG GACACATATA GAGGATATCA GCCTGCCCAT GGCTTGTTTC AGCAGCTCCA GCAACAGCAC AGTAAAAGAG CAAAAAAAGA AAAAAAATCG ATTATCAGAT TCCGAGTGGA AAACTGTTTG GAATGAGCTA CAGTTTACTA AAACAGAAAC AGACCATATC CTTAATTTAA GTTGGAAGGA GGTGAAGGCG TCCGGAAAAA CTGAACCTTT TGATCGCGCA GCAGAAGTAT TCAACAAACT TCAATACAAA ACGTTGTTCC CCGAGGATAA AAGTGCCGAG TTGTCGCTCC CGACTTTCTT TCGCGATGAA AGTGAGGACC CAAACACACT CTTCATTTAT GCTTCTGTCG CACAGGTTCC TCACGATTTT GCCATGCGAA TAAATCCAAG CATGTCGAAT GCCTCTGCAT ATTTGAAGGT TGGGGATTCA TCCCTGGACT ACAAGCCGTC TCTCACACGA TGCGTGATTC TTAAGCATGC TGCCATTAGA GGGCAACCGG TTACAAAAGT TCGTCTTGAG CCTCGAACTG GGCGACGTCA TCAGCTACGA GTACATTCCG CATTGTTAGG GCACGCCATA GTTGGAGATC AAACCTACAA AGCCCCAGGC TCTCCCGACC TAACGGATCG CATGTGTTTG CATTCTCAAT GTCTCGAAAT TCCACTTTTT GAGGAGTTAA TCAAAGTCGA AGCACCAGAT CCCTTCCTTG TAAAGAACCG TGAGATATTG GTACAGCATC TATGA
|
Protein sequence | MNVSLIGKIK RGIKEICSAT SNNSFPAVVD NPCWGLPSQC SLSDLLRCFE KDKVVVYETD HFLALNKPPD LRMDGHHPST VLKLLTYWYP PPAFQNLQSK GLLEKVSEFE NYRNIEGNEL RPCHQLDYAT SGILLVARNR QAADQARVSF EERSTRKTYL ALVHGHLSVP SNIPVMRRTD IDERMGRLEE IYRQSRRKHR KDTYRGYQPA HGLFQQLQQQ HNHILNLSWK EVKASGKTEP FDRAAEVFNK LQYKTLFPED KSAELSLPTF FRDESEDPNT LFIYASVAQV PHDFAMRINP SMSNASAYLK VGDSSLDYKP SLTRCVILKH AAIRGQPVTK VRLEPRTGRR HQLRVHSALL GHAIVGDQTY KAPGSPDLTD RMCLHSQCLE IPLFEELIKV EAPDPFLVKN REILVQHL
|
| |