Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49420 |
Symbol | |
ID | 7195795 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 208253 |
End bp | 210196 |
Gene Length | 1944 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184083 |
Protein GI | 219127731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCTC GATCGCAAGG CACCCCTAGT CGGCTATCCC TTGGAATGTT AAAACTGGCA AGTCTCGGAG CCAATCAGTG GTCGGGAACC GCTGGTGGCG ATTCTCTGGT TCGCACGAGT GTCTCGCAAG CTAGGCTGGA TGAACTGCTG GTACTCGAGC TTCGAAAAGA CGGGTGAGTC ATCAACACTA CCTATACGAT AGTCCTGACA CACGTTTTCT TACTGCATTG CGTTCGGCGT TGTTGTGGCA GTGGGCGAGG GTACCTTACC TTCAGTATAC GAGGTCTCTA CAAATTCGTG CTTACCTCCA TCACCAATCC CCACCATTCA TTACAGGAGA ATATCAGGCC CACAGTTTCT CCGCCCGTCA CGGTGACGGA ACCTTCGATA CATGACGCAG AGGAAGAAAA TTCGGAAAAT GCTGGCAGCG GCTCAATCGT TGCAGATGAG GACGAGCAGG TTACGTTCGC AGTACCAACC GAGACCATTC CGCCGGACGC TGCAGAGGAA GCTCAGGATT CCAGACGTGC CGAAATTGTG GAATCAAAGC CAGTCGTTAT GTTAAAAGAT ATGAAAAGAT GCAACAGGGC ATCCATATCA GGAATGCAGC CAAATAAAGT CACCGAAGAT GGAGGAACGA CGGATGAACA AGTGGCGGGC GAAGGAGAGA TGACGTACAG AGATCGGTTG GGTGGGTACC TACACCCGCG CGATATGAGG AAGCTGACTG CTCCTTTCTC AGCATCTGTC GAACCCGAGC TTATGGTGCG GCGTCACGTA GTGCTTCTAA ACTTTGATCC GTTTCGGGCA ATTATTTTGC GCGACCGGCT TCTTATTCTC GTTCCCGACG GAGCGGATTC AATCCTGGTT CAGTTGGAAC AGCGGGTACG CGGGGGAACG GCGGAATTAG AGAATTCCGT TTTTGGAGCA AGATCGGAGC ATGTGCATAT CTCTGATCCT AAGGAGACAC GACCAACTAG TGGATTCGTC AACATTTTTG ACAAGCTCGT TCGCAAGCCG ACTGGTTCCG ATGACCACGG CGTCAGCTCC ACCAGCAAAA GTTCCGACAG ACAAAATACG TTGCAAACTG TGGGACGACG GCTAAATTTG TCTTCTATCA AGAAACCCGT GGCCAATAGT CAACAGAGTT ATATGGCAAA AATTTCAAAG AATAGCGATG ATTGGAAGCT GCCTACTGTT TACGACTTTG GCGATGAGTG GAACGAAATC CACGGACGCG CTTGGATCGA TATTCCATTT GAACTTCAAT GTATTGATGC CTGCCTCTAC TCAGTATGCG AGATCCTTAC GAACGACACA ACCTCCATTC AAGAGGTGGC GAAGGACTAT ATCGAGGACA TATTGTCTGG TCGTTTTGGT TTAATGGAGG ATCCGCTCAT GGCTATTCGA CATATCAAAG ATGCAATTCG GGAAATGCGT TCCCGAGTAA ATAGTTTTGT CAAGGCACTC GATAGAATCC TTGACAATGA CGAGAATATG GCTTTAATGA ACCTTTCTCG TCTCCTGACG CATCCTGACC GTTTTCTTCA ATCTACCTCT TCTGCCATTC TTGAAGAAGA GGCTGATGAG GTAGAGCTGG TACTTGAAGA AAAGCAATCG AGCGGTTTCA CACTGCAAAA TGCGCTGCGG TTGGTAGATG GCCAGGTTGA TACGGCTTCC GATCTACTTG ACCAAAAGCA AGATGCCATT CGGAATAGGC TTTTGTTCGC AAACATGATC ATTAGTGTGT TCTCGCTCTG TGTTGCGTCG GCATCCTTTG TAGGGTCTAT CTTTGGAATG AACGTGCCGA TCTTTTTGGA AGAAAATTCA AACGCCTTTC GACAGATCAC AATAAGTACG ATTACAGGTG CCCTGTTTCT CGGTGTTTCG ATAATGTCTG CCCTCATTTG GACTGGAACA ATTCCACGAG CTCGATTGGG TTAA
|
Protein sequence | MKSRSQGTPS RLSLGMLKLA SLGANQWSGT AGGDSLVRTS VSQARLDELL VLELRKDGGR GYLTFSIRGL YKFVLTSITN PHHSLQENIR PTVSPPVTVT EPSIHDAEEE NSENAGSGSI VADEDEQVTF AVPTETIPPD AAEEAQDSRR AEIVESKPVV MLKDMKRCNR ASISGMQPNK VTEDGGTTDE QVAGEGEMTY RDRLGGYLHP RDMRKLTAPF SASVEPELMV RRHVVLLNFD PFRAIILRDR LLILVPDGAD SILVQLEQRV RGGTAELENS VFGARSEHVH ISDPKETRPT SGFVNIFDKL VRKPTGSDDH GVSSTSKSSD RQNTLQTVGR RLNLSSIKKP VANSQQSYMA KISKNSDDWK LPTVYDFGDE WNEIHGRAWI DIPFELQCID ACLYSVCEIL TNDTTSIQEV AKDYIEDILS GRFGLMEDPL MAIRHIKDAI REMRSRVNSF VKALDRILDN DENMALMNLS RLLTHPDRFL QSTSSAILEE EADEVELVLE EKQSSGFTLQ NALRLVDGQV DTASDLLDQK QDAIRNRLLF ANMIISVFSL CVASASFVGS IFGMNVPIFL EENSNAFRQI TISTITGALF LGVSIMSALI WTGTIPRARL G
|
| |