Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40433 |
Symbol | |
ID | 7198159 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 381101 |
End bp | 383068 |
Gene Length | 1968 bp |
Protein Length | 585 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184358 |
Protein GI | 219128308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.150375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGTT GACTGAAGCG ACGCAGGGGC TGCGGTCCGA AGACTCACAG GCTTCGTTTC GTAATTTTTG TATTTGCAAA TCGGAAAAGC CGGGATTATC TTTGGGAATT GACAGAAGTC GCATTGGTCG CATTGATCTC GTTTTTGCAG TAGCCACGCT AGCCTTCCTC TTTTGCTAAG CTCTTACTTT TTCTTCGTGA CCTTCTATTC CAGTCTGCAG AAAATCCTGA GGTCGAAAAC TTGAAGATGG AAGCGACCGA GGCGGAAGTC AAGTCTCGCA AGGATCGTCC CACCGGCTCC ATTCACTTTG GAGATACGGA CTTTGACGTC GAAGATGAAG TCGCTGATGC TACGTGGTCC GAGGTTTGCC AAGCTTGCTG TGTGCATTCT GGACAAGAGT GGGGCATGAT CGCAGTCGGC ATTTTTTTGG TCGCTTTCTT CCTTTACTTT TTCCTCGTTG GTCTCGACAT GCTGGGTAAC GGTGCCAAAG GTATGTGCGG CTGCACTGCT GGAGAGCTGT TTGGGGATGA CACTAACCCT ATCGCCGGTC TCATGATTGG TATTATTGCC ACTGTTCTGC TCCAGTCCTC TTCCACAACA ACCTCCATTG TTGTGTCTTT GGTTGGATCG GCCGTTTCGG TCCGTCAAGG AATCTACATG ATTATGGGTG CCAACATTGG TACTTCGGTC ACCAACACCA TTGTCGCTAT GGGGCAGATG GGAGATGGCG ATCAACTCGA GCGCGCTTTT GCTGGAGCAA CTGTCCACGA CATGTTCAAC TTTTTGTCAG TTGCTGTCCT GCTTCCGGTT GAGGTTATCA CTGGATACCT CTACCGCCTC ACCAAGGCTA TGGTCAAGAA CGTCAACCTT GAGGACGGAG AGAGTTGGGA CGGACCCATT AAGAAAATGG TTGATCCTCT TTCCGATATG GTTATCATCT CGAACAGCAA GATCATCACT GCTGTGGCTA AGGGCACCGG CACCTGCGAC GAAGGCGGTG GCTTTTATCC TATGAACTGT ACCGATTCTT CGTATTTGGG ATGCGGTAAG AAATTCGGTC TTATCTCTTG CAACAAAGTC AGCGGCAAAT GCCCGGCTTT CTTTCAAGCT GATGCCAGTG CAAAGGATGA TAAGGTGTCT GGGGGCGTTG TTTTTTTCAT CTCGATTGTG ATTTTGTTCA CGTGTCTGGC CGGTCTCGTG ACAGTCCTTC AAAAGATGCT CCTCGGTATG TCTACCCGTG TTGTTTATAA GGCCACGGAT ATCAACGGAT ATCTGGCCAT TGCCATCGGT GCTGGGTTGA CTATGATTGT CCAGTCTTCC TCTATCACTA CGTCTGCGCT GACGCCCTTG GTCGGTATGG GTGCCCTTCG TCTCGAACAA ATGTTTCCTT TGACGCTCGG AGCGAATATT GGAACAACCT TGACTGCCAT CATGTCTGCC CTTGTCTCTG CCAGTCAGGA TTCTCTACAA GTTGCACTCG CCCATTTGTT CTTCAACTTG ACCGGCATTC TCATCTGGTA TCCAGTCCCT ATTATGCGTC AAGTTCCGTT GAGTGCCGCC CGACGTCTCG GAAGGTTGAC CCGCATCTGG CGTGGGTTCC CGCTTGTCTA CATCGCGGTC ATGTTTTTGC TCATCCCTCT GCTTTTGTTG GGTCTTTCGT CGCTTTTCGA TGACGGCAGC AAGGGTCTCA CTGTGCTTGG TTCGTTCCTC ACAATCCTGC TAGCCCTGGT GCTGCTCTAT TCGGTCTACT GGTTCCGTTA CAAGGACGGT AGCCAGAAGT GCTCGGACTG CATGGCGGAA CGTGAAAAGA AACGTTTGGT CATCAAGGAG CTTCCTGAAG ATATGATCTA CCTCAAGGAA CACATGAAGC GCCTTATTGA GCACACCGGT CTTCCCGAGG ACGAAGAAGC TGGTGAAGCA AAGGATCTTT CACCGGATAC TTCGGACTCA GATGAGGTTG CAGCTTAA
|
Protein sequence | MSAENPEVEN LKMEATEAEV KSRKDRPTGS IHFGDTDFDV EDEVADATWS EVCQACCVHS GQEWGMIAVG IFLVAFFLYF FLVGLDMLGN GAKGMCGCTA GELFGDDTNP IAGLMIGIIA TVLLQSSSTT TSIVVSLVGS AVSVRQGIYM IMGANIGTSV TNTIVAMGQM GDGDQLERAF AGATVHDMFN FLSVAVLLPV EVITGYLYRL TKAMVKNVNL EDGESWDGPI KKMVDPLSDM VIISNSKIIT AVAKGTGTCD EGGGFYPMNC TDSSYLGCGK KFGLISCNKV SGKCPAFFQA DASAKDDKVS GGVVFFISIV ILFTCLAGLV TVLQKMLLGM STRVVYKATD INGYLAIAIG AGLTMIVQSS SITTSALTPL VGMGALRLEQ MFPLTLGANI GTTLTAIMSA LVSASQDSLQ VALAHLFFNL TGILIWYPVP IMRQVPLSAA RRLGRLTRIW RGFPLVYIAV MFLLIPLLLL GLSSLFDDGS KGLTVLGSFL TILLALVLLY SVYWFRYKDG SQKCSDCMAE REKKRLVIKE LPEDMIYLKE HMKRLIEHTG LPEDEEAGEA KDLSPDTSDS DEVAA
|
| |