Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38463 |
Symbol | |
ID | 7203260 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 176785 |
End bp | 178265 |
Gene Length | 1481 bp |
Protein Length | 391 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182483 |
Protein GI | 219124381 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.655546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTGA GTGATCCTAG ACTTCGAATT GGAGGGAAGG TGACAGCAAA GGCTTGTCAT GTTGTGCATC TGAGCAAGTG CGCACAGAGA TATGGCGTCA ACAAGCACTC CAAGCGGCTT GTTGGAACGG TTCTAGACGT CACGACCACC CCTGTATCCG TTTCAACCGG GCATACCTCT ACTTTGATTA CAACAGTTTA TGATTTTGGA GAGAGTTTGT TCAAGGAAAA AACACTGAAC ATTCGGAGTG TAAAGGCATT TGTACTGCCA GAAGATGAAG GAATGTCCTT AATTGAGGAA ATAGCCGCAG AGGCAGCAGA AGCAGACATG GAAGCCCTAA ACTTGATGGA AGAAAGTGTC GAAGCCCTGG TAGCCGAAAT AGTTGAAACC CCGGCTGACA TAGAGCCCAA TACCTTGGTT GACACAGAGC CCAATAGCAC GGTAGCCGAA ATTGTCGAGA CCCCGGTCGA CAATACCTTG GCTGACACAG AGTCCGAAAA CCTGGTAGCC ACAGTGCACC AAACAGAGTG GTATGTGAAT GAAAAAAAAA CCCGGCTGGA TGTGAATGGC CATGTCTATA TTAGGCACTT CTATATCCGT ACTTCAGTTG GTGACCTTAT TGGTCAAGAC TCTGACAATG AGGTGAGATT TTCGCGCCTC GAATATTTTC TGCTCATGTT TCCGCCAACC CAGCTGACTA CTATGTGTTG GCTTACAAAT ACTATGCTTG CTCAACAAAA CAAGCATCCA ATCACAGCCG GAGAACTTCT TCGGTTCTTT GGAATCCTCA TACTCACCAC AAAGTTTGAG TTCAGTAGCC GGGCCCAACT ATGGTCCACA ACTGCACCCT CCCAACAAAG AGGGTGACGA GGAGGATGAG CATCTACCTC ATGGTGCAAA GATTATCAAA GAACTTGTTT GTCCTTGGTG GGGGAATGAT CGGATTGTGT GTGCTGATTC TTATTTTGCC TCTGTTGTGA CAGCTGTCGA GCTTAAGAGG ATTGGCTTGA GATTCATTGG GGTTGTGAAG TCAGCAACAA GAATATATCC AAGACACATG GTTAGCCTAC TGCCAATGTA CAGGAATAGG AAAGTCTGCT GGACAGGAAG AAAAGCAGAA GGATTTCTAC AGTGCCTTAG CCGAGGAGCT GGTTGACAAC CAGTACAATA GTGTTGGAAG TCGCAAAGTT GGGAGGGATG AGTTGGACAA GGATAGCCCA ACAATCTCCA GACCTGGAGA GCCACAATGT GGTTTCTCCG CACATCTAAC ACCCACCAAA AGAAAAAGAA AGAACAAAGA TGGTACTATT AAAAACCAAA GACAGCAGGG AAGGTGTTTG GTGTGTTCCA AGAAGACCAC ACATGTTTGC TCTGTATGCA AAGATGTTGA GACAATTGAA AGCAAAGAAC CATGGATTTG CTACACAACG GGAGGGCAGC TATGCTTTGC CCAACACCTG ACTGCCTTGC ATGGTAGTTA A
|
Protein sequence | MPVSDPRLRI GGKVTAKACH VVHLSKCAQR YGVNKHSKRL VGTVLDVTTT PVSVSTGHTS TLITTVYDFG ESLFKEKTLN IRSVKAFVLP EDEGMSLIEE IAAEAAEADM EALNLMEESV EALVAEIVET PADIEPNTLV DTEPNSTVAE IVETPVDNTL ADTESENLVA TVHQTEWYVN EKKTRLDVNG HVYIRHFYIR TSVGDLIGQD SDNEPENFFG SLESSYSPQS LSSVAGPNYG PQLHPPNKEG DEEDEHLPHG IGKSAGQEEK QKDFYSALAE ELVDNQYNSV GSRKVGRDEL DKDSPTISRP GEPQCGFSAH LTPTKRKRKN KDGTIKNQRQ QGRCLVCSKK TTHVCSVCKD VETIESKEPW ICYTTGGQLC FAQHLTALHG S
|
| |