Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49253 |
Symbol | |
ID | 7195548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 380396 |
End bp | 382396 |
Gene Length | 2001 bp |
Protein Length | 566 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183862 |
Protein GI | 219127271 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.836677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATATTTGA GTTGAATCAT CAACAAAAGA CTAACTGTAA AAACATCTAC ATTACCTTCC GAACAAATCA TGAAGATAGT GATTGTCGGA GGAGTCGCTG CCGGGGCTAG TGCCGCTGCC CGTGCTCGCC GTTTGGACGA ACACGCCGAG ATTCTTCTCT TGCAGTCTGG ACCGGACGTA TCCTTCGCAT CATGCGGCAT GCCTTACTTG ATCGGAAACG AAATAACCGA CCGCGCCTCT ATGGCCGTTC AAACACCGCA ATCCTTGAAC GCTCGTCTCA ATATCATTGT TCGAGTCAAT ACCAAGGTCA ATGAAATCAA TACAACCGAC CAGACTGTTG TCGCTCGGAA TGAAACCACC GGGGATATCT ACACCGAACC CTACGATGAA CTTGTTCTGG CAGTGGGTGC GGCTCCCTTC AAGCCTCCGA TCCCTGGTAT TGACCGTCCT GGATTGTTCA CGCTCCGCAA TCTTCAGGAA ATGGATGCCA TCGTTCAGTG GCTCAATGTC AAAACTGAAA CCAAGAAGCC CGCTGACATG CACTGTGTGG TTGCTGGGGC GGGATTTATC GGACTCGAAA TGGTAGAACA GCTACATCAT CGCGGCATGA ACGTGACTCT GGTCGAAATG ATGCCACAGA TCCTGGCCCC TATGGATCAG GAAATGGCAG CTATGCTACA CAAAGATCTC GAAGACCACG ATGTTAACGT GATTGTTGGA GACGCCATCA AAGAATTTGC CGCCTACGAG AAGGATGCAG ATAGCTCGGT GCTGACTTTA CAATCCGGAC GTGTCCTTCC TCCTGCCCAG TTGACGATTC TTGGCCTTGG TGTCCGCCCC GACACGGGCG TAGTCAAGGC AGCGGGCATC GAGCTTTCGC CTAGAGGCCA CATTCTCGTT GATGAACACT TGCACACATC CGCAGCAAAT GTTTGGGCTG CTGGTGACGC CGTTGAAATC ATTAACCCAA TTTTGCCGGA TGAAAAGTGG GCCGTCCCCT TAGCAGGTCC AGCGAATCGC CAAGGTCGCA TGATTGCCGA CAACATATAC GGCAAAAAAC GCTCTTTTCG GGGAACGTAC GCGGTCAGTG TAGTACGGTC CTTTGATCTA TACGCAGCCT GCGTCGGTCT TAATGAAAAG TTTCTCAAAG CCAAGAATGT TCCCTATAAT GTCGTGCATG TACATCCCAA CAGCCATGCC GGATATTATC CTGGCGCCGA AAAGATCCAT CTCAAACTGG TCTTTGACAA GGAGTCTGGA AAAATTTACG GCGCTCAGGC CGTAGGCAAG GACGGCGTTG AAAAACGGAT TGACGTGATT GGGACTGCCA TGCAGGGTAA AATGACGGTG TCGGACTTGG CCGAGTTGGA GCTCTGCTAT GCCCCTCCGG TTGGTTCGGC AAAAGATCCA GTCAACTTTG CCGGCATGGC GGCTCAAAAC ATTATGGACG GGCTCATTTC CAATGTGGAA TGGTACGAAA TGGATGATCT CGTCAAAAAT CCTGACGTAT TTGTTTTGGA TGTTCGTGGA GGAGCCGAAA TCGAAAAGAC TGGTAAGCTT GCAGAAAAGG CCGTCAATAT CCCCGTCGAT GATTTGCGTG CCCGACTCTC CGAGGTACCG AAGGACAAGC GTATTGTTGT GTCGTGTGCT TCAGGGCAAC GGTCGTATTA TGCTTGCCGT ATTTTGAAGC AAAACGGGTA TGCCAACGTG GACAATTTGG ACGGTGCCTA CTTAACCTTT CACGCCGCCC ATCCAGAACC GGCTGCGTAA GGTTGCTTTC TCTTACGCCA ACCCATAGGA TAGAATAAGT CCAAAATAAG GAAAGCCAAC AGTCTATACC TATAGCGGAC ACCATAAGTA AAGACTCCAA TCCGAACTCT GTGCGTTTCT GCAGAGAAGT GCATGCAATC TGTATTGCCG GTCCAAAGAA TCTCGGAATT GTTGCCATAA ACAGCTCAAT ACAGGTAGTT TGTATGTGAA AATCGAGAGA CAATGGCCAG C
|
Protein sequence | MKIVIVGGVA AGASAAARAR RLDEHAEILL LQSGPDVSFA SCGMPYLIGN EITDRASMAV QTPQSLNARL NIIVRVNTKV NEINTTDQTV VARNETTGDI YTEPYDELVL AVGAAPFKPP IPGIDRPGLF TLRNLQEMDA IVQWLNVKTE TKKPADMHCV VAGAGFIGLE MVEQLHHRGM NVTLVEMMPQ ILAPMDQEMA AMLHKDLEDH DVNVIVGDAI KEFAAYEKDA DSSVLTLQSG RVLPPAQLTI LGLGVRPDTG VVKAAGIELS PRGHILVDEH LHTSAANVWA AGDAVEIINP ILPDEKWAVP LAGPANRQGR MIADNIYGKK RSFRGTYAVS VVRSFDLYAA CVGLNEKFLK AKNVPYNVVH VHPNSHAGYY PGAEKIHLKL VFDKESGKIY GAQAVGKDGV EKRIDVIGTA MQGKMTVSDL AELELCYAPP VGSAKDPVNF AGMAAQNIMD GLISNVEWYE MDDLVKNPDV FVLDVRGGAE IEKTGKLAEK AVNIPVDDLR ARLSEVPKDK RIVVSCASGQ RSYYACRILK QNGYANVDNL DGAYLTFHAA HPEPAA
|
| |