Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50440 |
Symbol | |
ID | 7199253 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 67018 |
End bp | 70241 |
Gene Length | 3224 bp |
Protein Length | 965 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185423 |
Protein GI | 219130544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGCCAAAA AAAGCTTGCA CCGAGACCAC AATCGATCGG ATACTTTCTC TCTCCGACAC CTAGAGTTAC TGATTCCAGC CCAGGTTCTT CCTACTTCTA TTTTCGTTCA CCCATCAACT AACTGCTACT GCTTCAACCC ATGACGGAAA CGAGTCTGCC CAGCCAAACG GCGCTTGCGC CCAGCACCAC CAAGCCCGCT CTCGTACTAC CCAAACAAGG CGTCGCCAAG GTCAAGTCCG TCACTTCGGG GGATACCGTG GTACTTTTGG GTAAGCCTCC GCAACCCAAT CTGCCCTGTC CCGAAGTACT CTTTACCCTC GAAGGCCTTT CGGCTCCGGT ACGTAGGCCG TCTCGATATC GACAAACATC ATTATCACCC ACAGTTGGTA TTCACATACA GATACATACC TACACCGATT GAACGCGTGT TGGAATACTC CCGTATACTC GCCGTTGCAC TCGTGCTCTG TGTTTCTTCT CCCACTCACT TTTTCGTTCG TTCGTTCGCT CACTTGTTCC GTAGAGAATG GCGAGCAAGG TCAATCCTAC CGACGAGCCG GGCGCCTTTC CCGCCCGCGA ATGGCTCCGT CAACAGCTGG TGGGCAAAGT GGTCCGCTTC GAGACTCGCA AGCAGCCGAA CAGTGCCGGT GATCGCGTCT ACGGCTGGAT CTTTTTGCCC GCCACCGCTC CCACGGATCC TCCCGTACAC GTAGCCGTGG AATGCGTGCG TGCGGGACAC GCGACGCCCA AATCGCTCAA GTACGCCACC GGCAACGACA CGGAGGCTCC GGCCGTCGTA CCCACCGCGC CGTCTCCCGA CGATGCACCG GAAGTCGCGG CCGCCAAGGA ATACGAGCTG CAGCTCGGGA AAGCCTACGC GGAAGCCAAG TCGGCACGGG TGGGTCTGCA CGCCACGGAT CCCCTACCCC TCGTACGGAC CCTCCGCGTC GCCAACGAAG ACTTCGCGAC GCTCCAGTTC GTGGAAGCCG TGCAAAAGCA CTGTACCCAC AAACGGATTC GTTGCGTCAT TGAATACGTC TTTGACGGAT CCCGGCTGCG TCTGCACGTG ACGGATGCAC AGTTGCCCGA GTTCCAGTAC ACTTCCTTTA CCCTCTTGTT GGCGGGAGTC ACGTGTCCCC GGCTCGGGAG CGCCAAGTCC GATCCACCCA CTCCGAACGA ACCCTTCGCC GTGCAAGCCC GGGAATTCAC GCAGACCAGA CTGCTCCAAC GCGAACTCGA CGTGTCTCTC GTCGGCACCG ATAAGGTCGG ATCCTCCGCC GTCGGAGTCG TCCATCATCC TGTCGGCAAT ATCGCCGTCG AATTGCTCAA GAACGGCTTG GCCCGCATGG CGGACTGGAG TGTCCGCCTC CTCGCGGTCG GCGATGTTCC GGCGCTCCGC GTCGCGGAAA ACACGGCCAA ACGCACCGCC TTGAACGTCT GGCGCAATTA CGCTCCACCC ACGCTGCAGA CGGCGTCGCA AGTCTCCGGA ACCGTGGTCG AAGTCGTGTC CGGCGACACC GTCCTCATAC TCCCCGACGG CAAGGCCTAC GACAGTGAAG CCGTCTTGTA CAAGGTCTCG TTGGCGTCGA TGCGCGCCCC GCGGGTCGGG AACGAACGCG CTGGACGGCC CGACGAACCC TACGCCGTCG AGTGCAAGGA GCGTTTGCGC GTCTTGACTG TCGGTCGGGC CGTGAAAGCC CAAGTCCACT ACGAACGCGA CATTCCACTG CAACCCGGTG TCAACGAAAC GCGGCCCTTT GCGACCCTCT CCACACCCAA GTACGAGGAC GTGGCCGAGG TGCTTATTCA GGAGGGACTG GCTGTGACAC AGCGTCACCG GGATGACGAC GAAACCTCGG CACGGTATGA TGAATTGCGG GCGGCCGAGG CCACTGCCAA GGCGGCAAAG AAGAATACGC ACTCGGAAAA GGAGTACAAG AGTGCCACCA TCAATGATTT GACCGATCCA CGAAAGGCCA AATCGTATTC CGGTTCCCTC ATGCGCTCGG GCCACACCAA AGCCATCGTG GACTACGTCT TCAACGGCGC ATTGTTCAAG CTGTACATTC CTTCGGAAAA TTGTTACATA CGCTTCGCGC CAAACTCGAT ACGGTGTCCG CAACCATCGC CGAGTCCGGG TGGTAAGGTG AACAAGGCAG CCGAGCCTTT CGGCGACGAG TCGAAGCGCC ACGCGCGACT TCACGTCCTA CAGCGTCACG TAGAAATTGT GTGCAACGGT GTCACCAACA GTGGAATTAT CACGGGGGAC ATGATGGTCG GACAAGGTGG ACAACGTCGT GATTACGCCA TCGAGTTGGT TGGTGCCGGC TTGGCCACGG TCGACCAACG CAAGATTGAC TATGGAGAGG CACCACGATC GCTCGTTGAC GCGCAATCAG CAGCACAGGA AAGTAAGGTC GGTCTATGGT CGATTGTCCA AGAGCAACCC GAAATTAAGG TTGCCAAAAC AGCAGTCAAA GCCAAGGAAA CGGTCGCCAC GATTCGGTTA AGCGAGATTC GCAGCGGGAA TCACTTCTTT TATCACGTGG TGGATGATGA AACAGCCAAG GTTGTGGAGG AATCGATGAA GGTTTTCACC AAAAGCCACG GCACGGGCGG CGCTCCGTGT GACGCTAAAA TTGGCAAAGT GGTTGCCGCC TTGTTTAACG ACGGCAGCGG AAAGGCATGG TACCGTGCCA AAGTTATCGA ACGCAAAGGG CCTGGCAAGA TGGCGGTATT GTTTTTGGAT CACGGAAATG TGGCGACGGT CCCGGTGGCA ACGCATCTGC GCCCTCTCGA TATGAACCTT GGGACAGATC GTATTCCACC GGTGGCCAAG GAGGCAGTCC TAGCTCTCAC CAACACGCGA CCATTGGACA GCGATGAGGG TATGGATGCG GCTCGACTGT TGCAAAGCAA ATGCTGGGGT CGCAACTTGA CGGCCCGGAT TTTCGCTCCG GACGAGTCAG GCAAAGCGGC TCTATCCATC GCGACGGAAG CTGGTTCCGA CGAAGAAACT ATCAACGCAA GTCTGGTGGT GGAGGGGCTA GCTCGCGTGG CCAAGCCAGA AACTGTGACG AGCATCTCGA GTCGTATGAT CGATCCTTCG TCATTGGTCG AGTTGGCGGC GGCACTCAAC GTGGCCCAGG AAGTGGCTCG CAAGTCTCGA GTTGGTATGT GGCGGTATGG TGATATTGGC GACGAGGATG ACGACGATAT GTAA
|
Protein sequence | MTETSLPSQT ALAPSTTKPA LVLPKQGVAK VKSVTSGDTV VLLGKPPQPN LPCPEVLFTL EGLSAPRMAS KVNPTDEPGA FPAREWLRQQ LVGKVVRFET RKQPNSAGDR VYGWIFLPAT APTDPPVHVA VECVRAGHAT PKSLKYATGN DTEAPAVVPT APSPDDAPEV AAAKEYELQL GKAYAEAKSA RVGLHATDPL PLVRTLRVAN EDFATLQFVE AVQKHCTHKR IRCVIEYVFD GSRLRLHVTD AQLPEFQYTS FTLLLAGVTC PRLGSAKSDP PTPNEPFAVQ AREFTQTRLL QRELDVSLVG TDKVGSSAVG VVHHPVGNIA VELLKNGLAR MADWSVRLLA VGDVPALRVA ENTAKRTALN VWRNYAPPTL QTASQVSGTV VEVVSGDTVL ILPDGKAYDS EAVLYKVSLA SMRAPRVGNE RAGRPDEPYA VECKERLRVL TVGRAVKAQV HYERDIPLQP GVNETRPFAT LSTPKYEDVA EVLIQEGLAV TQRHRDDDET SARYDELRAA EATAKAAKKN THSEKEYKSA TINDLTDPRK AKSYSGSLMR SGHTKAIVDY VFNGALFKLY IPSENCYIRF APNSIRCPQP SPSPGGKVNK AAEPFGDESK RHARLHVLQR HVEIVCNGVT NSGIITGDMM VGQGGQRRDY AIELVGAGLA TVDQRKIDYG EAPRSLVDAQ SAAQESKVGL WSIVQEQPEI KVAKTAVKAK ETVATIRLSE IRSGNHFFYH VVDDETAKVV EESMKVFTKS HGTGGAPCDA KIGKVVAALF NDGSGKAWYR AKVIERKGPG KMAVLFLDHG NVATVPVATH LRPLDMNLGT DRIPPVAKEA VLALTNTRPL DSDEGMDAAR LLQSKCWGRN LTARIFAPDE SGKAALSIAT EAGSDEETIN ASLVVEGLAR VAKPETVTSI SSRMIDPSSL VELAAALNVA QEVARKSRVG MWRYGDIGDE DDDDM
|
| |