Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54505 |
Symbol | |
ID | 7201038 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 815477 |
End bp | 818897 |
Gene Length | 3421 bp |
Protein Length | 1056 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180126 |
Protein GI | 219118718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGACGGAATT TGCTGAGTCG CAGAGGATTC GCCACAAGCA ACAGCGCTTG CTGCTGCTTC GTCATGCATC CCGCTGTCAA CATGAAGCGG GGAAATGCCC AGCTACCCCT CACTGCGCTA GTATGAAGCG ACTTTGGAGA CATATTGCAA ATTGTAAGGA TCAGGACTGC TCTGTTCAAC ATTGTCTTAG TAGTCGCGGC GTTCTCAGCC ATTATCGACG GTGTAAAGAT GCGCTCTGTC CTGCATGTGG GCCTGTCCGA GAAACTATAC GGAAAAGTCA TGAGATGGAA AGTCAAAGCA ATCCACAAGG GGTACCGTCC GACAATCGGT TTATGGGTCG AGATGATTCG TTCGGTCGGT CAAGTTCCGT TACATCGCCA ACCGAACAGG AACCGAAGCG TATGAGAACA GAACATCGCC CTAGCGCGGC GTCTATAAAA TCAGCGCGCT CTACGCCTGT GAGCGCGCCT CCTTTGAAGC AAGAACCCCC TCGAAGCATA GGCAAAGGTG AGAAAGTAGC TCCATCTGCT GAAAAAGATT CGAAAGGAAG TGTCGACCGA TCACTCCTCG AGAGTTTCTC GGTGAAGGAG CTCGAAACTC ATTTGCGATC GCTGGAACGA GAGACCCAAC TTCCTCCGGC GAAGCTCAAG TCTAAATGTC TGGATGTATT AAAGGGTTTA ATGGCTCACC AACACGGTTG GGTTTTCAAT GGTCCAGTCG ATCCAGTTGA GCTCGGTCTT GTTGATTATT TTGAAATTAT CAAGAAGCCC ATGGACCTCG GCACCATTCA AAAGCGTTTG GAAAGTAGTG CATACCACTC CATCGATGAC TTTAAAACGG ATATCTTCTT AACTTTTGAG AATGCAATGG TGTATAATGA GGATGGTTCC GTTGTCTACG ACATGGCGAA GCAGCTGAAG GTTAAAGCCG AATCTGACAT GAAGAGACTT GTGGCACAAC TGGAAACAGA AGACCTTGAA AGACGCCAGA ATGAACGCGC GTGCACCTTG TGTGGTTCAG AGAAACTGTT GTTTGAACCT CCTGTTTATT TTTGTAACGG AATTAATTGT CAATCGCAGC GGATCCGACG AAACAGTCAC TTCTATATCG GAGGAAACAA CCAATACTTT TGGTGTAGCC CTTGCTTTAA TGAACTTGAT GATAAAATTC CGATTGAGCT TGCCGACTTG ACAGTCATGA AAAACAATCT GAAGAAGAAA AAGAATGACG AGATTCACGA GGAGAGCTGG GTACAGTGTG ACACTTGCGA ACGGTGGGTT CACCAGATAT GTGGACTTTT TAACACCCGT CAGAATAAAG AGCACCACAG CGAGTACTGT TGTCCTAAAT GTTTGCTTGA AAAACGCAAA ACTGTTTCAA TAACTCCAGC GCCGAAGCCA TTGCTGGCTG CGGACTTGCC GCGGACTACT TTATCGGAGT GGCTAGAACG CAGTGTCACT AAGAAAGTGG AAAAAAGGAA GAGAGAACTG GCCGAAGAGC GTTCGCAGAA TGAGGTACGT GTCTCTACAT TTTGTTCATC GAGCAAGTCG TTTATGTATC GATTTAACTA AGGAAGTCTT TTCTCTTTCA GGGGATATCT CTTGAAGAAG CTTTGCGACA GGTAGAAAGT GGCGGCCCAA TAATAATTCG TCAAGTTACC GCGATGGATA GAAAGCTTGA GGTTCGCGAG CTGATGAAAA AGCGATATGC ACACAAGAAT TATCCTGACG AATTTCCCTT TCGGTGCAAA TCGATTGTCG TTTTTCAGCA TCTTGACGGA GTTGATGTCA TTCTGTTTGC GTTGTATCTC TACGAACACG GTGAAGACAA TCCTCCGCCC AACCAACGAA CCGTGTACAT CTCATATCTG GACAGTGTTC ACTTTATGAG GCCTCGCAAA CTCCGGACCT TTGTGTACCA TGAGATTCTG ATTGCCTATT TGGACTACGC TAGGCGACGG GGATTTGCAA CTGCTCATAT TTGGGCATGC CCACCTTTGA AGGGTGACGA TTACATTTTC TACGCTAAAC CAGAAGACCA GAAGACTCCG AGAGATTCAC GACTGCGCCT TTGGTACATT GACATGCTCG TAGAATGTCA AAAAAGGAGT ATCGTCGGCA AAGTAACGAA TATGTACGAT ATTTATTTCG CAGACCCGAA TTTGGACGCC ACTGCTGTTC CCTATTTGGA GGGCGACTAT TTTCCTGGTG AAGCGGAGAA TATTATAAAA ATGCTCGAAG AAGGTGGAGG CAAGAAACTT GGGTCAGTGG GGAAAAAGAA GAAAAGCAAA TCGTCGAAAG CGCAGAAGAA TAAGGGAGGA AATACGGGTA CTAGATCCAC TGGAGTCGAC GAAGAAGCGC TTATTGCGAG TGGTATTCTG GATGGAACCA AGAGTTTAAA GGACCTTGAT CGTGATCAGG TCATGGTGAA GCTGGGTGAA ACGATTCAGC CTATGAAGGA AAGTTTTATA GTAGCGTTCT TAAATTGGAA AGATGCTCGC GAAGAAGATA TGATAGTCCC AGAAGAAATC GAAATGGCTA GGATTGAATA CGCAGCGAAA GGTGATCCAG AGCTTGTTGG AAGCAAACGT GATGCTGCTG GAAACATGAG AGACGCTACG TCGAAGACGG GCGCGAATGG AGAGCCTGTA AAGGTTATTG ATGACGACGC TGAAGATCTA GATTGCGAGT TTTTGAACAA TCGCCAAGCA TTCTTGAATC TTTGTCGAGG AAACCATTAT CAATTTGACG AGCTCCGGCG AGCAAAGCAT ACTTCATTGA TGCTCCTTTG GCATCTACAT AACAGAGATG CACCAAAATT TGTGCAGCAG TGCGTTTCTT GCAGTCGCGA AATCCTCAGT GGCAAACGTT TTCACTGCGA CACGTGCCCT GACTATGATC TCTGTCAAGA TTGCTACAAA GACCCTAAGG CAAACAGAGG TAACTGTACG CACGCTCTTA AACCACTCGC CGTTGAAGCT GATTCCGGAC AGGATCGCAG TGGGCTATCA GAGCAAGAAC GCATGCAACG CCAGCGAAAC CTGTTGTTAC ACATTCAACT TATCGAACAC GCTTCAAGGT GTTCCTCTCA GACATGTTCT TCATTAAATT GCGCAAAAAT GAAAAAATAT CTGCAGCATG CTCGTGTCTG CAAGGTTAAA GTATTAGGAG GGTGCAAGAT TTGCAAAAAG ATCTGGACCT TACTCCGAAT TCATGCGCAG AAATGTAAGG ATACAAATTG CCCCATTCCA CAATGCAATG CGATTCGTGA GAAGATGAGG CAACTGCAAA AGCAGCAGCA GGCTATGGAC GACCGGCGCC GTCTGGAAAT GAATCGTCAC ATGCGTTTCT CCACCGCAGG AGGCTCTTGA GAAACCGAAA ATATTTTTCG TAATATAATA GAGCTTTGTT TTCATATTTT A
|
Protein sequence | MKRLWRHIAN CKDQDCSVQH CLSSRGVLSH YRRCKDALCP ACGPVRETIR KSHEMESQSN PQGVPSDNRF MGRDDSFGRS SSVTSPTEQE PKRMRTEHRP SAASIKSARS TPVSAPPLKQ EPPRSIGKGE KVAPSAEKDS KGSVDRSLLE SFSVKELETH LRSLERETQL PPAKLKSKCL DVLKGLMAHQ HGWVFNGPVD PVELGLVDYF EIIKKPMDLG TIQKRLESSA YHSIDDFKTD IFLTFENAMV YNEDGSVVYD MAKQLKVKAE SDMKRLVAQL ETEDLERRQN ERACTLCGSE KLLFEPPVYF CNGINCQSQR IRRNSHFYIG GNNQYFWCSP CFNELDDKIP IELADLTVMK NNLKKKKNDE IHEESWVQCD TCERWVHQIC GLFNTRQNKE HHSEYCCPKC LLEKRKTVSI TPAPKPLLAA DLPRTTLSEW LERSVTKKVE KRKRELAEER SQNEGISLEE ALRQVESGGP IIIRQVTAMD RKLEVRELMK KRYAHKNYPD EFPFRCKSIV VFQHLDGVDV ILFALYLYEH GEDNPPPNQR TVYISYLDSV HFMRPRKLRT FVYHEILIAY LDYARRRGFA TAHIWACPPL KGDDYIFYAK PEDQKTPRDS RLRLWYIDML VECQKRSIVG KVTNMYDIYF ADPNLDATAV PYLEGDYFPG EAENIIKMLE EGGGKKLGSV GKKKKSKSSK AQKNKGGNTG TRSTGVDEEA LIASGILDGT KSLKDLDRDQ VMVKLGETIQ PMKESFIVAF LNWKDAREED MIVPEEIEMA RIEYAAKGDP ELVGSKRDAA GNMRDATSKT GANGEPVKVI DDDAEDLDCE FLNNRQAFLN LCRGNHYQFD ELRRAKHTSL MLLWHLHNRD APKFVQQCVS CSREILSGKR FHCDTCPDYD LCQDCYKDPK ANRGNCTHAL KPLAVEADSG QDRSGLSEQE RMQRQRNLLL HIQLIEHASR CSSQTCSSLN CAKMKKYLQH ARVCKVKVLG GCKICKKIWT LLRIHAQKCK DTNCPIPQCN AIREKMRQLQ KQQQAMDDRR RLEMNRHMRF STAGGS
|
| |