Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50326 |
Symbol | |
ID | 7198985 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 311233 |
End bp | 315060 |
Gene Length | 3828 bp |
Protein Length | 998 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185257 |
Protein GI | 219130196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.455166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACC TGAACGATCT CTTTGGAGCC TTTGACGGAA ATGACAGCGA CCATGGCAGC AGCTCGTTGC AGGATGAGCC AGCAGTACTA CCGAAGAAGC CGCGACTGGA AGAAAGAAAT AGCCAGCGGA GCAGTAGTCC ACCAGAAGAT GAACTATCCA ACAGTGTGTC TACTACACCT GCTCCCGGAA CCGCTACTGC GACTAACGAA TCCATCCAGA GTAGCAAGTC GGCCTATTCC GCCGTCATGT CCAAGCCCAC GGCTTCCCAT TTGCGACCCG ATATAACTAC CGATACTTCC CACGGCAGAG ACCCGGACGG AGTGGTGGCG GAGACTGAGG CGGACGTCAA CCGGGAAATT GCGACGGGTA CATCCCACGA CAAGACAGTC CGCTCCTATT CGGCCTTTCC CAAAAACTTA CCCGCGGGTT TTGCCCCACC CCGAGTCGAA CCCCCACAAG AACCCGCCAA GACGTACGCC TTTAAGCTGG ATCCCTTTCA GGCGCAGGCC GTGGCCTACA TTGACAAGGA AGAATCGGTA CTGGTTGCGG CCCATACATC AGCGGGAAAA ACTGCCGTGG CGGAATACGC CGTCGCGAAG TCACTCAAAG CGGGACAGCG CGTCATCTAC ACGTCCCCGA TCAAGGCTTT GTCGAATCAG AAATTCCGGG ATCTACAGGA AGAATTTGAT GATGTCGGAC TCATGACCGG TGATAGTACG TTTTTACCAC AATGTCGTCG TCGTTGTTGT TGGTGTTATC GACTCATGCA AACTCGTTGT GTTAACCACT CGTATGCTCC GTCAATTTGC AGTCACAATC AATCCCGACG CTACTTGTCT CGTCATGACG ACGGAAATTC TGCGCAGCAT GTTGTACCGT GGCAGTGAGC TCATGCGAGA AATATCGTGG GTCATTTATG ACGAGGTACA TTACATGCGT GATGCCGAAC GCGGCGTCGT TTGGGAGGAG TCCATCATCT TACTGCCCCA TCGGGTCCGA TTCGTCTTTT TGTCCGCCAC CATTCCCAAC GCCACCCAAT TCGCTGATTG GATTGCCGAA ATTCACCACC AACCCTGCCA CGTCGTCTAC ACAAATTACC GGCCAACACC GCTCCAACAC TACATTTTCC CCCAAGGCGG GGAAGGCCTG CACCTGGTCG TGGACGAGCG TGGCAAATTC CGTGAAGCCA ACTTTCAAAA GGCCATGGCC AGCTTACAGT CGGGTAACGT TGATGTAGCG GCCGCGAATG CAATGTTGGA TTCGGGGAAC GGCAAGGGAA ACGCCAAAAA ACGCGGTCGG GGTAAACAAG GTGGAGGTGC CGGCCAATTT GCGGATTTAC ACCGAATTGT AAAACTCATC ATGGAGAGGA ATTTGAACCC ATGCATCATC TTCTCGTTTT CGAAAAAAGA CTGTGAAAAG TACGCGCTAG CTTTGAATCA AGAAGACTAC ACGGACGACG TAGAGAAGGA TTTGGTGGCG CAAGTGTACC ATAACGCAAT CGATTCGTTG AGTGATGACG ATCGTAAATT GCCGCAAGTA GAGGCACTAT TGCCTTTGTT GAAACGCGGA ATAGGAATCC ATCACGGCGG ATTACTTCCC ATTCTAAAAG AAATTGTTGA AATTTTGTTT ACGGAGGGGT TGATCAAGGC ATTATTTGCG ACCGAAACAT TTTCTATCGG TGTGTCCACG ATATTGAAAT CAGGCTACCC CTTATATTGT GTCATTTTGC CTGACACTCT TTTCTTGTCT ACAGGTATCA ACGCTCCTGC TAAAACGGTT GTTTTCACCA ATACTCGAAA ATGGGACGGG AAAGATTTCC GATGGGTATC TAGTGGGGAA TATATTCAAA TGAGTGGTCG TGCTGGTCGT CGCGGAAAAG ACGATCGCGG TATCGTTATT CAAATGGTCG ATGAAAAGAT GGAACCTGCG GTTTGTAAAG ATATGCTGTA CGGTGCTCCA AATCCTTTAA ACTCGAGCTA TCGAATTTCT TACAATATGC TTTTGAACCT TATGCGCGTG GAAGATGTTG ACCCTGAGTA CCTTATTCGT GCCTCATTTC ACCAGTTTCA ACGAGAAAAG GATGCTCCAG GATTGATTGC TGACGCTGAA ATCCTAGAAT CCCAAGCAGA GACAGTAGAC TTTCAATCCG AAGAAGAAGT GACGTTGGTT GCGGAATACT ACCAAATGGA CCAGCAGCTT CTATTGACTC GACGCAAAAT CGGCACGATT GTGAGAAAGC CGGGTTATGT TCTTAAGTTT CTGCAAGCTC CCGGGCGCTT TCTGGATGTC ATTCTTGACG GGGAAGAGTT CGGGTGGGGT GTTTTAGTCT CCTGCAAGAA GCGCCAAGGA ATCGGATCGG GAGGAGAAGC TGGTCGCATT GCATCATTGA CCAATCAGCC TGAATACATT CTGGATGTTC TTTTGAATTG TGTCGATCGG CATTTCGACA AAAACAGGAA AGGAAAGGAT GAAGATGCAG AGAACGTAAA CCTTCTGTGG CGAGGGACGG GCCGATCGTG CCGCCCTGTT CGCAGTGGAG ATATAAGCAA GTTAATTTCG ATGAGGGTCT TCACAGTTGG GCTCGACAGT ATCGAACGCC TTTCTGCGGT TCGAATCTTC ATTCCGCAAG ACATCACAAC CCCCGAAGCC CGTCGCAAAG TCTCCACGTC TGTCAAGGAG GTTTCGAAGC GCTTTCCAGA TGGTATCCCC TTGCTCGACC CTGTGGCCGA CCTTGGCATC AACGATGATG CTTTTATGAC ACTACTGAAG CGGGCTGAAA CACTGACAAA TCTCCTGGCG GAACATAAGC TGGCGAACGA TTTTGTGGAT TCAAGTCGCC TTGAACTAGT TCAACGGTAT GAGAAGAAAG CTGATATGTT GGAACGGGCA AAGACATTAC GCGAAGAGGC GCGCAACTGT CAAACGATGG CAATGAAAGA TGATCTCAAG AAGATGAAAA GAGTTCTCAA GAAGCTCGGC CATGTCGATG CAGGCGGAGT AATTCAGACG AAAGGTCGGA CTGCATGTGA AATTAACACC TCGGATGAGC TTGTTGTTGT TGAGCTCATT TTTGGCGGCA TTTTTAATGA TCTTTCTGTC GAGCAAAGTG TCGCGCTTCT ATCTTGTATG ACTTTTGACG AACGCAATAA AAATGAAGAT GATCCCGCAA GTGGCTTGAA GAGCTTTCTG TCCAACCCTT TTTACAAACT TCAAGAGGTA GCACGGACTG TTGCACGCGT TGTAATTTCC TGCGGAATCG ACCTGAACGA GGACGAGTTT GTGGACAAAT TCAACCCGGG GATGTAAGTG CTTACACAGT GCTATGTTCG AGCACGTCCT GCTGCAATGA TTATTCTTTT CAAGTCATCC ACATCCAATC TGTTGGAGCC GCAAAGCTTT GGCTTTGGAG CTCGCAGAGA GGAACAAAGC GGATGACCCT ACCAACGCTG AGGATCATCA AAGCGACCAT ACTCTCTGAG CAGCTTTACC ACTGACCTTA TTGTTCGCAA TCTTTGGCAT TTCACTTACA CCCTTTGATC CCGATTTAAC ATGTAGGATG GAGGCTGTCT TTGCTTGGTG TAAAGGAGCA AAGTTTATCG AAGTCCAAAA GCTTACTGGG TCGTTTGAAG GCACTACAAT TCGAACTTTG CGTCGGCTCG AAGAACTCGT TCGGCAGATT ACTGCTGCGG CGAAAGCCAT TGGAAACCAC GAACTGGAGG CAAAGTTTGA AAAAGGAAGT GAACTGATCA AACGAGATAT TGTCTTTTGC AGCTCTCTTT ATTTGTAACA TTCTAGAATA ATACCGTGAG CAGGCCATGT TCACGACC
|
Protein sequence | MADLNDLFGA FDGNDSDHGS SSLQDEPAVL PKKPRLEERN SQRSSSPPED ELSNIRSYSA FPKNLPAGFA PPRVEPPQEP AKTYAFKLDP FQAQAVAYID KEESVLVAAH TSAGKTAVAE YAVAKSLKAG QRVIYTSPIK ALSNQKFRDL QEEFDDVGLM TGDITINPDA TCLVMTTEIL RSMLYRGSEL MREISWVIYD EVHYMRDAER GVVWEESIIL LPHRVRFVFL SATIPNATQF ADWIAEIHHQ PCHVVYTNYR PTPLQHYIFP QGGEGLHLVV DERGKFREAN FQKAMASLQS GNGNAKKRGR GKQGGGAGQF ADLHRIVKLI MERNLNPCII FSFSKKDCEK YALALNQEDY TDDVEKDLVA QVYHNAIDSL SDDDRKLPQV EALLPLLKRG IGIHHGGLLP ILKEIVEILF TEGLIKALFA TETFSIGINA PAKTVVFTNT RKWDGKDFRW VSSGEYIQMS GRAGRRGKDD RGIVIQMVDE KMEPAVCKDM LYGAPNPLNS SYRISYNMLL NLMRVEDVDP EYLIRASFHQ FQREKDAPGL IADAEILESQ AETVDFQSEE EVTLVAEYYQ MDQQLLLTRR KIGTIVRKPG YVLKFLQAPG RFLDVILDGE EFGWGVLVSC KKRQGIGSGG EAGRIASLTN QPEYILDVLL NCVDRHFDKN RKGKDEDAEN VNLLIERLSA VRIFIPQDIT TPEARRKVST SVKEVSKRFP DGIPLLDPVA DLGINDDAFM TLLKRAETLT NLLAEHKLAN DFVDSSRLEL VQRYEKKADM LERAKTLREE ARNCQTMAMK DDLKKMKRVL KKLGHVDAGG VIQTKGRTAC EINTSDELVV VELIFGGIFN DLSVEQSVAL LSCMTFDERN KNEDDPASGL KSFLSNPFYK LQEVARTVAR VVISCGIDLN EDEFVDKFNP GMMEAVFAWC KGAKFIEVQK LTGSFEGTTI RTLRRLEELV RQITAAAKAI GNHELEAKFE KGSELIKRDI VFCSSLYL
|
| |