Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49571 |
Symbol | |
ID | 7198190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 76296 |
End bp | 79396 |
Gene Length | 3101 bp |
Protein Length | 1000 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184300 |
Protein GI | 219128187 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.822968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAGTTTGT TTTCGTTCTC TTGCTGTTTT CCGTTACACT TGCTTTCCTA CGCCAGTCTT CGTGATCTGT CGTTCCATTC TGAGTACAAG CCCATAAGAT GCGCGGTATC TTCTTCGCAG CATCCACGCT GGTTCTCGGC ATCGCTTCCT TTCGGGCGAC CGATGCCCAG GACGTCAGCT GCCAGTTGGT TCGTCAACCC AATGGCAGCG TTGAATACGT ATGTGACCAA GTAAGGGAGC CAATGCGGCC TCCCCTCGTC CCAGACGGCC AAACCTATCA ACAAGGCCCG GTAGAAAAGT ACTACCTTCA ACAACCGGCG GAAGATGTGA ATGCATCGGC AAACGCTACC ATTCCAATCT TACGTCGTGT AGGCGTCCCG CCATTTGATA CTCCTCTACA GGCTTGCCAA GGAAGCTGTG GAAGTGACGC AGATTGCGCG ACTGGACTCG TGTGTCTTGA TCAGTTGGAT CGGCCGCGAG ATGGAAGCGT CCAAGGATGT TCCGGTAACG GTCGCCTGAC TATGAGCGTG TGTGTGGTAC CGGGTTCGGA AGTTGATCTC AAGATTGGAT CGTTGTCGTT AGAAAAGTGC CAGGGGACCT GTTCTTCAGA CGATGATTGT GCTGGTGGCC TCGCTTGTTT CCGGCGCCAA GGAACGGAAA CCGTACCAGG TTGTCTCGAA CGTGATGTGA GCGAGTCTAA CTTCTGCCAT GATCCCAACG AAACCTTGGC AAGATTAGCT TTCGTGGGTG CGGCCCCTCA CTCACAGTCA TCGCTACTTC CTACATGCTC TGGATCCTGT TCATCAGATT GGGACTGCGT TCGCGGCAGT AATTGCTTTC GTCGAGATGG CACGGAATCC GTTCCTGGAT GTGAAGGAAG AGGACTCAGT GGAGCCAATT ATTGTTACAT TGCTCCGGAG GGGTCTTTGC TTCTGGTCGG CGATGGTGTA ACATACATAC AATATCCATT GAGACAATGC GAAGGACATT GTATTGCCGA TATCGATTGC GAAAGTGGTT TGAAATGCTT TCGACGCGAA GGTTCGGAAC ATATTCCTGG TTGTGACGGA GAGGGCGCGG ATCGAACCAA CTACTGTTAC CAACCATTTC CGGATACTTC CGGTCCAACG GATGGCCCTA CACTGGCACC GCTTTCAGTC TCCCCATCCA ATTCTCCTTC TTTAATCCTA ACGCTTGATC CCACCAATAG CTTGGGATCT CGAGGTGGCG AATCCGAGTC CCCTTCGACA CCGACATCAG ATATGCCGTC GATATCACCA TCAGATGAAC CTTCAATGAT ACCTTCTGAT ACGCCGTCAA TGGTACCGTC TGACGTGCCT TCGATGCTGC CGTCCGACGC ACCCTCGATG ATTCCATCTG ATAGTCCCTC AGACACGCCA TCTGATGTTC CTTCGGACGT GCCTTCGGTG TTGCCCTCTT TTTCACCTTC TATGAGGCCT TCTGATACTC CTTCGACAAT GCCATCGGAT ACTCCTACCG ACGTCCCCTC AGATGCCCCC TCTGATTCAC CGTCAGATGT TCCCTCCGAC GTTCCATCAG ATGCACCTTC TGATGTTCCC TCCGACGTCC CTTCAGATGC ACCTTCTGAC GTCCCTTCAG ATACCCCCTC TGATGTTCCC TCTGACGTCC CTTCAGATAC CCCTTCTGAT GTCCCCTCTG ACGTCCCTTC AGATACACCT TCTGCCGTAC CCTCTGATGT CCCTTCAGGT AACCCTTCTG ACGTCCCTTC AGAAACCCCC TCTGACGTCC CTTCAGATAC CCCTTCTGAT GTTCCCTCTG ACGTTCCCTC TGACGTTCCC TCTGATGTCC CCTCCGACGT CCCTTCAGAT GTGCCCTCTG ACGTCCCTTC AGATACCCCT TCTGATGTTC CCTCTGATGT CCCTTCTGAT ACTCCCTCCG ATCTCCCATC CGACGTGCCG TCTGATGTAC CGTCCACGAT GCCGTCTTCT ACGTTAGGTG GTACCTCATC TAATGGGCCA GTAACGGGAA AGGGCAGTCT AGCACCAACC GGTAGCTTTT CGGGTGAGTC GAGTGGAAGG CCTTCTAAGA CTCCTGGTGT GATGGTTGAA TTGAGCGCAA GGCCAAGCGA GTCTTTTACC CCATCTCCGA CTACTTTCCC AGTACTCCTT CCTTCAAAAG TACCGTCAAT AGCTCCATCT CGCACTGAGA CCGTTGCTCC GAGCACAGCC TTCCCCACTG AGACTTCGTC ATCACTGCCC ACTTTGACTA CGACTGTGTC TTCAGCTCCG ACACAAATGT GCAGTATGTC CGCATCAGAG CGCGCAAGCA CCATCATGGG AATTTTGGAA GAGGACTTTA ACGATGTGAG CAGTCCTCAG TTCCGGGCGG TGGAGTGGTT GGTGCAAATA GATCCTCTTT CGCTTTGCCC AGGAGATGAG AACTTGGAGC AACGATACAT TTTGGCTGTG CTGTATTTCC AAACCGGTGG AGAATGGTGG ACACGATGTT CACCTATGAG TGCTGAAGTT TGCGACCAGG GTGAAGCTTT TTTGAGTGGC GCAAATGAAT GTGCCTGGGG TGGAGTCAAT TGCGATTCTT CGAGTCGTGT GACGGCTCTT CACTTGGATT CAAACAACTT ATCGGGTAGT TTGCCCAGCG AGTTGGGTCG TTTGGCATAT TTGGTCGAAC TGGACATGGA CGACAACGAG CTGACGGGTT CCATACCGCG GATTCTGGGA CAGCTTTCTT TTTTGGAAAT TGTTGACCTG GACGACAACC AATTGACGGG AAGCATCCCG GAAGAATTGT ACAGTGTCAG CTCGCTGGAG ATTTTGGATC TGGATATTAA CCAATTGACA GGAACTATTT CTACCCTCAT TGGCAATCTG GTAAATCTGT ACTATTTGCA GATTGACTCG AACAAATTTA CGGGCAGCAT ACCGTCCGAG GTGGGCACTC TGACTCGTCT CGAATACTTC TCCATGACCG ATATTCAAAT AGCCGAGGCT CTACCCGACT CGTTATGCAG CCGCGATACC TTGCTTTTGT TCGGAGATTG TGAGGTTTGT GTGGTAGAAG ACTGCTGCAC CGCTTGTCTC GCCAAGGAGA CAACTCCGTA A
|
Protein sequence | MRGIFFAAST LVLGIASFRA TDAQDVSCQL VRQPNGSVEY VCDQVREPMR PPLVPDGQTY QQGPVEKYYL QQPAEDVNAS ANATIPILRR VGVPPFDTPL QACQGSCGSD ADCATGLVCL DQLDRPRDGS VQGCSGNGRL TMSVCVVPGS EVDLKIGSLS LEKCQGTCSS DDDCAGGLAC FRRQGTETVP GCLERDVSES NFCHDPNETL ARLAFVGAAP HSQSSLLPTC SGSCSSDWDC VRGSNCFRRD GTESVPGCEG RGLSGANYCY IAPEGSLLLV GDGVTYIQYP LRQCEGHCIA DIDCESGLKC FRREGSEHIP GCDGEGADRT NYCYQPFPDT SGPTDGPTLA PLSVSPSNSP SLILTLDPTN SLGSRGGESE SPSTPTSDMP SISPSDEPSM IPSDTPSMVP SDVPSMLPSD APSMIPSDSP SDTPSDVPSD VPSVLPSFSP SMRPSDTPST MPSDTPTDVP SDAPSDSPSD VPSDVPSDAP SDVPSDVPSD APSDVPSDTP SDVPSDVPSD TPSDVPSDVP SDTPSAVPSD VPSGNPSDVP SETPSDVPSD TPSDVPSDVP SDVPSDVPSD VPSDVPSDVP SDTPSDVPSD VPSDTPSDLP SDVPSDVPST MPSSTLGGTS SNGPVTGKGS LAPTGSFSGE SSGRPSKTPG VMVELSARPS ESFTPSPTTF PVLLPSKVPS IAPSRTETVA PSTAFPTETS SSLPTLTTTV SSAPTQMCSM SASERASTIM GILEEDFNDV SSPQFRAVEW LVQIDPLSLC PGDENLEQRY ILAVLYFQTG GEWWTRCSPM SAEVCDQGEA FLSGANECAW GGVNCDSSSR VTALHLDSNN LSGSLPSELG RLAYLVELDM DDNELTGSIP RILGQLSFLE IVDLDDNQLT GSIPEELYSV SSLEILDLDI NQLTGTISTL IGNLVNLYYL QIDSNKFTGS IPSEVGTLTR LEYFSMTDIQ IAEALPDSLC SRDTLLLFGD CEVCVVEDCC TACLAKETTP
|
| |