Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41576 |
Symbol | |
ID | 7199365 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 223295 |
End bp | 226602 |
Gene Length | 3308 bp |
Protein Length | 1092 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185500 |
Protein GI | 219130707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.433336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCTG CCACCCGGCA AATGACAAGT GTAGCCGTCT ATGCCCATTT TCTGGACAAT GTACTCCTTC TTCCCCAAGG ACATCCTATT CGTCTTGCCT TTGATCAACA AGGATATGAA TCGGCTGACG ATCTCTTGTG CATCTTTGAG AACGAACTCG ACTCTCTTGA GTACACTCCT CCTGCCATTC CTGACGGTCC CGAAAATCCG TCACGCATCC CTCTAATCAT GGCACATCGA CAGATCATAC GTCACTTCCT ACGTTGGCAA GCATCCTTAA AAGACCAAAA GGGGGCTCCT TTGAAGAACT CCGAGCTCGT TGCACTCAAC AACGAGGACT TTGTCCTGTA CCGTCGGTCA GCACTTGGCC AGGTTTCGAC GGCCACTGCA CCTGCCACTG TTCCCCCGAC TGTTCAGAGT CCCACAGGAA AGACACGTTC GGCTGTCGAG GATTTCAAGC GTGGGATAAA ACGTGATAAA ACTCACTATC CTGTGCTCAA AGATGACCGA TACTGGGACA ACTTCTATCG TTCGTTTGTT GTTACTGCCG TAACACATAA CGTTGACAAG GTTCTAGACC CAAACTACAT TCCTACCGAT CCTTTGGAAA AGTCCCTCTT TGAAGAACAG AACAAGTTTG TATATTCTGC TCTAGAGCAT ACTCTTCAGA CGGACATGGG AAAGAACGTT GTCTGTGAGC ACAGTTTTGA TTTCAATGCC CAAGAAGTTT TCCGTAAAAT CGTGAAACAT TATACGGAGT CAGCCAGCGC AAAGATCAGT TCGTCTACTA CTCTGGGGTA TCTCACAACT GCAAAATACG GATCGTCATG GACAGGTACA GCAGAAGGAT TTATTCTCCA CTGGAAGAAT CACTTGCGTA TTTACAATGA CACTGTACCT ACTGGTGAGC AACTTCCTCA GCAATTGTGC CTTGGTCTTT TGGAGAATGC TGTTCATGAT GTACCTGAAC TCCGACAGGT TAAAATCACA GCAACACTCG ACTTAGCCAA AGGTGGTAGT CCCATTAGCT ACGATAGTTA CCTCAGTCTC CTCCTTGCAT CGGCATCGCT CTACGACAAC GGTAATAATC TATCTAATTC TCGCAGTGGC AAGAACAAGC GCAATATCTA TACTACTGAA CTAGCCTATC ATCCGACGGA TTTTGAAAGC GAACCAGATG TAGACTATGA TATAGATGTG TCACCGACTG CCATATACGA AGCCAATGCC CACGTCCGTA ACAACAGTAC CCGTAACCGT CCCCTGGCAA CTAATCGCGA ACGACCTTAC ATTCCTCGTG AAATGTGGAA TTTGCTCTCT GATGATTCCA AGGCCATCCT CCAAGGTTTG GCTGCACCCG GCAAGCAGGC ACCATTAAAT GGTAGCCCGC CTCATCAAAC GCTGCAGGCC AATACACACG AGACCATTGG CACGGAACAT ACCGCAACGG ACACCTTCCA TGATTGCGCA CCTGAAACTG AATTACTCGC ACATCTTACT GAGCGTGTCA GTCGCATGAG CAGCGGTGAT ATTCGTAAGG TACTCGCCGC ATCACGTGAC GTATCAGAAA AGCCCAAATC ACTGCAATCT AACGTACTGC AATACCAAGT CTCTCGTCAT ACTACCAACG AGACTTCTGC ATCCCTTGTT GACCGTGGCG CTAACGGAGG GCTTGCGGGT GGTGATGTCA TTGTCCTGCT CAAAACAGGA CGTTCGGCAA ACATCACAGG TATCAACGAT CATACCTTGC CAAACTTGGA CATCGTCACT GCCGCTGGAT GTGTTGAATC CCAAAATGGG CCCATCATTC TCATTATGAA CCAGTATGCT CATCTGGGGA AGGGTAAAAC CATTCATTCA AGTGCGCAGT TGGAACACTA TCGTAATCAT GTCGAAGACC GTTCACGCAC GGTAGGGGGT AATCAGCGCA TTGTCACTTT AGATGACTAT ATCATTCCCC TCCATATTCG ACAGGGACTT CCATACATGG ACATGCGACG CCCCACTGAT GCTGAACTAG CGTCCCTCCC GCATGTTGTC CTAACCTCAG ACGTCGATTG GGACCCCTCT GTACTCGACA ATGAAATTGA CCTTGCGACT TCATGGTACG ATGGCATCCA TGACTTGCCC CAGCCCCCAT ACGTTGAACC ACGTTTTGAT CATACAGGCC AATACCTTCA CCGTCACATT TCTCTATGCG ACTACCGTGA TGACGCCATT GCACGTATCA TGCAGTGCCA ACAGCATCAC GTCACACGTA ATGTGCACGA TTATGAAGCC CTTCGTCCTT GCTTTGGCTG GGTCTCTGCC GACACCGTTC GGAAAACCAT CATGGCCACC ACGCAGCATG CCCGTGAAGT CTATCACGCA CCGTTACGTA AACATTTTAA GTCTCGTTTC CCAGCCTTAA ATGTACACCG TCGTAACGAA CCGGTCGCTA CTGATACCAT ATGGTCCGAC ACTCCCGCCG TAGACAATGG TGCAAAATTT GCACAACTCT TCGTTGGCAG ACGGTCTCTT GTCACTGATG CCTACCCCAT GAAAACTGAT AAAGAGTTTG TCAACACCCT TGAAGATCAC ATCCGTTTCC GTGGTGCAAT GGACAAACTA ATCAGTGATC GCGCTCAAGT TGAGATCAGT AAAAAGGTCA CTGATATTAC ACGCGCATAT AATATCGATC AGTGGCAGAG TGAGCCTAAC CATCAGCACC AAAACTTCGC CGAACGTCGT ATCGCCACCA TCGAAGCCAA TACGAACAAC ATTCTCAATC TTACTGGTGC TCCTGATAAC ACCTGGCTTC TTTGCGTGAC ATATGTTTGC TATGTCTTCA ACCATTTGGC GCATGAATCT CTCGATCATC GCACCCCCCT CGAAGTGCTT ACTGGTTCTA CACCTGATAT CAGTGTTCTC CTTCAGTTTC ATTTTTGGGA ACCGGTCTAT TATAGAATTG AAGATGCGAC ATTCCCCTCT GGTGGTACCG AGCAACAAGG ACATTTTGTC GGCATCGCAG ACTCCGTCGG TGACGCTCTC ACTTATAAGA TCCTCAACGA CCGCACCAAC CGCATTCTAT ATCGATCTAG TGTTCGTTCT GCGGCCATTT CCGGGCAAAC CAACCTACGC CTTGCGTCAC AGGATGGGGA GAATGGTCCT AAGCCCATCA ACTTTATCAA GTCGCGTAGA ACCGAAAACC AAAATTCCTA TGCCATTAAG GAGTTGCCTG GTTTTACACC TGATGATCTT ATCGGTCGCA CGTTTCTCAC CGACACTCGT GATGATGGAG AGCGTTTTCG GGCACGAATC ACCCGTAA
|
Protein sequence | MVPATRQMTS VAVYAHFLDN VLLLPQGHPI RLAFDQQGYE SADDLLCIFE NELDSLEYTP PAIPDGPENP SRIPLIMAHR QIIRHFLRWQ ASLKDQKGAP LKNSELVALN NEDFVLYRRS ALGQVSTATA PATVPPTVQS PTGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV VTAVTHNVDK VLDPNYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNV VCEHSFDFNA QEVFRKIVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP TGEQLPQQLC LGLLENAVHD VPELRQVKIT ATLDLAKGGS PISYDSYLSL LLASASLYDN GNNLSNSRSG KNKRNIYTTE LAYHPTDFES EPDVDYDIDV SPTAIYEANA HVRNNSTRNR PLATNRERPY IPREMWNLLS DDSKAILQGL AAPGKQAPLN GSPPHQTLQA NTHETIGTEH TATDTFHDCA PETELLAHLT ERVSRMSSGD IRKVLAASRD VSEKPKSLQS NVLQYQVSRH TTNETSASLV DRGANGGLAG GDVIVLLKTG RSANITGIND HTLPNLDIVT AAGCVESQNG PIILIMNQYA HLGKGKTIHS SAQLEHYRNH VEDRSRTVGG NQRIVTLDDY IIPLHIRQGL PYMDMRRPTD AELASLPHVV LTSDVDWDPS VLDNEIDLAT SWYDGIHDLP QPPYVEPRFD HTGQYLHRHI SLCDYRDDAI ARIMQCQQHH VTRNVHDYEA LRPCFGWVSA DTVRKTIMAT TQHAREVYHA PLRKHFKSRF PALNVHRRNE PVATDTIWSD TPAVDNGAKF AQLFVGRRSL VTDAYPMKTD KEFVNTLEDH IRFRGAMDKL ISDRAQVEIS KKVTDITRAY NIDQWQSEPN HQHQNFAERR IATIEANTNN ILNLTGAPDN TWLLCVTYVC YVFNHLAHES LDHRTPLEVL TGSTPDISVL LQFHFWEPVY YRIEDATFPS GGTEQQGHFV GIADSVGDAL TYKILNDRTN RILYRSSVRS AAISGQTNLR LASQDGENGP KPINFIKSRR TENQNSYAIK ELPGFTPDDL IGRTAFSGTN HP
|
| |