Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33459 |
Symbol | |
ID | 7204034 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 971789 |
End bp | 976230 |
Gene Length | 4442 bp |
Protein Length | 1263 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186163 |
Protein GI | 219113159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.155395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAG AAAGATATTT GTGTACGTTT GATTTTGTTC CTCAGGTTTT GTTGTATCTT GGCAAATCGG AGGAGCAACA AGAATTAGTT TATAGCACTA ATTCTTCAAT CATTGTTTTA TAAGTTCTTT GATCCGCATT GCTAGTTTAC GCATAGGTCC TCGTATCCGC TGCCTAACCC GAGGGGAGCG TCGCCAACGA GTAGAGTTTT ACAGTCCCTA AGTCTCTTAG GTGTATCGCC GGTCGTTCGT AATAACGCCC TCTCTTGTTT ACCAGGCTCT GTTCCTAACA GTGCTTCTGT CCATACGGTG TACATTGGTG TATATCGTAA CAGATTAACA GTCTTGGTTT CTCGTCAAAC CACTGCTGCT TATCGCGCAG AATTTGGTAA ACCCACCTTG AGTTCGTCTC AAGGAAACGT CGAGCTTGGT ACCAGCGTAC CTCGTCCGAC AACACTGCCT ACATTGTTTG TGTTGCATAA AGTATCAGAG GTGTCTAAGG AGACATTTAT CGGCTTGTGT TCCGCATCCT TGGAAGCTGC TACCACCATC CTCCTGACCA CTGGTATTGA AAACCCATTA GTACTTCAGG GTTTGTCAGT AAGTCGTCTC GGCCTCCCTA CTTCCTCGGA ATAACCGGCA CTGGAAATCG TCTAGGTTTT CTTCTGGATT TGCCAGTGGG TTGTTTCGTG ATTACGGGTT CTCTTGCCAA GTGAGGTATT CGATACCAAT AGGCTTGGAA GACCAATCCT GATTCCGGTT CGGACGGGTA TACTCACCGG TTAATAAGAG TACGGGGGGG GACTAATCAA CCCTTCGATA GTTACTGGCA CGGAGCGATA TCACGCACAG CAGTAACACT GCCACCGAGG TGGTAGAAGG GAGTGAACCA GTGCTGGCAA CGTCCACTGG TGTAGGGTCA AATAAAGAAG TCCGCTTAGT TACAGCGAAG GACCTGAAAT ATAGTGGAAG ACCTAACCCC GGAACGCTAC CGACCCGGCT ACCCAGCCCA ACTCAAGATC TGACCCGGCT ATCCAGCCCA ACCCCAGACC TGGTTGGTTA CCCATTTCAA TATGGTTACC GCTACGCGGA CAACGTCCGA TGCAAGCGCT TTCGCTCATT TACTCGACAC GGTCCTTGCC CTACCGGCAA CATCTCCTAT CCGTTCTAGT CTTGTACTAC ATGAGTTAGA TGATCTATAC GGACTCCTTA GTATTTTTGA GCGCCAAATC GAAACTCTAG AATATCTCCC TGTACCATCC GAAGGTGACG CAACCCCTGT CCCAATCAAG TTACGTATGG GTCATCAACA ACTCTTACGT TACTTGCTAC TATGGATACG TCAACTCGAG ACCAACAAAG GAGGTCCTCT CTCGAACTAC GAACTCATCT CTCTCATGAA AGAAGATTTC AGTTTATTTC GACGGTCTCC ATCAATTCAT TTGCCGAATG CAGTCCCAAC ACCCAGTTCA CAATCAAGCA CTCCTTCGAC TGTGGTTGGA AACTCTAGTC GTTCTGCTGT TGCTGATTTT AAACGCGGTG TTAAACGTGA TAAAACGCAT TATCCGGTGC TCAAGGACGA CCGATACTGG GACAACTTCT ACCGTACTTT TGTCGTTACC GCCGTATCGC ACAATGTTGA TAATGTCCTA GATCCAGCTT ACTCCCCTAC AAGTACAGAT GAAATATTGC TATTCAGGGA GCAGAAAAAG TTCGTCTATT CCGCTCTAGA ACACTGCTTG CAAACGGATA TGGGTAAAAA CATTGTTCGC GAGCATGCCT TTGATTTCGA TGCACAAACT GTTTTCGCAA AAGTGGTAAA ACACTACACT GAATCCACAG CTGCAAAGAT CAGTTCTGGT ACCACACTGT CATACCTGAC CTCTGCGAAG TACGGCAGCT CCTGGACTGG AACCGCCGAA GGATTTATTT TGCATTGGAA AAATCATCTT CGCATTTACA ACGACACGGT CCCAGTTACA GAGAAGTTGC CACCACAACT TTGCCTCAGC CTGCTCGAGT CCTCTGTACG CGACGTTTCA GAGCTTCGGC AAGTCAACAC TACCGCGAAT CTAGATTTAG CTAAAGGGGG GTCTCCCATT AACTATGAAA ATTACCTAAG TCTACTCCTT GCTGCCGCGA CCTTGTACGA TAAAGGAAAC AATTTTTCTA ATTCTCGTAG CCCAAAATTC AAGCGCAGCG CCTTTGTTAC TGAGACTACC TTTCCCGATG ATGAATATGG CGTCGATTAC GACATTGATT TGTCACCGTC CATCCTTTAC GAAGCGAATG CTCACAACCG CAGAGCAGGC GACCAAAATC GAGACCGCCA GAGCAATGTC AATCGTGAGC GACCGTATAT TCCTCGTGAG ATGTGGGATA AACTGTCCGA CGATGCAAAG GAGATTCTCC GTGGTATGTC TTCTCCTAAA GAAGGAAACG CCTCGGCCAA CGGCAAGTCT TCATCTGCAT TTCATGCCAA CTCCCATTCT TTATCCGATA CAGGACACAC CTCATCAACG GACGAATTGT TGCACGAAAA TGACAACGAT AAATTATATG ATTGCGGGAA CGACACGGAA CTGCTTGCAC ACCTTACTGA TCGCTCCAGT AATATGGCAA ATGGAGACAT TCGCAAAGTC CTCGCTTCGG CTTCCTCCTA TAAGCAGAAT TCGAAGAACT CCCTGCAGTC AAATATGCTC GAATACAGTA TTTCCCGACA CTCCGTTGCA GAGACTACAT CCTCCCTCAT CGACAGAGGC GCAAACGGCG GACTTGCCGG AAGCGATGTT AAAATCCTTA ACAAAACAGG CCGTTCTGCG AGCATCACGG GTATTAATGA CCATACTTTG CCTGATTTGG ACATTGTCAC CGCCGCTGGC CTCGTTGAAT CACAACATGG ACCCATCATT GTCATACTGC ATCAGTATGC CCACCATGGA AAAGGAAAAA CGATCCATTC TAGTGCTCAA CTTGAGTACT ACAAGAATAT TGTCGAGGAC CGTTCCCGTG TTTTAGGCGG TAAACAACGT ATCATAACTC TAGATGATTA CGTTATTCCC CTACACATCC GTCAGGGACT AGCTTATATG GACATGAGAC CTCCTTCCGA TGCAGAGTTT GACACGTTAC CCCACGTTGT ACTTACTTCC GATGTCGACT GGGACCCGTC CATTATCGAC AACGAAATTG ACCTTGTCAT AGACTGGCAT GATGCCATAC AGGACCTTCC CAGCGACCCG TACGTTGAAC CCCGTTTCAA TTCAACTGGT GAATACCGAC ATAGGCACGT TGCGACCTTT GACATTTTCT CGTCATCTGA CTTTGTTCAT CGGTCCACGG CTCTCGATAA TATACTCTCG TCAAACCAAC ATGACATGAC CCGCAATTCG CACAATTACG AAGCCTTGCG TCCTTGTCTT GGCTGGGTCT CCGCCAACAC AGTCCAGAAA ACCATCATGG CCACTACGCA ATTTGCTCGT GAGGTCTATA ATGCACCTAT GCGTAAACAT TTCAAGTCCC GTTTCCCGGC ACTTAATGTT CACCGGCGCA ACGAAGCTGT GGCTACCGAT ACCATTTGGT CGGACACACC TGCTGTCGAT AACGGCGCTA AATTTGCGCA ATTATTTGTC GGTAGACGAT CGCTTGTTAC CGACATTTAT CCTATGAAAA CAGACAAAGA GTTTGTTAAT GCACTCGAAG ACAATATTCG TCATCGGGGT GCCATGGATA AACTCATCAG TGACCGTGCC AAAGCCGAGA TCAGCAAGAA AGTTTCTGAT ATTACTCGTG CTTACCACAT TGATCAATGG CAAAGCGAGC CCAATCACCA GCACCAAAAT TATGCTGAAC GTCGAATTGC AACTGTTGAA GCAAATGCAA ATAAAATTCT TAACAAAACC GGCGCACCTA ATTCTACATG GTTATTGTGT GTTTCCTACA TTTGTTATTT GTTTAATCAT TTGGCCCATG AATCTTTGCA CGATCGCACC CCCCTCGAAA TTCTTAACGG TAGTACTCCT GATATTAGCG TACTCCTTCA ATTCCATTTC TGGGAACCGA TCTACTACCG ACTTGAAGAA CCTACTTTTC CTTCCGACGG CACTGAAAAA AAGGGCCACT TTGTTGGAAT TGCTGATTCC GTTGGTGATG CTCTTACCTA CAAGGTACTC ACCAACGACT CCCACAAGAT CCTTTTCCGA TCTAGTGTTC GCTCTGCATT GAAACCTAGT GAAACCAATT TGCGTCTTGA GCCACATGAA GGGGAGAGTC CTCCTAAGCC CATCAACTTC ATTAAGTCGC GCAGGACTGA GGACGGAAAT TCTTATGCCA TCCACACGCT ACCTGGTTTT ACCCCGGACG ATCTCATCGG ACGCACCTTT TTAACCGATA CCCAGGACAA TGGGGAGCGT TTTCGTGCAC GTATTGCCAG GAAAATTCTT GA
|
Protein sequence | MTEERYLFLV SRQTTAAYRA EFGKPTLSSS QGNVELGTSV PRPTTLPTLF VLHKVSEVSK ETFIGLCSAS LEAATTILLT TGIENPLVLQ GLSLLARSDI THSSNTATEV VEGSEPVLAT STGVGSNKEV RLVTAKDLKY SGRPNPGTLP TRLPSPTQDL TRLSSPTPDL VGYPFQYGYR YADNVRCKRF RSFTRHDDLY GLLSIFERQI ETLEYLPVPS EGDATPVPIK LRMGHQQLLR YLLLWIRQLE TNKGGPLSNY ELISLMKEDF SLFRRSPSIH LPNAVPTPSS QSSTPSTVVG NSSRSAVADF KRGVKRDKTH YPVLKDDRYW DNFYRTFVVT AVSHNVDNVL DPAYSPTSTD EILLFREQKK FVYSALEHCL QTDMGKNIVR EHAFDFDAQT VFAKVVKHYT ESTAAKISSG TTLSYLTSAK YGSSWTGTAE GFILHWKNHL RIYNDTVPVT EKLPPQLCLS LLESSVRDVS ELRQVNTTAN LDLAKGGSPI NYENYLSLLL AAATLYDKGN NFSNSRSPKF KRSAFVTETT FPDDEYGVDY DIDLSPSILY EANAHNRRAG DQNRDRQSNV NRERPYIPRE MWDKLSDDAK EILRGMSSPK EGNASANGKS SSAFHANSHS LSDTGHTSST DELLHENDND KLYDCGNDTE LLAHLTDRSS NMANGDIRKV LASASSYKQN SKNSLQSNML EYSISRHSVA ETTSSLIDRG ANGGLAGSDV KILNKTGRSA SITGINDHTL PDLDIVTAAG LVESQHGPII VILHQYAHHG KGKTIHSSAQ LEYYKNIVED RSRVLGGKQR IITLDDYVIP LHIRQGLAYM DMRPPSDAEF DTLPHVVLTS DVDWDPSIID NEIDLVIDWH DAIQDLPSDP YVEPRFNSTG EYRHRHVATF DIFSSSDFVH RSTALDNILS SNQHDMTRNS HNYEALRPCL GWVSANTVQK TIMATTQFAR EVYNAPMRKH FKSRFPALNV HRRNEAVATD TIWSDTPAVD NGAKFAQLFV GRRSLVTDIY PMKTDKEFVN ALEDNIRHRG AMDKLISDRA KAEISKKVSD ITRAYHIDQW QSEPNHQHQN YAERRIATVE ANANKILNKT GAPNSTWLLC VSYICYLFNH LAHESLHDRT PLEILNGSTP DISVLLQFHF WEPIYYRLEE PTFPSDGTEK KGHFVGIADS VGDALTYKVL TNDSHKILFR SSVRSALKPS ETNLRLEPHE GESPPKPINF IKSRRTEDGN SYAIHTLPGQ WGAFSCTYCQ ENS
|
| |