Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47929 |
Symbol | |
ID | 7203122 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 454678 |
End bp | 458866 |
Gene Length | 4189 bp |
Protein Length | 1272 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182398 |
Protein GI | 219124201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGAACAAG GGTCTCGCTC TCGAAAACAG TCGAAGATAC GCTAACACAG ATACTATTGA TGGCTAGATA GTAGGCATCT GAGGAGCAGT ACGAAACGAG CAAGCAAAAA AACATACCAA TACACGATGG AAGGCGACGG TGTGTCTCCT TCGACAGCAG GTTTGGTCGT TTCCCACCAC GACAACGGAA GCGAGAGTGA CGACGTAGTG CCAACGCAGG CCGATCCCGA ACACGACATA TCCGTAGAGA TCGAAGGCGA CGAAAAGAAG AAGATTGGCA GGACCACGTC GGTGACAGCG TCGTCACCTT GCGCAACTCG AGACAATGTG GGGGACGACG TTTTGAGGGG ACCCGATTTA AACGAATCCA AGACGTCTTT GGATTCCAAG TTATATCGCC AAATTCTACT ACCTAATGGC TTGCGTGCAG TCTTGATCCA AGATACCATC GCCATGCATC AAAACTCCCG TTACGAGTTG GGCGGGTCGG ATGAGGAGGA GGACGACAAC GACATTGACG ATGCAACTGC CGATCAAGCT ACACCTGCCT CCACCCGCTT GCACCATTCG CGCCACGGAA GGAGCGCCAC GGAATCCGAC GACGATAGTG ACGTTGACGA TAATGATGAC GACGATACGG GTTTGCGTGA CGCCGCCGCA TCCATTCTCG TCGGTGTTGG GTCCATGTAC GATCCTGTCA CCTGCCAAGG TCTCGCTCAT TTTCTCGAAC ATTTACTCTT CATGGGATCC GAAAAATATC CCGGAGAAAA CGAATACGAA TCCTTTGTCG CGAAACACGG TGGAACAGAC AACGCTTGGA CAGAATGGGA GTATACCACG TATACGGTTT CGATTCCGCA AGAATACCTT TGGGAAGCCA TGGATCGCTT GGCGCAGTTC TTTGTGGCAC CACTCTTGTT AGAATCAGCC GTCGATCGGG AATTGAATTC TATTGAATCC GAGTTTCAAC TCAACAAAAA TTCCGACTCG TGTCGGTGGC AACAGCTCTT GTGCGCCACG TCTCGTCCCG ATCATCCCAT GGCCAAGTTC AGCTGGGGCA ATCTGCGCTC TTTGAGGGAG ATCCCCCAAG CGCTCGGCGT CGACCCACTC GTTGAATTGC GACGTTTTTA CAATCAATAT TACTACGCTG CCAATATGAG GGTTTGCGTT ATTGGAGCGT ACACGCTAGA CGAAATGGAA CAACGCGTAC AATCTATGTT TGCGAAAGTG CCAGCTTTGC CTCGCACGCC CGGGCCCCTA GCGCTACCGC TCAAGCCAGA AACAGGATTA TGTTCCTGGC AAGCCGAATA TCATAGTCCC TTGCGAGAGG TCGGTTGCCC TTTGGCGGAG CATGCTTTAC AGAAGATTTT TCGGATCGTT CCCGTCAAAG ACAAACACGC GCTGTCCATT ACTTGGCCCT TTCCAGGACA AATGGATCAG TGGCGAACCA AGCCGGGCGA CTTCTTGGCG CATTTATTGG GACACGAAGC AAGTGGTTCG CTGCTCTCAT ACTTTCGATC CCAGTCTTGG GCGACTAGTT GCATGGCTGG TGTGGGCGAA GAAGGTAGTG AAAGAGCGAG CAGCCACGCC TTATTCAATA TGTCCTTCGC GCTCTCGAAA GAAGGCCTGG AGCATTGGAG AGACATGGTT GCTGCTGTTT ACGAGTACAT AGGTATGTTG CGTTTCAAGT CCGAGCATGG TTGGCCGGAA TGGATTTTCG ATGAGTTGCG CAGCATTCAC GAAGTATCTT ATCGCTATGG TGACGAGGCC TCACCGGAAG ATATTGTTGA AGCCATGACG GAAAGCATGG CGCCACACTA CCGATTGCCA CCTGAGCGTT TGCTGGATGG TCCACATCTA CTGTTTGGAT TTGACGCGGC GGCAATTTCT TCCCTGTTGG ATTGCATGAC TCCTCAAAAT GCACGTATCG ATTTGACGTC ATCGTCGTTT GGACGGCCAG CAGATTTCGG TGTTGTGATT GCGGAAGATT CCACCGACAC TCTTGTTACG GATCTTCAGA TCGCCGATGA GATGGAGCTA TTCGATGCGT CGGTAGCTGG TCCGCCTCAA ATTGAGCCCA TGTTTGGTAC ATTTTTTTGG TGTTCTGATG TTCCCTCTGA CTGGATTGTG GATTGGTGCT CGTTGGCGCG ACCGCAAGAG CCTACTTTGC GCATTGGTTT GCCGCCACGC AATCCGTTTG TTCCAGAAAA GTTTAATCTC AAACCCTTGC CTTCCGATGA TGCTCGACAT CCTTTGCTGA ACTCTTCATT AAAGCTTTGT ATTGCTGTTG GCAAATCGAA GCAATGGTTC CCGGCGACAG TTGTTCAGTA CAATGAAAAG AAGAATGCTT TGCTACTGTC TTACGAAGAC GAAGATGAAC AATGGCATGT CTTGGATCGA CATATTGAGA CGTTTCCTCC TGATCAGATT ACTCCCGACT TTGAAGGAAC AATGGACGAG AAGAAGGTCA AGTATCGCAT CGTGGCGCTC GCACAGCCAG GCATGGGTCC GTTGCGAAAA TTTGCCGACG ACAGTGATTT TGCCGCCGAG AATGGCACAG CCTTCCCCCC CATTCCACCG GCGTTACCAC CTTCTAGACT ACCCAAACAG ATATGCAACT CGAACTTGCT CAAAATGTGG TATCTGCAAG ATCGAAGCTT CCATCGACCT ATTGCTGAGC TACGTCTAGA AATTATTTGC GGAAAAGCGA ATAGTTCGCC GCTGCACAAG GCTTGTGCCG AATTGCTGGT CGAGCTCTGC GTTGACAATT GCCTGGAAAT GACTTATTTG GCTAGTGTCT GTGAACTGGG CTCGTTATTG GTCGCGACTG ATGTGGGTTT CTATTTACGC TTCCATGGGT TTGACAACAA GCTTTCGGAT CTGTTCGAAA GGTGCATAAT TGTTTTCCTG AGTTTTCGGC AGGAAGTGGA TACCTTGCCG TCCGGTATTG ACGGATCAAG ATTTAGGGCT TGCTTGGAAG TTCTTCGTCG AAGATATCGC AACCAGGACA TGTCCGCTTC ACACCTTGCC GGAAACTTGC GACTCCGTGC TTTGCGACCG AGCATCTGGT CGGCGAACAA AAAATTGCAT TCAATCAAAG ACCTTTGTGT TCCTTTATTC GCAAAAACGG TTTCAGAAGT CTTGGCCGAT TTTGCCACTG AATGTCTCCT CCACGGTAAC ATAGACCTTT CAGATGCAGA CCGCACGAAA AAGATGATCA TTTCGCTTGT TGGAAACGCT AGGTGGCAAG GGTCTTCCAC GTAAAAAGTA TCCAGCCCAG TCGATGATCC GCATTCCGTC GGTTGACAAA CCAGTTTCCC TTATTGCCCC TTCAAAGGAT CCTGGGGAAC CAAACACGGC AGTGGAAGTC TATGTACAGG TGAACAAGGA CAATCTGCAC GAACGTGTTT TGATTGATCT TCTTGTACAC ATAATTGATG AGCCGATTTA CGACCAGGTA AGGGAGCCCC TTGATGCACG AATAAAGAAT CAAAGTGCAT ACACTGGACT CACTCTATTT TGTCTTTCCT ATTCAGATCC GGACGAAAGA CCAATTTGAA TATGATGTAC ACTGTGATAT TAGATGGTCG TACGGTATTA TGGGAATTGT ATTCAAAATT GTAACAAACG TGAAGAGTGC ATCTGCAGCT GTCGAACGCA TTGACAAGTT CTTGTCGGAC TTCCGTGTAG ATCTTGAGAC AATGTCGGCA GCCGAATTCT TGGAGCACCT GGTGGGGCTT TCAACTCAAA AGCTGGACAT GTTCAACTCT CTGTCCGAAC AATGCGATCA CTACTGGTGT GAAATTAGGG ACGGGCGATT TGAGTGGGAA GCATATCGGG ACGAAGCAAT TTGCCTTCGA AGCGTGCAGA AAGGCGAACT TCTCAAAGCT TTCGACAAAT GGTTGAACCC AGCTAGCCGT CGCAATGTTA TTGCAATTCA AGTGATCGGG ACCGGAGAGG GCGATGTGTC AATCGGTCGG CCTTCTCTCG AAAGTGACAA AGTCGATGAT TACTTGGACG CAGTGTCATC AGACTTTCAC ATTCTCTGCA AAGCGCAAAC GTGGGGCCGA GTGAACTCGA AGCTCTTTTG AGCGTGTATA TTGCTACAAA TGACCAAGTG GTGCGGCCAC TTTCATAGAA TATTTGTCTT CTAAAGTCCT ACCTAGCTAG CTGACTATCA ATGCGAGCG
|
Protein sequence | MEGDGVSPST AGLVVSHHDN GSESDDVVPT QADPEHDISV EIEGDEKKKI GRTTSVTASS PCATRDNVGD DVLRGPDLNE SKTSLDSKLY RQILLPNGLR AVLIQDTIAM HQNSRYELGG SDEEEDDNDI DDATADQATP ASTRLHHSRH GRSATESDDD SDVDDNDDDD TGLRDAAASI LVGVGSMYDP VTCQGLAHFL EHLLFMGSEK YPGENEYESF VAKHGGTDNA WTEWEYTTYT VSIPQEYLWE AMDRLAQFFV APLLLESAVD RELNSIESEF QLNKNSDSCR WQQLLCATSR PDHPMAKFSW GNLRSLREIP QALGVDPLVE LRRFYNQYYY AANMRVCVIG AYTLDEMEQR VQSMFAKVPA LPRTPGPLAL PLKPETGLCS WQAEYHSPLR EVGCPLAEHA LQKIFRIVPV KDKHALSITW PFPGQMDQWR TKPGDFLAHL LGHEASGSLL SYFRSQSWAT SCMAGVGEEG SERASSHALF NMSFALSKEG LEHWRDMVAA VYEYIGMLRF KSEHGWPEWI FDELRSIHEV SYRYGDEASP EDIVEAMTES MAPHYRLPPE RLLDGPHLLF GFDAAAISSL LDCMTPQNAR IDLTSSSFGR PADFGVVIAE DSTDTLVTDL QIADEMELFD ASVAGPPQIE PMFGTFFWCS DVPSDWIVDW CSLARPQEPT LRIGLPPRNP FVPEKFNLKP LPSDDARHPL LNSSLKLCIA VGKSKQWFPA TVVQYNEKKN ALLLSYEDED EQWHVLDRHI ETFPPDQITP DFEGTMDEKK VKYRIVALAQ PGMGPLRKFA DDSDFAAENG TAFPPIPPAL PPSRLPKQIC NSNLLKMWYL QDRSFHRPIA ELRLEIICGK ANSSPLHKAC AELLVELCVD NCLEMTYLAS VCELGSLLVA TDVGFYLRFH GFDNKLSDLF ERCIIVFLSF RQEVDTLPSG IDGSRFRACL EVLRRRYRNQ DMSASHLAGN LRLRALRPSI WSANKKLHSI KDLCVPLFAK TVSEVLADFA TECLLHGGKG LPRKKYPAQS MIRIPSVDKP VSLIAPSKDP GEPNTAVEVY VQVNKDNLHE RVLIDLLVHI IDEPIYDQIR TKDQFEYDVH CDIRWSYGIM GIVFKIVTNV KSASAAVERI DKFLSDFRVD LETMSAAEFL EHLVGLSTQK LDMFNSLSEQ CDHYWCEIRD GRFEWEAYRD EAICLRSVQK GELLKAFDKW LNPASRRNVI AIQVIGTGEG DVSIGRPSLE SDKVDDYLDA VSSDFHILCK AQTWGRVNSK LF
|
| |