Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38712 |
Symbol | |
ID | 7203776 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 25786 |
End bp | 33209 |
Gene Length | 7424 bp |
Protein Length | 1998 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182755 |
Protein GI | 219124952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTGTC CGAGGAAGGT CACTCCCGCT GTTCCGGCCC CTGCCGCAGC GACGGACTCA CCGGCTGATG CCGCGTCCGC ATCCGAAGAG GAAGAGGAGT TCGGAGGATT CGACTCCTCC GACGGTCAGG AGCCTTCGGG CACCGCACCG TCATTGCCGG CATCTTCGGA TGATGAAGGT ATGGCAAAAA GACTGCAAAG CCTTTGGCTC GCAGCAAGAA CACGTCTGAC GAAGTCAGCG TGATTGAGAA AAGCGTCATC AACGCAGAGC CTCACTTGTC TAAAGACAGT GACGGTCTCA ACTCCGTTCC TCGGCAAGAC CGTGTTGAAC AAAAGGCCTT GATGGTTGTC CTCCGTGACG TCATCGCGGC TGCAATGTTG GTTCCGTTCC TCGGCAAGAC CGTGTTGAAC GAAAGGCCTT GATGGTTGTC CTCCGTGACG TCATCTGTGT CCATTGTCAG TTGCGGCTGC AATGTTCAAC AACGGCATTA AATCATCTGA TGATTTCCGT CTTCTCACGA AGGAGGACAT CAATGATCTC TGCATGCGGC TCAAAATGGG CTCCATGCAT ACCAAGCGAA TACTCGTCTT CGCAAAATGG ATGCATCACG CACCCAACTC AGTCGATGTC GCCAAAGAGT TCACGGCTTC CGTGCTACGC TTTGAGATGA TGACTAGAGC CGCGGCGTCG TATGATAATG TGACTACGAC GGCTGCAAAG GCTGAAAAAT CGGCTACTAG CCTCTTGCCT GAACCGTTTG ATGGTTCGCA GAAAAAGTGG CTCACTTTTC GTTACGGTTT CGAAGCGTGG GCAGGCGCAA GTGGGTCCAC TTTTACCGCG TGCATCGCGC ACCATTCGGA TCGGTATTCG AAAGCCGACC CAACCGGACC CCATACGTCG CCCCGTGACG TTTCAGATTT GTTTGCACTC TCCCCAGTTG TCAACATCAC CAGGAACGCA ACAATCTTCT ATACTCTCAT GTCGCTAACC AGCGCTGGGG ACGCCTGGGG ACTTGTTGAG CCCCACGAGC ATACTAAGGA CGGACGCAGT GCCTGGATTT CTCTATGTGC CTTCTATGAA GGAACGAGCC AAGTGGGTCT CACTACCGAG CAGGCTCGCG CGATAGTCAT GGAGTCGGTG TATATAGGAC TGTCCAAACA GTTTTCCTTC ACCAAATATG TCGCTCGGCA TATCTCTGCC AACAATGCCC TTTTGCGTAA CAAGGAGGGC TATTCGGACG CTCAGAAAAC GAATTTCTTT CTTAAAGGGA TTACTGATCC GGCACTCCTT CCTTATAAGG CAACTGCCGA AGCGCGACTC GATGACGGGA ATTTTAATCG GGTCGTCAAC TACATGCGTA CGTCCGCGAC GAAACTCAGT TCCAAGGACA GAAGCGACTC ACGGAACGTA CGTCAGACAA AGACCAATGG CAGAGCCACC GGCAACCAAC GTGGTAACGA CAAGAAACGG CGTGGCTCGT CCAACCGTCC GTCGAACAAG GGGGCTGAGA GACCTTCTCG CCCTCATAAA CACACGTCTT ACCTCCTGAG CAGTGGGAAG CCCTAACCCC AGCTATCAGG GAGAGTATCT TGAGCGCAAA ACGCAGTATT CCACCCCCTG GCCGTGAGGC CAAAAGGGCT AAATCCTCAG ATACAGATAA CTCTAGTTCA ACCGTTGAAT CTTATTCACA ACCGCCTAGT AGTAAAAAAC CTATTCGTAA ACATACATGC GAAACTCACG TCCAAGTAGA TTCCAGTACC CCTGAAACCC TACTTCGTGA CGCACCCACA GACATTTCAC CCCACGTCAC CACCAAAAAA GTGACATTTG GTGCAGGTGT CCTCTTTGGT CGGTACGCTA ATCGCGTATC GTTGAATCGT ATGGTCCGCT CCGGCAGTCA TTTCGATCAA GCCCCTTGGC GCAAGTCGGA TTTCCGACTT AACGATGCGA CACTAGTTCG TATTCGTCAG AACCGCTCAC GCGGAACAAA AACTCCCACC AATTATGGTG AAGCGGTAAT TGATACTGGT GCAGACACCG TCTGCGTCGG TGCCGGGTAC TCTGTATTGT CATACACGGG TCGATCAGTC AGCCTTCGCG GTTTTCATGA TGACGGTGAA ACGTTTGACT GTGAACGGAT TCCGGTTGTC ACGGCGGCAA CCGCCTATGA TTATGACGAC GGAACAACCG TGATTCTCAT CTTTCATGAG GCACTGAACC TCGGACCAAC ACAGACCACC TCGCTCATTA ATTTGAATCA AATCCGACAT GCCGGACATC AAACCGATGA CATTCCAAAA TTCTTGTCGC AAGGCAAATC CCTTCACGGC ATCGAAACTC TCGACGGTGA TTATATCCCG TTTGAGCTCA AAGGTCATGC ATCTCTGTTG TATTCTCGCG TACCTACTCA ACATGAGCTT GACAACTGTC AGCACATTGA TCTCACTTGT GACCAACCTT GGGACCCCAA CAGTAAAGAT TGGGAAGATA ATGAAGCAAA GTACACACGA CACGATCGTT CTCGTCGTGC CTGCTACACC AACAGCGTAC CGGTTGACAT TCTCCCGGAT TGGCCTCCAC TACCCGTTTC CCCTGGATCC GTGGTACCGG ATTTCCATAA CCGTGTCATG AACTGTCACG GTATAGTACC ATGCGACATG TCTCCTAGTG ATGTCCGTCC TCGAGACGTC AACACGGTGA ACGGCGGTTG GAGTCTTCAT AAGGGAGAAG TGAGTCGTAT GGAAGTTAAC TCGGTCTTAT TGCATGAAAT TCCTAGATAC AATGATGTAT TGTGCGTTGG AGAGAACGCG ACTGAACATG TCGTGTTCTG ACAGTGGCAA CGGAGTGGGA CTGACAATAC TTATTGCCTG ACTAACGGAT GTAAATGGAT GCGCTCCATT TCCCGCGGCA TGCATGTTAT GAAAGGTATT TCCGGTTCCA CTATAAGTAT ATGTGAGCGC GTGTCATGGT ATAAGGATAC TATGACGGAA GAAAGATATT TGTGTATGTT TGAGTTTGTT CCTCAGGTTT TGTTGTATCT TGGCAAATCG GAGGAGCAAC AAGAATTAGT TTATAGCACT AATTCTTCAA TCATTGTTTT ATAAGTTCTT TGATCCGCAT TGCTAGTTTA CGCATAGGTC CTCGTATCCG CTGCCTAACC CGAGGGGAGC GTCGCCAACA AGTAGAGTTT TACAGTCCCT AAGTCCCTTA GGTGTATCGC CGGTTGTTCG TAATAACGCC CTCTCTTGTT TACCAGGCTC TGTTCCTAAC AGCGCTTCTG TCCATACGGT GTACATTGGT GTATATTGTA ACAGATTAAC AGTCTTGGTT TCTCGTCAAA CCACTGCTGC TTATCGCGCA GAATTTAGTA AACCCACCTT GAGTCCGTCT CAAGGAAACG TCAAGCTTGG TACCAGCGTA CCTCGTCCGA TAACACTGCC TACATTGTTT GTGTTGCATA AAGTATCAGA GGTGTCTAAG GAGACATTTA TCGGCTTGTG TTCCGCATCC TTGGAAGCTG CTACCACCAT CCTCCTGACC ACTGGTATTG AAAACCCATT AGTACTTCAG GGTTTGTCAG TAAGTCGTCT CGGCCTCCCT ACTTCCTCGG AATAACCGGC ACTGGAAATC GTCTAGGTTT TCTTCTGGAT TTGCCAGTGG GTTGTTTCGT GATTACGGGT TCTCTTGCCA AGTGAGGTAT TCGATACCAA TAGGCTTGGA AGACCAATCC TGATTCCGGT TCGGACGGGT ATACTCACCG GTTAATAAGA GTACGGGGGG GGGACTAATC AACCCTTCGA TAGTTACTGG CACGGAGCGA TATCACGCAC AGCAGTAAAA CTGCCACCGA GGTGGTAGAA GGGAGTGAAC CAGTGCTGGC AACGTCCACT GGTGTAGGGT CAAATAAAGA AGTCCGCTTA GTTACAGCGA AGGACCTGAA ATATAGTGGA AGACCTAACC CCGGAACGCT ACCGACCCGG CTACCCAGCC CAACTCAAGA TCTGACCCGG CTATCCAGCC CAACCCAAGA CCTGGTTGGT TACCCATTTC AATATGGTTA CCGCTACGCG GACAACGTCC AATGCAAGCG CTTTCGCTCA TTTACTTGAC ACGGTCCTTG CCCTACCGGC AACATCTCCT ATCCGTTCTA GTCTTGTACT ACATGAGTTA GATGATCTAG ACGGACTCCT TAGTATTTTT GAGGGCCAAA TCGAAACTCT AGAATATCTC CCTGTATCAT CCGAAGGTGA CGCAACCCCT GTCCCAATCA AGTTACGTAT GGGCCATCAA CAACTCTTAC GTTACTTGCT ACTCTGGATA CGTCAACTCG CGCACGACAA AGGAGGTCCC CTCTCGAACT ACGAACTCAT CTCTCTCATG AAAGAAGATT TCAGTTTATT TCGACGGTCT CCATCAACTC ATTTGCCGAA TGCAGTCCCA ACACCCAGTT CACAATCAAG CACTCCTTCG ACTATGGTTG GAAACTCTAG TCGTTCTGCT GTCGCTGACT TTAAACGCGG TGTTATACGT GATAAAACGC ATTATCCGGT GCTCAAGGAC GACCGATATT GGGACAACTT CTACCGTACT TTTGTCGTTA CCGCCGTATC GCACAATGTT GATAATGTCC TAGATCCAGC TTACTCCCCT ACGAATACAG ATGACATATT GCTATTCAGG GAGCAGAAAA AGTTCGTCTA TTCCGCTCTA GAACACTGCT TGCAAACGGA TATGGGTAAA AACATTGTCC GCGAGCATGC CTTTGATTTC GATGCACAAA CTGTTTTCGC AAAAGTGGTA AAACACTACA CCGAATCCAC AGCTGCAAAG ATCAGTTCTG GTACCACACT GTCATACCTG ACCTCTGCGA AGTAAGGCAG CTCCTGGACT GGAACCGCCG AAGCATTTAT TTTGCATTGG AAAAATCATC TTCGCATTTA CAACGACACG GTCCCGGTTA CAGAGAAGTT GCCACCACAA CTTTGCCTCA GCCTGCTCGA GTCCTCTGTA CGCGACGTTT CAGAGCTTCG CCAAGTCAAC ACTACCGCGA ATCTAGATTT AGCTAAAGGG GGGTCTCCCA TTAACTATGA AAATTACCTA AGTCTACTCC TTGCTGCCGC GACTTTGTAC GATAAAGGAA ACAATTTTTC TAATTCCCGT AGCCCAAAAT CCAAGCGCAG CGCCTTTGTT ACTGAGACTA CCTTTCCCGA TGATGAATAT GGCGTCGATT ACGACATTGA TTTGTCACCG TCCATCCTTT ACGAAGCGAA TGCTCACAAC CGCAGAGCAG GCGACCAAAA TCGAGAACGC CAGAGCAATG TCAATTGTGA GCGACCGTAT ATTCCTCGTG AGATGTGGGA TAAACTGTCC GACGATGCAA AGGAGATTCT CCGTGGTATG TCTTCTCCTA AAGAAGGAAA CGCCTCGGCC AACAGCAAGT CTTCATCTGC ATTTCATGCC AACTCCCATT CTTTAACCGA TACGGGACAC TCCTCATCAA CGGACGAATC ATTGCACGAA AATGACAATG ATAAATTCCA TGATTGCGGG AACGACACGG AACTGCTTGC ACACCTTACT GATCACTCAA GTAATATGGC AAATGGGGAC ATTCGTAAGG TCCTCGCTTC AGCTTCCTCC TATAAGCAGA ATTCGAAGAA CTCCCTGCAG TCAAATATGC TCGAATACAG TATTTCCCGA CACTCCGTTG CAGAGACTAC ATCCTCCCTC ATCGACAGAG GCGCAAACGG CGGACTTGCC GGAAGCGATG TTATAATCCT TAACAAAACA GGCCGTTCTG CGAGCATCAC GGGTATTAAT GACCATACTT TGCCTGATTT GGACATTGTC ACCGCCGCTG GCCTCGTTGA ATCACAACAT GGACCCATCA TTGTCATACT CCATCAGTAT GCCCACCATG GAAAGGGAAA AACGATCCAT TCTAGTGCTC AACTTGAGTA CTACAAGAAT ATTGTCGAGG ACCGTTCCCG TGTTTTAGGC GGTAAACAAC GTATCATAAC TCTAGATGAT TACGTTATTC CCCTACACGT TCGTCAAGGA CTAGCTTATA TGGACATGAG ACCTCCTTCC AATGCAGAGT TTGACACGTT ACCCCACGTT GTACTTACTT CCGATGTCGA CTGGGACCCG TCCATTATCG ACAACGAAAT TGACCTTGTC ACAGACTGGC ATGATGCCAT ACAGGACCTT CCCAGCGACC CGTACGTTGA ACCCCGTTTC AATTCAACTG GTGAATACTG ACATAGGCAC GTTGCGACCT TTGACATTTT CTCGTCATAT GACTTTGTTC ATCGGTCCAC GGCTATCGAT AATATACTCT CGCCAAATCA ACATGACATG ACCCGCAATT CGCACAATTA CGAAGCCTTG CGTCCTTGTC TTGGCTGGGT CTCCGCCAAC ACAGTCCAGA AAACCATCAT GGCCACTACG CAATTCGCTC GTGAGGTCTA TAATGCACCT ATGCGTAAAC ATTTCAAGTC TCGTTTTCCG GCACTTAATG TTCACCGGCG CAACGAAGCT GTGGCTACCG ATACCATTTG GTCGGACACA CCTGCTGTCG ATAACAGCGC TAAATTTGCG CAATTATTTG TCGGTAGACG ATCGCTTGTT ACCGACATTT ATCCTATGAA AACAGACAAA GAGTTTGTTA ATGCACTCGA AGACAATATT CGTCATCGGG GTGCCATGGA TAAACTCATC AGTGACCGTG CCAAAGCCGA GATCAGCAAG AAAGTTTCTG ATATTACTCG TGCTTACCAC ATTGATCAAT GGCAAAGCGA GCCCAACCAC CAGCACCAAA ATTATGCTGA ACGTCGAATT GCAACTGTTG AAGCAAATGC AAATAAAATT CTTAACCAAA CTGGTGCACC TAATTCTACC TGGTTATTGT GTGTTTCCTA CATTTGTTAT TTGTTTAATC ATTTGGCCCA TGAGTCTTTG CACGATCGCA CCCCCCTCGA AATTCTTAAC GGTAGTACTC CTGATATTAG TGTACTCCTT CAATTCCATT TCTGGGAACC GATCTACTAC CGACTTGAAG ACCCTACTTT TCCTTCCGAC GGAACTGAAA AAAAGGGCCA CTTTGTTGGA ATTGCTGATT CCGTTGGTGA TGCTCTTACC TACAAGGTAC TCACCAACGA CTCCCACAAG ATCCTTCTCC GATCTAGTGT TCGCTCTGCG TTGAAACCTA GTGAACCCAA TTTGCGTCTT GAGCCACATG AAGGGGAGAG TCCTCCTAAG CCCATCAACT TCATTAAGTC GCGCAGAACT GAGGACGGAA ATTCTTATGC CATCCACACG CTACCTGGTT TCACCCCGGA CGATCTCATC GGACGCACCT TTTTAACCGA TACCCAGGAC AATGGGGAGC ATTTTCGTGC ACGTATTGCC AGGAAAATTC TTGA
|
Protein sequence | MRCPRKVTPA VPAPAAATDS PADAASASEE EEEFGGFDSS DGQEPSGTAP SLPASSDDEV AAAMFNNGIK SSDDFRLLTK EDINDLCMRL KMGSMHTKRI LVFAKWMHHA PNSVDVAKEF TASVLRFEMM TRAAASYDNV TTTAAKAEKS ATSLLPEPFD GSQKKWLTFR YGFEAWAGAS GSTFTACIAH HSDRYSKADP TGPHTSPRDV SDLFALSPVV NITRNATIFY TLMSLTSAGD AWGLVEPHEH TKDGRSAWIS LCAFYEGTSQ VGLTTEQARA IVMESVYIGL SKQFSFTKYV ARHISANNAL LRNKEGYSDA QKTNFFLKGI TDPALLPYKA TAEARLDDGN FNRVVNYMRT SATKLSSKDR SDSRNWEALT PAIRESILSA KRSIPPPGRE AKRAKSSDTD NSSSTVESYS QPPSSKKPIR KHTCETHVQV DSSTPETLLR DAPTDISPHV TTKKVTFGAG VLFGRYANRV SLNRMVRSGS HFDQAPWRKS DFRLNDATLV RIRQNRSRGT KTPTNYGEAV IDTGADTVCV GAGYSVLSYT GRSVSLRGFH DDGETFDCER IPVVTAATAY DYDDGTTVIL IFHEALNLGP TQTTSLINLN QIRHAGHQTD DIPKFLSQGK SLHGIETLDG DYIPFELKGH ASLLYSRVPT QHELDNCQHI DLTCDQPWDP NSKDWEDNEA KYTRHDRSRR ACYTNSVPVD ILPDWPPLPV SPGSVVPDFH NRVMNCHGIV PCDMSPSDVR PRDVNTVNGG WSLHKGEVSR MEVNSVLLHE IPRYNDVLLT VLVSRQTTAA YRAEFSKPTL SPSQGNVKLG TSVPRPITLP TLFVLHKVSE VSKETFIGLC SASLEAATTI LLTTGIENPL VLQGLSLLAR SDITHSSKTA TEVVEGSEPV LATSTAQPKT WLVTHFNMVT ATRTTSNASA FAHLLDTVLA LPATSPIRSS LVLHELDDLD GLLSIFEGQI ETLEYLPVSS EGDATPVPIK LRMGHQQLLR YLLLWIRQLA HDKGGPLSNY ELISLMKEDF SLFRRSPSTH LPNAVPTPSS QSSTPSTMVG NSSRSAVADF KRGVIRDKTH YPVLKDDRYW DNFYRTFVVT AVSHNVDNVL DPAYSPTNTD DILLFREQKK FVYSALEHCL QTDMGKNIVR EHAFDFDAQT VFAKVVKHYT ESTAAKISSG TTLSWTGTAE AFILHWKNHL RIYNDTVPVT EKLPPQLCLS LLESSVRDVS ELRQVNTTAN LDLAKGGSPI NYENYLSLLL AAATLYDKGN NFSNSRSPKS KRSAFVTETT FPDDEYGVDY DIDLSPSILY EANAHNRRAG DQNRERQSNV NCERPYIPRE MWDKLSDDAK EILRGMSSPK EGNASANSKS SSAFHANSHS LTDTGHSSST DESLHENDND KFHDCGNDTE LLAHLTDHSS NMANGDIRKV LASASSYKQN SKNSLQSNML EYSISRHSVA ETTSSLIDRG ANGGLAGSDV IILNKTGRSA SITGINDHTL PDLDIVTAAG LVESQHGPII VILHQYAHHG KGKTIHSSAQ LEYYKNIVED RSRVLGGKQR IITLDDYVIP LHVRQGLAYM DMRPPSNAEF DTLPHVVLTS DVDWDPSIID NEIDLVTDWH DAIQDLPSDP HVATFDIFSS YDFVHRSTAI DNILSPNQHD MTRNSHNYEA LRPCLGWVSA NTVQKTIMAT TQFAREVYNA PMRKHFKSRF PALNVHRRNE AVATDTIWSD TPAVDNSAKF AQLFVGRRSL VTDIYPMKTD KEFVNALEDN IRHRGAMDKL ISDRAKAEIS KKVSDITRAY HIDQWQSEPN HQHQNYAERR IATVEANANK ILNQTGAPNS TWLLCVSYIC YLFNHLAHES LHDRTPLEIL NGSTPDISVL LQFHFWEPIY YRLEDPTFPS DGTEKKGHFV GIADSVGDAL TYKVLTNDSH KILLRSSVRS ALKPSEPNLR LEPHEGESPP KPINFIKSRR TEDGNSYAIH TLPGQWGAFS CTYCQENS
|
| |