Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42799 |
Symbol | |
ID | 7196164 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1161122 |
End bp | 1165356 |
Gene Length | 4235 bp |
Protein Length | 1343 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176733 |
Protein GI | 219109961 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.528448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCCCG TTGGTGACGC CCTGTTGGGA AATGATGTCG ACTTTCACCC TCCCTTTCCG TCCGACAAAG CCGACTACTT GCAAGCTACA GTATTGCAGG GTCTAAGCTC TGCGAGTGCT AGGGTTATTC TGTATTCTGT AGAGAGTCCA TCAGCAACGG TCACAGTCAG TCCTGCTGTG ATGAACCCGC AAAAGTTGCC AGCAGGTTCG AGATACTCAA CGTTCAAACT TGCGGTGTAC GATCCCTCGA TCCGAGAGAC GTTGTAATAC TGTTGTAATT TGATCCAGCA ATTATAGCAA TCCAGCAAGA ATAGCGTCGT TGCTAAGTAA GAGCGCGGTC TCTACTTTCC GCAACAATTC GAATATGAGC CACTCAGCAT TATCAGATTA TGAGGACCAT GACGAAGAGG AAGACGAGTG CCGCGTGTGC CGCGGTCCAG AGGAAGAAGG GTAAGCATCA CTTTGATACA GCTGTCGATC CTCGTGCCAA GCCATCCTGG ACCTTAAGCC GCAGACGAGC ATATTTTGGT CTTCCAATCG TGCCTTTTCG GTTTTGGCCG AACGAGTTTG TATGAAGCAC TCTCTCCAAC TTATCCCTGA TCTCACTTTG TCTGTTTTTG TGCGTCAGAC GACCCTTATT TAAACCTTGC AAATGCTCCG GGAGTATCGG TCTGACGCAT CAGGATTGCT TGCAATCCTG GCTTGAGGTG CAGCGAGGCG ACGGCCGGTG CGAGCTATGT CACACTGAAT TTCGCTTTGC TCCTCAATAC GACAATGATG CCCCCGAGCG ATTGCCCGCT TCTCAAGTTG TCTTGAGCTT GATGCGACAG TTTTTCTCTC GCTGGCTTCC AGTCTTGATA CGTTGCGTAT TTGCCGCCAG TCTCTGGCTA CTCGTAGCCC CCCTTCTTAC GGCGTATGTC TACCACGCAT GGATGCATCA ACCGTCGGTG GTCTGGGACC GCTGCTCCGA CTGGAGTCTC ATTCCTGGTG ATATGGTATC GGGGGCCGTC CTTGTGGGGG TCATCATTGT CAGCTTTTTA TCAATAATGA GTTTTATTGA CTTTTTGCGA GTGGAATGGC ACCCTGACGG TCGGCCCAGG CCGCGCTGGG GGGAAGAAGG ACCGGAACCA GCCGTGGGGG AAGCCCCCGC TCCCGATGAA AACGCCATTG ACAATGCCGT TTGGGATGCC TTCCAGCGAC AGGTTGTGGA GCGACATCGA CGGCAACGAG GAAGAGCGGT ACCTCAACGA GTCGAACATG AACTTCTGCA AGCACATGCG CCAGGACAGC CACGAACAGA GGAACGGTTC TCGAATGAAA ACGTGGAACT TGACGCATCT GGAAGCGACT CGGAATCGAG TTGGCAAGAC GACGATGACC GTGACGATGA TGACGATACG GTTTCGGATG ACGAGTGGAT CGAAAACGTT GAAGAAGACG ACGATAGCGT AAATGATGAC AACGATCGAG ATGAGCCACA GTTGCATCCA CCTGCTGTCC CCATTGTCGA CGATCGAGAC GACGGTGACG AAAATCCTCA GGCATTTGGT CGTAACAATT TTGACATGGA CCCGGATGAC GGGATGGATA TGGATATCAA CATCGCACTC GATGAGTTGT TGGGTGCACG GGGACCAATA ACTTCGGTGG TTCGCAATCT TTTGTGGCTG CTGGCCTTTA ACACTGTGTA CTTGGGCTTT TTTGGATTTA CACCAAAAGT GCTGGGGACC ATAACGTCAA CGATCTTTAG AAATACTACA GTATGGTCAC CCATGGTTTT CACTATTGTC ACCAACGCTA CGGTGCCCGA TGACACTCAA ATCGCAAATG AATCTCTGTC GATTTGGACA GCGTACAGGG CCATTGAGTC TGAGAGCGCA AGTGCCAATA CAACATTTCG ATTGCACGAT CTTTTTTTGG TACTCTTGGG CTATGCTTCG TGCGCTGGTA TGGTTGTTCT CATTCGCTTT CTGTGGTTAG CGTCTCAAAA GATTCGAGCT CTTCGGAGCG GGCGTGCTGA CAACCCAGTA CCCTTGCGTG AACTCCAAGA AGGGTTTGAG GAGATGAATC GAATCATGCG CATGGGTCCT GAACAAATGA ACATGGTAGA TGACAATGTT GCCATTCACG TTTTCCTTAC CACGACTCTC GATGCGACTT TGGCAATTAC GAAGGTCGGC GTACTTTTGT TCATGAAGAT GTTTTTGCTA CCTATTTGGT TGGGTCTATG CTTGGATGTT TCTTCGCTCC CAATCTTAGG AAGCTCGTTC GAGGAAAGAA TCGCTTACGC TGGGAAAGAC TTATTCTCTT TTCTCTTGCT TCATTGGGTC GTGGGCATAA CCTTCATGCT TCTAGTGACG GTATCTGTCC TTCAGCTTAG GGAAGTAGTA CATCCTGAAC TACTAGCGCA GACGATTCGG CCACAAGAGC CCCAACCAGA TCTTCTTGGT AACTTAATGA ATGAAAGCAT CATTACGCAT ATGAAACGAA TGGTGCTTTC CCTTGTCATC TACGTGGTGC TACTTGCTAT GTACATATAT CTCCCAATTC AGGCAATCAT GGCAAGTGGC GTTAGCGCAG ACCTATCAAT GGCACAACTC AAGTTCTGGT ACCCTATAAT GCCAGAGCTT CAGGTTCCTC TGGAGCTACT TACATTCCAT CTTTGCATGC TGGCACTTCT CGAAAAGCAC AAGAACTCAA TCGGTGAAAT TCAGCATTAC TGGCTCAAAT TTATATCGAG GCTTGTGGGG TTGACAGACT CCCTGATTCC CATGCGCGTA GATTGCTTTG AGTATGTTGG AGTGCTTCCA ATATTCGAGC ATGAAGCTGT CTCGCCTTTT TGGTCCAAGC TGGCAAAAAA CGAGAATGAG AGAGAGAAAC TTCTGGATGA AAGCGTAGCA ACTTTTCTGA AGAGCGACAT TCCACGAGTC AATATAGGTC AGTCCAAAGC TAATGGACAA CGTGTTCTGG CATCTAAAGA CTATGTTCGG CTTCCGGATG TTCTTCCTGG TAGATTGCTG CGCAGTCGTT CCGTTCTCAT GCCGACAACG ATTGGAAAGT ATCGTCTTCA ACGAATTCTC TCGCTCGACG GTACTCCCTT GATTGAAATC TGGAAAGAAG AACGAGGCAC GCCCATTGCA CGTCCACCAG AAGGGTGGGA TGATCTTGGC GCAGGAGGAG CTGATGTTCA AGGACGCTGG GCCTGGGGAC GGGAGAAGAA ATCCGTCGTT GAAGAAGCTA TTGCGCATCG AGTGGATTTT TTCGGTCGAC ATAAAGCATC AGTATCCTAT TGGGCCGTAT GGGTTAAACT CATTTTCATG TTCTTCTCAG CGTGGCTGTC GACCACTTGT TTCATTTGCA TTTCGCTTAT ATGCCCGTTG GCTTTTGGCC GAACTCTGTA TCATATACTC CAAGTACCTC AAGCCTACGT TCATGACCCT TTGGCTTTCG TTGTCGGCTG TCTTATCTTT TTCCCTGTGG CACGATGTAT TGGACGATGG AGTCTCTCTG GAGAGAACTC TCTGATGCAG AGGCTCTTCT CATGGCTGCG GTCTTATAGA CGCCCGCCCT CTGCCAAAGC AAGACTTTTG CTTGTCACAG CTATTACATG TTTCGGTCTT GCACCTGTCC TACTTGGCTT TATCTATCAT GCTCTCCTTG TGAAATTGCC TCCTTTCTTT GCTGGAACAC AGGAGTGGAT CGCACTCTCG CTTTTCTCAT CGTATTGGGC AACCGGATTT GTTTTGTTAT TTGCGTGGGC GCGGCTCTGT ATTGCTGAAG CATTTACCAA AAAGTTTTGG AAAGGTCTAA TGGGAGCTGC TGGCGATGTC GACGATAACG AAGGCAATGT CCAGAATAGT TTACGGTGGG CTTGGCAAGG AAAGGAAGGG CGCGTATCTC GGTTTACCAA GTGTTGGAGA AAGGTTGTCT TAACCTGGGA ATTTGACCAG GTGGATCTGA ATACGCTAGT CGACGACGTT GCCACTCCTG TTATCACTGC TCTAGCAAGA GTTGTAGTTC CATACTGCAT TTTTATGATG CTAATTGTCA GAACTTTTGG ACTAGACCAG GTCACTGTTG CTACCTTTGG CCGAATAGCA TTGGGGCTTC TGTGCGCCGA GGGAGTTTCA CGAACTTGGA GGATACAGTT TTCGAATTGG CTTGAAGCCG CCCACCAAAT GGCACGAGAC GACCGCTACT TGATTGGCGA GATGCTGATG AATTACGATG GATGA
|
Protein sequence | MSPVGDALLG NDVDFHPPFP SDKADYLQAT VLQGLSSASA RVILYSVESP SATVTVSPAV MNPQKLPAGS RYSTFKLAVY DPSIRETFNY SNPARIASLL SKSAVSTFRN NSNMSHSALS DYEDHDEEED ECRVCRGPEE EGRPLFKPCK CSGSIGLTHQ DCLQSWLEVQ RGDGRCELCH TEFRFAPQYD NDAPERLPAS QVVLSLMRQF FSRWLPVLIR CVFAASLWLL VAPLLTAYVY HAWMHQPSVV WDRCSDWSLI PGDMVSGAVL VGVIIVSFLS IMSFIDFLRV EWHPDGRPRP RWGEEGPEPA VGEAPAPDEN AIDNAVWDAF QRQVVERHRR QRGRAVPQRV EHELLQAHAP GQPRTEERFS NENVELDASG SDSESSWQDD DDRDDDDDTV SDDEWIENVE EDDDSVNDDN DRDEPQLHPP AVPIVDDRDD GDENPQAFGR NNFDMDPDDG MDMDINIALD ELLGARGPIT SVVRNLLWLL AFNTVYLGFF GFTPKVLGTI TSTIFRNTTV WSPMVFTIVT NATVPDDTQI ANESLSIWTA YRAIESESAS ANTTFRLHDL FLVLLGYASC AGMVVLIRFL WLASQKIRAL RSGRADNPVP LRELQEGFEE MNRIMRMGPE QMNMVDDNVA IHVFLTTTLD ATLAITKVGV LLFMKMFLLP IWLGLCLDVS SLPILGSSFE ERIAYAGKDL FSFLLLHWVV GITFMLLVTV SVLQLREVVH PELLAQTIRP QEPQPDLLGN LMNESIITHM KRMVLSLVIY VVLLAMYIYL PIQAIMASGV SADLSMAQLK FWYPIMPELQ VPLELLTFHL CMLALLEKHK NSIGEIQHYW LKFISRLVGL TDSLIPMRVD CFEYVGVLPI FEHEAVSPFW SKLAKNENER EKLLDESVAT FLKSDIPRVN IGQSKANGQR VLASKDYVRL PDVLPGRLLR SRSVLMPTTI GKYRLQRILS LDGTPLIEIW KEERGTPIAR PPEGWDDLGA GGADVQGRWA WGREKKSVVE EAIAHRVDFF GRHKASVSYW AVWVKLIFMF FSAWLSTTCF ICISLICPLA FGRTLYHILQ VPQAYVHDPL AFVVGCLIFF PVARCIGRWS LSGENSLMQR LFSWLRSYRR PPSAKARLLL VTAITCFGLA PVLLGFIYHA LLVKLPPFFA GTQEWIALSL FSSYWATGFV LLFAWARLCI AEAFTKKFWK GLMGAAGDVD DNEGNVQNSL RWAWQGKEGR VSRFTKCWRK VVLTWEFDQV DLNTLVDDVA TPVITALARV VVPYCIFMML IVRTFGLDQV TVATFGRIAL GLLCAEGVSR TWRIQFSNWL EAAHQMARDD RYLIGEMLMN YDG
|
| |