Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54686 |
Symbol | |
ID | 7202147 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 288517 |
End bp | 292557 |
Gene Length | 4041 bp |
Protein Length | 1226 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181175 |
Protein GI | 219121650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATTCGACTC GTGCGTCTTT CGGAAAACGA ACAGGTTTTG TTTATCTACT CTAAAAAGAA TAAAGTCACC AAATCGATCA CAAGGAACAG GAATCTTACA CTAGTAGCTC GAACATTCTC CCTGTCGGCG CATAGTCTTT CACCGCATCA TGTCCAACAA CGGGGCGTCG TACCAAAAGA CCGAGGGCTC GTCCGGGAGT TACCAAGAAA TAGGTGGTAA CGGCGATGGT TCCTCGGCTG CGGGTGGTTC CGGCACTATC AAGAAAGGTT TGATCGCAAT TGGTCTCGTG GCCATCCTGG GCATTGTTTA CACCGTCACC AGTACGCGGG GCGATCCGCA AGCCAGCGTG CAGAAAAGCA TCGAGTCTTC CAGCACTGGA TTACAGGTCA AGGCGAACGG CAAGCTCAAG CTTTTCGACG ACAACAGTAA GTCGCGCTTT CCCAGATTGT CGCTCTTTCA GACTCTTCTG TATCTAACGG CTCTCTGTTG CTTGTGTTGT TTCGCAGACC GTTACGTGAT GGAAGATTAC GATGCCAAAT CAACCTTTTC CAGTTTCTTA CCCGGCGTCG CTGGATACTT TGGCAAGCCC GTTTGGGCAT TTTACGTCAA CCGCGGTCAA GCCATGGCCA CTTTCGGTAC TGAATCGAAG GATTATCCCA TGCTCGAATT CAATCCCGCT AACAAGGCGT ACCAATTGAC TCCCTTTGTG GGCTTTCGTA CCTTTGTCCG CGGCGTGCGC GGCACGACAT CCTTCCAAAC GGAACCCTTT GCTCCCGCCA AAACACGTAA TCTCGAAGAT GGGGATGATC AAGCTGACAA GACCAAACCC AAGCGTATCT TGTACGTTGG TGCCAACGAG CTTGAGATTC AAGAAATTGA CGGGGTCCTC GGTTTGACGA CCAACGTTCA ATATTTCATC CTTCCCGAAG AAGATTTTGG CAGTTTGATT CGTCGCACAA CGTTATCCAA CTCGGGAGAC ACAGACTTGA CGATTGATGT CCTGGATGGA CTTGCCAAAA TGGAGCCCTT CGGTGGACCG CTGGATGGTA TGCTCAAGGC CATGGGACGT ACTTTAGAAG GTTGGATGGG CGTCTACCAT GCGGATGATA CCTTGACTAT GCCCTTTTAC AAGCTTTCCA CGGAGCCTTC CGACTCGGCA TCGGTCAAGA TTGAGGAGTC CGGCCACTAC TGCTTGTCCT TTTACGAAGA TGATGAAATG CAGGCGACTC TACTGCCCAT TGTCTTTGAT ACGTCCAAGG TCTTTGGGGC CGACACATCC CTCGAAATGC CACGCGGTTT GTCCTCGTCC TCCGTGGACG ACATTTTGGA TACGCCCCAG TACGGGTTCG CCAAAACTTC GTCCGCATTT TCGGCCGCGA GTGACATTAC TCTCAAACCT GGTGAGAACA TTACTATTGC TACGGTCTAC GGTCGGGCTT TCCATGTGGA CGACGTCCCC AAGATTGCCG GGGTCGTTAC CGCGCCCGGT TTTGTCAAAA GCAAATTCGA ACGCGCCCGT TCTATGATCA ACGAACTTAC GGCGGGTGTC GAAACCAAAA CCGTAAAACC GCTGTTTGAT GGTACCGTCA AACAAATGTT CCTCGACAAC AGCCTCCGTG GTGGTATCCC GACCATCATG GGCTTGGTCG ATTCCGACGC GACCTACGAC GAAGATTCTC GCGTCAAAGT TTTCCATTCC TTTTCTCGTA TTCACGGTGA CATGGAGCGT GACTACAACG CCTTCAAGAT TGACGCCACA TACTTTTCGC AAGGTCCCGG TAACTACCGT GATGTCGCAC AAAATCGACG CAATGATGTG ACGTTCTTCC CGCGCATGGG TTCGTTTGAT ATTCAAATGT TTTTGTCTTA CATTCAAGCC GACGGGTACG AGCCGCTGAC GGTGGAAGCT GTAGTGTACC GCTTTGCGGA TTCTGACAAG GCCGTTGAAA TTGCCAAGAA CGTGACGAAC GATGCCAAAT CGGCCAAAAT GTTGGGCGAC GTTTTGAACG GCGGCCCATT TCGACCCGGA CAGCTGTTTG CCTTGTGCGA AAATTTGGGC ATCAACTTAG CTGTCACCAA TGAAGTATTC ATCAATAGAG TTTTAGAGTA CGCTGAGGAC CGTGCGATGG CTGTTTTTGG ACAAGGATAC TGGGCCGACC ACTGGGACTA CTACATGGAT ATGATCGATG CGTATCTCGA AATCTTTCCG GATGTCGAGG AAGAGGTCAT GTACGATAAG CCTTTGCGGT ACTTTTTCTC GACTGCAACA GTAAAGCCTC GTTCCGAAAA GTATGTGTTA ACGCTTACGT ACGATGCCAA AAGCAAGCAT GTCCTTCAAT TGGATTCGAC GTATTACGAT TCGGAAAAGG CTGAGGAACA GCAGGCCTTC TTAGACCAGA ACACTGGCCT TTTAGGTATC GAAGCAAACT GGCAGCGTAC ACGCGAAGGA GAGCCTTTCA TGAGTTCCGC CATTGCCAAG CTATTTCTGT TGGGTTCCAT CAAGTTTGCC ATGCGCGATG CCTGGGGCAT GGGCGTTGAA TACGAAGGTG GCCGTCCTGG ATGGTTGGAC AGTATGAACG GCCTCCCTGG TATGGTCGGT AGCGGCATGC CGGAAACATA CGAACTTTAC TTGCTCCTGC AATATGTCAA GAAGGTTGTG GACACTTACG ACCGTCCTGT GCACATTCCG GTGGAACTAC ACACAATGCT GAGTACCATG AACGACGCAT TAGATACTTT GGAAAAAGCT GGCTACGTGG ACACGGAAGA TATGCCTGCG GCCGTGCCCA AGGCGTTGTT TGAATATTGG GACATTGTTG CTGCCGCTCG TGAAAGCTAC CGCAATGATG TCCAGTACTA TTTCTCTGGC AACACGACGG CGTACCGCGC TACGACTGTG TCGAGGATGT GCGACCGATG GCTCGGAGAG ATCCAGAAAG GCATTGACCG CTCGTTCAAA GTTGCTACGG AAGGATATGG AGATGACGGT ACCTCAGGAA TTCCGGCTAC CTTCTTTGCC TACGACATAA CCAAGTGGGA GCTTAACAGT AACCGTAACG CCGAAGGTCT TCCTCTCGTC AACGCTCTCG CAATGAAGGT TCGCAAATTC CCTTTGTTTT TGGAGGGGCC AGTCCGCTAC ATGAAGATCA TTCAGGATAA TAAGGAACGA ATGAAGGACA TCTACGAGCG TGTTCTCGAC TCTGGCCTCC GCGACAAGCA ACTCAATATG TACTTTTTGT CCGCCAGTTT GGAGGGCCAA ACTTTTGATA TGGGTCGCCA GATTGCCTTT GCGCCTGGTT GGCTTGAGAA TCAGTCCATC TGGCTTCACA TGAGCTACAA ATATTATTTG CAGCTCTTGC GAGGAAAGCT CTACGAAGAG TTTTTTAGTG AGATGAAGGG AGGCGGTATC CTACCATTTA TGGACCCTGA TGTGTATGGT CGGTCACTTA TGGAGTGCTC GTCCTTCATC GCCTCGTCCG CCTTCCCGGA CCCGTCGATT GTTGGCGAAG GTTTCCTTGC TCGTCTCAGT GGTTCCACGG CCGAGTTCAT GGATATTTGG AAACTTATGT TCATTGGACC CGATCTTTTC TCTTACAACG ATAAGGGTAA AGTCGAAATG AAGCTGATTC CTGCCTTACC TTCGTGGTTG TTCGACGATC CCGATGGTGA TAGCGAGCCT ACTCTGGATG ATGATGGCAA CTACGTTGTC TCGTTTAAGC TTTTTGCATC CATTCCAGTG ACTTATCATA ATGCTGGTGG AAAAGACTTG TTCGGGATTG GCCCAAAGAG CTATAAGGTC AGCATGTTTG ATTCCAAACC ATTGGAGATT GATGGCCCAG CTATTCCAGA AAAAATAGCG TTCAAAATTC GTCGTATGTC CGGTGTCAAG TCGATTGACG CTTACTTCTA ATGTATGCCT TTTTTGGTGA TTGTCTCTAA ATTGATCAAT TTACAGTAAC TTGATGCTCA ACGATAGGGT GAGGATTTAT TTGTATCGCA AGTCGAAAAT TACGAGTATA AACTGAGATT TACGTTTCCA T
|
Protein sequence | MSNNGASYQK TEGSSGSYQE IGGNGDGSSA AGGSGTIKKG LIAIGLVAIL GIVYTVTSTR GDPQASVQKS IESSSTGLQV KANGKLKLFD DNNRYVMEDY DAKSTFSSFL PGVAGYFGKP VWAFYVNRGQ AMATFGTESK DYPMLEFNPA NKAYQLTPFV GFRTFVRGVR GTTSFQTEPF APAKTRNLED GDDQADKTKP KRILYVGANE LEIQEIDGVL GLTTNVQYFI LPEEDFGSLI RRTTLSNSGD TDLTIDVLDG LAKMEPFGGP LDGMLKAMGR TLEGWMGVYH ADDTLTMPFY KLSTEPSDSA SVKIEESGHY CLSFYEDDEM QATLLPIVFD TSKVFGADTS LEMPRGLSSS SVDDILDTPQ YGFAKTSSAF SAASDITLKP GENITIATVY GRAFHVDDVP KIAGVVTAPG FVKSKFERAR SMINELTAGV ETKTVKPLFD GTVKQMFLDN SLRGGIPTIM GLVDSDATYD EDSRVKVFHS FSRIHGDMER DYNAFKIDAT YFSQGPGNYR DVAQNRRNDV TFFPRMGSFD IQMFLSYIQA DGYEPLTVEA VVYRFADSDK AVEIAKNVTN DAKSAKMLGD VLNGGPFRPG QLFALCENLG INLAVTNEVF INRVLEYAED RAMAVFGQGY WADHWDYYMD MIDAYLEIFP DVEEEVMYDK PLRYFFSTAT VKPRSEKYVL TLTYDAKSKH VLQLDSTYYD SEKAEEQQAF LDQNTGLLGI EANWQRTREG EPFMSSAIAK LFLLGSIKFA MRDAWGMGVE YEGGRPGWLD SMNGLPGMVG SGMPETYELY LLLQYVKKVV DTYDRPVHIP VELHTMLSTM NDALDTLEKA GYVDTEDMPA AVPKALFEYW DIVAAARESY RNDVQYYFSG NTTAYRATTV SRMCDRWLGE IQKGIDRSFK VATEGYGDDG TSGIPATFFA YDITKWELNS NRNAEGLPLV NALAMKVRKF PLFLEGPVRY MKIIQDNKER MKDIYERVLD SGLRDKQLNM YFLSASLEGQ TFDMGRQIAF APGWLENQSI WLHMSYKYYL QLLRGKLYEE FFSEMKGGGI LPFMDPDVYG RSLMECSSFI ASSAFPDPSI VGEGFLARLS GSTAEFMDIW KLMFIGPDLF SYNDKGKVEM KLIPALPSWL FDDPDGDSEP TLDDDGNYVV SFKLFASIPV TYHNAGGKDL FGIGPKSYKV SMFDSKPLEI DGPAIPEKIA FKIRRMSGVK SIDAYF
|
| |