Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44703 |
Symbol | |
ID | 7197930 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1322562 |
End bp | 1327088 |
Gene Length | 4527 bp |
Protein Length | 1453 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178682 |
Protein GI | 219115773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.816493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGGCT TTGGATCTTC ACTCCGCATA GCACGTCGGT CCGGCTGGGA AGGAGCCTAC CTCGACTACG AAACCCTCAA ACTCTTGCTT TCGCAAATCG AAGCCGTCTA CGAAGAAGAG ACGCACCGCC AAACATCGCA CGAATGGAAC GTCTTGGGCG AACAAGAAAC CGCAAGGGAC TATCGGGAAG AATTGTTTCT GGAGTCGGAC TCGGACGAAG AATACGCGTC CCTGGAAGAC GATGTTGACA TCAACGACAA CGACGATCTT TCTGCCGGTA TGGGCGCAGC CGACGAATCG GGACAAAACG TCGACTTTTC AATGGGAACG TCGTCCCGAT CGAGCAAACC CTTCAAGTTG TCCTATTCCC ACGAAGCCAC CTCGAGTGAC GAGGAAGAGG ACCAGGTGGA CGTTGGCTGT GGAGCCGGTA CCTTGAGTTC CTGGACACCC CGGACCTGGG ACACCAGTAA TATTACTCGC AAAAAGCCAC CGTACCAGCA GCAAGGCGGA AACGCCGCCA AACGCAAGGC TCATCGGTTG CCACGATTGA GAGAAGACGA CGACTTCTAC GCGAGCGCTG GCGGAAGTGG CCACGCGATT GGGGGAAGCG GCGTTCGCTC AGGGATCGGC ACTCTTCCCT CTCACGGAAC AACGCTCACC AACAATACCG CCACCCAAGC ATTTTACATG GCTGGGGGTT CCGGCAGTGA CAGTGCCGAG TATTTGCAGA CCTTGGGAAG CTCTCACACT CCGTCCATTC TCAACGCCCC CATACGTAGA ACGGGCGAAA GTGCGTCATT GCTGCCACCC ACCACACCAT CCCAATCGAA TGCGTCCTGG TATACGTTCC ATACATCTCC ATCCGTACGG ACGAGTCACA ATATGGACAA TACAACTCCA CCGCACGTAA CTTCAGCAGA TTTTCTGCGA GACAACCACC TTCTACCCGA GAGCATCAAT CTCACACCAC CATTGTCCGG ACGCATGAAC CACCGGGGAG AGGACGAACG TAATCGCGAC CGCGTCGCTC GTCGCAAACG TCGGCACAGA CTCCGTGCCA TACGCAAACA GAAGGAACGG AAGGTACCCC GGCACATTCG TGTCGCACAT TCCAAAGCTA GGGCCATTAC GGAACGGTTC CTGGGATTAC TGCGAGCCGA AACAGAAAAG GTCATGTTGT TTGCTCAGTC GAGATTGGGA GAATTGGCCG ATACAGCTGG TAGTTTGCGC TTTCCTTCGA TCGATGAGAT CGACTATGTT CCACACCAAA CACAGGGCTC GGCCGCCTAT GATTATCGCC TTACCGACGG TGGGATGCAC CCTTCGGCAT CCTCTTCAGA AGACGACGGC CCGGCAGGAG CATGTCCCTG GTCAGACAGC TCGGACGAAG AAGACGCGAC TACTGCCAAC AACGGTCGCG CAAGTCCCCG TGAAGGATCG TACATAATAC CCAAAGGTCA CCTGTCGGAA GAAACTTTGG GACGAGCTAA TCCCAGCGCG TCTAAAAGTG TTGCATCTTC CGCCAAACAG CTCAAACGGG CGGCCAAAAT TAAGGAGAGC AAAGGCCAGC GAAATGTTAA GCAGCACGCG GAAACGCTGT CATCCGTGCA GCGACAGATT GCTCATTTTG CTCGCCTACG AAAAAATAGA CCTGTCTTTC AACGGAATGA CCAAATTCTT GGGGAAGATA TGTTGCTGAT ATCGGCCGTG GAAGAGGCGG ACGGCTTTAC CGCCGTGGGG GTAGAACTCA TGCACGTGCT CCGCTACATT TGTGTCAACT TGATTGCCGT TCGCAAAATA TGTCGCAAGC ATGATCGTCT CCTGATGAAT CGAATGCTTG GAGGATACTA TCACCGCACA AGGAATTTGG GGCACGATCG GTACGCCCAC ATAGAGGATG TGCAAACCCT GGGTGGTCTA CTGGCACGAG TTTCTGGCGA TATCTACGAA GCTCATCCAG CGTTGATTGG GCATATGACG TATTATAAAC TCGTGGGAGT GTACGATCGA AAGGTCCAAA AGCTAGCAAA TTCACGTACC GTGCAAGTAA TTTCTGCCTG TCTGGCCCTG GCCCTATCAG AGCACGAAAT AACACAGTCG AGAGCCGAGA AGCTGACCAA GATCCAGTCT ACAACGTCAT CGGACACTCC AAAACGATCA GGACATGGCG GCACCTTGTC ATGGGTACAA GGATTGAAGA ACAGCTTGTT TCAGCCAGCT GTAAAATCCG ACGATGAAGA AGATGACGGG CCGCCATCGA CGACTTCCAC GGTATCTTTG ACAAGGCTGC GATTCACTGT CACATCAATT TTTGCTTTGC GAGAAGCAGC TCGATACAAA ATGGACCATT ATGCTACATA CTTGTCTAGA TCGTTGATTT CGTTTACAGG ACAGCCCGCT GCCGGCGAAG GACTGGACGG CTGCTCGCGA GTGACTCTCG ATTTTTTGGT GTCGTACAAT CCCGATGCAG CATTGCTTTT CGACTCTTCG GTACTGTACA ATGGTATCAA GCACGGACTT TGGCTCGGTG AACCTGTGAG TGAGGTCATG GTCTCAACGT TGGCAGCTGC TACAAGTCCA GATCCTGCGT TTCTGCTCAA TGGTTCATCG ACACTAAGTC CGGAAGAGCT TGCGGTGGCA AATGCAGTCA GCATCGTACC CGGACCTAAG AACTTGTTCC TCCAAAGTTT TTTGAAGGGA CTTCTTCCTA AACAACTTGC TGTGAAGGGA AGTCCGAACA TATCAGAGGC TCCATTGAAG ATCTTGCGAT TAAGTCAATT TTCATGCTTT CTTTATTCGG TAAGTTTATG CTGCCTTTTT CCGACTGAGA CTACGCTTCA GAGTTCATTA CGACTCACAT TATATTGAAT CCCTCTATCC AGATGAATTA TTTTGTCGCT CATTCGAGTA CAAATACTTT TACGCACGCG ATGGGGGCTC ATTCAGCCTA TTCCTCTGTG ATTATTGGGA TTCCTAATGT GGCAGCTATC ATCGTCGCGG TCATACATGT ACAAGCAGTT GCTTCGGAAA AGACCGCTGG GCGGTTTCCC AGAGACAGTA CAGGTCTTCT GCGTGGTTGT TTTCTGGTAT CTGCAATCAT GGGCGTGCTT GGAAACATTG CGCACAGTCT CGCGATCGAA AAAGAATGCC TAGCACTTGC GATATGTGGA CGTTTCCTCT TGGGGTTTAG CTCCGCCGAG ATATTGCATC GACAGTTGTT GTCGACTTGC TTGCCTGCGT ACATCGTTTC AGAATCATCG ATGCTTGTGC AGCTTCGAGC AGTCGGCACT GTTCTTGGGC TTATCGCCGG CACAATCATT GAATCCTTGC CGATTTCCTT CGATGCACTC GGAATTCGCG TCCTTCAAAC ATCGGGCTGG CTGATGGCAG TCCTCTGGAT GACTCAACTG ATAAATCTAT TATTGAGCTC AGGATTAGTT GAAGGGCGGA TGAAAGCTTT AGGGTGCGAT TTTTCGGTAC TAGAGGGACA AGAGATCAAT GGTCCTGCAA AAGCTTCGAG TGTCGACTAC AGTTCCGACT CATCTAGCTC CGTTGAGCCA GGAACACCTA CGAGCGTACT GTACCGATCT TCTTCAGACG TTACGTCCCA TGATCCTTTC ACTATAGCCT ACGGGAGTCA GCAAGAGGAG CAAATAATGT CAGACCAAAA CACTGCCAGC ACGGAATCGA CCCCTCTAAA GACTTTTGAT CACGGCATTC GACGACGGCC CTTTCGCCAA ATCACCACAT TTTTTGCTCG AACAAAGAAG CTGCTTGCAT TTCATATTGC AATCCCTATA TCTTTGGCAA TTCTCCTGTA CACTAGCTAT GCCACTGAAG TATTCTTCAC CGCCACTCCG ATTGTTGTCT GCCGATACTT TGGATGGAAC GGTGATCATG CCGGCGTCTT TCTCGGCGGC CTCGCTCTTT TGGTTCTCCC GACCTACTTT GTATGTGAGT TGGTGGCTCG ACGATACGAG GAACGTACTG TTCTCAAGGT AAGTCGATAG CAAAATTGCG GCGGTACGGA ATCGTTGGCA ACCTGCGATG GCCAATTTTG ATTTCCTGTT CCTTCTACAG CGATCACTGC ACGTTATAAG TTGGGGGCTG TTTTTTATGA TTAATTGGGT AAGCATATGC TCACTGGCGG GGCGGTTTTC AAGTCTGCTA GCGGAGAGGC CTGACGGTCC CATGGCTCAC CCTTACGACT GGAAGTTGGG AGTCTTCCAG TACGTGATCG GATTGACGTT TACCTTTATT GGTCTCACAT CGCTCGAGGG GGCGGCCCAG GCACTGGCAT CTAAGGCGTC ACCGTCTCGG CTAAGCAGCG TGTCGGTACA TGTGAGTAGT CTCGCTGTCT TTCTTTCCTT CACTGGCCGC ATCTTTGCCG ATGGTCAAAT CTTTTTTGTG GAGCTTTCGC ACAAGCTGAT AAATGTGGAC ATTGTGAACT CGCTGGTCGT TCCAGTATTC TTGGCTTGCT TTGTCGGCCT GTATTTTGTA AAAAAACACT ACTTTTTCCT AATGTAA
|
Protein sequence | MVGFGSSLRI ARRSGWEGAY LDYETLKLLL SQIEAVYEEE THRQTSHEWN VLGEQETARD YREELFLESD SDEEYASLED DVDINDNDDL SAGMGAADES GQNVDFSMGT SSRSSKPFKL SYSHEATSSD EEEDQVDVGC GAGTLSSWTP RTWDTSNITR KKPPYQQQGG NAAKRKAHRL PRLREDDDFY ASAGGSGHAI GGSGVRSGIG TLPSHGTTLT NNTATQAFYM AGGSGSDSAE YLQTLGSSHT PSILNAPIRR TGESASLLPP TTPSQSNASW YTFHTSPSVR TSHNMDNTTP PHVTSADFLR DNHLLPESIN LTPPLSGRMN HRGEDERNRD RVARRKRRHR LRAIRKQKER KVPRHIRVAH SKARAITERF LGLLRAETEK VMLFAQSRLG ELADTAGSLR FPSIDEIDYV PHQTQGSAAY DYRLTDGGMH PSASSSEDDG PAGACPWSDS SDEEDATTAN NGRASPREGS YIIPKGHLSE ETLGRANPSA SKSVASSAKQ LKRAAKIKES KGQRNVKQHA ETLSSVQRQI AHFARLRKNR PVFQRNDQIL GEDMLLISAV EEADGFTAVG VELMHVLRYI CVNLIAVRKI CRKHDRLLMN RMLGGYYHRT RNLGHDRYAH IEDVQTLGGL LARVSGDIYE AHPALIGHMT YYKLVGVYDR KVQKLANSRT VQVISACLAL ALSEHEITQS RAEKLTKIQS TTSSDTPKRS GHGGTLSWVQ GLKNSLFQPA VKSDDEEDDG PPSTTSTVSL TRLRFTVTSI FALREAARYK MDHYATYLSR SLISFTGQPA AGEGLDGCSR VTLDFLVSYN PDAALLFDSS VLYNGIKHGL WLGEPVSEVM VSTLAAATSP DPAFLLNGSS TLSPEELAVA NAVSIVPGPK NLFLQSFLKG LLPKQLAVKG SPNISEAPLK ILRLSQFSCF LYSMNYFVAH SSTNTFTHAM GAHSAYSSVI IGIPNVAAII VAVIHVQAVA SEKTAGRFPR DSTGLLRGCF LVSAIMGVLG NIAHSLAIEK ECLALAICGR FLLGFSSAEI LHRQLLSTCL PAYIVSESSM LVQLRAVGTV LGLIAGTIIE SLPISFDALG IRVLQTSGWL MAVLWMTQLI NLLLSSGLVE GRMKALGCDF SVLEGQEING PAKASSVDYS SDSSSSVEPG TPTSVLYRSS SDVTSHDPFT IAYGSQQEEQ IMSDQNTAST ESTPLKTFDH GIRRRPFRQI TTFFARTKKL LAFHIAIPIS LAILLYTSYA TEVFFTATPI VVCRYFGWNG DHAGVFLGGL ALLVLPTYFV CELVARRYEE RTVLKRSLHV ISWGLFFMIN WVSICSLAGR FSSLLAERPD GPMAHPYDWK LGVFQYVIGL TFTFIGLTSL EGAAQALASK ASPSRLSSVS VHVSSLAVFL SFTGRIFADG QIFFVELSHK LINVDIVNSL VVPVFLACFV GLYFVKKHYF FLM
|
| |