Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46097 |
Symbol | |
ID | 7201437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 262409 |
End bp | 265438 |
Gene Length | 3030 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180604 |
Protein GI | 219119700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGGCGTTG GTGAATTTAA TCCAAGCTAC GCCTTGGAAT GAACGACGAC CGCAAAGAAG CCGAGGATTT GGCGGATGTC AGCTACGAAG TCCCCATTGA TTCGGGAGAA GTCGTGGATC GGGCACGGAT TCTACCTTCG CACGAGACGT ACGGTTCTAC GGATAACCGC TCGACGCGGC AACGCAAATG GATCGACGAC GGTGTGAATC AGAAATCTGC CAGTACCGGA AAACTGCGAT ACCCGAGCAG TCTTGATTTT GAGAGAGTCA TCAATGATTA TTCCATTCAA GCGACCAAGG ATCGAGTGCT CGTGCAAGAA TTAGAAGCTG ATAGGAATCC ACCAGATCCA TCGGAGACTG ATCGACACGC GCTTCTGCAC GCACCCGATG GACTCTTGAC CTACAATTCG TTTGAAGAAC CCAGTCGAGA CGATTCGTTC CGTCCGCCAC TACCCCCTCC GCCTCCACCT CCTCCCCCTA TGGCGTACTT GAAACGGAGA CCCAAAAAGT CACCACTGGG GTACACTGGA CGGACTGCTA CACGATGGCT ACTGACCAAT GCCACTGGGC TCATGACGGG GTTAATATCC ATCATGATTG TTAGTGCAAC GGATTTCATT CAGACGTGGC GATCACATAC TATAGACTAC TTATGGAAGA ACGACAAGAA CCATCACCGA TTAACGACTG TGTTTATTCT TTACGCGTCC GTCAATCTCT CTCTTGCTCT GGCGTCATCG GCTCTTTGTC TATTCTTGGC TCCAGAAGCT GCCGGATCAG GTATCCCCGA AATCAAAGCT TATTTGAATG GGGTGCGAGT CAAACGCTTC ACTTCCGTGC AACTCTTCTT TGTCAAAATT GTTGCCACGA TTCTTTCGGT ATCGTCGGGT CTCGCGATTG GACCAGAAGG ACCTCTGGTA CACATCGGTG CTATTCTAGG CGCGAGTTGT ACCAAGCTTT CTAGTCTCAT GCTCAGGGTC CTTCCCAAAT CTTGGTCAAC TCATTTGTGG TCGTTCGTCA CAATGGATCT TTCTCACTTT TCAACGGACG GAGAACGTCG TGATCTCGTT AGTATCGGAG CGGCTGCTGG CTTTGCAGCT GCCTTTGGTG CACCCATCGG AGGTCTACTC TTCACCGTCG AAGAGGCTTC AACATATTTT GATCAAAGCA TGTTCCTGAA GACTCTCTCG GCGACGGCGC TAGCGACATT CTGCTTGGCT GTACATCATG GTGATTTGAG CCATTACAGT ATCATTTCTC TGGGTGATTT CGAATCATCC GACTCCAATA TTTTCGTGAA TCGAGTCGAG CAAGTGCCAC TCTATTTTAT TGTCGCTATC GCTGGGGGGA TCCTGGGAGG ACTTTTTTGT CGATTCTGGG AGTTTCTGCA GCGATCTCGA CAGCGTCTCA AGCAACGTCG CTGGTCGTAC GAACTGCTTG AAGTAGCCTT TGTTAGCTTG CTTACGTCGT CGGTGACATA CTTTGCACCC TTCATGAGCT TCGCTTGCCG GGCGGTAGCT CCCACCGACG ACATCGTTTC CGAAAAGAGC CTTTTCGACC CTTGGATGTC GCACGCGCAT CAGTTCGACT GCCCCACAGG GTCAGTGAAT GAGCTCGGAA CGATCTTTTT CGGCTCACGC GACGACGCTA TCGGCACAAT CTTAAGTGAC CCTTCGCAAT TTGACCCGAG GACATTATGG ACGGTTGGCA TACTATTCTT TCCTCTTATG ATACTGACCC TTGGTGTGAA CATTCCATCC GGAATATTTA TGCCAACGGT ACTGATTGGC TGCTCACTCG GTGGCGCAGC CGGTCTCGCC TTTCAAAACT GGATCAGCGA GGATCTGTCG CCATCCACGT TCGCCTTGCT AGGTGCTGCT GCTCTCTTGG CTGGTATTCA ACGATCTACC GTCAGTCTTT GTGTGATTCT CGTTGAAGGC ACGGGACAAA CCAAAGTGTT GATTCCCGTT ATCATTACGG TTGTGGTCGC GCGCTACGTA GGAAATTTGG TCAGCAAGCA TGGCTTGTAC GAAACTGCCA TTGAAATCAA CCAGTATCCA TTTCTCGATC ACGAGCCCAA GAAGCGCTAC GATATATTCC AGGTTGGAGA AATAATGAGC ACACCGGCAG TGACATTGGG CCCGCGGGAA CGGGCGCACA CCCTTGTCAA GCTTTTGCGT GACTCTGGGC ATCACGGCTT TCCTGTGACA GAAAAAGACA CGGGAAAATT TCTCGGGCTT GTACGACGGG ACCAAATTGT TGCTTTACTG GAATGTGGGA TCTTTGAAGA CGAGCATGAA TGGGATGATG ATTCATCTAC TGGAACCAGT TCCATGCCGG GGACGCCTTC TACTGAATGG ACGCCAAAGC CAGGAATCGG AAAGTCATCG CTGATGCATT TGGCTTTCCA TATTCCAGAT GACCGCTACG ACTACTTGAC GGATAATCAG GGTGCAATCG AAGCAGTAGA AAATATTAAC AAAATGATGG TTGAAGACGA GTTCGACGCA AACGCTTGGC TTGTATCAAT TCGACGGAGT CGAGAACACT TGGCCGGTCT GGAAAATAAC GAAGAGGATT CGGCTTGCCC TCACATTGTG GTTGGAGACG ATACACTACC ACCAATTTCA CAGAACCGCC GATACATTCC TAAAGGCACA CTCGGGAGCA CCCGAGCAGC TGTTTCCCAG GGCCGCTTTG CTACGGTGAC TACCAATTCG AAAGGCGATG TCTACGTTCA ATGGCTGAAT CCAAGCTGCA AGCGCAAATG GGTCCATGTT GCCGCCGTCA TGAATCGTGG CACGTACTGT GTGACAGAGA CGACTCCTTT GAGCAACGCC CATTTTCTCT TCACCTCTCT TGGATTGCGC CATCTAGTGG TGCTTGGCGG CAAAAGAGGA GGCACGGTTG TTGGTGTTGT CACACGCATC AATCTTCTCA AAGATTTTAT TCAGGAGCGC ACAGGATGTA AGTTTTATTG AGGGGCGTCA CCTTATTTAT AGCATCAATT TAGACTCTCT ATGACATACC
|
Protein sequence | MNDDRKEAED LADVSYEVPI DSGEVVDRAR ILPSHETYGS TDNRSTRQRK WIDDGVNQKS ASTGKLRYPS SLDFERVIND YSIQATKDRV LVQELEADRN PPDPSETDRH ALLHAPDGLL TYNSFEEPSR DDSFRPPLPP PPPPPPPMAY LKRRPKKSPL GYTGRTATRW LLTNATGLMT GLISIMIVSA TDFIQTWRSH TIDYLWKNDK NHHRLTTVFI LYASVNLSLA LASSALCLFL APEAAGSGIP EIKAYLNGVR VKRFTSVQLF FVKIVATILS VSSGLAIGPE GPLVHIGAIL GASCTKLSSL MLRVLPKSWS THLWSFVTMD LSHFSTDGER RDLVSIGAAA GFAAAFGAPI GGLLFTVEEA STYFDQSMFL KTLSATALAT FCLAVHHGDL SHYSIISLGD FESSDSNIFV NRVEQVPLYF IVAIAGGILG GLFCRFWEFL QRSRQRLKQR RWSYELLEVA FVSLLTSSVT YFAPFMSFAC RAVAPTDDIV SEKSLFDPWM SHAHQFDCPT GSVNELGTIF FGSRDDAIGT ILSDPSQFDP RTLWTVGILF FPLMILTLGV NIPSGIFMPT VLIGCSLGGA AGLAFQNWIS EDLSPSTFAL LGAAALLAGI QRSTVSLCVI LVEGTGQTKV LIPVIITVVV ARYVGNLVSK HGLYETAIEI NQYPFLDHEP KKRYDIFQVG EIMSTPAVTL GPRERAHTLV KLLRDSGHHG FPVTEKDTGK FLGLVRRDQI VALLECGIFE DEHEWDDDSS TGTSSMPGTP STEWTPKPGI GKSSLMHLAF HIPDDRYDYL TDNQGAIEAV ENINKMMVED EFDANAWLVS IRRSREHLAG LENNEEDSAC PHIVVGDDTL PPISQNRRYI PKGTLGSTRA AVSQGRFATV TTNSKGDVYV QWLNPSCKRK WVHVAAVMNR GTYCVTETTP LSNAHFLFTS LGLRHLVVLG GKRGGTVVGV VTRINLLKDF IQERTGCKFY
|
| |