Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48479 |
Symbol | |
ID | 7203764 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 619377 |
End bp | 622349 |
Gene Length | 2973 bp |
Protein Length | 976 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182995 |
Protein GI | 219125451 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAGA GTGGTACGAT GATCGACAGT GCGGGCAACC CTCCCCCCTC TTTCGTGGCA AAACCCTTCC CCTATAAAAC ACTTCCGGGT ATCCCGAACG CACAATCCCC ACCGGAACGC GATCCCGTCA CGAAACCATG CGATTGTTCC GTTACCGTGA GTCGCACGGG GCAGGGGGCA CCCAACTCAT TCCGCCGGTG GAGTAGAGGA CCCGCTACTC GTGACCATCA CATTTGTTTA TTGATGGATG GATGGATAAG CGGAGCGAGT GCGACAACAA TGACGACGAG CGAAAAGCAA CGGCGGGAGG AGCCGCCAGT GTCCCGACGT CTCCGACGGG ACACCAGCAA CCATAGAATC AAATCCACAA TTTACAATCT CGACGCACAA GACGATGAAC TGGAGCAAGC GTTGGAAGCC TACAATGACG ACACTACACG AACGAGTACT AACGATCGCT GTCTCCAAAC CAAAGGAGTC AAACGAACAA TGCCGGAACG CCAAAGTGGA CCGACGGTCC AGAGCGCATA CGAAGAGGAA GTGGAGCATA GTAGTACCAA CAATAGGCCA AACAAATCCA TGGGGAGGCG ACCACGACAA CCATCAACCA AGCGCAAGAA ACGAACGCCG ACAAAGGGCA TCACAAAAAG AATCCCTGTC GACGATCAAT CGGGAGATGA GGAAGAGACC GATCAAAAGA CACCAGCTCC GAAACGGCGA CAACCTCGAC GTTCAACCCG AACAATTTCC ACTATCCGAC CGAAACCATC CTTATCCGAA GTAGAATCGA GTAACGAAAG CCACCAAGCC GAGGACGAGG ATGAGGACGA TAATGAGTCA GTTATTATTG CACCCCTTTC CGCCATTACC AAAACGCTGA AGTGTCCTCA TTGCCCGAAA ACATTTGGTA CCGATGGTGG TCTACGTTAT CACGTCGCCA ACTTTGTTTG CCAACCTGAT TCACGTCCAG GAGGTCCCGT CGTTAGGGGT CGCCGTGGCA AGAGTGCCTC AGGCGGGATG GATGGTTCAT CCAAGCGTAA ATTTCGTAGA ATTCGGGGTG CCGCGAAAGA TCGTACCTGC CCAGACTGCC ATCGAGTCTT TACCAGCGTT TTGGGCATGA CCTATCACCG CGAAAAGGCT GTTTGTCACC GTAAAGGCCA AAAGGACGCG GGAACAGAGT CATCGGCCCT GCCTTTCGGT ACTTTGGAAG CAGGTTCAAA GTTTGTTACC AACTGGGGTG TCGTGCAAGT CATTCGGGAT GATCGTGCTA CACCCGTAGC TGAGCCTTTG CAGAAACCCA AGGACCTAGC CCGTTCCTTT CAGGCGCATA AAAGTAGTCG CGAAAATCAA CTGGAAAAAC AATACGCCAC CCTTGCAGTC TTGTCCCTGA CACGAAGAAA GCAGTTGCTG GAGGAGTACA AGACGAAGGC CGATTCGAAT ATTACCCCAC AGTCCGTATG GATGGCCTAC TTTGGTACAC GGGAAGACCC GCGGGAGATC CCAAAATCTC GACAAGCGGC TCCTTTTAGA CTGGGACAAT TGCGAGACGA CCCCTTGGCT CCGGAGAATG CCTTCGCGGA CCGCATTGTG GAATGCATTG CTATCGCCGA TGACCGGAGA CGATTCGTAG GTCTCTATGA CGATTCGGAA ACGACAGGGT CGTCATCTAT GGGGAGATAC CCGACCAAAC TCTTTTTAAG TCGTCGGCTT CTCACCGAAT CATACAACCC AAGTGGTTCC ATACACATGT GTCCGTCCTG TGGGCGGTCG TTTGGTTCCA AACCTGGTTG TAAGTACCAT TTGGTCTCCA AAGTATGTAC TAGTAAATCA GATGCGCAGG GAGAGCTGAG ACAGAAACGA CTTGGTGATA TTGAAGACAG GTCTCTCCGA CTTTTGGCAA AAGGGGCCGG ACCAGAGCGT CGTAAGTATC GTCCGCCACA GCCTGTGCAC CGTGACATCC CCGCAATGGA CAGTGTAACG CCACACAAGC GACTTGACAG CGTTTCCGAC GATGATGACA CCAATAAATT GGCCGCACAA TTACAAGAGA AGGAGCAAAA GACCTACGAC ACCAGGAAAG AAGAAAATGT ACCTTCACCT GATGAATGCA TCAAAGAGCT ATTTCAACAG CTTCGCTTTG AGCAGTCCAA GCAACTTGGC CCCATGTACA CTGATGTCTT TCGAGTGCTC AAGTTTAAGC GTTATGTTTC CAGACCTGCG AAGAAACGAA AGAAACGAAA GATTGTTGTA AAAAAAATCA AGGTAGTCAA GAAAGTAAAA AGATCAAAGG CCTCGAAAGA GAGCACCACC TTGGACGGGA CTTCAAAGAA ATCAAAGAAG ACCGAGGTGG TATCGACAAG AACAACATAC CCTTCACTCG TTCTACCGCC ACCACCTCTG CCAACACAAT TCATTCAAAC TCAAAGCCAT ATACCGATCC CTCCTATTAT CGATACACGA GTTCTGGTTG GCGAAGTGGA CGCCGGAAGG TACCCGAGTA TAAAACGGGA CCCCGCTCGC ACAAATCAAG ATATTTGTTC CATCTGCAAG AGAGGCAACC GCTTAGTAGC ATGCGACTTT TGTCCTTTAT CAGTCCACTT TCGTTGTGTA CGCACAAAAT ACTTGCTTAA AGATCCTGAA CCGGAAGACG ACTTCATGTG CAATACCTGT ATCCAGTACA TTTGGCACCG TCGTGCTCGG GCTGAGAAGC GAAGAATTCA GAAACTGGGT GAAGACAAAG TGCAAACTGA CCAAACGGCT GCGGAATCGG TGGCTCGACT TACAAAAGGC GCAGTGGAAG GCGAAGAATA CGAGTGTGTC GCATCCCAGG CCCGCCGTCT AGCCGATCTT TCGGAGCTAC TGATGGAAGC CAAGGTCCGT CTAAAGCAAA ACATGGCGAT GGCTAAAGTC AATGACATGC GGAGAGCTAT GATAAGTGGT CAAGTAGTTT CCAAAGGTTC CACTTCAATA TGA
|
Protein sequence | MRESGTMIDS AGNPPPSFVA KPFPYKTLPG IPNAQSPPER DPVTKPCDCS VTVSRTGQGA PNSFRRWSRG PATRDHHICL LMDGWISGAS ATTMTTSEKQ RREEPPVSRR LRRDTSNHRI KSTIYNLDAQ DDELEQALEA YNDDTTRTST NDRCLQTKGV KRTMPERQSG PTVQSAYEEE VEHSSTNNRP NKSMGRRPRQ PSTKRKKRTP TKGITKRIPV DDQSGDEEET DQKTPAPKRR QPRRSTRTIS TIRPKPSLSE VESSNESHQA EDEDEDDNES VIIAPLSAIT KTLKCPHCPK TFGTDGGLRY HVANFVCQPD SRPGGPVVRG RRGKSASGGM DGSSKRKFRR IRGAAKDRTC PDCHRVFTSV LGMTYHREKA VCHRKGQKDA GTESSALPFG TLEAGSKFVT NWGVVQVIRD DRATPVAEPL QKPKDLARSF QAHKSSRENQ LEKQYATLAV LSLTRRKQLL EEYKTKADSN ITPQSVWMAY FGTREDPREI PKSRQAAPFR LGQLRDDPLA PENAFADRIV ECIAIADDRR RFVGLYDDSE TTGSSSMGRY PTKLFLSRRL LTESYNPSGS IHMCPSCGRS FGSKPGYAQG ELRQKRLGDI EDRSLRLLAK GAGPERRKYR PPQPVHRDIP AMDSVTPHKR LDSVSDDDDT NKLAAQLQEK EQKTYDTRKE ENVPSPDECI KELFQQLRFE QSKQLGPMYT DVFRVLKFKR YVSRPAKKRK KRKIVVKKIK VVKKVKRSKA SKESTTLDGT SKKSKKTEVV STRTTYPSLV LPPPPLPTQF IQTQSHIPIP PIIDTRVLVG EVDAGRYPSI KRDPARTNQD ICSICKRGNR LVACDFCPLS VHFRCVRTKY LLKDPEPEDD FMCNTCIQYI WHRRARAEKR RIQKLGEDKV QTDQTAAESV ARLTKGAVEG EEYECVASQA RRLADLSELL MEAKVRLKQN MAMAKVNDMR RAMISGQVVS KGSTSI
|
| |