Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48621 |
Symbol | |
ID | 7194827 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 396586 |
End bp | 400778 |
Gene Length | 4193 bp |
Protein Length | 1313 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183088 |
Protein GI | 219125650 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGGTCAAA CCGATTGAGT TACCGATTAA TTACAATACA ATCGGGGCCT TACGAGGCTT TCCCCTTTAT TCATCTTACT CTCTCATTGT ATCAACAATA CATTGCGTGA GTATAGAAGA ATTCGTTAGT TTATCAGACC CATGTTGTCA TTGCTGGCGT TGCTCTCGCC AACGTTCTTC CTTTACACGA TTGTCTCTCT TTTTGAACTG AGCCATGGCC CGAGTTCGCA AGGCAACCGG TCCTACCCGG AAGGGAGCGA CCGAAATGGT GCCGGAGGAG CGAGTGGAAG AAGAAACGCC CTTTGAGGCC GTTGAGTCGC CGTCCAAGGA CAGTGACAAT GAGACGCAAC CATCGTCCAT GGGCGATGAT AATGACTCAC AGTCTGAGAT CGAGTCGTAC AAGATTGATA CCGACATTGA TTTCAAGTAC AACCCAAACT TTTTTGAGGA CAAGAAAGCC CTTGAAAGTG TTCTAAGGAA TACTATGGGA TTTGGAGATA TCCATGTGAA GTCACTCCAA AACGAAGGTT TGAAGACCGC AAATGATTTC TTGCTTATTT CTATGAGTGA CATCAATGAT CTTTGCGACA AGCTTTTGTT TGCAACAGTT TACAGGGCTC GCCTACGGGC ATTTGCTACA TGGTTACGTA GTCAACCCGA CAACATAAAT ATTACCCAAG AATGGACAAT TCCAGTTATG CAATTGGAAA TGCAGATGAA GGCGCAAGCG TCTCCATTTG GAACCTCCGA GACCAACAAA ACAGACAAGT CAGTCTCCAG TCTGGTGCCT GATCCCTTTG ATGGTACACA GAAGAAGTGG CTCGCCTTTC GATACAGTTT TGAGGCATGG GCCGGAGCAA GTGGGCAATC TTTTGATGCC TGCATCTCAC ATGACTCGGA GCGATATTCC CGTTCAGAAC CAACAGCGAC CTACAATGAC ATCAATGACG AACCTGATTC ATTTAAATAT GACTGGAACG TTAAGTCAGT TCGCAATTCA AACATCTTTT TTATGCTCAA GTCGCTCACA AGCGGCGGAG ATGCATGGGG CCTTATCGAA CCTTACGAGG TTTCAAAAAA TGGCCGTCAT GCCTGGATCG CCTTGTGTGC GTTCTATGAA GGGGCCAGTC AGGTGGGCTT AACCACAGAA GAAGCTCGCA CTACAATTCT GACATCGAAG TATACCGGAC AATCCCGGAA CTTCACTTTT ACCAAGTATG TTCAAAAGCA TCTTACTGGT AACAACATAT TGGCTCGCAA CAAAGAGGCC TACACGGACT CACAGAAAAC AAACTTTTTC CTACGGGGAA TTGTTGATAC TGAACTTATG GCATTCAAGG CAGCTGCTGA AGCTAACCTA AATGAATGGA AGTTCGAACG CGTTGTCACG TACATGCGTA CTCAAGCCGC CAAGCTCACG AGCAAGGACG GTAAGGATTC CCGAAACATT CGTCAGGCTA CGGGCTTGTC GAAAAACAGG AACAACAAAA ACAACCGGCG CAAGCGCTCG GAATACCAAA GCCAAGGCAA AGGTAATAAA GAGTCGGGCA AAGGAAACAA TGCTCCTAGT ACTCAACTCC GCAAGGACAT CTGGGATGAA TTGTCTCCCG AGATAAAGGA TGCCATCAAA GCGGCAAAGC GTAGAGCGTC TACGGACCCG CGCACGGCTA AAAGAGCCAA GACTAGTAGT ACGGATAACT CTAACGCAAG CGTTGAGTCC TACTCGCCTG ATTTAAGGTC AATGTCTACT GAAATATTTA AAGCAGATGA TGACAAGGAC TTGGCTTCAG GTCAGCCTGA GGCGAAAGAT ACACCACTTC ATTTGGAACT TGAAGATACG CTTAAGAAAC CTACATATGG AGCAGGTACC CTATTTGGGC GATCTGCTGA CAGGGTCTCC TTTAATCGTA TGGTATGCAG TTCAGAAGAA AACAAAGTCA CTCCTTGGCG CATGTCAGAA CTACGGCTTG CGGATGCAAC AATAAGACGC ATTTGTAAGA ATCGCACACG AAATCCTACC GGCCGTTCAA CATGGGGCGA AGCTGCCATT GATACTGGTG CCGACACAAT TTGCATTGGT TCAGGCTATA CTGTACTTGC CCATACAGGT CGATATGTGA GTCTGCGAGG TTTTCATGAC AGTGGTGATA CTCTTGATCG AATTCCAGTT GTGACGGCTG CTACAGCATA TGACTACGAT GACGGAACCA CCGTTATTCT GGTTTTCCAT GAAGCTTTGA ATCTTGGGCC TACACAGTCC ACATCTCTCA TCAACTTGAA TCAGATTCGG CACGCCGGAC ATCAGACTGA TGACATTCCG AAGTTTTTAT CCCAAGGGAA ATCTCTTCAC GGAATTGAAA CAATTGATGG CGACTACATT CCTTTTGAAT TGAAGGGACG CACATCATTG TTGTACTCAC GAGTACCTAC TCGCCATGAG CTTGAGAACT GCCTGCACAT TGATCTCACA TCTGATCAAC CCTGGGATCC AAACAGCAAA GACTGGGAGG ATAATGAGCA GCGCTACACG CGTCATGACC GACAACGGAA TGCACGCTAT ACCGCAACTG ATAATGCGGA TGAGGAGAAC TTTTACCATG GGTATTTCTC TCTCCCTGAC TCTAAGGAGT TCCCGGTTCT ACCGGCAAAC AATAATGTTA TGAACCCAAA TGATGTCGTA CGCGAGATCA AATATGCTAC TGCACGGGTT TCAAAATCTA GCCCACGGGA TCTAGATGTC GATCGAGACA AACTTCGCCG CATCCTGGGA CATGTTCCTA TGGAAGTAGT TGACCGAACA CTGGAAGCTA CAACACAACT TGCGGAACGC TCTGGCAAAA TGCCACTGCA TCGACGTTTT AAAACGAAGT TTGAACAATT GCGATACCGC CGGTTGAAGT GTACGTTATA TAGCGACACT TTCAAATCTA CTGTTAAATC CTCCCGAGGA CACACGCATA CCCAAGGGTT TGTATGTGGT GATTCTTACT TTGTATACCA CTTTCTTATG AAAGCGGAAT CCGAAGCAGA CCAAGGTCTT GCGTCAATTA TACAAGATAT AGGAATTCCG GCACAAATTC ACACCGACAA CGCAAAAGTG GAAACCTTAA GCAAATGGAA GAAAATCACT TCCGGTCACT GGATAAAAGT CACAGTCACG GAACCATACT CACCGTGGCA AAACCGTTGC GAACACGAAT TCGGTGCGGT TCGGATCCAG ACACGACTTG TTATGGAAAC GACACAATGT CCAGAACAGC TTTGGGACTA CGCCATTACC TACGTGGTAA TTGTGCGTAA TAATACCGCT CGCAAAGCCT TAAATTGGCA AACGCCATTA ACGGTTATGA CAGGTGACAC GAGCGATATT TCAGAATTGT TGGATTTCGA GTTCTACGAA CCGGTACAAT ATTTTGACAA TCCTGAAATT AAATTTCCAC AAGCTAAGAC TAAAGTTGGT CGGTGGCTTG GTATTGCAAC AAATGTTGGA CAAGCTATGT GCTACTATGT CCTAACAGAC AAAGGAACCG TGATAGCGCG ATCCACAGTC ACACCACTTC ACAAAGTTGA TTCAACTGCT TTGCAAACCT CTCTTACAGC TTTTGATGCT ATGATAAGGG ATATTTATCA GCCTACTGAT TTTGCTCACA GCACTAAAAA GCAAGCAGCC TCGTTACGAC GAGATGAAGC AATGAAGGTT GCCAGAAAAA CTGGTGAACC TGAAGATCCA GGAGTCCGTA ATAGACATGT TCTGTATGAC TTAAATGAGG GAGCCGACCA TGACCAAGTG GAACCAGGAC TATCAGTTGA TGATTACTAC GGTAACGACG ACGAAAAAGA GTCTGGTTCG TCGGATCTCC TTGTCGGCAG CGAAGTACTC CTTACTAAGG GAGGTATACA ACATCTAGGC AAAGTCACCA ACCTTGATAA AAATGGCCAG CCCAAGGGCT CAAACGAAAC AACCAATTAT GTTGTTGAGT TCAATGATGG TACTGAAGAG ATTCATGGAT ACAATGCTCT GCTTGACGCT GTGTATAAGC AAGTCGATGA TGATGGTAAT GAATGGTATA CTTTTGAAGA TATTGTTGAC CATCAAAGGC GCCCACGTGG CGGCCGAGGA CGAACGAAAG GTTGGTTCCT CCGTGTTAAA TGGGCCAATG GTGAATACAC CTGGGAGCCT CTTACCTCTT TAA
|
Protein sequence | MARVRKATGP TRKGATEMVP EERVEEETPF EAVESPSKDS DNETQPSSMG DDNDSQSEIE SYKIDTDIDF KYNPNFFEDK KALESVLRNT MGFGDIHVKS LQNEGLKTAN DFLLISMSDI NDLCDKLLFA TVYRARLRAF ATWLRSQPDN INITQEWTIP VMQLEMQMKA QASPFGTSET NKTDKSVSSL VPDPFDGTQK KWLAFRYSFE AWAGASGQSF DACISHDSER YSRSEPTATY NDINDEPDSF KYDWNVKSVR NSNIFFMLKS LTSGGDAWGL IEPYEVSKNG RHAWIALCAF YEGASQVGLT TEEARTTILT SKYTGQSRNF TFTKYVQKHL TGNNILARNK EAYTDSQKTN FFLRGIVDTE LMAFKAAAEA NLNEWKFERV VTYMRTQAAK LTSKDGKDSR NIRQATGLSK NRNNKNNRRK RSEYQSQGKG NKESGKGNNA PSTQLRKDIW DELSPEIKDA IKAAKRRAST DPRTAKRAKT SSTDNSNASV ESYSPDLRSM STEIFKADDD KDLASGQPEA KDTPLHLELE DTLKKPTYGA GTLFGRSADR VSFNRMVCSS EENKVTPWRM SELRLADATI RRICKNRTRN PTGRSTWGEA AIDTGADTIC IGSGYTVLAH TGRYVSLRGF HDSGDTLDRI PVVTAATAYD YDDGTTVILV FHEALNLGPT QSTSLINLNQ IRHAGHQTDD IPKFLSQGKS LHGIETIDGD YIPFELKGRT SLLYSRVPTR HELENCLHID LTSDQPWDPN SKDWEDNEQR YTRHDRQRNA RYTATDNADE ENFYHGYFSL PDSKEFPVLP ANNNVMNPND VVREIKYATA RVSKSSPRDL DVDRDKLRRI LGHVPMEVVD RTLEATTQLA ERSGKMPLHR RFKTKFEQLR YRRLKCTLYS DTFKSTVKSS RGHTHTQGFV CGDSYFVYHF LMKAESEADQ GLASIIQDIG IPAQIHTDNA KVETLSKWKK ITSGHWIKVT VTEPYSPWQN RCEHEFGAVR IQTRLVMETT QCPEQLWDYA ITYVVIVRNN TARKALNWQT PLTVMTGDTS DISELLDFEF YEPVQYFDNP EIKFPQAKTK VGRWLGIATN VGQAMCYYVL TDKGTVIARS TVTPLHKVDS TALQTSLTAF DAMIRDIYQP TDFAHSTKKQ AASLRRDEAM KVARKTGEPE DPGVRNRHVL YDLNEGADHD QVEPGLSVDD YYGNDDEKES GSSDLLVGSE VLLTKGGIQH LGKVTNLDKN GQPKGSNETT NYVVEFNDGT EEIHGYNALL DAVYKQILLT IKGAHVAAED ERKVGSSVLN GPMVNTPGSL LPL
|
| |