Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49706 |
Symbol | |
ID | 7198329 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 499228 |
End bp | 504166 |
Gene Length | 4939 bp |
Protein Length | 1222 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184471 |
Protein GI | 219128544 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTCC TGATCGGTGT CGCGATTTCT AGTCGAAGGG CTGAGTATCT GGTGTCGCTG ATTGTTTATG GAGCTATGCC ACAACTTTGT TTGAAAGGAA AGCCCTCTCT GGAAGCACGA GCCAGTTTCA TCTTAGACCG ATGCCATCGG CAACCGAAAA CTGCATTCAA AAGTATCCTG ATGTACTCAT TCTGTGTTGC AACAGTGTTA CCCAAAGTCA TCTTTAACGC AACAAGCTTT GTTCACTCTG TAATTCAACA ACAGTTACGT AACGCGAAGG CTTGTAAACC GACCGCAGAG CGGTAAACAG AAAGGTAAAG AAAGTTAACG AAGTCGGAAT TGGAAAGCGA AGAAAGCAAA CCTCAGAATG GGAATGTTCG CGTCGACCAC TCGCAGACAT CACGCTTCTC TGAGGAGTGT TCTTTGCTTG CTGTTCTTGA CGCTTTCCGT TTCAGCGCAG ACAACGCCGC AGCGTCAAAT TGTATCAGAG CTCTCTCTCT GCGTGGAGGT TGCTGGAGCG AGCCAAGACG ACGGGGCGTC CATATTTCAA GGGAATTGTA ATGACGGTAA CAAGCACCAG GTGTTCGACT TTATTCCTGC TAGCGGTACA GACAACGCGT ATCATCAAAT TCGGGCTTCC CACTCCAACA AGTGCCTTGG TGTGGCTGAT GGGGCATTAG CACCTGGGGC TGACATAGTG CAACTGTCTT GTGTCGACAA CGACCCCGCT ACGCTGTGGA AGCAACGTGA TGGGGTATTG ATCTTGCAGC ACTCTGGTTA TTGTTTGAGT GTCGATGCCA GTTCGCGTTC CACATCAGGG GCCTTTATGG TACAATGGTT TTGCGACAGT CCCAGCGTCA AGTGGCAAGT TAAGACAGTA TCCGAGCAAT ATGACGGACT ACAGCAACTT TTGAAACGCA GCAATTTTGC GCGCAGATCC GAATCTGGAT TCTACCTCAG TACGGGAACG ACGCACGACA ACCAGTTTCT TACTACGAAT TCGTCTTGGT CGGAGACAGA AACCGATGGG GCAACTTACT CGGGGTGTGG CAGTTATGGG CAAGACACAT TTAACCGTCA TCGCACTGGG CCAGCCTTTG TCTTCACCAT ACCGGGGTTC GTTGCGGGCG ATACGTACGA CGTAACAATG GGGTTCGCTG AGATGTGGGT CCCCCAATGC CAGGACGGAA AGCGAATTAT GAATATCACA GTCAATGGAA GAACGTTTGC GCAAGACTTG GATGTCTATA ATGCCGCTGG TGGGTGCCGC ACTGCTTTGA TTCTGACCAA GTCGTTCGAG GCCAGTAGCA GAGGAGAATT CGTGATCGAT TTTACCAGTA CGATTCACAA CCCTATGGTT TCATTCATTC AGATACAACA TGTTGGAGCA AACGCTATGC AAGCCCGAAT GCTAGCAGAT GTACCGACGG CTTCGCCAAA TCTAAGCATC GAGCCCACGA TTGCACGAGC AACAAACGTC TCCACCATAG TTTCAGAAGG ATTTAGTGGA CTCTGTGTTG ATATACTTGG GGGCAGTGTA TCAGATGGTG CTCAGGCTGT CCAGGCATCT TGCAACGGAG GTGATAGTCA GGAGTATCAA TTCTTGATTG TTGGTGAGGG CCAGTTCCAA GTTGTGGCCT CTCATTCACA AAAGTGTTTG GGTGTTGCCG ACTTGGATGT GACAGACAGT GCCGATGTCA TTCAATTGCC GTGTACGAGC AACACCCTTT GGTACGTTGT TGGCGGTGGT GCATACCTAC AACTTCGCGT TCTGCATAGT GCCAAATGCT TGAGCATATT CGGAGCCTCG ATTGCCCCAG GTACGAAGCT TGTTCAGGAA GCCTGTGATG AAGGCCCCGA TCAGCTCTGG CGCGTCGCGG ACGGCCTGTT CGCAGAGGAG ACGCTGGAGC CCTCGGCGTC GGCAGCTCCA AGCCCGAGCC CCAGTGTGAG CTTTGAGCCA ACGAGAGCGG GAGCGGGAAA GGGCCCCCCT GCTCCGACAA TTCAACCGAC ATTGTCCAAC GCTCCAAGTA GTATGCCAAG TCTGAGTGCC TTACGCGGAT CCTCGCCACC CTCACAACAA CCTTCACAAA CCGTAAGCGT TTCACCGAGT GTTAGCGTAG AACCAACGAT AGCACCAGGG CCTCCCTTGT TCTATCTCAA GACTGGAACT AACCAAGACC TTGCATACAT CTCCGGTGAT AATCTTGGTG TCTATTCCAA ACCTGGGGCC CAGATCTTAG GAGCTGGAAG GTACGGAGAA GCGACATTCC GGAGTCACCG ATGGGGAAAT ACGTTCGTTA TTACAATCCC TGGATTCAGT GTCGGTGACA TGTACACGAT ATCACTCGGC TTTGCTGAGA TCTACTTTTG CGCCGAAGGC AAACGTGTTA TGACCATTAC TGTAAACGAC GAGGTATTCG AGGCCGACCT GGATGTGGTT GCTGCTGCTG GAGCGTGTAA TACGGCTCTC GTGATGAGGA AAGATTTTGC AGCCAATAGT GATGGTGCTT TTGCAATTGC TTTTTTCAGC CCTATCAACA ACGCAATGGT TTCTTTTGTT GAGATCGATT TGGCGGGGAC GTTGTCGCCA ACCTTTTCCC CAGCTCCTAG CTTTGCTCCG ACTTTCAGTT CAGCACCAAC AGAGTCGTCG GAGGTGCCTT CATTGGCGCC TTCAACGTCC GTATCTCCGA GCACTTCACC AACTTCAAGT GCGGAACCCA CAGTAGCTCC GGAGCCAAAT CCGTGGGAAG GTGAATACGC TATGACTCTG GTTGCTGTAG CTGCTGCAAA TCTGGACGAT GGGAGAATTC TAGCTTGGTC AGCGTGGAGC AGGACTCATT ATGCCCGTAG TGTCGGAAAA ACTTTCATTT CCATTTTCGA TCCTGCAACC AATGAGAGTA CGGAGGGAGA AATCACAAAT ACGAACCACG ACATGTTTTG CCCTGGGACT GCTACTTTAG GCGACGGTCG AATTATGATT ACCGGAGGTT CCAATGCAGC GTCCGTTACT TTTTTCGACC CCTCCGCCAA TAGCTGGTAC AGAGGTCCCC CAATGAAAAT TCCGCGGGGA TACCATTCGA TGACTGTGCT AGGGGATGGA TCGGTCTTCA CTCTCGGAGG TTCATGGAGT GGTGCAGGAA GGGGCAACAG GGGCGGCGAG GTTTGGAGTC CAACCGGTGG GTGGGTTTTG AAGAGTAACA TTCTCATACC TGGAAGCTCA AGCTTGCTTA CGAACGACGT TGGTGGCGTC TTTCGGTCAG ACAACCACAT GTGGCTCTTC ACTGCCCCCA ACGGTAAGGT ATTTCATGCC GGACCGTCGA AAAGAATGCA CTGGATCGAC GTTGCTGGCG AAGGAGAAAT ATCCGACTCT CTTCTCCGTG GCAACGACAA CGACGCTATG AATGGCAATG CGGTCATGTT TGATATTGGC AAAATTTTTA CCGTCGGAGG CGCTCCCAAT TATGAGTATG GTGATAACGA GGGAACTAAG CTGGCCCACG TTATCGATAT CAATGCCGGA GAAGGGTCTG AGACTGTTCA GAGGGTAGGT GACATGGCCT TCGCAAGAAC ATTGGCTAAC AGCGTGGGCC TCCCCTCGGG AGAAGTGATT GTTATTGGTG GTCAAACGAA AGTATTTCTT TTCACAGACA GAGAAGCTGT TTTCGCTGCC GAGATCTGGA GTCCTAACAC AGGCCAGTTC ACGACTTTGG CGGAAATGAA GATACCGCGC ACCTATCACA GTGTAGCAAT CTTGATGAAA GATGGTCGCG TGTGGGCGGC AGGTGGCGGA CTTTGCGGAA ATTGTCCTAC AAATCATAAA GACGCTGAGA TCCTTACCCC ACCCTACTTG CTCAATGAGG ATGGTTCCCT GAAGACGCGG CCTGTCATAC AGTCTTCACC GTCTCGGTTA GTTCCCGGTG AGACAATTAA TGTATCGGTG GACACAAGCG GCAACCATAA TTTTGTGCTC ATGCGGATCT CCGCTGTTAC TCACTCTGTG AACAACGACC AGCGGCGCAT ACCGCTCACG ACTGTGGGTG GCGACAATAA TTCCTTTCAA TTGATTGCTC CGGACAACTA CAATGTGACT GTACCTGGAA CTTACTTTTT ATTTGCTATG AATGCTGATG GTGTTCCAAG CGTTGGAAAG ACGATTGTAG TTGACGCCCC AGACGGTCCG CAGCCAGACC CGCCCCTAAT TTTTCCTATC GAGTCTGCGG ACTTTAGTGG TCTGTGCGTA AATATTGCAA GCAACAGCTT CGAGAACGGG GCCCAAGCTA CTCAATATAC ATGTAATGAG AACGCTAACC AGCAATTTGA TTTCCAATCT GCTGAAGGAG GGCTTTACCG TATTGTCGCC TTCCATTCCC AAAAATGCTT AACGGTCACT CAGGGATCGA TGGGTGAAGG AGCAAACATC GTACAACAAC CTTGTGACGA CTTTTCCCAT CAGCTATGGA CCGTTACGGG ATCTGGGAGT GATCAGCAAC TCAAGGCTTC ACATTCCGGC AAGTGTTTGA GCATTTTTGA ATCCTCCATC GCGATTGGCG CGATATTGGT TCAATTGGAA TGCAATGAGG AACCCGCTCA ACTCTGGCGA ATTGATGACC GATTGAACTC ATCACAAGTA GAGGCCCCGT CCCTCTCCAA CCCACCGAGC TTAGCACCAA GTGTGAGTTC CGCACCAGCT GTAGCGTCGG CAAGTCTACC ACCGACAATA GCCCCAGCGC CGCCAATCTT CTCCCTTCGA ACGGGATCCC CGCTAGACCT CCCTTACATA TCGGCGGGCG GACCGGCAAG CAAGACTTAC CAGGATCCAA CTCCGTCAGC TATTTCCGGA GCCGGAGAAT ACGGGGATGC AACATTCCAA CGCCATCGAT GGGGGAACAC CTTTACTTTT ACCATCCCTG GCTTCGTGGC TGGCAACACA TACGCTGTC
|
Protein sequence | MPFLIGVAIS SRRAEYLVSL IVYGAMPQLC LKGKPSLEAR ASFILDRCHR QPKTAFKTQT TPQRQIVSEL SLCVEVAGAS QDDGASIFQG NCNDGNKHQV FDFIPASGTD NAYHQIRASH SNKCLGVADG ALAPGADIVQ LSCVDNDPAT LWKQRDGVLI LQHSGYCLSV DASSRSTSGA FMVQWFCDSP SVKWQVKTVS EQYDGLQQLL KRSNFARRSE SGFYLSTGTT HDNQFLTTNS SWSETETDGA TYSGCGSYGQ DTFNRHRTGP AFVFTIPGFV AGDTYDVTMG FAEMWVPQCQ DGKRIMNITV NGRTFAQDLD VYNAAGGCRT ALILTKSFEA SSRGEFVIDF TSTIHNPMVS FIQIQHVGAN AMQARMLADV PTASPNLSIE PTIARATNVS TIVSEGFSGL CVDILGGSVS DGAQAVQASC NGGDSQEYQF LIVGEGQFQV VASHSQKCLG VADLDVTDSA DVIQLPCTSN TLWYVVGGGA YLQLRVLHSA KCLSIFGASI APGTKLVQEA CDEGPDQLWR VADGLFAEET LEPSASAAPS PSPSVSFEPT RAGAGKGPPA PTIQPTLSNA PSSMPSLSAL RGSSPPSQQP SQTVSVSPSV SVEPTIAPGP PLFYLKTGTN QDLAYISAPS FAPTFSSAPT ESSEVPSLAP STSVSPSTSP TSSAEPTVAP EPNPWEGEYA MTLVAVAAAN LDDGRILAWS AWSRTHYARS VGKTFISIFD PATNESTEGE ITNTNHDMFC PGTATLGDGR IMITGGSNAA SVTFFDPSAN SWYRGPPMKI PRGYHSMTVL GDGSVFTLGG SWSGAGRGNR GGEVWSPTGG WVLKSNILIP GSSSLLTNDV GGVFRSDNHM WLFTAPNGKV FHAGPSKRMH WIDVAGEGEI SDSLLRGNDN DAMNGNAVMF DIGKIFTVGG APNYEYGDNE GTKLAHVIDI NAGEGSETVQ RVGDMAFART LANSVGLPSG EVIVIGGQTK VFLFTDREAV FAAEIWSPNT GQFTTLAEMK IPRTYHSVAI LMKDGRVWAA GGGLCGNCPT NHKDAEILTP PYLLNEDGSL KTRPVIQSSP SRLVPGETIN VSVDTSGNHN FVLMRISAVT HSVNNDQRRI PLTTVGGDNN SFQLIAPDNY NVTVPGTYFL FAMNADGVPS VGKTIVVDAP DGPQPDPPLI FPIESADFSG LDRWVKEQTS YNNLVTTFPI SYGPLRDLGV ISNSRLHIPA SV
|
| |