Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49856 |
Symbol | |
ID | 7198580 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 100363 |
End bp | 102983 |
Gene Length | 2621 bp |
Protein Length | 810 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184647 |
Protein GI | 219128916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTGA ACGATGATCC GACCCTTCAT CCAAAGGAAT CGGTAGCATC GGACGCCGCC ACAGTGTACC TGCCTTCGGG TGAAGGGAGA ACCACTGCCG CTACTTTTCC TTATTCTCGA CACCGACCAC AGATACCGTG CATGACGGAT TTACGCAAAT GGCTCCAAGC ATCCCGAACG CCGCGAAGGT TGGTCGTACT ATTCGTCCTG CTGGAGTCTC TGGCATTCCA GAAAGGATCG ATGCGAGTGA CGGCAGAGAG CGAAGTACCC GACAACGTGG CAGGCCAGCT GTTGCGCCAG ACGGTAAAGG CTGCAGCAGC CTCGTCTCCG TCGCGAGATA GCTTGGACGG CGCCACTGCT AGCGGACTCG TCTGGAGTGA TCTGAGTGTA GTTTCGTCCA ATCGGGATGT CACCCTTCTG CACCCGTTTT CTGGATGGAT CACCAGCGGT CAGATTGGCG GGATCCTCGG ACCGAGCGGA AGTGGCAAGT CAACTTTTCT ATCGGCCCTC TCGGGTTCTT CACGACAACT TTACCAAACC GGGCAAGTCT GGCACTATCT ACATACCGTT GTGCACGGCT CCAAGGACAC ACAAACACCG ATACAATTGT CTCGAATACC CACACAAGAG GTTGCTTGGC TTCAGCAACA CGACGATTTT TTCAGCATGC TAACGGTCCG GGAGACTCTA GATTTGGCGG CATATTTAGA GCTACCCCAC TTAGTCCTGT CGCAACGAGA CGCTTTGGTT CAAACTCACT TGGATGCGCT GGGCCTAGCT CACGCCGCCG ACCGACCAAT TGGCTCGGAT CTTACCGGTT TGGGCACTGC ACGCTTATCC GGTGGTGAGC GCCGACGATT GTCGGTGGCG TTGGAACTGT TGACGGAAAA ACAACTTCTT TTAGCGGATG AACCCACGTC GGGTTTGGAC AGCAGTATCA GTGTCAAAGT GATGCAAAAT ATCCGAGACG TTTGTCGCAA GCGAAATATC CCTTGCTTAT GTGCAATTCA TCAGCCTCGA TCATCCATTT GGCACTTATT GGATACCCTC ATCCTGATGG CCCCCGGTGG ACGCGTGGTA TACGCCGGCC CGAAATCCGA AGCCGTCGCA TATTTTGCGA CCCAAGGGTA CCGTTGCCCC GACGCGACCA ACCCGGCCGA GTACTTTGTC GATCTCGTTT CCGTCGATAC CGAAGACGAG CAGGTGGCCG CTATCGACGA AGCACGGATT GATAAACTCG CTTCCGTCTT TCGTGACTAC CAACAAACAT CTTTGCTTCT GCCTGCCAAA CGACCTCAAG TGAATTTGAG TCTGGATATC GAGACTGATA TACAACAACC TTCAAATGGT CAATCGATGA GACGGGCTTT CCAAGAAAAA AGCCAACTTG GGCTCCTGAA ATTTCTGTGG GTTCCACGAT TGGGAGCCTT GCTAAAGAGG TCGTGGCGAC AAAATGTTCG CAACTGGGAA ATCAATATTT TCCGAGCGTT CGCCAGCGCG GGTAACGCGA TTCTCCTGGC TCAAATCTTT CCAACTGTCC GAGGAAGTGT CGCCAAAGCC AATAGTGTAG CCGACAGGGT GGCACTGTTG TCGTTCGGTG CAATCAACAT GTGCTTTATT GCATTTATGA AGACTGTCAC GTTAATCGCG GAAGAGAAAC CGGTTGTTCA ACGGGAACAA TCACGTCGTC AGTACTCGAG CTTGGAGTAC CTGGTGGCCA AGGTTTTAGC AGAATTTCCC TTGGACTCCT TGTTTTCCGC TATCTTTACA GCCTTCCTAA AAAAGTGCTC AGGAGTCCGG ATTTCGTGGG CCAAGCTGAC TGGGGTCTTT AGTTTGTTGA CGGTGTCTGG CGCTTCGCTT GGTTTGATGC TGGGCAGCTG GCTTCCAACC GAAAAACTGG CTACGACGGG CAGTATTCCA GTTCTAGTCG TATTGATGGT TGTGGGTATC AGTAAGTGCT GTTCCATGTG TTTGACTGAA TCGATTTGGT TTTGTGTTGT CCGCAGCCTC TCAATCATGG TGGATACTCT CGAATAACGT TGTAGTCAAT CCGAGTGGCG TAGATCAGTC CACCCCTCCA CCGGCGGTCG TGCAGGTATT GAAACGTTGT AGCCCATTTG CCTACGCTAT TGAAGCGCTC TGTCTTGGGG AGTACCCCGG AATGGAATTC GAACGTCAGT CAGGCTGGTT CGGCCGTATC AGGGACTTGC CTAGAATGGG CGGATTGGTA CGTTTGTTGT TCGTCTCACG TTTACTCTTG CGTCGAACGA AAGAAAATAT CACTGATCCT CGCGGGTGTT TTCTTTTTGC AATCGTTGCA GGCCATGGTT CGAAATGGTG ATCAAGTCCT GGAGGCGCTG GGCTTACAAG ACAAGGGGTA TGTTCGAGTC ATGCAACACC TTGGAGTATT GTCTGCTGCG TACCTGGCAG TTAGTTGGCT GGGCATGCTT GTACAGGGTA GAAAACATGG CATGCATGGT GCGGTCGAAG CGGACACTAG CCAGCACGTA CAGCGAACCA AGGCTCCAAA GGACACCGAG GGCAGCTTTC TGTCCAAGTC AACAACGGAA ACGTCAACAT CACAACGACA TTTGAAGGTT CCTTTAAAGA TCCGAGTCTA A
|
Protein sequence | MTLNDDPTLH PKESVASDAA TVYLPSGEGR TTAATFPYSR HRPQIPCMTD LRKWLQASRT PRRLVVLFVL LESLAFQKGS MRVTAESEVP DNVAGQLLRQ TVKAAAASSP SRDSLDGATA SGLVWSDLSV VSSNRDVTLL HPFSGWITSG QIGGILGPSG SGKSTFLSAL SGSSRQLYQT GQVWHYLHTV VHGSKDTQTP IQLSRIPTQE VAWLQQHDDF FSMLTVRETL DLAAYLELPH LVLSQRDALV QTHLDALGLA HAADRPIGSD LTGLGTARLS GGERRRLSVA LELLTEKQLL LADEPTSGLD SSISVKVMQN IRDVCRKRNI PCLCAIHQPR SSIWHLLDTL ILMAPGGRVV YAGPKSEAVA YFATQGYRCP DATNPAEYFV DLVSVDTEDE QVAAIDEARI DKLASVFRDY QQTSLLLPAK RPQVNLSLDI ETDIQQPSNG QSMRRAFQEK SQLGLLKFLW VPRLGALLKR SWRQNVRNWE INIFRAFASA GNAILLAQIF PTVRGSVAKA NSVADRVALL SFGAINMCFI AFMKTVTLIA EEKPVVQREQ SRRQYSSLEY LVAKVLAEFP LDSLFSAIFT AFLKKCSGVR ISWAKLTGVF SLLTVSGASL GLMLGSWLPT EKLATTGSIP VLVVLMVVGI INPSGVDQST PPPAVVQVLK RCSPFAYAIE ALCLGEYPGM EFERQSGWFG RIRDLPRMGG LAMVRNGDQV LEALGLQDKG YVRVMQHLGV LSAAYLAVSW LGMLVQGRKH GMHGAVEADT SQHVQRTKAP KDTEGSFLSK STTETSTSQR HLKVPLKIRV
|
| |