Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48553 |
Symbol | |
ID | 7194722 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 195011 |
End bp | 198846 |
Gene Length | 3836 bp |
Protein Length | 952 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183174 |
Protein GI | 219125829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.102709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTTAGATGG GAGTTGAAAT CCCTCAAGCT GTACGAATTG GTCTTGCGAG ATTTGAAAGG AATGAAAAAG CGCTTATTCG ACTCCTATCA GTGTTTTCCC CGCAATCGAC AAATGCGCCT TCGTTTATGA TCACAATTTA CAACATTTTT TGTTTTAAGC TGTGTTCCGT AAACCATGCT TGCATCACCC CACTTCCAGG CCTCATTTGC CCTTTGCTAC TTCCTTCTTG GTCGATTTAG TTTGGGCGCC ACATTGGAGT TTGACAAGTT TCGTGTCCTC CCAAGCAAGT ATGGAGATCA GAACCTCGTC TTCCTAAGAA AAGCAACATC GAATGCTCCA GATGTAAATC ATGATTTCAT AAGTGGCCTG AAGTAAGTAG GATTCTACGT TCTTGGGTCG GAAACAACAA TCTTGATTCT AAACGTGCAT CCCTTGCGAA CCATTAGCTT GGAACAAAAC CCAGCCTGGC TGCTTGCCGA GTTGTCCAGG TCAATGTCCA TGGAGATATC TTCAAATCCT ACCGACGTCA ACGAGGGAGG TGTAAATACT TCCCCAACAA ATTTACCAGA CTCAGGCCAA TCACAGTCAA CTGTATCAAC CTCACCAACG TCCGCACCGT CCATATCTAT CTTTCCGTCC GTTTCGGCTG AACCGTCGTC TACTCCTTCT ATCTCTCAAG CGCCATCGCT CTCAGTAGGT CCATCAGTCG CGCCGTCCAT GACTATCTTA CCATCCGTTT CGAGGGAACC GAGTTCTTCC TCATCGAACT CTCTAGCACC ATTACTCTCA GCAGGTCCAT CCGTCGCGCC GTCCATGTCT ATCATGCCGT CCGATTCGAT GGAACCGAGT TCTTCTCCTT CGATCTCTCA ACTGCCATCG CTCTCAGCAG GTCCATCGGC CGCGCCTTCC ATGTCTATGT CACCATCCGT TTCGATGGAA CCGAGTTTTA CTCCTTCCAC ATCTAAAGCG CCATCGCTCT CAGCTATTCC ATCGGATACT CCGTCTCTAT CCTCGCTACC ATCCGATACC CCGTCCAATA TTCCATCCAG AGAGCCCTCT GGGCAGCCGT CGCCGTCTCT CTTTCAAAAC GATGCAATAT TTACATGCAC TGACGCTGGT GTGAACATTG CCCAGCTTCC AGTTGCCGAA CTTACCGGAA TCGATATTCG AGTTGGATAT GTCGTGGAAT CCTTGTCGGA TTTTGTGGAT TTTGAAGTTG CCTTAGAACG CAATATCCTA GCTGCGGCAA CTGTGGGGGT ATTAGACTGC ATTGATGGAG GTCCAGTCTT TGGGGTTGGT GGAGAGGTAC CTTCAGTGGA GATGAGTACC ATTGAGACAC CTGGAGTGGC CTGTATTCCA GAATTTTCCA GTTGTACAAC TTACCAAACA ACATTTCAAA TCGTAGTGGA CAAGAGTGTA GACCCCGATC TTGCTGCGTT CTTGGCATAT GTCCGATTGC AGGTACCTAT TTTTGACTAA CGTGTTGCAT CTTGTCGCAA AGCGAAACAA TCCTAAACCG AACTTTTGCT TCACAGGAGA ATATGGATGG AGGAAGCTTT ACAAGAGATA TTCTGATGCT GGATCGTATC GAATATGCAA GTCCACTACC ACTGCTCCCC CCAATTACAG CCCCAATTTC CACTCAGCCA CCGGCTTCAG GAACACGAGA AGGTTCTCTC ACTGTTTCAC CGTGGACAAT TGGTGCAGTC TTGGCCATGT GTAAGCTTGA AATCGCAACA TTGGTCACAT TTAGAGCAGG CTCTAATGAA CTTCCATTTG GTCAACAGGT ATGGGAGGCA CTGTGGCCCT CTGGGCTTGG GCTCGCAATC GGCAAACTCG AAATCGTAGG CATATGCAGC TCCTTGAAGA CATGTCGGTA TCTTCACCTC AGTAATTATA TTGTGAGAAA TCACCACGGT ATGGCAATCA ATCGTTTTAT TGCTCCTTTC AGTAAAGAGA TACAGAACAT TCACCTGACT TCATCAGAAA ACAAAAGGAA AGTTCGTGTA TGTCTATGGA AGAGCCCAAG AGAAATACCT AAGCTTGTGT GTATAAATGA TATACCCAGG TGTCACTTGT CCTTGACGTA TTTATTGCTA TCACTATTGA CCTCGGACTG TCGGCACTAG TACAGAGGGA ACTTGGAATA TATTGGACAA TAGAATGAGT GACGTGTGAC GGCTAATGTT TCGGATTAGG TAGCAGTTGG TGAATCGCTT GGTTTCAACA ACGGCCCGGG AGTGCTGTCA GCTGTAGCCA AGGTAGCGGT TGTAGTGGGC GCTGGGCTTG CGCTTTCCAA TGTTTTGGTT ACAGTAGGAC TGTTGGTGGC TACCGCAGCT ATTGGCGAGT CACTTGGCTT CACAGCTGAC ACTGGCGGTT TGCTTGTCGG TACTAAGGTA GTTGAAACTG TTGGAGACGC CGTCATCGCA AGAGTGGGCG TACGCAAACC AGGTATTCTT ATGGTAGGCC TTCTGCCGCG AGGACTACTA GGTATCATGG ACGGTAAAGA CGTGGGAAGT AAAGTAGGTC TTGAGCTTGG CTGCGAAGAC ACCGTCGGAG ATGCGCTTGG TATCGATGTA GGAAACTCGC TTGGGTGTTC ACTGGGAAAT TTACTTGGAA CAGCGGATGG CACAAGGCTG GGTAAAGAAC TCGGAGTTTG GGTTGGTTCT ATGCTGGGAA CTGGACTTGG GAAATTACTA GGAGAATCGC TCGGAAAGTG GCTTGGCTCC ATGGAGACGG AAGGCAAGAA CGAAGGAGTA CCGGATGGAG CAACACTTGG TTTAGCAGAC GGCTCAGAGG ATGGATCGGA CGAAGGCATC ATCGAAATGG AGGGCGACGA GGATGGTTGA CTTGATGGTT CAGATGAAGG CATCATCGAA ATAGAGGGCG ACGCGGATGG TTGACTTGAC GGTTCAGATG AAGGCATCAT CGACACAGTC GGTTGAAATG AAGGTGAACT ACTGGGCTTT GAAGATGGGT TCGTGCTTGG CTCACTTGAC GGCTGCATTG AAATACTAGG TTGCGCTGAA GGCTGCACAG AAACAGTCGG ATCACTTGAT GGCACGGAGG ACGGTGTAGA GGATGGCATC GTCGAGACAC TCGGACTGCT CGAAGGAACC GACGAAGGAG AAAAGCTAAT CGAGGGAACT GAAGAGGGAG TACCAGAAGG ATACATAGAG GGACTAGCGG ACGGAACACG AGACGGGGAC ATTGATGAGC TTGGGAAAGC AGACGGCTCC TTGCTTGGAG ATGAGGTTGG AACGCTACTC GGAGCACCAG TCGGCAAAAA AGTAGGAGCC GTGCTAGGCT GGCGACTTGG CGAAGAACTC GGGTTTGAAC TAACAACCAA CGTAGGTGCG GAAGTTGGCT CAAAACTTGG GGTGGCGGAT AGATTGGAAG CGACAACACC GGATGGCGAC GCAGTCGGCA GTCTAGATGG AGGAATCGAA GGTTTGCCGC TTACAGGCGA GTCGGAAGGC TCAGTGGTGG AAATGTCATC AGGAATCTCT GTTGGACGAA AGGTTGTAGC AGAAGCTAGA TTTGAGGTCG AATCATATGC GGTAAATCCA ACTGCCGGTG GGTCAGTGGT AGCGCTCGAC GTCAAAAGAG TTGTTGGAAC ACTCGTGTTT GGCATTCTTG TTGGAGTTTC TGTTGGCACC GGCGGTGTTC TTCCCAGAAG CGTCATTCGA TCACTTCCTT CGACAAACGC TGTTGCAAGG GCTAGAAGCC CGATAGCGAG TGGATAAACA ACCCTCATAA ATTGGTGTCT GTAAATCAAG CTCAAAAATT TACACTGTGC TTTTTTATGT CGATGGGATT GATCTTTGAC AAAGAT
|
Protein sequence | MLASPHFQAS FALCYFLLGR FSLGATLEFD KFRVLPSKYG DQNLVFLRKA TSNAPDVNHD FISGLNLEQN PAWLLAELSR SMSMEISSNP TDVNEGGVNT SPTNLPDSGQ SQSTVSTSPT SAPSISIFPS VSAEPSSTPS ISQAPSLSVG PSVAPSMTIL PSVSREPSSS SSNSLAPLLS AGPSVAPSMS IMPSDSMEPS SSPSISQLPS LSAGPSAAPS MSMSPSVSME PSFTPSTSKA PSLSAIPSDT PSLSSLPSDT PSNIPSREPS GQPSPSLFQN DAIFTCTDAG VNIAQLPVAE LTGIDIRVGY VVESLSDFVD FEVALERNIL AAATVGVLDC IDGGPVFGVG GEVPSVEMST IETPGVACIP EFSSCTTYQT TFQIVVDKSV DPDLAAFLAY VRLQENMDGG SFTRDILMLD RIEYASPLPL LPPITAPIST QPPASGTREG SLTVSPWTIG AVLAMCMGGT VALWAWARNR QTRNRRHMQL LEDMSVAVGE SLGFNNGPGV LSAVAKVAVV VGAGLALSNV LVTVGLLVAT AAIGESLGFT ADTGGLLVGT KVVETVGDAV IARVGVRKPG ILMVGLLPRG LLGIMDGKDV GSKVGLELGC EDTVGDALGI DVGNSLGCSL GNLLGTADGT RLGKELGVWV GSMLGTGLGK LLGESLGKWL GSMETEGKNE GVPDGATLGL ADGSEDGSDE GIIEMEGDED GCAEGCTETV GSLDGTEDGV EDGIVETLGL LEGTDEGEKL IEGTEEGVPE GYIEGLADGT RDGDIDELGK ADGSLLGDEV GTLLGAPVGK KVGAVLGWRL GEELGFELTT NVGAEVGSKL GVADRLEATT PDGDAVGSLD GGIEGLPLTG ESEGSVVEMS SGISVGRKVV AEARFEVESY AVNPTAGGSV VALDVKRVVG TLVFGILVGV SVGTGGVLPR SVIRSLPSTN AVARARSPIA SG
|
| |