Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18469 |
Symbol | |
ID | 7203970 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 209917 |
End bp | 211915 |
Gene Length | 1999 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186017 |
Protein GI | 219112867 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0427076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAATAGACG CGCACTTCAC CAACGTGAGC TTTCATTGGG TACCTGTTCA ATGTTATACC GCTGGTGTTG GTGTGGTTGA CAGTTTTTCG GGAAGTGCGT TTTTCCAAGC GGACGAACAC ATGAGCCAAA ACCAAGGAGT ACGGAGAACG TTTTCGTCTC GATCGAATTC GAACGCTGGC GGACAAGACA GCAATCTTTT AGCAAAGACG GACGCTGTGG ACGAAGGGCA TCTATCCAAA GGATTTTCGG CAGCCTCGAC ACTCTCGGCA CGAAACAATA GTGTGAATAT GCCTCTCCCA ACCTTGCCGG CCTTAGCCGA CAGCTTTGAA CCGGCTCAAT GCGACGGAAA GAACCCTGCC ACTTCGCAGA TAACAGGACG AGCCGGTCCC TCGTGGTTTG TCATTCAAGT AACTTTATTG GCCAGTTTGG GTGGTATACT CTTTGGATAC GATTTGGGTG TTATATCTGG AGCCCTTCCA CAATTGACAT CCTACTTTGA CCTACAAAGC GCGCAACAAG AAATCGTAGT TTCCGTCTTG TACGTTGGTG GAGGATTCGG CGCCGCCTTG GGAGGAGCTC TCTGCGATAC CTACGGACGC AAAAGTACTA TTCTTGTCAC GGATGTATTG TTCCTGTTGG GGGCAATAAT ACTGTACGCG GCAGCCTCGT ACGGGATGAT AATTTGTGGG AGAATTGTTG TGGGATTCGC CATTGCTGTC TCAGGGATTG CCGATGTTTC GTACCTTCAT GAAATTGCAC CAATACAGTA CCGTGGCTCG ATTGTTTCCG TTAACGAAGC TTGCATTGCA CTGGGGTTCT TACTTGCATT TGGTGTCGGT GGATGGATGT CTCGAGAGGA AACCAATAAC GAGGGATGGC GAGGTATGTT TGGGATAAGT GGTGTGGTTG CCTTTATCCA GCTCATCGGA ATGTGGACCA TGCCGGAATC GCCCACATGG CTAAAGGATC GGGGATTACA TCGAGAAAGT GAGGCAGTGT TGCGGCGCAT TTATCCGGAA CCATTTGTTT CCAATTTCGT CCATAGTGAC GCCAATTCTG GTCCCGAAAA TAAGGTTGTT TCCAGCATAA TGTACGAAAC ACTTTCACCC AAACCAAGGA AACCCTCCGT GCATCCATCA GTGTCCTCTC TTGAAGCAGG AACTTCGCCT TCACTATCGG TCAATGCCGG CTTCTTGGCC AAATCGACAT ATGCTTGTCG GTACTCTCAT TACCTATGCA CCCAATTGAA AGCATTCGCC GTGACTTCCA TGCATACCTA CCGAAGGCAA GTCTACATTG CCCTTTTCTT AGCAGTCTGT CAACAGCTAT GTGGCCAAAC GAACGTGCTT AGCTATGCAC CCTTGATCTT CGCTGGAGGA AACGCATCGA AAAGCGGAGA TTTCGTCCGA GGATGGGCGA CCCTTTCGAT AGGCATCGTC AAATTTGCCG TTTCGTGTGT AGTCATCTGG AAAGTTGATG CGCTGGGACG ACGACATTTA CTACTAGCTG GATTGGGAGT GGTCGCAGTG GGTTTGCTTT TCTTGAGTAT TGCTTTTCGC GGAGCCGAAG TCTCTGACAA GCCGGTAAAA GGCGGCGATG AGCCGACCAC TACCTTGATT GACGAAGGCG ACCGTGCCTT CTCACTCGCC TTACCAGGCG TATTGCTTGT TGTAACAGGA TATTCCATGT CCTTTGGGCC GCTCACATGG CTGCTAACAT CGGAACTATT TCCGACCGAT ATTCGAGGAC GAGCTTTAGG AGCAAGTACG ATCATCACTT ACTTTTGTGC ATGGGTCGTG ACGAGCACTT TTTTATCCGC GCAAGAATGG CTGGGTGCTA GCACCGTCTT CACGATGTAT TTTCTCGTTA CAGTGGCAGG ATTCCTCTTT GCGATAAAGG CAATTCCGGA TACCGGCGAG AAAAGCACCA GAGAGATCGA TGACAGTTTG GATCAAATGG CGTGGTGGCG TCCGCGGAGA AATGACGTCT CGCGCACTC
|
Protein sequence | MSQNQGVRRT FSSRSNSNAG GQDSNLLAKT DAVDEGHLSK GFSAASTLSA RNNSVNMPLP TLPALADSFE PAQCDGKNPA TSQITGRAGP SWFVIQVTLL ASLGGILFGY DLGVISGALP QLTSYFDLQS AQQEIVVSVL YVGGGFGAAL GGALCDTYGR KSTILVTDVL FLLGAIILYA AASYGMIICG RIVVGFAIAV SGIADVSYLH EIAPIQYRGS IVSVNEACIA LGFLLAFGVG GWMSREETNN EGWRGMFGIS GVVAFIQLIG MWTMPESPTW LKDRGLHRES EAVLRRIYPE PFVSNFVHSD ANSAFAVTSM HTYRRQVYIA LFLAVCQQLC GQTNVLSYAP LIFAGGNASK SGDFVRGWAT LSIGIVKFAV SCVVIWKVDA LGRRHLLLAG LGVVAVGLLF LSGDEPTTTL IDEGDRAFSL ALPGVLLVVT GYSMSFGPLT WLLTSELFPT DIRGRALGAS TIITYFCAWV VTSTFLSAQE WLGASTVFTM YFLVTVAGFL FAIKAIPDTG EKSTREIDDS LDQMAWWRPR RNDVSRT
|
| |