Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47952 |
Symbol | |
ID | 7203137 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 538325 |
End bp | 540069 |
Gene Length | 1745 bp |
Protein Length | 420 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182248 |
Protein GI | 219123888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.870123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGTACTT CCAGATTAAT GTTGCAGTTG GGCAATCCTG ACGTGTGAGT CGCGCGTTTC TCCCTTCTTC TGGGCAGAGC AGCTTCCAGT AAAAATTTCC CTTCGAAAAC GTGAAACAGT GTAGCCGACA GCGGGGTATT GACTACTTGA GACTGTTCAC TCTCCGTTCA CAAAATATTA CTATCACAAC ACATTCCAAC TGTGCACTCC CAAGTTGCGT CTCTTTACGG AACAAGGTTG ACTAGTGCTG TGGCATTTGT CATTGCGAAT CAGACAAAAT GTCGTGGTAT TTCGCAAGTC TACCTTCGGC ATCGGATCGT TGCTTGGACG CCTCAACAAG ATCGACTCTT CATCAACTTC TGAAAAACGA AGAAGAACTG AGCAGTTTTC AGCGTACTAG AGTATGTCCA GGAGATTGTC AACGGAGACT TTTGGATAGT CTCATCACTC CTCCTTTCTT CTTTAGAGGC GAGGCACCAT CTGGTCTCGG GGATTACAGC GACGTTCCGT TTCATCAGCT TCCCAGAACT TTTTTGGGAA GTGAGCCAGC AATAGCTCCG GCTGAGATGC GCAGCCTGCG TTTTCCACTC CCGGCTAAGC TCAACGCTAT TCTTTCCAAC CATGAAAATG AGCATATTGT TTCTTGGATG CCGCACGGAC AAGCATGGCG AATTCATGAT CTTGATCGAT TCCGTGATCA GATTCTACCT GTTTATTTTG ATTGCGGCTC ATGTGCTGGT AGCATCAATG CATTTTTGCG CCTTTTAAAG CTTTGGGGGT TCCGTCAGCT CGCGCATGGA CCCGACATTT CCGCATTTTA TAACGAGGTA TGCCTGCTTC ATAAAGTTTC CAAAAAATGC AATATTTGTG TTATCACAAC AATTGTACTA AGATTTTCTT CCCTCTTCAA ACAGATGTTT CTTCGAGGCA TGCCGAATCT GCATAGACTG ATGCGCGCTT TCGACAGCAA TATAAGGCAA TCTCTTTATA CCACCCCTGA GCCCAATCTA TCAGCTTTTC CCGTGTTGCA ATACAACCAG CCTGCGTCTC GAATAGACCC GACTTCAAAC GCGCTACACT CGCTCAAAAT GTCCCGAGAT CTTCACTTTC GCTCGCTGGT GCTATCTTCT ATGCTACTTT CAAAGGGCGC GTTGTGCAAA AAGGGGTCAT CCGTTGTTTC TTCACAAAAG AGCAGAGCGT CTTTTTGCAG CAATACCAAA GAGGAGACTT CAGACCTTGA TGGTTTAGAT TTAAATACTG GCTTGCTTGC AGCACAGGAG CTTCACTTGA GCAGCATGGC GAACATGAAT CAAGATCGGC AAGTTGATAA ATCCCAAGGC CTGGCTTCGG GTTTGGTTCG CATCGGATTC TGTGGTAGGA AAGATTTTGC CGGTAGCCGT TTAGAGGACC CTGAAAAACT CCAGCCCACC AAGAATGGAA CAGGGTCAGA ATCAGTCGTA TGCCCCGTCT CGTCCTCTTC AAACCTGATG ACAAGAAAAG GGCGGAAAAG ATGGCTGCCT TCATTGGGCC CCAAGCAAAC ATTTTCTGGT CCCGATCCGG GCTTATGCGA GTGGTTTTGT ACCAATCCAG AAGTGTTTGC TTCCTTGGCA GACTCGCCGC AATCTTGATG AGAAAGGTGG ATGTGAGAAG GAGATACCCC GTGCATCATA ACAGTTGATG AGATTTTCTT TTTGTTTTCA TTCCATTCAA AAATTGACAG TATTTTAAAG AAACGAATAT AATGT
|
Protein sequence | MSWYFASLPS ASDRCLDAST RSTLHQLLKN EEELSSFQRT RVCPGDCQRR LLDSLITPPF FFRGEAPSGL GDYSDVPFHQ LPRTFLGSEP AIAPAEMRSL RFPLPAKLNA ILSNHENEHI VSWMPHGQAW RIHDLDRFRD QILPVYFDCG SCAGSINAFL RLLKLWGFRQ LAHGPDISAF YNEMFLRGMP NLHRLMRAFD SNIRQSLYTT PEPNLSAFPV LQYNQPASRI DPTSNALHSL KMSRDLHFRS LVLSSMLLSK GALCKKGSSV VSSQKSRASF CSNTKEETSD LDGLDLNTGL LAAQELHLSS MANMNQDRQV DKSQGLASGL VRIGFCGRKD FAGSRLEDPE KLQPTKNGTG SESVVCPVSS SSNLMTRKGR KRWLPSLGPK QTFSGPDPGL CEWFCTNPEV FASLADSPQS
|
| |