Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48468 |
Symbol | |
ID | 7203697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 585662 |
End bp | 587130 |
Gene Length | 1469 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182865 |
Protein GI | 219125181 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCCCTCCC GTCCTCCAGT ATCGGATCCT AACGACTCCA TCGGCTGCGC ACACAAAATT CCTCCCACGC AAACCTTTCA TAATCATAAA AAGTCAGATT TACTGTTAGA TCGAAGCAAT CATGATGCCG TCCGCCTTTT TTGTGCTGGG TACCATCAGT CTTCTTTTAA GCGAGTCTCA GGCCTGGACG ACTCGCCCAG CGCAGACATC AATTAGACTT TTCTCGCGAC TTCATTTATC CCGACCCTCC TCCGAGTCAT GTTCGAAACA AACGAATGTC GATAACGAAA GACGCTCGAA GAGATTTGAT AGTTTCTCTC GACGCGAGTT GTTCGCGGCT ACCGCTGGTG CAGCTGCTAG CATTGCCTTG GGGCCGTCCA CAGCGGCATT CGCGCCAGCC CCGTCGATTG TTACAACCGC CGCAACGTGC GACACCACTG TATCAGTATG GCAACGCGGT GACAGAATCG TATACATCCT GGGAACAGCA CACATTAGTG AAATATCCTC TGATCTCGCT GGCCAACTAG TAAAGGATGT ACATCCCAGT GCCGTTTTTG TCGAGCTCGA TCTTAAGCGG GTTAGTGGAG TTACGGTGTC GCCCGGTACA CCCGTGACTA GTCGTCTACC CATTTCAACC GACGTAACAG AGTTACCTGC CACAGGAGAA GGTGCGTCTA CAAAGCAATC GAAAATTATT GTCTCCGTAC CAGCCCTTCC CGACTCTGTC GCGTCACGCC CCACCGAATC GACTGGTATT GCCTCGCTAG CGGCAACCAC CAAACAAGAC GATGAGCTCG CAAGCGCCTC ACCCGTCCTT ACAGAATCCC CACGCCGCGG GCTTGGTCAG CGTATGCTTG GTTTTGGTGC AGCTGCTGTC GGTAAGGCCA TTCAAGGCAT GTACAAAAAC TTGAACGACT CCGGATTCAA GCCTGGCGAA GAATTCGTCG TAGCTGTACG GGAAGGGCAA AGAATTGGGG CCGACATAGT GCTAGGTGAC CAAGATGTTG AAGTTACGCT TCGTCGTATG ACCCAAGCTC TAGCTCAAAC GGATCTCAAT AAGCTCCTTG ATCCTGATTC GGAACTAGAA CGCGGCATGC GGGAGCTCAT GGGAGACTCG GATCCGTCTT TGGCGAGTTC GCCGGACGCC TTTAAGTCAG AACTCTCTAC CTATGTGGAG AATATGAAAA CACGGGATAG TGTTCGAAAG ATAATGGCTC AGCTCCAGAA AGTTGCACCC GCACTGGTAC AAGTTATGCT AACAGAACGC GATGCTTACA TGGCGGCGGG CCTCGATACA CTGAACCAGT TTGAAGTCAT AACTGCCGTC ATGGGTATCG CGCACATGGA TGGCGTCGAA CGCAATTTGC AATCACAAGG ATGGAAACAA ATGCGCCCCA GTTGCCCCCG CGTGTAAGCT CCTATTTAGG CTAGACTCTG CGATAACCAT GATAAACTCT CCAAATCTT
|
Protein sequence | MMPSAFFVLG TISLLLSESQ AWTTRPAQTS IRLFSRLHLS RPSSESCSKQ TNVDNERRSK RFDSFSRREL FAATAGAAAS IALGPSTAAF APAPSIVTTA ATCDTTVSVW QRGDRIVYIL GTAHISEISS DLAGQLVKDV HPSAVFVELD LKRVSGVTVS PGTPVTSRLP ISTDVTELPA TGEGASTKQS KIIVSVPALP DSVASRPTES TGIASLAATT KQDDELASAS PVLTESPRRG LGQRMLGFGA AAVGKAIQGM YKNLNDSGFK PGEEFVVAVR EGQRIGADIV LGDQDVEVTL RRMTQALAQT DLNKLLDPDS ELERGMRELM GDSDPSLASS PDAFKSELST YVENMKTRDS VRKIMAQLQK VAPALVQVML TERDAYMAAG LDTLNQFEVI TAVMGIAHMD GVERNLQSQG WKQMRPSCPR V
|
| |