Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48698 |
Symbol | |
ID | 7194683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 665132 |
End bp | 666872 |
Gene Length | 1741 bp |
Protein Length | 566 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183135 |
Protein GI | 219125747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCCC AAAAGGGACT CAGTGAGTCC GGAGTAGGAA TTCCCAATTG GAGTCCGCCG GCGTGTATCG AGAGTATCAC AAAGAAATCC TCCAACAAAT TGGCAATAGC TGACCATGGC CGTTGCCGGA AGATGACTAT GGGACCTCGT CGCCCTGTAC AAGGTAGCGA ACTATCGTAC CAGCCCGTGT CGATCGACGG CGAAGCAGAA TGGGAGGGCA GTTGCGATTT ACAAGGCTTT TCTGGAACCA GTCATGCAAG ACAGTCTCTG CCCTCCTCAA CAAGTTCGAG TCTCTCGCCG TGTTGCGACT CCGGTACAAT TATGGCTTTT GGTGCCTTGG GCGTCTTGAA CAATATGCCT TACGTCATCA TGCTAGCCGC AGCGAAGTCG ATTTCGGAAG GAGGGACCGC TTTGGTCTAT TTGGCGAACG TCGTACCCGG ATTGATAGTC AAGATTTCGG CACCATATTG GTTCGACAGA GTCTCCTACC GCCGGCGTCT GTGGACTGCG TCCCTATTAA TGGGTACCGC ATTTGTTTTG GTGGGTCTCT GTTCTCGGGA GTCGATCGAC GAATCATCAT CGCAGACTGT TTCGTCGCGT ACTTATGGGC AGCTTTTGGG AGTGGCCTTG ATGAGTGTAC AGTGTGGATT GGGAGAAGCA TCCCTATTGG CGTTGGCTGG CGGGATGGAC GCGGCGGCTT TGACGGCCTT TTCGTCAGGT ACCGGTGTAG CGGGCCCCGC TGGGTACTTG TGGAAAATTG TGCTCACGAG CAGCCTCGGA TGGTCCCTCC GGCGGATCGC GTATACGGCA ATCCTCCTGC CTATCCTTTA CGCTGGAATC TACCAACGTT GTATTGAACC AATCCTTCGG GGAGACGCTG GGGACGTAGT TCCTGAGTAC GACACCTCCC AATTGGAAGG ACCGAGTATT CGCGATCGAC GTCCGGAAAC GGATGTCCCG CTTGTTCCTG GAGAGACGAC GGGATTGGGG GAAGAGCTAC AAGTCGTTGA CCGCAGTCGT AGCACTTGCG ATAACGACAC ATCTCGCGAA GATTGCACCA ACGGGAGCGA TGCGTCGCAA GACCCACTTT TCACGGTTCC GATTGCGTCC ATGACTGGGA CGCAAAGATT TCACCGAGTA TTGTCACTGT GGAGGTACAT TGTTCCTCTC TTTACAGTGT ACGCTGCAGA ATACGTTTGC CAAGCAGGGG TGTGGACCGC TCTCGGATTT CCCGTATCAA ACGCGGCCTC CCGGAATCAC GCTTATCAAA CCGCCAACGC GTGCTATCAA ACGGGGGTAC TGGTAAGTCG TAGTAGCGGC AACTGTTTCC GGCTAAGTCT TGTATGGCTC TGGATTTTAC CTGGATTGCA AATCGTCAAT TTGGTCTTTT TCACGTACGT CGCCGCGAAC GCGCCGGATG CGGACACGGA GGGGGGACGC TGGTTCTGGT ACTACACACC CACGGTCTTG TACGGTGCGG CCGTTGGAAC TGGTTTGCTT GGTGGTGGCG TCTATATTCA CGGATATCAG CGCGTGGTCG CGGACAGCAC CCAAAAGGAC CACTGTGAGT TTGCTTTGAG TTCCGTTTCG GTCGCCGAAG GTCTGGGGGT TGGATTTGCG GATATTCTGA GTTTGTGGTG GCAGTCCTGT CTGTACAAGG CCAATAATCT GGCGGGAGCC GTTGTCTCCT GCCCATTCTG AAATCACCTG GGAAGGATTT AGAAAGGTAT CGGGCGGTAA G
|
Protein sequence | MGSQKGLSES GVGIPNWSPP ACIESITKKS SNKLAIADHG RCRKMTMGPR RPVQGSELSY QPVSIDGEAE WEGSCDLQGF SGTSHARQSL PSSTSSSLSP CCDSGTIMAF GALGVLNNMP YVIMLAAAKS ISEGGTALVY LANVVPGLIV KISAPYWFDR VSYRRRLWTA SLLMGTAFVL VGLCSRESID ESSSQTVSSR TYGQLLGVAL MSVQCGLGEA SLLALAGGMD AAALTAFSSG TGVAGPAGYL WKIVLTSSLG WSLRRIAYTA ILLPILYAGI YQRCIEPILR GDAGDVVPEY DTSQLEGPSI RDRRPETDVP LVPGETTGLG EELQVVDRSR STCDNDTSRE DCTNGSDASQ DPLFTVPIAS MTGTQRFHRV LSLWRYIVPL FTVYAAEYVC QAGVWTALGF PVSNAASRNH AYQTANACYQ TGVLVSRSSG NCFRLSLVWL WILPGLQIVN LVFFTYVAAN APDADTEGGR WFWYYTPTVL YGAAVGTGLL GGGVYIHGYQ RVVADSTQKD HCEFALSSVS VAEGLGVGFA DILSLWWQSC LYKANNLAGA VVSCPF
|
| |