Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44003 |
Symbol | |
ID | 7204206 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 665478 |
End bp | 666755 |
Gene Length | 1278 bp |
Protein Length | 354 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186105 |
Protein GI | 219113043 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0157788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCAT TCTTGCAAAG ATATAGCTTT GATATCGAGT CGCTTAAAAG CATGCTTTCC TTTGTTTGCT TCACGTTGTC CTTCCCAGTA GTGGCGGCTT CACTATCGGG CAGCATGTCA ACTATTCGGA ATGTGGTGGT GGTCGGTGGA ACGCACGGGA ACGAGTACAC GGGGGTCTGG TGCATCAAGG CCCTTGACCG GGCTGCCGAA AGCCTCAGCG CTCTTTACCC TTCGTTACGG ATATCCACGT TACTTGCAAA CCCTGAAGCC TATCGTCAAA ATAAACGATT TGTGGATGAA GATTTAAATC GACAGTTTTC GGAAAATAAC CTTATGTTAC GCAGCATGGA CGGCGACAAT TTACCACGCA CAGTAGAATC CCTACGCGCT CACGAAATCA ATGAGATGCT CGGGCCAAAG TTTCAAAACC GATGTCATCG TGGATTTGCA TTCCACAACT AGCAACATGG GTCTGACGCT TATAATTGCC GAAGGCGACT CACTCATGGC TGCGGCAGCC GCTTATGTAT TGCTGAAGTG TGGAGGCGAA GCTCGCGGAG CTCGATGTTT AATGCACTCT CATCCTTCAC GAGAATCTCG GCCGAACCTA TCGTCGGCTG CACGGCACGG GTTCACTATT GAAGTCGGGC CTGTTCCTCA AGGAATACTC CGTCACGATG CCGTTGAAAG GACGCAACAA GCGGTGGCGG CGCTACTGGA ATTTCTGCAG CTTCACAATC AAGACAGGAC TCGTTTGCTA CAGAATCTGC GGCAAGCCTA TGATAATAAA AAAGTTCCTT GCTTTCGATC TGCACCAGCA CGGAGACCGG GAGAAATGTC AGGGAAAATC ACTTGGCCCT GTGATTCCGA AAATGTCAAT TTCCCAGCAG TGATGGTACA CAAGGACTTG CAAGATCGGG ATTTTCATGA AATACAAACA GGTGATCCCC TTTTTGTCGA TTTGAAAGGC AAGACCATTC CGTATACCGG GTCACATGGG TCGCCTGTCT ATCTCATGTT CGTTAACGAA GGCGGGTATT ATTACGCAAG TTCTGGAACT GGTATTGGCG TCGCTATCCG AGCTGATTTT AATCTGGAGT CGGGGAAATT TGTAGAAAAG ATCACGGTGG GAGAGTATGA TGTATCCTAG GATAGACAGG AATCTCACAG CGAGCTGCTG TACGACGGCG TAGCAAAATG TTGACATTAC CATCGTGGTG GATGAAAAAA AGAATGATTG CAAGGTAGGT CACCGTATGC TACTTGAAAG GAGGAGAG
|
Protein sequence | MQSFLQRYSF DIESLKSMLS FVCFTLSFPV VAASLSGSMS TIRNVVVVGG THGNEYTGVW CIKALDRAAE SLSALYPSLR ISTLLANPEA YRQNKRFNPY ALTKSMRCSG QSFKTDVIVD LHSTTSNMGL TLIIAEGDSL MAAAAAYVLL KCGGEARGAR CLMHSHPSRE SRPNLSSAAR HGFTIEVGPV PQGILRHDAV ERTQQAVAAL LEFLQLHNQD RTRLLQNLRQ AYDNKKVPCF RSAPARRPGE MSGKITWPCD SENVNFPAVM VHKDLQDRDF HEIQTGDPLF VDLKGKTIPY TGSHGSPVYL MFVNEGGYYY ASSGTGIGVA IRADFNLESG KFVEKITVGE YDVS
|
| |