Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50914 |
Symbol | |
ID | 7200874 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 13894 |
End bp | 15200 |
Gene Length | 1307 bp |
Protein Length | 388 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179957 |
Protein GI | 219118365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATC TTCAGATACT TTCCAAGTTC ACCGTTGGTG GACAGGAGCT GCAGAACCGT GTCGTTCTGG CCCCTTTGAC CCGCGCTCGG TAAGCGGCAA GAACGAACTA TTTTGTGCCC AGTGTCGAAC ACGATTACTC ACTGTACTCG TGTTTCAATT CTATAGCTGC ACACCTACCG AAGATCCGCT CGATACCGTC TCCCGGACAC CGAACGACCT CATGGCGACT TACTATGAAC AACGTGCGTC GGCGGGTCTC ATCATTACGG AAGCCACTGC CGTTTCTGAA GAGGGCTACG GCTGGCTCAA CAGTCCAGAG CTTCGTACCG AAGCACAAAT GGAAGCATGG AAAAAGATCG TGGATAAGGT ACACGCCAAG GGATCCAAGA TTTATGTCCA ATTTTGGAAT ATGGGTCGAC AAGCCCATTC GTCTTTTCAC GTCGAATCCC AACGCGTAGT TTCGGCGTCC GACATTCCCA TGGCCGACTC TTTCAAGGTC AAGTCATCAA CCTTTGAAGA TGTACCGCCC GAAACACCCG TTCCCTTGAC GGTGGACGAG ATTCAAAGTG TGGTCGCAGA TTTCGTACAT GGTGCCAAAC TCGCTCGTCA GGCAGGCTTT GACGGAATCG AGATCCACTC CGCCAACGGA TATTTGATTG ATCAATTCTT GCAGTCCAAG ACCAACAAAC GCGCGGACCA ATACGGCGGA AGCATGGAAA ACCGCTTTCG CTTTTTGAAG GAAATTGTGC AAGGTATCGT GGACAGCGGA GCCTACCCCT CGAATCGGAT TGGCTTTCGA ATCTCGCCCA ACGGAGTCTT TGGAGACATG GGTAGTGAGG ACAACGCCCA GATGTTTACC TTTGTGGCGG CCGAAATGAG TAAACTCAAG GTGGCCTACC TGCATCTTAT GGATGGTCTC GGCTTTGGAT ACCATGGATT ATGTCCGGCA GTTACGGCTG CCGATATCCG TAAAGTCTTT GACGGTCCCA TTATTTGCAA CGTTGGACTT ACGAAAGAAA TTGCCGAAGG GATGATTCGC TCGGGTGCCG CTGATCTGGC CTGCTTTGGA CGTTTGTACA TTAGCAATCC CGATCTGGTC GAACGTTTCG CCAATGACTG GCCTCTAGAA CCTGAAGCTG CTTATCAGCA CTGGTGGCAA CACGTTGGCG CCAAAGGTTA CACCGATTGG CCAACGTACA AGCCATCCGA GGAAGATAGC GACGACGCTC AGAACGACGA GTAGGCTGCT ACTACAAGAG CTCGAAATTT CGTGAATGTG CCACGTTCCT GCACTTTGCG GTTGGGT
|
Protein sequence | MSNLQILSKF TVGGQELQNR VVLAPLTRAR CTPTEDPLDT VSRTPNDLMA TYYEQRASAG LIITEATAVS EEGYGWLNSP ELRTEAQMEA WKKIVDKVHA KGSKIYVQFW NMGRQAHSSF HVESQRVVSA SDIPMADSFK VKSSTFEDVP PETPVPLTVD EIQSVVADFV HGAKLARQAG FDGIEIHSAN GYLIDQFLQS KTNKRADQYG GSMENRFRFL KEIVQGIVDS GAYPSNRIGF RISPNGVFGD MGSEDNAQMF TFVAAEMSKL KVAYLHLMDG LGFGYHGLCP AVTAADIRKV FDGPIICNVG LTKEIAEGMI RSGAADLACF GRLYISNPDL VERFANDWPL EPEAAYQHWW QHVGAKGYTD WPTYKPSEED SDDAQNDE
|
| |