Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48664 |
Symbol | |
ID | 7194902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 517656 |
End bp | 519392 |
Gene Length | 1737 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183239 |
Protein GI | 219125965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00116938 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCAGGCAAC CCGACGTACG TTCAATGAAA CCAAATCCCC ATGATTACCG TTACTGTTCG TTTTGGAATC TACGTGATTT ACTGCTAGAC GCAACGCATC GTGACGTGCC CTGATACGGC GCTTTCTCCT TTTTGTCTCT GTCAATCGTC CTTCCTTGCG TCACTCCCAC CTTGCAACAA ACAACCGAAA ACGACACCAA CGACCACAAC AACCACACCG CAATGGCGTC GGCGTATCGG CAAGCCCGGG AGGCCAGCGA ACCGGCAGCC GTCCACGACG GTACCACGAA GACCACCGTC ATGGACAGTA GTAGCAGCAT GAGCGTGAGT AGTAGTAGTA GCGGTCCGAC GACGACGACT CCTCGTAGTA CGGCGCTTGG GAGTGCCAGC AGCGCTTTGC GCACCGTCAC CGGTACACCC CTCGTCGTCG CCCACGACGT GCCGTTCAAC TGGGTTGACG GTCTCCCCTA CATCCCGACA CGGTTTGCCG ATTTGGCCGA TCCCCAATCC ATCCAACGCG TCGTGGCTCT TCTCCTCCAA GCCACCCTTT CCAACACCAA CACCACCACT CCCGACTCGA CCGACCACGC ACCCGACCAA CCCCCTTCCG TACACTCGCC GAACCCCACT CCACCGACCT CCACCGTCGC CATTGCTCCG GACGATTGGA ACGTGATTTA TTCCAGTTTC CAACTCGAAC AAGTCGCGGG GGGAATTACC AATACACTCG TCCGCGTCAC CAATTTGTCC AGTTTTTTCG ATCCCACGAC GACTCCCGAT TCCGTCCTCG TGCGGATCTT TGGAGCCGTC GGTTTGATTG ATCGGGACGA AGAAACACAC GTCTTGGCTC GGTTGGCCGT TCGGGGCATC GCTCCCGCCT ATTACGGACG TTTCGGGAAC GGCCGTTTGG AAGCATGGCG GGATGGGATG CGGGCTCTCG CGACCTACGA ACTCGGCGAA CCCGACAAAC TCGTACCCAT TGCCCGGGAA GTTGCTCGAC TCCATCACAC TCATCTACAC GATATCGACC GCAGTGATGC CGATAACGAA TCCACTCCCC AAAACAACGA CAACAACGAC AGCATTACAT CCACGCACGA GCCTACCTTG TGGACGCAAT TGTACGATTG GTACGACCAG GCCTTGGTCG CCACCGCTTC CACCAAGTCG GTCACACTCG AGTTGTCCAG TTACCGGGCG GAACTGGACT GGGTCCGTTC CCTGACCCCA CCGGACACAC CCATTGCGTT TTGTCACAAC GACCTGCTCG CCGCCAATAT TCTTTACAAC GACAATCCCG ACCCTACCGA TCCCCGGGTG ATTCAACTGA TTGATTTCGA ATACGGGGGG ACCAATTACG TCGCCTTTGA CATTGCGAAC CACTTTAACG AATTTGCCGG AGGTCCCCCG ACGCATCCCG TACCGGACTA CGACAATCTA CCCACACCGG CACAACAATT ACTCTTTGCC GAAACCTATC TCGAACAAGA ACAAGAACTG CAACAGCAAC CAGGCGCGAC AACCACGGCC TGGAAGTCGG CACGGGAATT GTTGGACCAC GTACGGATCT TTGCCCTCGC CAATCACCTG TACTGGGGTT TGTGGGCCGT TAACCAAGCC GCCACCGAAG GGTGTGACGC ATTCGATTAC CGTACCTACG CTGTCAATCG ACTGAAACAG TACCACGTGG TCAAACAGGA ATACGCAGAT TCCACCGCGA TCAATGGTCA CGTGTAA
|
Protein sequence | MASAYRQARE ASEPAAVHDG TTKTTVMDSS SSMSVSSSSS GPTTTTPRST ALGSASSALR TVTGTPLVVA HDVPFNWVDG LPYIPTRFAD LADPQSIQRV VALLLQATLS NTNTTTPDST DHAPDQPPSV HSPNPTPPTS TVAIAPDDWN VIYSSFQLEQ VAGGITNTLV RVTNLSSFFD PTTTPDSVLV RIFGAVGLID RDEETHVLAR LAVRGIAPAY YGRFGNGRLE AWRDGMRALA TYELGEPDKL VPIAREVARL HHTHLHDIDR SDADNESTPQ NNDNNDSITS THEPTLWTQL YDWYDQALVA TASTKSVTLE LSSYRAELDW VRSLTPPDTP IAFCHNDLLA ANILYNDNPD PTDPRVIQLI DFEYGGTNYV AFDIANHFNE FAGGPPTHPV PDYDNLPTPA QQLLFAETYL EQEQELQQQP GATTTAWKSA RELLDHVRIF ALANHLYWGL WAVNQAATEG CDAFDYRTYA VNRLKQYHVV KQEYADSTAI NGHV
|
| |