Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47666 |
Symbol | |
ID | 7202856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 469045 |
End bp | 470497 |
Gene Length | 1453 bp |
Protein Length | 434 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181913 |
Protein GI | 219123191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000959743 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCGA ATATTGGTAC TTCAGTCACC AACACTATTG TCGCCATCGG ACAGATGGGC GATGGGGATC AGCTCGAACG TGCGTTTGCT GGTGCCACCG TCCACGATAT GTTTAACTTT TTGTCGGTGG CCGTACTCCT TCCTGTAGAA GTCATTACTG GATACCTCTA CCACTTGACT AAGGCAATGG TCAAGAATGC ATCCACTGAG AAAGGAGATA AATGGGAAGG TCCCGTCAAG GTACTTGTTG CTCCTATCGG TTCGAAGATT ATCATCGCGA ATAAGGATAT TATCAAGGCA ATTGCCCAGA ATAAAGGCAG CTGTGATGAG GGCGACGGCT TCTACCCCAT GAACTGTACC GAGGACAGCT ATTCTGGCTG CGGCAAGTCA TTTGGTCTTA TAAGTTGTGA CAAGAAGTCT GGAGACTGTC CTGCTTTCTT CCAGTCTGAT GCGTCTGCCA AGGATGACAA GGTCTCCGGA GGTGTCGTCT TTTTCATTTC TATTGTCATT CTTTTCACTT GTCTTGCCGG TCTTGTGACT GTCCTCCAGA AAATGTTGCT CGGTATGTCT TCTCGTATTG TCTACAAGGC AACAAACATT AACGGATACC TTGCCATTGT GATTGGTGCT GGTATCACCA TGGTAGTACA GTCTTCTTCC ATTACGACCT CTACGTTGAC TCCTTTGGTT GGTATGGGAG CTCTCCGTCT TGAACAGATG TTACCCCTTA CTCTCGGCGC TAATATTGGA ACAACAATGA CTGCCATCTT GTCAGCACTT GTCACAGAAG GAACTGGATC TCTCCAGGTT GCACTAGCTC ATTTGTTCTT CAACTTGACT GGTATTGCCA TCTGGTATCC CCTTCCTTTT ATGCGCAACG TTCCGCTTGA AGCTGCTCGT AAACTTGGGC GGACAACACG AATCTGGCGT GGCTTTCCTT TTCTTTATAT TGCCGTGATG TTCTTTCTTA TTCCGCTGCT TTTGCTTGGT CTCTCTTCTC TCTTTGAGGA TGGCAGTAAG GGTTTCACTG TTCTTGGATC ATTTCTCACT ATTATCCTTG GCCTTGGAAT CCTGTACGTC ATGTACTGGT GTCGTTACAA GGAGGGCCGG GAGAAATGCT CAAGTTGCAT GGCCGAGCGT GAGAAGAAGC GCGTCATTAT GAAGGAGCTT CCTGACGACA TGATTTACCT CAAGGAACAC ATGAAACGCC TCATTGAGCA CACTGGTCTT CCAATTGTGG AAGAGGAAGA CGCTGAAGCT GGCAAAGAGA TTGACGAAGG TGATTCCGAT GAGGTAGAGG CCTAGGCTTC CGCCCCGTGA GGTAGATATC ACTGATTCTA GAATTGGTTT GAGGTCCCCT CTAAAATTTG GTTGTTTTAT TTGAATATTG TGCCAAGTCT GTTCTCGCTT GTTAAAGCTA GAGTTACTAT AAACCTTTGG AATATACCAC GCT
|
Protein sequence | MGANIGTSVT NTIVAIGQMG DGDQLERAFA GATVHDMFNF LSVAVLLPVE VITGYLYHLT KAMVKNASTE KGDKWEGPVK VLVAPIGSKI IIANKDIIKA IAQNKGSCDE GDGFYPMNCT EDSYSGCGKS FGLISCDKKS GDCPAFFQSD ASAKDDKVSG GVVFFISIVI LFTCLAGLVT VLQKMLLGMS SRIVYKATNI NGYLAIVIGA GITMVVQSSS ITTSTLTPLV GMGALRLEQM LPLTLGANIG TTMTAILSAL VTEGTGSLQV ALAHLFFNLT GIAIWYPLPF MRNVPLEAAR KLGRTTRIWR GFPFLYIAVM FFLIPLLLLG LSSLFEDGSK GFTVLGSFLT IILGLGILYV MYWCRYKEGR EKCSSCMAER EKKRVIMKEL PDDMIYLKEH MKRLIEHTGL PIVEEEDAEA GKEIDEGDSD EVEA
|
| |