Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40279 |
Symbol | |
ID | 7195756 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 588759 |
End bp | 590277 |
Gene Length | 1519 bp |
Protein Length | 420 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184165 |
Protein GI | 219127902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACCG CCACGCGAAC TACATCTGAT GCAGCTGCCC TTGCTCATTT GCTAGACACA GGGCTCGCAC TATCTGCAAC TTCACCTATC CGCCTTAGCC TACAACTTAA CGGATATGAC AATCTACAAA GTGTTGTTGG TATTTTTGAG AGTGAACTGG AATCTTTAGA GTATGTCCCT TTGGCACTTG AAGAAGGAGT CCCACCTCTC CCGGTGAAGT TGCTAATGGC GCATCGACAG CTACTACGTT ATTTCCTACG TTGGATGAGC CATCTTCGGA ACACAAATGG GGGGCCACTA TCCCCTTATG AGATCGTTTC CCTTGCTAGA CACATACTGG ATGTCGCCTC CTGTCCAGTT GTCAGAATCA CCGTCTACTC CCCTTTGGCA ATCCAGCACC CCTATGGCCA TACCTGGAAA CCATAGTCAC GCTGCTGTTG CTGATTTCAA ACAAGGAGTG AAACATGACA AGATGCACTA TCCAGTGCTT AAGGATGATC GCTACTGGGA CCATTTGTAC CGTACCTTTG TCGTTACCGC CGTATTGCAC AATGTAGATA ATGTTCTAGA TCCGGCTTAT TCTCCAACAG ATCCAGATAA AATCTCACTT GTTAAAGAGC AGAAAAAGTT TGTCTATTCC GCCCTAGAAC ACTGTTTGCA AACCAATATG GGTAAAAACA TTGTTCGAGA ACATGCTTTC GATTTCGACG CCCAAGTTGT ATTTGCAAAA ATCGTTAAAC ATTACACGGA ATCCACAGCC GCGAAAATCA GCTCCGGCAC TACTCTCTCA TACTTGACCT CTGCAAAATA CGGCAGCTCC TGGACCGGCA CTGCGGAAGG TTTTATCTTG CATTGGAAAA ACCATCTACG CATCTACAAC AATACCGTGC CAACTTCGGA ACAGTTGCCA CCGCAACTCT GCCTCAGTTT GCTTGAGTCC TCTGTTCGCG ACGTCTCCAA ACTACGTCAA GTCAACACTA CCGCGAATTT AGATTTAGCT AAAGGGGGGT CTCCCATTAA CTATGAAAAT TATCTAAGTC TACTTCTCGC TGCAGCAACT TTATACGACA AAGGGAACAA CCTTTCCAAC TCTCGTAGCC CTAAGACCAA GCGTAGTGCC TTTGTTGCTG AAACCATCTT CCCCGACGAC GACTACAGCA TTGATTACGA CATTGATTTA TCTCCGTCCA TTCTGTACGA AGCGAATGCT CACAACCGCA GAGCAGGAGA TCAAAATCGA GACCGCCAGG GCAATGTCAA CCGTGAACGA CCGTATATCC CCCGTGAGAT GTGGGATAAA CTGTCCGATG ATGCAAAGGC AATTCTCCGA GGCATGTCTT CTCCCGCGGA AGGTCAAGCC TCGCCTAACA GCAATCAACA CCCTCATTTT AATGCCAATT CTCATTCTCT AGCCGACATG GGACACCCCT CCACAACCAA CGACTCGTTG AATGAAAGCG ACAACGAAAA AATTCCACAA TTGTGGAAAC GATACGGAGT TACTTGCCCA CCTTACTGA
|
Protein sequence | MVTATRTTSD AAALAHLLDT GLALSATSPI RLSLQLNGYD NLQSVVGIFE SELESLDHAA VADFKQGVKH DKMHYPVLKD DRYWDHLYRT FVVTAVLHNV DNVLDPAYSP TDPDKISLVK EQKKFVYSAL EHCLQTNMGK NIVREHAFDF DAQVVFAKIV KHYTESTAAK ISSGTTLSYL TSAKYGSSWT GTAEGFILHW KNHLRIYNNT VPTSEQLPPQ LCLSLLESSV RDVSKLRQVN TTANLDLAKG GSPINYENYL SLLLAAATLY DKGNNLSNSR SPKTKRSAFV AETIFPDDDY SIDYDIDLSP SILYEANAHN RRAGDQNRDR QGNVNRERPY IPREMWDKLS DDAKAILRGM SSPAEGQASP NSNQHPHFNA NSHSLADMGH PSTTNDSLNE SDNEKIPQLW KRYGVTCPPY
|
| |