Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36570 |
Symbol | |
ID | 7201716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 784031 |
End bp | 785449 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180896 |
Protein GI | 219120310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATCTT TATCGATGAA GTCGATATTC TCTACCGTGA TTTTGCTCCT TGCTTCTCCA GTTGTGGTGT GTGTCGACAG TGAGAAACAG CAAGATCCCA CGGAAACCTT ACAAGAATTT CAAGCTCGCA TGGCATCTTC AATCCAAGGC CTCCTTGCGC AACAGAATCA AACTGATGCT TTGTGGTTTG GTATTTCAGA CCCATTTTAC GGAAGGGTAT ACTGGACGTT TGGATTTGCC AACAAGGGAA GTAGCACACC TGCTACCGCG AACGATCATT TCCAGATTGG ATCCATTTCC AAATCTTTTG GAGCAACTGT GATTCTCTTA CTGGCAGAAA GGGGCGATTT GAGTCTGGTG GATACAATCG CCAATCTCGC GCCGGAATTG ACAGCTCAAT TTCCTGTGTA TGCCGACTTT ACTGTAGCGG ATTTGCTTGG CATGCAAGCT CTAGTTCCTG ATTTTCTGGA TGATCCGACC GGTCTCGTCA GAAACCTGAC GGCTGATCCT ACCTTGCGTT TCTCTATTCC AGAAATGATC CAAGCCTCTT TGCGTAGTGG ACAAGTGGAG GTATGTCCAC AAGGGGAGGA ATGCGCTTTT TACAGTACTA CCAATTTCCT CGTTTTGGAG TATATTGCCG AGAAGATCAC CGACACGCCT ATGCCTCAAT TGATTGCTGA TCTCATCACC AAACCGTTGG ACATTGGTGA TACAGTGGAG CCAAAACGCG ATAGTGACGG TGTCTTACCA GACCCCACCG TCACTCCGTA CGCGGGCAGT AAATGCGTTG AAGAGTTTCA GCAGAGAGGC ACTACCGTCG ATCCGGGTGC AGACTTGACC GAGCTATCAC GTGCCGTTTC CAGTTTTGGA ATTGGCGGAA ATATGTACAG TACTATCCGC AATCTGGTTG TATGGGCACA AAGTGGCACG GGCGATTCTT TGTTAGCGAA CAACACAGTC ACAGATCGTC ACGTCTACCG CACTATTCCA TCCACCGACC GATCGTATGG GATGGGTCAA TATCAGCTCG AGGGGAACTG GTACGGTCAT GAGGGTGGTA CGTTTGGGTT TGGGTCGCAG GCCTACCGAA ACGACTGTTT CAACGCAGCG TACGCTGCTG CCGTGAATAC GTGTGCGTAC ACGGACATTC TTGACCAATT GCGCGTTTTG TACCAAGCGG AGCTGGAAGG GCGCATTACG GACACTGGCG GTAGTCCCAC GATGAGCCCA CCAGAAAGAG AAAGGGGGAA TGGAGGTGGA GGAGGATCTG GTAGTGGTGG TGGTAGTATA GCTGTATCTC CAGAGGCTTC GGATCTACCA ACGGCGGCCC CGGTCGGTGG CCCGTCCCTG GGAGCGACTC TTTGTTACGC TGCGTGCATT GCCGTGAACC TTGTCGCCAC GGTTCTGACT CTCCGTTAG
|
Protein sequence | MVSLSMKSIF STVILLLASP VVVCVDSEKQ QDPTETLQEF QARMASSIQG LLAQQNQTDA LWFGISDPFY GRVYWTFGFA NKGSSTPATA NDHFQIGSIS KSFGATVILL LAERGDLSLV DTIANLAPEL TAQFPVYADF TVADLLGMQA LVPDFLDDPT GLVRNLTADP TLRFSIPEMI QASLRSGQVE VCPQGEECAF YSTTNFLVLE YIAEKITDTP MPQLIADLIT KPLDIGDTVE PKRDSDGVLP DPTVTPYAGS KCVEEFQQRG TTVDPGADLT ELSRAVSSFG IGGNMYSTIR NLVVWAQSGT GDSLLANNTV TDRHVYRTIP STDRSYGMGQ YQLEGNWYGH EGGTFGFGSQ AYRNDCFNAA YAAAVNTCAY TDILDQLRVL YQAELEGRIT DTGGSPTMSP PERERGNGGG GGSGSGGGSI AVSPEASDLP TAAPVGGPSL GATLCYAACI AVNLVATVLT LR
|
| |