Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51174 |
Symbol | |
ID | 7195359 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 87662 |
End bp | 89697 |
Gene Length | 2036 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183546 |
Protein GI | 219126611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGGAC CGGATGGCAA AAAACTCAGC AACAAAGAGC GCAAACGCCT GCTCAAGGTT AAGTTAGCCG AAGAGCGAGA AGCGGCATAC GAAGCATCCG CATCCAAAGC TTCCAAGGAA GGGGCTCAAT TTGCTTGTTC ACAAACTGCC GTCAACGAGA AGGACCCGCA ATGGGAAAAC TCGCTGGACG TCAATATACC AAATTTTAGT ATCAGTGCGG CCGGAAAAAT CCTCTTCAAG GATGCCTCAC TGACAATTGG ACATGGTCGT CGGTACGGAC TGGTTGGTCC CAATGGACGC GGAAAATCTA CACTTTTGAA AATGATTCAT TCGCGCGATC TCAAGCTCCC GCCTCGCATT GACTTCTTGT ACGTTGAGCA GGAAGTGGTC GCTGACGACA CACCCGCAGT TGAAGCCGTT CTTAGAGCTG ATACGGTGCG CTGGAATCTC ATGGAAGAAG AAAAGACTCT GATGCAAGCC GTGGATGCTG GTGACGAGTC GGTCGAGAAG ATTGAACGTC TGCAGCAAGT CGTGGACGAG TTGACCGCCA TGGGAGCAGA TTCGGCCGAA GCCAAAGCCC GTCGTATCCT GTACGGTCTC GGGTTCACTA TGGACATGCA AACAAAGCCG ACCAAGATGT TTAGTGGAGG GTGGCGCATG CGTATTTCGC TGGCACGTGC GTTGTTTGTT GAACCGACAC TGCTCATGTT GGACGAACCG ACGAATCACT TGGATTTGAA CGCCGTAATC TGGCTCGACG AGTACCTGCA AAGATGGAAA AAGACTTTGT TGGTGGTATC CCACGACCAG GATTTCTTAA ACAGCGTTTG TCAGGAAATG TTGCATATCG AGGACCTCAA ACTAATTTCT TACAAGGGTA ATTACGACAG TTTTAAAAAG GCCGAAGCGA CCAAGTTTCA GCAACACATC AAGGCGTACG AAAAGCAGGA AAAGCGTCTG CGGGAATTGA AAAGGTCGGG ACAATCCAAG AACAAGGCAC AGGAAACTGT CAAGAAGAGC TCCAAGCGAG AAGCTGGTGC GCGGAGTCAA AAGTTGAAAA ACCAGGCAAT TGCTGCGGGA ACGGAAACGG CGGAACTATC GGAGCTCATT CATCGGCCAA AGGAATATCA AGTCAAGTTC GAATTTTCTG AGGTGAACGA GCTTACGCGC CCCGTGATTG AAGTGAACAA CGTGCATTTC CGTTATTCAC CAAAGCATCC CGTCATCTTT GAAAAGGTCG ACTTTGGAAT TGACATGGAC AGCCGAATAA CCATTGTCGG CCCGAATGGT GCCGGCAAAT CAACGTTGCT GAAATTGTTG ACGGGAGCGC TGAACGCGAC GGGGGGCGAC GTCCGTCGCA ATGGGCGCTT ACGAATGGGT ATTTACGATC AGCATTTTGT CGATCGACTA CCGATGAATA AAACACCAGT GGAACACTTG CGAGATCGTT TCGAAGAAGA AACCTACCAA TCGGTACGCA ATCGTCTCGG AAAGTACGGC TTGGAGGGAC ACGCACACGA AGTCGTGATG CGCGATTTGA GTGGTGGCCA AAAGGCTCGA GTAGTTTTTG TCGAACTGAG TTTGCAGTGC CCGCATGTGC TTCTGTTGGA TGAACCGACA AACAATCTCG GTACGTATCG TCAAGGAGAT GTGCCTCGTC GACTTGTGTC TTGTGTGTAC TAATAAATGT CAACATTCAT CTGATATTCG ATTCGTTTCA CTTTTTGACT GCAGATATCG AATCCATTGA CGCGTTGACC GACGCGATCA ACGCATTCAA CGGAGGAGTA GTCGTAGTTA CACACGATCA GCGATTGATT GAAGAATGTG AATGCACCTT GTGGGTTGTC GAAAAGCAAG GGGTCACGGA ATGGAAGGCT GGCTTTGACG ATTACAAGGA GAACATTCTG CGTGAGTTGG AAGAAGAAGT AGAACGGGAG GCAGTGATTC GGAGACAAAA GATTGACGCT GCGGCTATTG CTCGTGCCGA AAAGCTGGCC CGACTGGCAA GCAAAGTCAA AACCACGAAG AAGTAA
|
Protein sequence | MYGPDGKKLS NKERKRLLKV KLAEEREAAY EASASKASKE GAQFACSQTA VNEKDPQWEN SLDVNIPNFS ISAAGKILFK DASLTIGHGR RYGLVGPNGR GKSTLLKMIH SRDLKLPPRI DFLYVEQEVV ADDTPAVEAV LRADTVRWNL MEEEKTLMQA VDAGDESVEK IERLQQVVDE LTAMGADSAE AKARRILYGL GFTMDMQTKP TKMFSGGWRM RISLARALFV EPTLLMLDEP TNHLDLNAVI WLDEYLQRWK KTLLVVSHDQ DFLNSVCQEM LHIEDLKLIS YKGNYDSFKK AEATKSGQSK NKAQETVKKS SKREAGARSQ KLKNQAIAAG TETAELSELI HRPKEYQVKF EFSEVNELTR PVIEVNNVHF RYSPKHPVIF EKVDFGIDMD SRITIVGPNG AGKSTLLKLL TGALNATGGD VRRNGRLRMG IYDQHFVDRL PMNKTPVEHL RDRFEEETYQ SVRNRLGKYG LEGHAHEVVM RDLSGGQKAR VVFVELSLQC PHVLLLDEPT NNLDIESIDA LTDAINAFNG GVVVVTHDQR LIEECECTLW VVEKQGVTEW KAGFDDYKEN ILRELEEEVE REAVIRRQKI DAAAIARAEK LARLASKVKT TKK
|
| |