Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42828 |
Symbol | |
ID | 7196487 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1255253 |
End bp | 1257168 |
Gene Length | 1916 bp |
Protein Length | 499 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176749 |
Protein GI | 219109993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGAGA GTACCCTTGC GAATAGTAGC ACTTTGGCCA CCATTGACAA GATCAAGTTC ACTTGCAACC AGGATACATG ATTGTGGGAG TCACAATGCC ACAGGATCCC GGCATGGCAA TACCTCCTTA TAGTCAGAGT GGGCAAGCGA ATGACGCATA TCCGGAAGGC GGCTCGGGCG AGAAATTCGA CACGAGCCAG CCACAGGCCG TCGCAATGCC GGCCCTCCAA CCAGCAGTAG ATGGTAATCT ACCGCCCCAA CAAGTCTTTG TGAAATCCCA TATTGACCCT ATGGGAAGCC CGGAAATGCT TGGCTTGGAA GGTCAGTTCC AGTCTATCGG ATTCGCGCAA GAAGGATTCG ATCACAATTC CAGCGCGCAC TACAATGGTG GCGATAGTAG TGGCAATGCC AGCGCCGACA ACGATGATGG CGAAGACGAT CCCATGAAGC TTTTTGTTGG ACAGGTATGT AATAAGATAC GGCTGGTCTC GCTTGGATGC ATCGAGTAGT GAGGGCTGGT TGGGTTCTGT GTATGAATCA TTGTTCTACA TTGCTTGGAT GAAAAGCGGA ATGTCGTACT GTATTTCCCG TAAGCGAGAG AATTTCGTGT CTCCTCAATA TCCGCTCATG TCTTGTACTT TCCACACAGG TTCCGAAGGC AATGAGCGAG GAGGATGTGT TTCCAACGTT TGATTCGTTC GGTCCGCTCA AAGATGTCGC TATCATTCGC GACAAGCACA CTGGTTTGCA CCGTGGGTGC GCGTTCGTCA CCTACTGGTC GGCTGCCGAC GCAGAACGCG CGCAAGAAGC GCTCCACGAC ACCTTTACCT TCCCCGGAGC GCGGAGAGCA GCACAAGTTA AACCTGCCGA ACCATCCGTC CCCGAGAACA AACTCTTTGT GGGAATGCTT TCACGCAAGG CAACCGAAGT GGAAATTCGC GAGCTATTTG AACCGTTCGG TGAAATTCGA GAAGTATACA TGATCCGCAA TGCTGACGGA TCTAGCAAGT GCGCAGCATT TTTGAGATAC ATGAAACGAG GCGCGGCTGT TCAAGCCATT GAAACTCTTA ACAATATTTA CATGATGGAA GGTGCAGCCA GGCCCCTCAT CGTTCGATTT GCGGACAATA AACATCAGCG CCATCAGCGC CAGATACGAA ACATCAGAAG ACATGAAATG ATTGCAGCAA TGGGTGGCGG CTATGCAACA TACCCCCCAC ATGTGCAGGT TCAGATGGGA ATGCCCGGGC ATCCCGGTGC TAGTCCACAG TACACTGTAC CCGTACCTCC TCATTACGTT GAAGCTGCAT ACGGGCCACC GAACGGTGCC CCAATGCCAG GGCATCCGTA CATGTACCCC CCTCAGCAAT ACGCTCCTAC ACCAGCATAT ATTTACCCAG AACACACGTC TGAAGAAACT AAACCGACCA ATAACCGTCC GCGTGAAGGC CCCGCTGGAG CGAATTTATT TGTATATCAT CTCCCTCATG ATTTAACCGA CGCCGATCTA GCAACAGCCT TCAATCCTTT CGGGAACGTT ATTAGCGCCA AGGTGTATGT CGACAAATAT TCAGGCGAAA GCAAGGGTTT CGGTAAGTTG CAGCACGCCT TCTGGTCTTT GTACCCCTAT AAATGTCATT CTCACGCTCT GAAAATCCTT GCCAGGCTTT GTGTCGTATG ACTCAGTTAT TGCGGCAGAA GCAGCAATCG AGCAGATGAA CGGATTTCAG ATCGGCAACA AACGACTGAA GGTACAACAT AAGCGTGTTC ACGGAAACCA TCCGAATGCT CCGTCCTTAA GCGATTCCCA AGACCCTCCC GAAGATTTGC TTTAAACGAC GCTTTCAACT TTTTCCCCAT GTTGATATAT CCTTCCAGCC CTTACCACTG AAACGATCCA AACGAAACAA CAACCC
|
Protein sequence | MYESTLANSS TLATIDKINQ SGQANDAYPE GGSGEKFDTS QPQAVAMPAL QPAVDGNLPP QQVFVKSHID PMGSPEMLGL EGQFQSIGFA QEGFDHNSSA HYNGGDSSGN ASADNDDGED DPMKLFVGQV PKAMSEEDVF PTFDSFGPLK DVAIIRDKHT GLHRGCAFVT YWSAADAERA QEALHDTFTF PGARRAAQVK PAEPSVPENK LFVGMLSRKA TEVEIRELFE PFGEIREVYM IRNADGSSKC AAFLRYMKRG AAVQAIETLN NIYMMEGAAR PLIVRFADNK HQRHQRQIRN IRRHEMIAAM GGGYATYPPH VQVQMGMPGH PGASPQYTVP VPPHYVEAAY GPPNGAPMPG HPYMYPPQQY APTPAYIYPE HTSEETKPTN NRPREGPAGA NLFVYHLPHD LTDADLATAF NPFGNVISAK VYVDKYSGES KGFGFVSYDS VIAAEAAIEQ MNGFQIGNKR LKVQHKRVHG NHPNAPSLSD SQDPPEDLL
|
| |