Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46359 |
Symbol | |
ID | 7201628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 111219 |
End bp | 112565 |
Gene Length | 1347 bp |
Protein Length | 357 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180944 |
Protein GI | 219120410 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATA TTCGTGTCCT GCGATTTCGT CAAATGCTGT TGGATCCTTT TGTACTGGGC GTTTGCCTTC TCGCGTTCGC CGGTACTACC ATCAATACGG TACACGCGCT CGTACCTAGC CAAGGTGGCG AACTCCAAAA GGCAGCCCAG CAGTTGCAGC ACCAGTACGT TTCACAGTCC CCGAGTGCTC CAGTGAGTAC GAGCCGAGCG GGACTCTCTC CTTCTGTCGC TCCTCCATTG TCGTCACGCA TCCCTGGGCA TCCTCCGCTG GCTTCGACTA CGAGTCGCGC GGCGGCAGCC AGGATGACCG AAGAGGAGAA CGAGTGGTAC ACTCCACCTC CGGCTCCCGT CCAATCCACC CAGATTCCTA CCGAAATCGC CGTCGTGAAC TCCGACGAGG CTTGGCGGAG TTTCTGGCTC TCGATGACGA CCGGCTCTGC GTAATTTCCT TTCACGCGTC CTGGTGCAAG AGTTGCCAAA AGTTCGGACT CCTCTATAAA TCGCTAGCGC ACAAACTCGG AGACAAGCGT GACCGCAAGA CGCAGAACAT CGTTGAACGC GGTTCCGTGC GTTTTGCCTC GGTCGAATGG GGCGCCAACA CGGCACTGTG CCGATCCCTC GGTATTAAGC GACTGCCTAC CACTCAAATA TACCACGCAG GGACCCTGCT CACCAGCTTT GCCTGTGCGC CGGCCAAATT TCAAACCCTC AAGGATCAAA TCAAGTACAT GACCCGGACG CTGCAAAACA GCGATCGACT CGCGGCGATC AAGGCAAATC CCTTGTTTCC GGACGAGCAA GCCGTCCAAG ATAAGGTTCA AGCGCTGCTA GCGGCCCATA CCGAACAGGA ATTTTCCAAG ACCCTAGACA TTGGCGCGGC GCTCATTGAC GCTACCGTTA TGGTGGCACC CCCATCATCC GATCAGTCGG ACGGCAGCAA TACGGAAGAC CATGGACAGG GGCTCTCCCG TGCCGAGCGT ATGGCGGCGT TTGCGGAACG GATCCGCTCC AAACAAAACC GAGCAGATCA TGTTGCAACA ACCGCTGAGA GTGTAGCCGA TTCCTCCCAC AAGGTGTGGT GGCGCTTTCG ACGAGCACCC TAGACAAACC CAACACTTAC CCACTCACAA AGTCCACGTT TGGGCTTTGT GGACATAGAT GTGTACCCTA GTACGAATGT GTGAAGCAGC AAACAGAAAA GAATAGAAAA AGACAACTTG ATCTGGAAGG GACCTTGTTC CAGACCATCA AAAAAAGAAT AGAGACTACA GAAAATGAGA CGCAGTGTTA GCAACGTAAC TATATATGGA TACCGCTCAT TTAAATAGCT CTGCTGCCAT AGTCCAT
|
Protein sequence | MTNIRVLRFR QMLLDPFVLG VCLLAFAGTT INTVHALVPS QGGELQKAAQ QLQHHEYEPS GTLSFCRSSI VVTHPWASSA GFDYESRGGS QDDRRGERVV HSTSGSRPIH PDSYRNRRRE LRRGLAEFLA LDDDRLCVIS FHASWCKSCQ KFGLLYKSLA HKLGDKRDRK TQNIVERGSV RFASVEWGAN TALCRSLGIK RLPTTQIYHA GTLLTSFACA PAKFQTLKDQ IKYMTRTLQN SDRLAAIKAN PLFPDEQAVQ DKVQALLAAH TEQEFSKTLD IGAALIDATV MVAPPSSDQS DGSNTEDHGQ GLSRAERMAA FAERIRSKQN RADHVATTAE SVADSSHKVW WRFRRAP
|
| |