Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33217 |
Symbol | |
ID | 7204343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 332777 |
End bp | 334150 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186326 |
Protein GI | 219113485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGGA CTGCTGCCGT TCCGCCGCCG GGATCCTCGG CGGACGAGCC TTCCGCGGTT CCGACACTCC GTGTGATGCG CCTCCAGAGT CCGGCACTGC ACAATCCCGC GGCGGGTTCG CTCGACAACC AAGCTGCGTT GCACAACTCT CTCTGTCTAC CCGAATCGTT GGGAGTGTAC GTGGGAGAAA CCTTCACGGC ATATTTGGGC GTGTTGAACA CCTCGACACG CCAGTCTATT CGGCGTTTGA CGGTCCTGGC ACAGTTGCAG ACGCCTTCCA ACCGATGGCA ACTCCCGTCT CTATTAGAAA AAGGGGTCGA CGTCAATCCC GCCAACGGGG TGGACGCCAT TGTCGCACAC GCTATTGAGG AGCCCGGACA ACACATTTTG CGCGTGGAAG TGGGCTACCG CACGAACGAT GGAGGGCTGC AAACCTTTCG CAAATTCTAC CGCTTTCAGG TAGTCAATCC CTTGACGATC CAGCAAACGA CGACCCGCAT GGGTGACAGT CAATGTCTCG TTTCTCTATC GGTAACATAC AACAAGACTG CGGACGCGAC CGGGCCTCTG GTGATTGCCA ACGCCGCGTT TCGGCCGGTG GACGGTTTGG TCGCGCGACT TCTGGATGGA CACGTCTCGG AGTCAACCCC CGACGCCAAA ATGTCCGCCT TGCAGCTCCT GGATAAGTCT GGTCTCCTCC AACCGGGATC AATTGTACGG TACCTGTTCC AGATTGAAGC CACTTCGAGA GAAGCCGTGC TGAAAGGAAT CGCCGCCGGC GATTTACTTG GCCAGGCCGT CCTGACCTGG CGCAAGGCCA TGGGTGAAAC GGGTCAAATC TACTCCGCCA GCATTTACTG TCCACCAGTC CAACCCGATC AGGAACATTT TGTAACGCAC CGATCGGGTC TTTCCGTAGA CGTGGCGGCG GCTGCCGCCC AGCGCGCGCA GAGTACGCCG AGCGCCGTCG ACCAGACTTT ATTGACCAAC CGATTTCCGG TCACGGTCGA ACCGATCGAT CCGCCTCGTC GAATGCTACC AAACGTACCC ACACGTGTCC AATTTCTTGT CGTCAATCAC TCCCAACAGG CCATGACCTT ACAGTTGCAA ATGCGAGTAG CCGATATGCA GCAAGGTTTG GTAGCTTGCG GTACCACCTT TACCAACTTG GGTGTGGTCC AACCCGACGG GGGATCCAAA GTCGCCACGA CTCGCCTGGT AGCCCTGGAG ACTGGATTGT GGCGAGTGCA AGGATGCGTC GTCGTGGATC TAGCGTCTGG CCGGGAGATT CCGCAACCGC CACTCTTTTC CGTTATGGTC GTAGGGGAAA CGGATCCCTC CGATAACGAC AACACCCAAA TCTGTCGGCA ATGA
|
Protein sequence | MSRTAAVPPP GSSADEPSAV PTLRVMRLQS PALHNPAAGS LDNQAALHNS LCLPESLGVY VGETFTAYLG VLNTSTRQSI RRLTVLAQLQ TPSNRWQLPS LLEKGVDVNP ANGVDAIVAH AIEEPGQHIL RVEVGYRTND GGLQTFRKFY RFQVVNPLTI QQTTTRMGDS QCLVSLSVTY NKTADATGPL VIANAAFRPV DGLVARLLDG HVSESTPDAK MSALQLLDKS GLLQPGSIVR YLFQIEATSR EAVLKGIAAG DLLGQAVLTW RKAMGETGQI YSASIYCPPV QPDQEHFVTH RSGLSVDVAA AAAQRAQSTP SAVDQTLLTN RFPVTVEPID PPRRMLPNVP TRVQFLVVNH SQQAMTLQLQ MRVADMQQGL VACGTTFTNL GVVQPDGGSK VATTRLVALE TGLWRVQGCV VVDLASGREI PQPPLFSVMV VGETDPSDND NTQICRQ
|
| |