Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33692 |
Symbol | |
ID | 7198005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 133025 |
End bp | 134572 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178172 |
Protein GI | 219114753 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCCG ACGGCGCCAG CACGAATCAG TACTCTCCAC AGTCGTTCAA TAGCAGGATG ACTGCCGTGG TTCCTTTTGT GGTCCTTCAA CAACCCGACT GGGCCACTCT ACACCGCACA GTGCAGGCAT TTCGCCAAGA ATCGCAGGCA GCCTTACAAG GGCTCAGTCG TGACTTGCAG GTTGCCTTGG CTACCGAGCA AGAATATCGC CAAGTATTGC AGGAGAGCAC TGAATCCGAC TCCGGTACTC TGGCCGATTC TCTCCAAAAT CGTATCGATC GGCTATCGAT ACTCTTGGAA CACAACGCCA AACTTCTCGG TACCTTGCTC CAACCGTTTC CAACGATCCG CGTTTTGCCT AGTGCCAACC GAGAGGGGCA AGTCGGTACA GTGAACTCCC CAAACCCCAA TGATACAATC GATCACACCT TCTCCCGGGC ACACACCTTC CCCATTCCCC GCGGAGCGGC AGCATTCTTT GAGTCCTCTT CTCCGCAGCA GCCGTACGAT TCGGGTGCCC AGATATTGGC ACACATTGTC CGCGATTGGA GTGACGCAGG GCGTCCCATT CAGGCTTCTC TGTACGATTG GTGCGTCGAA CAAGTACTGG CCTACCGCAC GAGGACCCCA TCGTTGCATC CAACCGTCCG CCAGGCACAG CACGATTCGA CTCGACCAGC AGACCGGATC CTGGTTCCGG GTGCAGGGTT GGGACGCCTG GCGTGGGAAC TCGCCGCGTT GCCAACGCCG TCGGGAAACA ATAGTGCGGA GCATCGTGCT GTGTATGTGG AAGCAGTGGA ATGTTCCGTA TCCATGGCGG CCACCGCCGC AATGATCTTG CCCCATACGT ATCGACACAA GTTGGACGAA TCTGTTACTA CTACACCTGG TGGCTGGAGT GGACGGGCCG CCGCGCATTG GACGGCCTAC CCCTACGTCG TGGATGCCTT TTCTAACGAA GTCGATAGCG AACGACGGTA TCGAGCCGTG CATTTTCCTT CCGTGGATCA ATCCGAGAAG GCGTACGAAA CCGGTGTCGA TGTGGGCGCC GAGTCGTCCG ACCGGTACCG ACGGAGTCGC CATCTTGACT CCCGGAACAA TCTGTCGTAT ACGATAGCGG ACTTTACGAC CTACCGAGGG TTGACGGAAA CCAGTGGAGC GTACCGGTTC GTTGTAACAT GTTTTTTCCT CGATACCGCC ACGAACGTGT ACGACTACGT GGCGACGATT CGGCACGTGT TGGAAGGACC ATCCCGAGAC CGCGATATGG CACTCGATGA TGAGCGTTGT GGTGGTGACG GTGACGGCGG CCTGTGGATC AACGTGGGTC CGTTGCAGTG GCACCGCAAT GCGGTATTGC ATCCGAGTGC GAACGAGCTG CGTAGTATTG TGGAACGCAT GGGCTTTACT ATTCTGTATT GGAAAGTGGA CGCAGCACCC GTGGAATACC GCGACGAAGT TGTTGGAACG AGAGGTCCGA AGGAGGAACC GCGATCCACT CATTACGATG CCTATTGTCC GTTGCGCTTC GTCGCACGGC GCAATTAG
|
Protein sequence | MDSDGASTNQ YSPQSFNSRM TAVVPFVVLQ QPDWATLHRT VQAFRQESQA ALQGLSRDLQ VALATEQEYR QVLQESTESD SGTLADSLQN RIDRLSILLE HNAKLLGTLL QPFPTIRVLP SANREGQVGT VNSPNPNDTI DHTFSRAHTF PIPRGAAAFF ESSSPQQPYD SGAQILAHIV RDWSDAGRPI QASLYDWCVE QVLAYRTRTP SLHPTVRQAQ HDSTRPADRI LVPGAGLGRL AWELAALPTP SGNNSAEHRA VYVEAVECSV SMAATAAMIL PHTYRHKLDE SVTTTPGGWS GRAAAHWTAY PYVVDAFSNE VDSERRYRAV HFPSVDQSEK AYETGVDVGA ESSDRYRRSR HLDSRNNLSY TIADFTTYRG LTETSGAYRF VVTCFFLDTA TNVYDYVATI RHVLEGPSRD RDMALDDERC GGDGDGGLWI NVGPLQWHRN AVLHPSANEL RSIVERMGFT ILYWKVDAAP VEYRDEVVGT RGPKEEPRST HYDAYCPLRF VARRN
|
| |