Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30898 |
Symbol | |
ID | 7198909 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 49373 |
End bp | 51007 |
Gene Length | 1635 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | vacuolar protein |
Protein accession | XP_002184958 |
Protein GI | 219129570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTGAAAGTA CACTTGTATA TCCGACCAAC CTTTTCGCTG TGGCTTTGTT TCCCCTTCAA TTTCGCTCCC TGTCTCAATT TCTAGCATGG AGAATTCGTT AATCCCACAG GGCATCGAAA TGGTGCAAAA AGCCATTTCG GCAGACAACG AAGGGGAATA CGAAAAGGCA CTGGGTTTGT ATCGCGACGC TCTGGCTCGT TTCACCATGG GACTCAAATA CGAAAAGAAT GAAGCGCGTA AGAAACTTAT TCTTGAACGC GTGGAAGGAT ACATGAATCG GGCTGAGGAA CTCTCGGATT ACGTCAAGAA ACAATCGGAA TTGGATAAAA ATGGAGGAGG AGGCGTAGCG GCCAAGAACA AAGACGATGG TGACGACGAC GGTGACGCCG ACAAGAAGAA ACTACGGGGG TCGCTTTCAG CAGCAATTGT CACGGAAAAG CCAAACATTT CCTGGGAGGA CGTGGCTGGT CTGGAAAACG CCAAGGAGTC ACTCAAGGAG GTAAGCAGTC CCGTACTTGA CCTTCACGAG TCTCTGCGAC CGGCTGGAAG CTGATCGTTT CCGACTATAA GCAAAAAGGA AAAGTACGCC GTGATACGAG TACATTATCC TAACCCTTTT TTCCTCGCCC TTACTGCTCG CTTACTACAG ACTGTCATTT TGCCAACAAA ATTCCCACAA CTTTTTACTG GAAAGCGGAA ACCTTTCAAG GGCATCCTCC TCTACGGTCC TCCTGGAACG GGTAAATCCT ATTTGGCCAA GGCGGTCGCA ACCGAAGCGG ATTCCACCTT CTTTTCCGTA TCATCCGCAG ATCTCATAAG CAAGTGGCAA GGCGAAAGCG AACGACTCGT TCGAAACCTC TTTGAAATGG CCCGAGAATC GCCCGGTAGT CGCGCCATTA TATTCATCGA TGAAGTCGAC TCCCTCTGCG GTAGTCGCTC AGAGGGAGAA TCGGATTCTC TTCGTCGTGT CAAGACGGAA TTTCTCGTTC AAATGGACGG TGTCGGCAAA CAAGACGGAC AAGTCCTCGT GCTCGGGGCC ACCAACATTC CGTGGGAATT GGACGCGGCT ATTCGTCGGC GTTTTGAAAA GCGGGTGTAC ATTCCCCTCC CGGAAGCCGA AGCTCGATCT TACATGCTCA AGTTGCATCT AGGCGACACG CCTAACGATT TGGAAGAGGA AGATTTTGAT CGCCTGGGTA CAATTACTGA AGGAGCGTCC GGATCTGACA TCCAAGTCCT CGTAAAAGAA GCCTTGATGG AACCCCTGCG AAGATGCCAG CAAGCCAAGC AATTTTACAA AGATGAGGAA GGCTATTTTC ATCCGTGTAC AAAATACCCA AACTGTTCCA AGTGTCCACC GAAGCTGTCA TCGGATAAGC CCGGCAAGGA TTATTCATGC AAAAGTTGCG GTGCGGTTCG TATGAGTTTG TGGGATGTCC CAGGCGAGAA GCTTAGGGCC CCCAAGGTCG TGCGTAAAGA CTTTGAAAAG GTTATGAAGC ATTCCGTAGC CACCGTATCA CCGGATGAGC TCAAGCGGTT TGTGGATTGG ACCAAGATGT TTGGGCAAGA TGGCGCATAG TAAAATTGGA AAGGGATGTT GCGCTTCTAA ATTAGTGCAA TAATTGTGAT GAGTT
|
Protein sequence | MENSLIPQGI EMVQKAISAD NEGEYEKALG LYRDALARFT MGLKYEKNEA RKKLILERVE GYMNRAEELS DYVKKQSELD KNGGGGVAAK NKDDGDDDGD ADKKKLRGSL SAAIVTEKPN ISWEDVAGLE NAKESLKETV ILPTKFPQLF TGKRKPFKGI LLYGPPGTGK SYLAKAVATE ADSTFFSVSS ADLISKWQGE SERLVRNLFE MARESPGSRA IIFIDEVDSL CGSRSEGESD SLRRVKTEFL VQMDGVGKQD GQVLVLGATN IPWELDAAIR RRFEKRVYIP LPEAEARSYM LKLHLGDTPN DLEEEDFDRL GTITEGASGS DIQVLVKEAL MEPLRRCQQA KQFYKDEEGY FHPCTKYPNC SNLWDVPGEK LRAPKVVRKD FEKVMKHSVA TVSPDELKRF VDWTKMFGQD GA
|
| |