Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_36992 |
Symbol | |
ID | 7204457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 830018 |
End bp | 831163 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185793 |
Protein GI | 219121125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.40266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAC GATTCCGCCG TAAAGCATTT CTTTTCAGGG TATGCTTTCT AGTGTTGGCT GGGCAAGTCC TGCTACTCTG CCATAGATGG GCCGAGGAAG AACAGCAAGA ACACTACTCT GTCGAGATAT TGCCGAGAAT TCATTTTACG CCAGCTGAAA GTAACTTATC ACTGAAATCT TCATCCGCAT TGATCACAGA ACCCAGTTTT TCGTCCGGCA GCCAACGAAG GGATGAACTA TCTGATACTT TATGGCCCGC AGACACTTTC CGCAATAGAA CTTTAGTGGT TGTTCTGGGT AGTCTTCGCT GTGGGGAAGC GGCATGGAAG AGTCTATACC GCAATGTCCT GGATGTCAAC AACGCTGATT TGGCCGTTAT TACGCAGGCT AAACCAAAAT CAGACAACAG CCGTCATACC GTATCTTTGC TGCAACGGGC CAAATATGTT TGGGAGGTAC CTTCCTTTAC CGACTGGGCG GATGCGTTGG ACTTAATGGC GGGAGGAAGA GCCTGGAGGG AACGGGTACC AGAATATTAT ACAGACCACG AATCGGGTAT TCTGGGTGGT GCCTTGAATT ACTCTGGGAG CGGCGCAATT ATCTTCTGGT ACCGATGGTT CCTTACGGAG CGGATTCGGG AACTGAATTG GAAGTCTCGG TACGATCGGT TCGTTATTAC GCGAAGCGAT CATTTTTATC TCTGCCCACA CGACATCAAC GAACTGGATC CTTACTTTAT GTGGGTTCCA CAAGGACAGA CCTGGGGTGG TGTCACCGAT CGCTATTTGG TCGTTAATGC ATCGAATGTT TTGGAAGCAT TGAACATACT ACCCCCCTTA TTGAAAAACC CTTCGAGATA CTCCAATTAT TTGCAACGGA AAGTCAATAC TGAGCATTTT TTGGCCATGC GCTGGCGCGA ACAGTCGCTC TTTCCAGACA AGCGCAACAA GTTTCCCCGA ACCATGTTCG TGTGTATGGC TCGCGAAGAT GCCCTCACAT CGTGGAAACC ACTGGGCATC GAAGTGTTCC CGGGTGTGTT CACCAAGTAT CACATCGAAT TTTCGACGAG TCAACAGGCC TGCCGAATGT CTCGACAAGA ACGACTAGAC TTCATAGCAC GCAAATCTAA TTTATTGCCG ACTTAA
|
Protein sequence | MSKRFRRKAF LFRVCFLVLA GQVLLLCHRW AEEEQQEHYS VEILPRIHFT PAESNLSLKS SSALITEPSF SSGSQRRDEL SDTLWPADTF RNRTLVVVLG SLRCGEAAWK SLYRNVLDVN NADLAVITQA KPKSDNSRHT VSLLQRAKYV WEVPSFTDWA DALDLMAGGR AWRERVPEYY TDHESGILGG ALNYSGSGAI IFWYRWFLTE RIRELNWKSR YDRFVITRSD HFYLCPHDIN ELDPYFMWVP QGQTWGGVTD RYLVVNASNV LEALNILPPL LKNPSRYSNY LQRKVNTEHF LAMRWREQSL FPDKRNKFPR TMFVCMARED ALTSWKPLGI EVFPGVFTKY HIEFSTSQQA CRMSRQERLD FIARKSNLLP T
|
| |