Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48723 |
Symbol | |
ID | 7194994 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 56541 |
End bp | 58420 |
Gene Length | 1880 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183279 |
Protein GI | 219126051 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.187612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTGCAATAG CTGGATCTTT CCAATTCCCA GTAACGGACA GTTCGCCATT GCCAAAAAGA GATTGTTAAC GACGTGCTCA CAGCGCAGTG GCCTAGTCCT TTGTTCGCAT CACATACAAC ACGAGTTGCA GCAAAGAATC ACTTCAAAGC GCTCAAAAAA ACCTTTGTCC CAACATGAGA GTAGCTCATT TTCCTTTCGC GGATGGTCTT TTGTTGGTAC TTGGCGTGCT GTGGTGTACC GATGGCGCCT TGGCGTTCAC CGCGGCTCCA TTGCACACTC AGTCGCACTT GGATTCAAAG AATGGCGTCT TCGGGAACGG AAAACTGACT GACAATCGCT TTTTCGGCAA GGCCGATACA GTTAGGCCCA TTAGCACTCC GCGCGGTTCG GCTGTGGGAA ACTTGAAGGC TTCCCCGTTG GTCGCACTGC TCTCGTCACC ACTAGGTTCC GTTGGAGTGC TCGCCAGTAT CGTCTTGGTG CACGAAATGG GACATTATCT GGCGGCTCGA AGTTTTGGTA TTTCCGTGGA AGAATTTAGC ATCGGATTTG GCCCCAAGCT GTTGGGCTTT CGAGCCTTTG GGGACGAATT CAATTTACGA GCGTTACCCC TGGGAGGCTA CGTCCGATTT CCCGAAAATT ACAACGCGAC ACGAGTCCGC GAAATGGAAG AAGCGGCATA CTTAGCCGCA AAGGAGAATG GTGCGTTGGA AAAACCCGAC GCTGCATCGG AAATACTGAA CATGGTAACC TTTGGAGCCG TCGAAGCTCG GCAGCAAAGG GAAAAGGAGC AACAACTATT ACAACAAGTG GAAGAATTTA ACAACTTGCC GTTTTGGAAA AAAATGGTCA AGACACCTCC GCAAAAGTCG CTTGATCGCG GAAACGTTGA GATTGAGTAT TATGATGATC CTAAGCTTTT ACAGAATCGG CCGTGGCAAG AACGAGCTGT AGTCCTTAGC GGAGGAGTGG TAAGTCTGGT TGCGCCACAC GTATATCTTT CGTCTCGACA ACGGGGAGTA ACTCATTCTC TCTTGGATTG GAAAAGGTAT TTAATTTGCT CTTGAGTTTC TCGATCTACT TCGGCCAAAT CAGCGTCGGG CCGGGACTTC CCCAACCTGT CTTTGATCGT GGAATCGTCA TCAACGCGGC ACCAACTTCC AACGCAGCGG CGAGCGGTTT GCTTCGAAAA GGCGATATCG TGTACGAGAT CAACGGCTCG CCAGTTTCTG TTTCGTCATC GCCGTCACCG TATGAGGCAC AGAAATCTAT CAACGAATTT ATTGCCAAAA TTCGAACGGC ACCTGAAGGA CAGCCAATAA AACTCGTTGT GAGGCATCCG AACGAAAAAG AACTGGTTAA TGTTGACGTC GTTCCCAAAA AGTTGGACGC TGCTGGACCT CAAACCATTG GAGTTTTGCT CGCACCAAAC TACATCAAAT CAGAAGTGTT ACGTACCGAC AACGTCGGGG AAGCTGCTTC GTTAGCATAC AAGTACGCTT ATTCATTGAC GAGCCAAACG GCTGCGGGTC TGGGGTCCTT ATTTGGGGAT TTGTTTTCCG GAAAAGCGGG CTCCTCCAGC AACCAAGTCT CGGGCCCTAT TGGCTTGATA CGTACAGGTT CCGAAGTCGT CGCGACTCAA GATCTGACTA CCGTCTTATT GTTTGCCGCG GCCATCAGCA TCAATTTGGG TGTTGTAAAT GCCCTCCCTT TGCCTGCGCT GGATGGCGGA CAGCTTCTCT TCGTCATTGC TGAAGCACTT ACCGGACGTA AAGTCAATCA ACGATTACAG GAAGGAATTA CTGGAGCGGC AGTTTTGCTG CTACTGCTTC TGAGCGTGGG TGCGGCCGTC GGCGATGTTT CCTCTATTCT TGGACGGTAG
|
Protein sequence | MRVAHFPFAD GLLLVLGVLW CTDGALAFTA APLHTQSHLD SKNGVFGNGK LTDNRFFGKA DTVRPISTPR GSAVGNLKAS PLVALLSSPL GSVGVLASIV LVHEMGHYLA ARSFGISVEE FSIGFGPKLL GFRAFGDEFN LRALPLGGYV RFPENYNATR VREMEEAAYL AAKENGALEK PDAASEILNM VTFGAVEARQ QREKEQQLLQ QVEEFNNLPF WKKMVKTPPQ KSLDRGNVEI EYYDDPKLLQ NRPWQERAVV LSGGVVFNLL LSFSIYFGQI SVGPGLPQPV FDRGIVINAA PTSNAAASGL LRKGDIVYEI NGSPVSVSSS PSPYEAQKSI NEFIAKIRTA PEGQPIKLVV RHPNEKELVN VDVVPKKLDA AGPQTIGVLL APNYIKSEVL RTDNVGEAAS LAYKYAYSLT SQTAAGLGSL FGDLFSGKAG SSSNQVSGPI GLIRTGSEVV ATQDLTTVLL FAAAISINLG VVNALPLPAL DGGQLLFVIA EALTGRKVNQ RLQEGITGAA VLLLLLLSVG AAVGDVSSIL GR
|
| |