Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46782 |
Symbol | |
ID | 7204538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 441122 |
End bp | 442302 |
Gene Length | 1181 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185703 |
Protein GI | 219120943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.211133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAATTCG AAGTTACAGT TACACAGTTA GTTACATTAT TTAGTCAAGT CAACCCAGAA AGCAGCAAAC CTGACAATCA CAGAACAGGT TTATTGCGGA ATGTGAGATC TCCATTGTTG GTATGATTGA TGACAACACT TCAGTTGGCT CAAAGAACTG GTCCGTATTC AAACAAACTC AAGTTCGATA CAAACGCCAG GTTGACGCTC CTGATCGGCT GGACGATGTA GATGACTTTG TGGATTTCGG CCGAGCCGAT GGGCGGATTC AGCGCATTGT CGTACCAAAA AGCGAAGATT TTGCTTTTTA CAAAGGGCCC GTCTATGGAG TAAATGAGTT CCCTGGATTC TTATACGCAC CGCAGGCGCT TTCTGAACTG CTACAAGCAG AGCTTTCCTT TTTGGCGGTT TCTTCGTTTT GTGAACGCCC TCACTCAACC AATATTGACA AGGTCCCTTG CAAAAATTGG GAAATAGACG ATGGACAACG ATGCATGTGG GAAGAATGGA AATTAGAGCA AATGGAAACT TATACAGAGG CATCCCAAAT GACTTCCAAA AGTTCGTCCA GACCAAAGTA TAGAAGTTTC AGAAAGCTAT CCTGGGCTAC GATGGGCTAT CATTACGACT GGAATACTCG ATCGTACAAT GAAAAGGCAA AATCACCGAT GCCAAAATTG TTGGAACGGA TTGCGGAAAT ATTCGCTGCA ACGTCTCTTC TTGTCGACGG ACAGGATCCA TGTTTCACGG CTTCAGCCAG CATCGTCAAC TTCTACACGC CCAAGTCCAT GATGGGTGGA CACCGGGATG ATTTAGAGCA TGCTCTGGAC AAACCAATTG TTTCTATTAG CTTAGGACGA CCGGCCGTAT TTCTGTTGGG TGGAAACACC AAGGATGATC AACCAGTAGT AGCGATACTA GTTCGACCGG GAGATGTTAT GATGATGGGA GGGGCATCCC GGTTGCGCTA TCACGGAATG GCCCGACTAC TGCCTACGAC CGGTCTACCC TCAGTCGAGA AAGACCGTGT GCCAGACTGG GATTTGCAGC TTTCTGCAAA ATCGTTAGGA AAGGAAGCGG AACTTTCGCA GTTTGAAGAG GACGACCGAA GGGCTTTGGC ATCTTTTCTG GAACAACATA GAATCAATAT CAACGTTCGC CAAGTATACT CCGGAACGTA G
|
Protein sequence | MIDDNTSVGS KNWSVFKQTQ VRYKRQVDAP DRLDDVDDFV DFGRADGRIQ RIVVPKSEDF AFYKGPVYGV NEFPGFLYAP QALSELLQAE LSFLAVSSFC ERPHSTNIDK VPCKNWEIDD GQRCMWEEWK LEQMETYTEA SQMTSKSSSR PKYRSFRKLS WATMGYHYDW NTRSYNEKAK SPMPKLLERI AEIFAATSLL VDGQDPCFTA SASIVNFYTP KSMMGGHRDD LEHALDKPIV SISLGRPAVF LLGGNTKDDQ PVVAILVRPG DVMMMGGASR LRYHGMARLL PTTGLPSVEK DRVPDWDLQL SAKSLGKEAE LSQFEEDDRR ALASFLEQHR ININVRQVYS GT
|
| |