Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39785 |
Symbol | |
ID | 7195641 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 31934 |
End bp | 33203 |
Gene Length | 1270 bp |
Protein Length | 320 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183796 |
Protein GI | 219127134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGCA AGAACAATGC CAGTCGCAAC GTCGACTACG CGATTCATGA GTCCACTTTC CCCGTCAAGC TTCATTATCT TCTGAGCGAG ACGGAGGACA ACGGGAGCGA TCATATCATC TCGTGGCAGC CCCACGGACG GGCCTTTCTA GTACACGACC ACGGGGCCTT TGTGGATCAC GTCCTTCCCA AGTAAGTCAC TTCTCAACGC GTAACAGTAT TGGGTATGCG TCACACGGTG GATGCCTTTG CCTATGATTG GGACACGGAG GAACCCGTGT ATCCTTTTAT GTATTACACC TTGCGTTGTA CTGTGTGTTG TTGTTGTTGT TGCTTGGCAT TGCTTGGCAT TGGATCTTTC TCTACTCACC CGTTGTTCAA AAGCTGGTTC AAGCAATCCA AGTTCCCTTC GTTTCAGAGA CAGCTCAACT TGTACGGCTT CAAGCGCTTT ACGGCGGGTA AGTGGCGCGT ATTGTGTATG TAGAGAGAGC TACGGGACAA TATTACTATG TGAGACCACT TCTCACGTGA TTCATTATTC GTTCGTTGAA TCGTTCTTCG GGACCCGCAC AGGCCGCGAC AAAGGAGCCT ATTATCACGA AATCTTCCTC CGTGGCCGAC CCCATCTTGC GCACCGCATT CCTCGCGTTA AAGTCAAGGG CAGCGGGGTG CGCAAGCCGG GAGCGCCCGA GTCGGAACCC AACCTTTACC TCCGACCCTT TTTGCTCACT TCGGACTTTA ACGGTGACGC CACGGCCGAG GAAGAATTGC ACACCGTCTC CAAGAAACCC CACACCGTGA TCCCGGACGG TCCCAGCAGC CAGAGTGGAC CGGACGTGGC CCGGATCGCC GGTAGAGCCC TGCCGGTTGG CTCCGTGGCG GCGCTGTACG CCAACGCTCC GCCACCGAGA CCTCCTTTTC CGGGTCGCCC GTCGTTGCAT CACTTTCTGG CAGCGCAACA CTGGTCCGGT CCTCCACGCG GGTTCGGTGT CCCGATGCAC AACCCTACCA TGAGAAGATT CGACCCCACC GCTTTATCTC CGCACCAATT GATGTCCCTA CAAACCGCAC TGGAGGAAGA CAATATTCGA CAGCGAGAAG CCCTCCTGAC TTCGTACGCC ACCTTCCCCG GACGGAGCTC GGAGCCGGTG GCGCCATCCG CTAGTACCCT CCAGAAGTTG TCCAAGAAAT TCGCGGCGAC GTCCGCGTCG ACCGCCGCCC AAGATGTAGC CACACTGCTC CAATTGGCTG CGTCCCTCGG ATACCGCTAG
|
Protein sequence | MGGKNNASRN VDYAIHESTF PVKLHYLLSE TEDNGSDHII SWQPHGRAFL VHDHGAFVDH VLPNWFKQSK FPSFQRQLNL YGFKRFTAGR DKGAYYHEIF LRGRPHLAHR IPRVKVKGSG VRKPGAPESE PNLYLRPFLL TSDFNGDATA EEELHTVSKK PHTVIPDGPS SQSGPDVARI AGRALPVGSV AALYANAPPP RPPFPGRPSL HHFLAAQHWS GPPRGFGVPM HNPTMRRFDP TALSPHQLMS LQTALEEDNI RQREALLTSY ATFPGRSSEP VAPSASTLQK LSKKFAATSA STAAQDVATL LQLAASLGYR
|
| |