Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32758 |
Symbol | |
ID | 7197568 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 735083 |
End bp | 736618 |
Gene Length | 1536 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177675 |
Protein GI | 219111847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTTC CTTCTTGGTG GAAAAACGAG GAATCCAGCT TTTTAGGCGT TAATAAGGCT ATTCAGCGCA ATATTGAAAA CGGGCGCTGG AAAGAGCCGA AGCCTATACA GATGCAAGCT ATTCCTACTT TGCTGGAGCG GCGTGATTTC ATTGGATCGG CGCCGACCGG TTCGGGAAAG TCGGGTGCCT TTATCATTCC GTCCCTCTTT CTAAGCAGCG CTCCGCATCA CGTGTATTAC GACAACATTA GTAGTCCAAT GAACGGAAAA TACAAAACAA TCAAAAAGAA ATTGAAGTCA TCTCGAGAAG GTGAGATTCG GGGGTTGATC TTGGCCCCGT CCTTGGAATT GGCAGCTCAA TTGCATCGCG AAATCGAGCG CCTTGGTATT GGCAAACCCG GTGGATTGTC GTCTCTTCTA CTGTCTAGGT CAAACGCGTC CCAAGTTATT GGAGGTAGTG CCGGGGGAAA GAGTGGACTC GATATGCTCG TCTCTACTCC ATTGCGGCTC GTCGATGCGA TAGAAAAGGG CTTGCGGCTA AATTCAGTAC GAATAGTGGT ACTTGACGAA GCGGATCGTC TCCTCGACGC TACTGATGGG AAAAGAGCCC GAAAGCCAAA GGGGGAGGCA GGAACGGAGT CGATTGTAGA AGACGAAGAG GAGGAGGAAG AAGAAGAAGA TGACAGTGTG CATGGTGCTT CTGGCTCCAT GCAGAGTCAA TCTTTTCTTG CACAAATGGA CATTGTACTG AGTGAAGTGC CATCAACGGC AACGCGTGCT CTATTCTCGG CTACCGTCAC GCCGACTGTA CGCTTCTTGG CTGAATCTAT TCTGCGAAAC CCTTTAGACG TCACAATTGC CAACTCAGGC TCCGTTGGTG GTGCAAATAC AGATATCGAG CAAGAGTTAA TGTTTGTTGG AAAGGAACAG GGCAAACTCC TTGCTATACG GCAACTGGTC CAGCGAGGAC AGCTCCATCC TCCTGCCATA ATATTTTTGG AGAGTAAAAA TCGAGCGCAA GCGCTATTTG GAGAACTTTT GTATGATGGT ATTCATGTGG ACGTTATCCA TGCGGGTCGC TCCAAATCTG CCAGGGAAAA TGCGGTTGCC AAGTTTCGTC GAGGTGATAC TTGGGTTTTG ATCTGTACCG ATCTCGTCGC TCGTGGTGTG GACTTCAGAG CCGTGAATAT GGTCATCAAT TATGATTTAC CCTCATCAGG TATCGACTAT GTACATCGTA TTGGTCGAAC AGGGCGCGCC GGTCGCAAAG GAAAAGCCAT TACCTTCTTT ACAGAGGGGG ACTTTGAGAA CTTGCGAACT ATTGCCAATA TTATCAAGCA GAGCGGCTGT AACGTAGAGG ACTGGATGCT GAATCTACCA AGAAAGAGCA ACAAAAAGAA CGCGAATTCC GTGGTTCGAC GACGGGAACG AATCAGCACG ACACCCGAGT ATGACCGCCA AAAAAAACGC CGACGACAAC AGGCCATAGA ACACAACAAA AAGAAGTTGA AAATGTCAAC TACTTCGCAA GAATAG
|
Protein sequence | MSFPSWWKNE ESSFLGVNKA IQRNIENGRW KEPKPIQMQA IPTLLERRDF IGSAPTGSGK SGAFIIPSLF LSSAPHHKLK SSREGEIRGL ILAPSLELAA QLHREIERLG IGKPGGLSSL LLSRSNASQV IGGSAGGKSG LDMLVSTPLR LVDAIEKGLR LNSVRIVVLD EADRLLDATD GKRARKPKGE AGTESIVEDE EEEEEEEDDS SQSFLAQMDI VLSEVPSTAT RALFSATVTP TVRFLAESIL RNPLDVTIAN SGSVGGANTD IEQELMFVGK EQGKLLAIRQ LVQRGQLHPP AIIFLESKNR AQALFGELLY DGIHVDVIHA GRSKSARENA VAKFRRGDTW VLICTDLVAR GVDFRAVNMV INYDLPSSGI DYVHRIGRTG RAGRKGKAIT FFTEGDFENL RTIANIIKQS GCNVEDWMLN LPRKSNKKNA NSVVRRRERI STTPEYDRQK KRRRQQAIEH NKKKLKMSTT SQE
|
| |