Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50966 |
Symbol | |
ID | 7201653 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 226422 |
End bp | 228121 |
Gene Length | 1700 bp |
Protein Length | 468 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180967 |
Protein GI | 219120458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACACACATC ACGGTCCTTC TACTGTAACC ATCCCAGTCG TATACTATAT AGTCTCATTA TGAAATTCGC TCTCGCCTCT TCTGTTTTGG TTTTTACGGC TGTCGCGTCG TCGGCCTTTG TCCCGCACCA GGCCTTGGCC CGGGCCGGGA GTTCCTCGTT GCACAGCGCT GTTTCGGATA CCTACACCTT TACCAAAAGC GAAGAAATCT TTGCCGAAGC CAAAACTGTA CGTGTCCATT CCCTCTTTAC GTTATTTGGT GAGCTGAAAA TAGTGATGCT CACGGAATGA CTTGCATGTT TGTTTGATTT TGGTTATTGT TGTATTTGGC AGTTAATGCC CGGTGGTGTT TCGTCGCCAG TCCGCGCGTT CAAGTCCGTC GGCGGCAACC CCGTTGTCTT TGACAAGGTC AAGGGAGCCT ACGCCTGGGA CGTGGACGGC AACCAATACA TTGATTACGT CGGAACCTGG GGACCGGCTA TTCTCGGACA CGCCGATGAC GACGTCCTCG CCGCCATTAC CGAAACCATG GCCAAGGGAA CTTCCTTCGG TGCCCCCTGT CCCTTGGAGA ACGAGCTCGC CAAAGCCGTC ATTGCCGCCG TGCCTTCCGT AGAAATGGTG CGATTCACCA ACTCTGGTAC CGAAGCCTGC ATGGGAATGA TTCGTTTGGT CCGCGCACAC ACGGGTCGGG AAAAGGTCAT CAAGTTCGAA GGATGCTACC ACGGACACGC CGATTCATTT TTGGTCCAAG CCGGATCCGG CGTCGCCACG CTCGGTCTCC CCGATTCACC CGGGGTCCCC AAGGCTGCCA CCAGTGCCAC TTTCTGCGCC GAGTACAACA ACCTCGAATC GGTCAAGAAA ATTTTGGAGG AACACAAGGA CGAGTTTGCC GCCATTATTC TCGAACCCGT CGTGGGCAAT TCCGGATTCA TCAAGCCCAC CAAGGAGTTC TTGCAGGGTC TGCGCGATTT GGCCACCGAA AACGGCGCCC TGCTCGTCTT TGACGAAGTC ATGACGGGAT TCCGCGTCGC CTACGGAGGT GCTCAGGAGT ACTTTGGCGT CACCCCCGAC GTCACCACCA TGGGCAAGGT CATTGGCGGT GGACTCCCGG TCGGAGCCTA CGGTGGCCGC AAGGATATCA TGGAAATGGT CGCCCCGGCC GGTCCCATGT ACCAGGCCGG AACTCTCTCC GGCAACCCCC TCGCGATGCG GGCGGGCATC GAAACCCTCA AAAAGCTGAG CGCCCCCGGA ACACACGAGG AACTCGAACG CAAATCCCAA AAACTGATCG ACGGCATTGC CGCCGCCGCC GAAAAACACG GCCACGACTT TACTTCCGGG TGTGCCGGAG GTATGTTCGG ATGGTTCTTC ACCAAGGGAC CCGTCACCAA TTTCAGTGAG GCCGCCAAGT CGGATAGCGA AAAATTTGGC AAATGGCACC GCATGATGCT GGAACGGGGT GTCTACCTGG CGCCGTCCCT ATACGAAGCC GGCTTTATGA GTATGGCACA CACGGATGCA GATATTGAAA AGACAATCGC CATTGCCGAC GAAGTCATGG CAAAGTTGTA AAACAACAAG GGAATACTAG TGGGTAGCCT ATCTTGACGA GAATAACGTA GCATCGCCTT ACACAACCGT CCCACCGGAA CTTTTTGGGC AAGCAAAAGA TTCAAATACA AACGTAACCT ATTTACATAT
|
Protein sequence | MKFALASSVL VFTAVASSAF VPHQALARAG SSSLHSAVSD TYTFTKSEEI FAEAKTLMPG GVSSPVRAFK SVGGNPVVFD KVKGAYAWDV DGNQYIDYVG TWGPAILGHA DDDVLAAITE TMAKGTSFGA PCPLENELAK AVIAAVPSVE MVRFTNSGTE ACMGMIRLVR AHTGREKVIK FEGCYHGHAD SFLVQAGSGV ATLGLPDSPG VPKAATSATF CAEYNNLESV KKILEEHKDE FAAIILEPVV GNSGFIKPTK EFLQGLRDLA TENGALLVFD EVMTGFRVAY GGAQEYFGVT PDVTTMGKVI GGGLPVGAYG GRKDIMEMVA PAGPMYQAGT LSGNPLAMRA GIETLKKLSA PGTHEELERK SQKLIDGIAA AAEKHGHDFT SGCAGGMFGW FFTKGPVTNF SEAAKSDSEK FGKWHRMMLE RGVYLAPSLY EAGFMSMAHT DADIEKTIAI ADEVMAKL
|
| |