Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30810 |
Symbol | |
ID | 7198795 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 206157 |
End bp | 207880 |
Gene Length | 1724 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184914 |
Protein GI | 219129476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTCACGTG AAAAACCATG GTGGATGTCT TTTTAATTGT CGCGACCGTG GTCGCCATTG TGATTCTTTT GATCATCGCT TCGTATCTCT TGGTTCACTA CCAACATCCC GACGACCACA ATGATGCTTA CGTACCAAAG CTGATCGTTT TACTCGGCTT TGTCTTGGCT GGAGCGACTG TCCTCATGTT GCCGCTGGAT GTCGCTAACA ACGAGGGCTA CGCCGGTAAG CCACGGAATC CGCTGTTTCT AGTAGAAATT CTACGTAGAA CGGAAGCACG ACTGACAAGT CCCGTTTTAC CTCCATCTAT TTGTTGACTG CATCTCAACG TTTTCTTATT GATGTTGTCT ACAGGTTGCG AAGGCTACGA TACGGGATTA TGTGGTGGGC TCAATATGGA ACTCATGTGG GATATTGTGT TTTGGATGAT TCCCATTTGG GTCTTTGTTT TGATCCCTTT CGCTACCTTC TATTACGAGG CCGACGATGG CATGCTCATG GCCGGCACCG CCTACGCACC CAATCCAGTC AGGCAGTCGC GTATTGGCCA AGCCATATGT TATCAACTGT TCGTTTTTGT CATTATCGGT GTCATTTTTG CCGTCACTTA CATTAGTCTG TCGGACTCGA AAATTCCTGT CCAAGAATAC GTGGGGCCAG CATTAGGGAA GGTTAATCAA GGGTTCACCT ACTCCGCGCA AAGAAACGCA ACCGACGATT TGCTTCCTTT CGATTCCGAC GGATTGCAAC CTTGGGGAGA CTCGGATACC ACCTACCTAT CAAACGTCGT GGACAACGGC GAGCAGACCC TGGTATTGCA GGTGTCCTTG AGTACCTTTT ATGCTGGACT CATGGCGTGG TTGGGCTGGT TCCTGTTTGC CATTTTTGGA GGTATCGGCT TGGCGGCACT TCCATTGGAC TTGTACTTGA TGTTCAAAAA TCGACCGCGG CATATGGATG CGGCAGAATT TGCCGAAGCC CAATTGTCCC TGCGGGAACG GGTCAACGAA ATGGTAGACA TTGGCGAACT TATCAAGATT GAACGGGAAC AAAAGGCGCA GGCCGGGCTA ACGTCGGCGT TTGCCACCTT CTCGCTAAAT TCGGATACAC GGAAGGCAGC ACGCGATGAA AATCAAGCTG TTCTGGGTTT CAAACAGGCT GTCTACCTTT TGGAACAGGA TGTGGAGGAC TTTCAGAATG CAACCGTGAA TTACAAGAAG TACAATGTCC TGATACCCTA CATTGCTTTG CTGCTGAGCT TGTGTGCCTT TATTGTCAGT ATATTCTGGT TCATTCACGT AATTGTTTAC GTCTTCCCCA GTCCACCGTT GGCCCCATTT CTGAACAATT ACTTCGAGTG GTTTGACAAG TGGTTTCCGT TATTTGGGGT ATTGTCGGTC GCACTCTTTG TTTCGTATTT ACTTTTAGCG GCACTTAAAG GCTGCTTCAA ATTTGGCATC CGTTTCTTGT TCTTTCACAT TCATCCTATG AAAGTCGGCA AAACCTACAT GAGTTCCTTT ATGTTCAATA TTGCCCTGGT CCTATTGTGC GCCTTGCCCG CGGTTCAGTT TTCGCAGGCG GCCTTTGCCG ACTACGCAGC CTTTGCAGAA ATTCGACAAA TCTTTGGCGT ACAGATACAG TTTTTGCAAT TCTTTTCCTT CTTCTGGACG AACAACGTAT TTATTTACTG CTTCTTAGCC TTCACAGTGC TAACGTCCAT CTAT
|
Protein sequence | MVDVFLIVAT VVAIVILLII ASYLLVHYQH PDDHNDAYVP KLIVLLGFVL AGATVLMLPL DVANNEGYAG YDTGLCGGLN MELMWDIVFW MIPIWVFVLI PFATFYYEAD DGMLMAGTAY APNPVRQSRI GQAICYQLFV FVIIGVIFAV TYISLSDSKI PVQEYVGPAL GKVNQGFTYS AQRNATDDLL PFDSDGLQPW GDSDTTYLSN VVDNGEQTLV LQVSLSTFYA GLMAWLGWFL FAIFGGIGLA ALPLDLYLMF KNRPRHMDAA EFAEAQLSLR ERVNEMVDIG ELIKIEREQK AQAGLTSAFA TFSLNSDTRK AARDENQAVL GFKQAVYLLE QDVEDFQNAT VNYKKYNVLI PYIALLLSLC AFIVSIFWFI HVIVYVFPSP PLAPFLNNYF EWFDKWFPLF GVLSVALFVS YLLLAALKGC FKFGIRFLFF HIHPMKVGKT YMSSFMFNIA LVLLCALPAV QFSQAAFADY AAFAEIRQIF GVQIQFLQFF SFFWTNNVFI YCFLAFTVLT SIY
|
| |