Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37459 |
Symbol | |
ID | 7202371 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 193918 |
End bp | 195261 |
Gene Length | 1344 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181676 |
Protein GI | 219122695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGGC GTATTCCTGA AGAGGACCCT TTAAAGTGGG AACGCATGTA TCAAGAAGGA GGGTAGGTCG AGCAAGGACC AATTTCATTG GTGAGTCTTC TGTACCCTAA CCAAGATTCG GTTTTCCTGT CCTTCAGAAA CGCTGCTCCT TTCAAGCTCG AAGGAATGAT GAACCTGCAG CAGTCCAGCG AAGTCAGAGT TGTTTCGTTT GATCTCGACA ATACTTTATG GGTCACATCG GCGACTATTT CCGCCGCCAA TGAAGCTCTC GCGGCCTTCC TCGACGCACG AGGCGTCGTT CAACCTCAGC GAATAGAAAC AATAATGGGA ATTTTATTCA AAGAAAACAA AGAACGATAC TGCCCCATTG AAGTGGAACA GGCAAAAGCT CCAGCTTTAT TAACACTACT CCGAAAAGAT GCCATTCGAA AAATTCTTTT GGACGACAAC GGATACTCGT CCGAGAGTGC TGAATGCTCT GCCGAAGAAG CATTTCAGAC TTGGACAAAT GCGCGCCACG ATGCCATTAC CTTTAACATG GCTGAAGCTG TGAAAGAATG TCTTCAAGAA ATAGCTGCTA TTCAAACGTC GGATGGACAT TCGGTCGTGA TTGGAGCCAT TACGGATGGC AACTCAGATC CACGCTTGAT TGATGAGCTA TCCAAATATT TTCATTTCTG CGTCAACGCC GAAAAAGTTG GAATAAGCAA ACCTGACAAA CGAATCTACC TAAAAGCTGT ACAGGAACTG GCCGGTCACC CTAGCTTAAA ACATCTCCTT CCCGACGATG ACGCCCAAGA CTATGAATTG GAATCAAGAT TGGGACCGTG GTGGGTTCAT GTGGGTGATG ATTTCATCAA AGACGTAGTC GCTGCAAAAG ATCTGAATAT GCGTAGCGTC TGGGCTCGAG AGCTGGTCCT CAACAAACAG GTAGATTATG CATTGTCGGA GGGAAAGCCG GAGCGAAGCG TTGAAGCTCT GGTGAAAGAT GTTTCTAAGA ATGAAGTAGT TAAGATGCAG GTAGGGGCTA CAGATTACTT GGTGAATTCT CTTCACCAAG AGTTTGCAGA TGCAATTGTC GACCGCTTTG GTGAAGTTGC CACCGTTCTA AATGCATGGC ACAGTGAAGG ACTGGTCAAA ACCTCTACTC CTCTCCAAAT TGTCGAGAAC GATGTGACGG TACAGGAAGA AGTCGTGCTA CGCCCCGAAG TAGAATCGGG AGACACTGAA AACGACAGAA CGCCAAACAT AAAGAACGGA GGATCAAAAT TTTGCCTGTT TTGTAGGAAT ACACTTCCTG GAGCCGCGAA GTTCTGCTCG GAATGTGGGG AGGGACAACA TTAG
|
Protein sequence | MEGRIPEEDP LKWERMYQEG GNAAPFKLEG MMNLQQSSEV RVVSFDLDNT LWVTSATISA ANEALAAFLD ARGVVQPQRI ETIMGILFKE NKERYCPIEV EQAKAPALLT LLRKDAIRKI LLDDNGYSSE SAECSAEEAF QTWTNARHDA ITFNMAEAVK ECLQEIAAIQ TSDGHSVVIG AITDGNSDPR LIDELSKYFH FCVNAEKVGI SKPDKRIYLK AVQELAGHPS LKHLLPDDDA QDYELESRLG PWWVHVGDDF IKDVVAAKDL NMRSVWAREL VLNKQVDYAL SEGKPERSVE ALVKDVSKNE VVKMQVGATD YLVNSLHQEF ADAIVDRFGE VATVLNAWHS EGLVKTSTPL QIVENDVTVQ EEVVLRPEVE SGDTENDRTP NIKNGGSKFC LFCRNTLPGA AKFCSECGEG QH
|
| |