Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23830 |
Symbol | |
ID | 7198959 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 317127 |
End bp | 319215 |
Gene Length | 2089 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185010 |
Protein GI | 219129679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.028524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGAAGAGA CCTCACGACA GAGAGAAGAA GGCAAACAAA TCGACTTATG TTCGTTTTCT TTGCATCTTC GAGATTCATG CCGAGGAGTC GCTTTTCTAG CGCTTCACTG TCATGCAACA GGTAAACTCG ATTGATGGCT GCCTGAGTCT TCCTCGAATT CTCTTTTCCA GCATGAAATC TTTGGCCGGA AATCGATTTT ACCGGTCTAT TTGACTTTTT TTCAATGATA GCCCTGAGTG AATTTTAAAG ATCCGATACC GCAGAGAGCT CTTGGATACA ATGCGAACTG TTCCAGACAT TGCGATGATT GAAAAGTGGA AAAGTCGTCA CCTGAAGGGG AGCTTATAAA ACGACAGCCA CTCTTCAACG ATCTTCCCGA AAAGCTAGCT TTCCATTTTG AGACCGATAT CATCATGAGC GTTGACATGA GTCCTTTTCT TTGGATAGCC GTCCTCGGAG GTATCTGTGG CTTTGGTTAC GGGTTCTTTA TCGGCGCGAA CGACGTTGCA AACGCTTTTG CTTCTTCCGT GTCCTCCAAG TCTGTAACTT TGAAGCAAGC CGTCATCATC GCTAGTATCT TCGAGTTTTC GGGGGCCTTT TTTCTCGGAG CTTCTGTTAC GGGTACAATT CGATCCAAGA TTATCGACAT CAACCTCTAC ATTGACGAGC CTGAGCTTCT CATGTTCGGT ATGTTCACGT CCTTGCTCTC GGCCGTCATT ATGTTGGCCA TCGCGACACG ATTCGGTCTC CCCGTGTCCA CTACGCACGA CATTGTTGGC TGCATCATGG GCTTTTCAAT TGCCGCCAAA GGCTTCGACT CGGTAGACTG GGATGTGGCT CGCAAGCTGT TCATGTCGTG GGTCGCATCG CCTTTGATCT CGGGTTGTGT TGCTGCCTTT CTGTTTGGAA TGGTCAAATA TTTTGTAATG AAGACGGACA ATCCCTACCA GCGAGCCTAC TACACCTTCC CCATTGTTCT TACGATTGGA CTCGGTATTG ATATCTTCTA CATTCTGTAC AAGGCGAGTT CCAATTTTTC GGGATTTTCT GACAAACTCG AGCTGTACTG GGTTCTTCCT ACATCCTTTG GTATTGGTCT CTTTGCTGGT ATCTTGTGGA TCTTCGTATT TGGTCCTTGC GCCAAGAAGC GTGTTGAACA TTTGAAGATC ACGAGGGACG AGGCTGAGCA CGAAGCTACT ACTTGCGATC CCGAATCAAC TCCGGGATCT ATAGCTGAGG ATCGCAAAGC GTCAAACAGC CGCGAGGAGA TTCACGATCT CGATGACAGC GGGCATCCCA TGAAGGAGTC GACCGACGAA CCAAGTTCAG AAGAAAAATC GCCCTCGACC ATCGTCGATA ACCTCAAGCT CTTGTCTAAA AAATTTGGAG ATTCTACTTA CAACCAAGAT CTCCATGCTC AGTCTATGCA TGAGAATCCC CGCACAGCCG AGATTTGGGA GCAGGGTGAA GTCTACGACC CGGATGCAGA AATGCTCTTT TCATACGTCC AGGTCTTCAC TGCTTGCCTA AACTCTTTTG CCCACGGTGC CAATGACGTG TCCAACACGA TTGCTCCTTT GAGTGCCATT ATCCAGCTCT ACCAGGATGG TGTTGTTGAG AAGAAATCGG AAGTTCAAAA GTGGGTTCTC GCTTATGGAG GCATTGCCAT TGTTCTCGGT CTCGCTCTCT ACGGATACCG TGTAATGAAG TCGGTCGGCT ACAAACTGAC TCGCTTGTCC CCTTCGCGCG GCGCTTCAGC CGAGCTTGCC GCTTCGTTGA CAGTTGTGAC CGCTTCTTTT CTCTCGATTC CAGTGTCGTC GACCCAGTGC ATCGTTGGCG CCGTCTCGGG GGTTGGTTTG ATTGGCGGCT GGAAGAATGT CCAGTGGCTC TTCCTTGCCC GTGTTTGTGT TGGATGGGTC GTGCTGTTCT TTGTCGCGGT GTTGCTCTCG GCTGGGGTCT TTTCCTTTGG AGCTTTTTCC CCCTCTGCCT GAGACAGCTT GCGCATGCCC ATGCACTGGA TCCGTTTAAA CAATGGGTAG AATATCCCAA TTTGAGTAGC ACTTAAAATA ATTCCATGAA TATATATGC
|
Protein sequence | MSVDMSPFLW IAVLGGICGF GYGFFIGAND VANAFASSVS SKSVTLKQAV IIASIFEFSG AFFLGASVTG TIRSKIIDIN LYIDEPELLM FGMFTSLLSA VIMLAIATRF GLPVSTTHDI VGCIMGFSIA AKGFDSVDWD VARKLFMSWV ASPLISGCVA AFLFGMVKYF VMKTDNPYQR AYYTFPIVLT IGLGIDIFYI LYKASSNFSG FSDKLELYWV LPTSFGIGLF AGILWIFVFG PCAKKPEDRK ASNSREEIHD LDDSGHPMKE STDEPSSEEK SPSTIVDNLK LLSKKFGDST YNQDLHAQSM HENPRTAEIW EQGEVYDPDA EMLFSYVQVF TACLNSFAHG ANDVSNTIAP LSAIIQLYQD GVVEKKSEVQ KWVLAYGGIA IVLGLALYGY RVMKSVGYKL TRLSPSRGAS AELAASLTVV TASFLSIPVS STQCIVGAVS GVGLIGGWKN VQWLFLARVC VGWVVLFFVA VLLSAGVFSF GAFSPSA
|
| |