Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49908 |
Symbol | |
ID | 7198530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 263682 |
End bp | 265922 |
Gene Length | 2241 bp |
Protein Length | 709 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184762 |
Protein GI | 219129156 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCTAGTCA CGTTTCCAGT TCATCATCTC ACGACCAGTG TTGACCGTTT TTATTTTCTC TCAGCGAGGC CTTGACAGCA ATGTCTCTGA GTACGCTCCT ACTTCTCCAC GTATTCCCGG CGCTGCTCCC GCCGAAGGAC GGTGACGAAT GGGAATACTA CGCAGTCCTG GGGTTCGATG ATCCGCGTCG CGTTTCGATC GACGACATTC GTAAGGCTTA CCGCAAAAAG TCACTCGAGC TGCATCCCGA CAAAGTTGCG CAGCGGCGGC AGCAAAACGC TGCTGAAGCC GCGGCGGAAT ACGAAAAGGT GCAGGAAGCC TACGGGGTAC TCGGCGACGA GAAGCACCGG CAACAGTACG CGGTGTTGGG GAATTCGCCA GCACGCTACC GGTTTGTACA CACGGGTGGC CTCTACAACC CAGCTCGTTT GTACGAAAAT CTGGCCAAGG CCAGCACGAT TGACAAGACG CGGCTGGTTG TGGTGGTATC AATCGTCTTT ATGATTCTGT TGCTGCAGCC CATCCTTGTT GCTGCCAAAG TGAATCAAGT CATTGAACAA GAGGGTACTT TACAAAATAC CGACTGGGTT GTAATTCTGA TACCAACATG GATGTTGTAC GGGCTGCTGC TAATATTCTG GGTAGCAATG GCAATTCTTG CACCAAATAA AGCCAAACCT CAAGTTATTG CAACCTTGCT CACTAGCGTT TGTTTTCTCA GTGGATTGAT TATTTTGGCA AAGAAATGGG ACGAGCCGTT ATCGACCAAC AAACACTGGC ACCGAGCTTC GGTCCCGTTT TATTTGGCAA TCTTTTGTCG TGTTTTGAGC GCATACCTTG TGGTTGCGGC GGCACAGGAC GAATTTCAGA AAATGGTTTC TCTACAGCAT TTGGAAACCA CGGAAGCATG TTGTTTGGAG GACATGTCGG AGGAACGACG AGCTCAAGTC CTGAAGGACA ATCACGTCAT TACCGTAGAC GAGGAGGCTG TTGCAGCCGC TTTGGAAATA TTGGCGAAAG AAGGGGTTGA GTTCACGGAC GACGACGTAG AGGCTATACG CGTGACGACA TCTCCGGAAT TCGAAGCAAT GGACGCTATG ACCCAAGCTG TGGTAACACC GGCAGGGAAC ATGCTCGTTT TCGGCGTCAC TTTCATTGCA TTGGTGGCAG CGAAAGTGGA AGGCCAGATT GATGCTTCTT GGTGGCTAGT ATTTTTGCCA ATTTGGATTT TGTTGTGTCA AGACGTAGCT CGAGGACTCT TCAGTTGCTG TTGTGGCTCT GTAAGTGGTG AAGAAGTTGT TGTTATGGGA CGATCGGACG AGAACAGCGC CGAAAAGGAG AAAGGCGAAA ACGACACAGA CATCGAGGTG GGACATTTGC CGGTATCAAC CGGAATGGCG GCTATGCAAT CGTCAATGGA GTGGGCCACA GCCGAAGGTG ATATGAAGGA TTTCCATTCT CCAACCTCTG CCCCCGTTGA GCCATTAGAC ATTGGTGAAG CAGGCAACAA GGAAATTCCT AGCGACGAAG AAGCGAGTCA AACAGATCAA CTCGGATCAA AGAATGAAAC ATTTCTACGG CAGAATCCCC ACGGAGCACC GGCAAACAAC TTGACTGTAG AACACTCGGA GGAAGAGCAG GACACAGAAG GAAGCAATCT GGCACCCGCA GAGCAGGTGG ATAAAGATAC GAGCGAGAAC GCCGAGGAGC CAGAAATTGA ATTCGACGAG GACACATTCA GGGCTTGGCA ACAAGCGCAA ACTGAGGCTG ATCAAAGTGC AATGGATGCC CAGGCCAAAG CACAAGGTCA ATGCTGCGTG TCTCTCTTCC AGGTCATTGT TGTATGTTTG ATCGTCGGAA AGCTGGAAAA CGATTTCGAA AGCGATACTC AGGATCCATC AGACACAGGT TACAGCGCCT TCTGGATCTT GTTTCCGATA TTTTTGATTG CTGGGTTGAT GTTTTGCTGC TGCTCTTGCC TCATCTATAC TGCGGGAGCT ACTGGGTTGG ATGAGTTGGT GGACAAGGCG AAACGAAAAG ACGATGAGGA ACCTAACGAG GCCTCTGCTT CTAGCAATGT CGTTCCAACA CCCCCACCAT CTCACGCCCA CGAGCCCGTA AAGTCGGATG ACCTTGATAC GACACCCGTG AGCAAAGTGA ACGAACACAA TGCTGAAGCT TCTAACGAAA ATATGAACGA TTTAGACTAA TGGTAAATTC GGACAAGTTG TATTTGTGTA C
|
Protein sequence | MSLSTLLLLH VFPALLPPKD GDEWEYYAVL GFDDPRRVSI DDIRKAYRKK SLELHPDKVA QRRQQNAAEA AAEYEKVQEA YGVLGDEKHR QQYAVLGNSP ARYRFVHTGG LYNPARLYEN LAKASTIDKT RLVVVVSIVF MILLLQPILV AAKVNQVIEQ EGTLQNTDWV VILIPTWMLY GLLLIFWVAM AILAPNKAKP QVIATLLTSV CFLSGLIILA KKWDEPLSTN KHWHRASVPF YLAIFCRVLS AYLVVAAAQD EFQKMVSLQH LETTEACCLE DMSEERRAQV LKDNHVITVD EEAVAAALEI LAKEGVEFTD DDVEAIRVTT SPEFEAMDAM TQAVVTPAGN MLVFGVTFIA LVAAKVEGQI DASWWLVFLP IWILLCQDVA RGLFSCCCGS VSGEEVVVMG RSDENSAEKE KGENDTDIEV GHLPVSTGMA AMQSSMEWAT AEGDMKDFHS PTSAPVEPLD IGEAGNKEIP SDEEASQTDQ LGSKNETFLR QNPHGAPANN LTVEHSEEEQ DTEGSNLAPA EQVDKDTSEN AEEPEIEFDE DTFRAWQQAQ TEADQSAMDA QAKAQGQCCV SLFQVIVVCL IVGKLENDFE SDTQDPSDTG YSAFWILFPI FLIAGLMFCC CSCLIYTAGA TGLDELVDKA KRKDDEEPNE ASASSNVVPT PPPSHAHEPV KSDDLDTTPV SKVNEHNAEA SNENMNDLD
|
| |