Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23672 |
Symbol | |
ID | 7198792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 179007 |
End bp | 180232 |
Gene Length | 1226 bp |
Protein Length | 290 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184909 |
Protein GI | 219129466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0834708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAAGTAGTA AAATCCGTTC CCTCGCCGTC GACGGCCCAG AGTTCAAACA AACATACACC GCTAGACTGA CACTGATTGA CAAGTGTGAC ACTGCACGCC CCATTTTGCC ATACCCTCTA GCGGCAGGGA ATCCGTCCTA CCATTTACCG GTACAACTAC AACTACAGTA CGTTAACTAT TTGTTGTCCA TCGTTGGCTC CGTCCCCAAA AGCGATCACT ACTGCTACTA CCACCGCAAC GGTAGGATTC ATTGAACGAG CCGTTACCAG TAGGGCTTTC GAAAAAACAT CACGTGCACA CACGCCCATT GTTGCTAGTC TGTTTTGATT TTCTACAGCT TCATCGACGC ATCATGCCAC TGTCGGACGC GTGGATTGAT TTCGTGGCGG GCTGGTGTTC GGGTGCCGCC GCCGTTTTGG TTTGCCAACC CGTCGATACG GTACTGACAC GGCTACAGGC CGGGACGGTC TCGTCCCTCG TCACCGGTAC CACCCGTGCT ACTACAACCA CCACCACCGC GCGTACCGCA ACCCTCGACT TGACGCAGTC TGCTGGATTC CAGGCACTCT GGCGCGGGGC CTCGCCCATG ATTACCGCCG TGCCTTTGCA GAACGCTCTG CTCATGGGCG GCTACGGCGT CGGACAAGCC TACTCCGCCG GAAACTTGGA TTCCGCGGAT CGCCTCGCCG CCATTTTCGT CGGCGGCTGT ACCGGTGGAA TTCTACAATC CTTCCTCATG AGTCCGGTGG AACTCATCAA AGTCTCACAG CAGGTCCGCG GTACCCCCTT GCGGGACGCC GGTGGCGCCG TTCTCACGCA CTGGGCCCAA CCCGCCGCCT GGCGGGGACT CTACGCCACA CTGCTCCGTG ACGGCATACC GCACGGCGTA TGGTTCGCCA CCTACGAAGT CGCCAAGGAC GGCCTCGAGG AAACGCTGGG GAAGGATTCC GTCAGTGTAC CTCTCGCCAG CGGTGCCCTA GCGGCGACCG CCGCCTGGGC CGTCGGGTAC CCCGCCGATC TCATCAAAAC AAGGATTCAA GCCGCCGGTC CCACCGAAAA CCACGGTATC GTCGAGACGG CGCGCGCCAT TATGAACGAA GGAAACGCAG GTGTGGCCGG GCTCTACCGA GGATTTGGCT TGAAACTGGT GCGGTCGATA CCCGCCTCCA TGATTGGCTT TACCGTTTAC GAATTCGTCA AGAAGCAAGT ACAAGACCGA TGGTAG
|
Protein sequence | MPLSDAWIDF VAGWCSGAAA VLVCQPVDTV LTRLQAGTVS SLVTGTTRAT TTTTTARTAT LDLTQSAGFQ ALWRGASPMI TAVPLQNALL MGGYGVGQAY SAGNLDSADR LAAIFVGGCT GGILQSFLMS PVELIKVSQQ VRGTPLRDAG GAVLTHWAQP AAWRGLYATL LRDGIPHGVW FATYEVAKDG LEETLGKDSV SVPLASGALA ATAAWAVGYP ADLIKTRIQA AGPTENHGIV ETARAIMNEG NAGVAGLYRG FGLKLVRSIP ASMIGFTVYE FVKKQVQDRW
|
| |