Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50479 |
Symbol | |
ID | 7199275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 172509 |
End bp | 174938 |
Gene Length | 2430 bp |
Protein Length | 698 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185439 |
Protein GI | 219130578 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTCTTTCG TGCACGAAAA TTCGAACCCT TCTTGCGATC CAGCGTGGGA GAGAGTCAAC CTGCTGCCCC TTAACTTTCA ATCTCCTCAC TGCTCTAGTC AATTTCCCGG CACTCTACTT TCGCTTTTGA AAGTTCGAAA GTCTTTCGAC TGCTACATTG GCAATATCTA ATTCTCTTTG AAGTAGGAGA CCCCGTTTGT CTTCCTCGTG TTCTCATCGT ACCGTTGTGT CCGACGCAGA CTGCGATCTT CCGAGGAGCG ATCGCTTGGA TTCAAAGCGC GTTCCACAAA TCATGACCTC CACCACTGGT AATGTCGCTG TTGATCCCGC TACTGCGGAA GGCCTCGGCA TCCCGCTGAC GGACATCACG GACCCGGACG ACTCGGACGT GCTCTGCGGT CGCGGAGGTG CCGCGCTGCG ACATCCGGGA AATCAAACGT ACCGGCGTTT GGTCAATCTC AACAAGGGCC TTTACATTAC CTGCCTCAAA ACGGAAAAAT TGAAAATTTC CCGTTCAATT GTCGCGGCGA TTCGCGAGCA GAAGGGTCGA TTCTTGGAAA AAGACTCGAA AACGGGAAAT TGGTACGATA TTGGGGACAA GAAAGCTGTG GAAAAGACTT CACAAGCACT CCGCGAAGGA CAGCCCAAGC TGCGCCAGAA GATTGTGGAA ATGGGGGGTG GTGTAGCGGG TACCGCAGCC TTTATGGAGT CGCAATTGGG GGATTCGAGC AGCGCATTTT CCCACACAAA TAGGAACGAT ATACCCCCAC CTCCACCTTC GATGTTGGGG ACTCCGGATA TGTCGCATTC TCCTCATCTA GGCCACAACG GAAATGCTTT GAATTCGGTC GGCATATCGG CACATGCCGG AATGCCTGGA CTCTTGCCGC ATAATCCTCA CGACGCGGCT ACCGCTGCAG CGTTGCGGCG CCAACATCTG GATCTCCGTC AACAGCAACA AGACTTGGAG CAGCAGTTGC ATGCGGCTTC GATGGGAAAC GCGATGAACT TCAATCAACA ACAGCAGCAA CAACAGCAAT CGCGTTCCAA AGACCTTCAC CAGGATATGT TGCAGCGCCT TAGTTTGCGG GACGTCTCGT CCGACCCCAA CGCCTACGAT ATGCAGGAGC AAACCAATCG TCTTCGGCCA TCCCTGACAC AGCGGGGGCC GCAGATAGCA CAAGAATTGG GGATTCGAGA TTCCCAACTC TCCCTTTTAT CAGATTTCTC GGCGTATGGA TCCGGTCAAC AACTTTTGGT TAACATGTCG CTCGGGAGTA TGGACCCTGG ATCCTTTCGG TACCAACAGC ATCCACAGCA AATGCAGCAG TCACTGCAAA GTATGGATTC TGGCTCGTTT CGTCAGCAGC TGCAGTCTCT ACAGAGTATT GATTCCGGTT CCTTTCGACA GCAAATGCCG CGCCAACAGC AACACCACCA GCAACAGCAG CAACAACAAC AGATGCAGCA GCAACATTTT CAACAACAGT ATCAGCAGCA ACAGCATCAG GGATTTTCTG CACTGGACCA TGGTCAAAGC GACGAGTGCC ATCCCCGACC TATCCAACAG TCGTTGAACG ATAGAGGAAC TAGCTCCACC ACCGCGCGCG AAGACTACTC CTCAAGTAAT GATGGCAACA GAAAGACAAA TGATGGAGTT TCGAATCCAT CTGTGAACAG CGTGGTAACG GCGTCTTCGA ACAGTGATCC ACATTGTAAC GGCAACACAA ATTCAGGCTC CAGCACTAAC AGTAGCAAGC TTGCGGGCCT AGATCGTCGT CGTGTGTTCG CCAAGATGAA GTACACTCGA CCGCCATCTG AGATGAAAAT GAAACCGGAA GATTCGGCTC GTTCGATGCA AGACGGCATG TCAGACTTCC ACATGGTCGA GTCCACCATG AGCTTTCTCT CCAACATGTC CCAACTGTCA GCGGCGGACA AAGGTGGTAA TGGAGAGAAG ACGGCGTCGG CGGGCGCGGA CGGTGCTTCG TCCGCAGAGA TACTGGTGTC TGCCGTCCCG ACACCTGTCT TTCCAGCTGG TGTAGAAACA GCAAAGGTGA TTGATCACAG CAGTGACAAT CATGAACGCA TGAGTACATA CTCGGAAGCA GCGTCGGGGA GTCGTCGCTC GATCATGTCT GGTCTATCAC GGATTAGTGA CGCGGATATA TCCATATTCT CGGACCTTTC CCGAAAGATC GGCAACGTCT CAACACGATC CATCGCCATG AGCGATATTT CGGCCATCGA TATGCAAGAG CAAGACAACG AAGACGAAAG CACAACTTCG AACTTTGAAG GCGCTTCCAT TGACCCTATT GATCCTATAC GGTCGCCACA ACGGCTTTCC GGCGGGAATT ACTCGGAACC GTATGACTTT ACAATTTGAT AGCATTGTTT TTAATTGTTA ACTGCACCAA AGTTATGGCT
|
Protein sequence | MTSTTGNVAV DPATAEGLGI PLTDITDPDD SDVLCGRGGA ALRHPGNQTY RRLVNLNKGL YITCLKTEKL KISRSIVAAI REQKGRFLEK DSKTGNWYDI GDKKAVEKTS QALREGQPKL RQKIVEMGGG VAGTAAFMES QLGDSSSAFS HTNRNDIPPP PPSMLGTPDM SHSPHLGHNG NALNSVGISA HAGMPGLLPH NPHDAATAAA LRRQHLDLRQ QQQDLEQQLH AASMGNAMNF NQQQQQQQQS RSKDLHQDML QRLSLRDVSS DPNAYDMQEQ TNRLRPSLTQ RGPQIAQELG IRDSQLSLLS DFSAYGSGQQ LLVNMSLGSM DPGSFRYQQH PQQMQQSLQS MDSGSFRQQL QSLQSIDSGS FRQQMPRQQQ HHQQQQQQQQ MQQQHFQQQY QQQQHQGFSA LDHGQSDECH PRPIQQSLND RGTSSTTARE DYSSSNDGNR KTNDGVSNPS VNSVVTASSN SDPHCNGNTN SGSSTNSSKL AGLDRRRVFA KMKYTRPPSE MKMKPEDSAR SMQDGMSDFH MVESTMSFLS NMSQLSAADK GGNGEKTASA GADGASSAEI LVSAVPTPVF PAGVETAKVI DHSSDNHERM STYSEAASGS RRSIMSGLSR ISDADISIFS DLSRKIGNVS TRSIAMSDIS AIDMQEQDNE DESTTSNFEG ASIDPIDPIR SPQRLSGGNY SEPYDFTI
|
| |