Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47922 |
Symbol | |
ID | 7203118 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 434966 |
End bp | 436468 |
Gene Length | 1503 bp |
Protein Length | 394 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182226 |
Protein GI | 219123843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCGTTTC ACACACCGAG AGACATCACG AAGGCTGCGT GAGAACAAAG CGAGACGGAA GAAGAAAAGC GTTGACCTCG TCCAACAGTA AGGTCTTCGG GCGCTCTCTA CTGCAATTTT TTTCGCCAGA TAGATTCATC TAGATGAGTT ACTCAATAGC GCCTTCTCTA ATATTGCTTT TCGGCGTGTT GTCCGGCGGG CAAGCCTTTC AATTCAAAGC ATTTGCTCCT AGATCATTGA CACGAGTTCT GGCGGAACCG TCGACGACAG CTCCCGCTGA TTCTCAGACA TCTTCTTCGA ATAAGGGAAA GACACTGGGG CTTCTTACGT TTGACCTAGA CGATACGTTG TACCCTATCG ATCTCGTTCT GAATGAAGCC AACGCAGCGT TCGCTCGTGC CATGGAGAAT TTTGGATACA AAGGCATCCA GCCTAGCGAT ATCAATGAGA CGAGCAAAAC GATTCGCGAG GAAATTGCTG CTCGCGATCC TCAAGAAGCG GCGGCCTTGA CACACACTGA ACTCCGTAAA TTGGCCATCC GACGAGAGAT GGAGCAAATT ACCATCACTC GAAAATTGCA ATCGTGCGCG GATGATTGGG CAACGCCAGT AGCTGACTTG TCCCCAGTGG TCGTCAAACA CGCCAAAAAG TACGTGATCT GAGCTTTTGA AAGGTTGCTG ACTTCACTTG GTAATCTCTG ATAATATTGG ATCGACTGCT CCTTCTTTTC GTCTTTAGGT GGGCGACGGA AGCCGTTTCT CCGACAATCG TTCAGGCTGT TTTGAACGCT TGGGAAATGG AACGACACCA CGCCGCTGAA CGCCACCTAT ACCCCGAATG CGTGGAAGTG TTTGAGCGTA TTAAACAGGA CCACCCAGAC GTCATTATTG GAGCTGTCAC GGATGGCAAG GCCAATCCGC TTTTCATGAC CTTTACTTTA GCTCCGTACT TTGACTTTTG TATGAGTTGG GAAGACGATC AAGCCGGACG ACGAAAGTTC TTCAAGGAGC TCGGGTCCAT TGAAGGAAAC GCAGATCTAA AGTGGATTTA TGATGCCGCT CTTGAAAAGT ATCAGGAGTT GGCATCGGCA GCTGCTGCAT TACAAAAAGG AGATTCTGCC CAAGACCCAA ACAAGATCTG GATTCATGTA GGGGACGACT TGGCCTACGA TGTTGGCGGC TCGGCCCAGA GTGGAGCAAA AACAATTTTG GTGGAGCTTG ACGACGAGAA GTATCACCAA ACGGCACGTC ACCGATTCGA ATTGACGAAA CAACCAGACT GGTCCACCAC GTCCGACATT GAACTCGAGA AGCGCAAAGT CATGAACGAG GCTGCAACAA ATCAAATCGA TCGCAAGATT AAATTCCTTA CGAGGCTACC AGAAGCCATC AACGAAGTCC TCGAGGAGCA GAGCTAAAAG TGGGCTCATG GACAAGCTCT CGGTGGGTAA CGAGCAATTT CAAGTTAAAA ACATGTATAT TTAACATTGC ATGATTGCTT TTC
|
Protein sequence | MSYSIAPSLI LLFGVLSGGQ AFQFKAFAPR SLTRVLAEPS TTAPADSQTS SSNKGKTLGL LTFDLDDTLY PIDLVLNEAN AAFARAMENF GYKGIQPSDI NETSKTIREE IAARDPQEAA ALTHTELRKL AIRREMEQIT ITRKLQSCAD DWATPVADLS PVVVKHAKKW ATEAVSPTIV QAVLNAWEME RHHAAERHLY PECVEVFERI KQDHPDVIIG AVTDGKANPL FMTFTLAPYF DFCMSWEDDQ AGRRKFFKEL GSIEGNADLK WIYDAALEKY QELASAAAAL QKGDSAQDPN KIWIHVGDDL AYDVGGSAQS GAKTILVELD DEKYHQTARH RFELTKQPDW STTSDIELEK RKVMNEAATN QIDRKIKFLT RLPEAINEVL EEQS
|
| |