Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47384 |
Symbol | |
ID | 7202531 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 482487 |
End bp | 484283 |
Gene Length | 1797 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181566 |
Protein GI | 219122468 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCAGGATCAT CCAACGCAAG CAGTTGTAGC AAGCTGTATT CTTCTATACG CACGCACGCA CATACATTCC AGCGAAACGA ATTTCTCACT CGGGAGTAGG ATCCACGTCC ACCGCACGTG TGTGAAAAAG TTCGTTCGCT CGATTGTCGA GCCATCGCCT CACACTGTCT CTTTGCTCTA CTAATTGTCT TATGAGTTCG GAAAAACCCG AAGAAAAGAA CCCCGAGGAA TCTCCTCCGA AAGTAGATTT GCCGATAGGC GAAGAGAAGG AGAAATTGGC AGAAAACTCG GATGTACTGG ATGGGGTTGC GACGGTGGAT CAAAAGGAGG AAGCTGCCGA AACCCGCATC AACTCTAAAA AGGACAGCGA TGAGCTTACT AAGAGATTGC AAAAGCTTGC TTTGGAACGT CAAAAAGCGA GCACGGGGGA TCGTTTGAAA GTCATTCAAT CTGACACGAG CTCGCACTTG TCCAGCGTGA AAACATTCGA CGAACTCAAT TTGCCGAAGC ATTTATTGGA CGCTGTTTAT GCCATGGGTT TTGATCGACC GTCGGCGATT CAAGAAGAAG CTCTACCGCG AATTTTGGCC GACCCCATGC GGAATTTGAT TGGACAAGCT CAGGCTGGTA GCGGCAAATC GGCGGCCTTC ACCTTGGGAA TGCTCTACCG GATCGTGGTT GATTCACCAG CGACGACGCA AGCTCTGTGT GTCACACCAA CACGGGAATT AGCCATTCAG ATTGTGGATA AAGCGGTCAA ACCGCTGGCG GCTAATATGA AAGGTCTCAA AATATGTCTA GCCATCGCCA ATACGTTCAT TGACCGCGGA AAGACTGTAG ATGCACACTT GGTCGTTGGA ACCCCGGGAA AGGTTTCTGA TTTCCTCAAG CGTAAAAATC TAAATCCCAG AACGATTAAA GTCTTCGTCT TGGACGAAGC GGACCATATG GTGGAAGAAG GTGGTCATAG AGCGAATTCG CTCGTTATCA GAAAGGTCAT GCCGCCAACC TGTCAGTCGC TCTTCTTTTC GGCTACTTTT CCTCCCGAGG TCGTTCAGTT TGCGGAGAAG ATGGTTGAGA AACCCGACAA GATTCTCATC GAAGATGGAC CCGAATTTTT AGTAAGTCAA TTCACAAGCC ACTTTTCCTT CGTTCGTTCA TGGTGTCGCC GAGGATGCTC ACTCTTTGTT TCAAATCTTT ACTTAAGGTC GTGGACAATA TTCGACAACT CTGGGTTGAC ACACGAAACT ACGAGGGCGG AAAAATCGAG TTTTTGGCTG ACATTTATTC TCTCATGAGC ATCGGTCAAA GTATCGTCTT TGTTGGTACT GTTGTTCAAG CTGACAAAGT GTACAACACG CTGACGAGCT CTGGGTACAC CTGCTCCGTG CTGCATAGTA AAGTTGGCCC CGAAAACCGG GACACAACTA TGGAAGCCTT TCGCAACGGC GAAAGCAACG TCTTAATTAC GACGAACGTT TTGGCACGAG GTGTGGACGT TGATAATGTT GGCCTCGTCA TAAACTATGA TGTGCCAATA GACAAGGATG GCAATCCTGA TCATGAAACG TACCTCCATC GCATTGGTCG CACCGGGAGA TTCGGACGGA AGGGAACAGC AATCAATCTG ATTTCGGACG AAAAGTCCAT TGGGATATTG GCTGCCATTG AAAAGTTTTA TTCGCCCGCC AAAGAAATGA TCAAACAAGT AGAGGCTGAT CCCGAAACAC TAGCGGACCA CATCCAAATC TAACATTAGC GAGACAGGGA TCAAAAATAG GAAACAATTT GCTCCTT
|
Protein sequence | MSSEKPEEKN PEESPPKVDL PIGEEKEKLA ENSDVLDGVA TVDQKEEAAE TRINSKKDSD ELTKRLQKLA LERQKASTGD RLKVIQSDTS SHLSSVKTFD ELNLPKHLLD AVYAMGFDRP SAIQEEALPR ILADPMRNLI GQAQAGSGKS AAFTLGMLYR IVVDSPATTQ ALCVTPTREL AIQIVDKAVK PLAANMKGLK ICLAIANTFI DRGKTVDAHL VVGTPGKVSD FLKRKNLNPR TIKVFVLDEA DHMVEEGGHR ANSLVIRKVM PPTCQSLFFS ATFPPEVVQF AEKMVEKPDK ILIEDGPEFL VVDNIRQLWV DTRNYEGGKI EFLADIYSLM SIGQSIVFVG TVVQADKVYN TLTSSGYTCS VLHSKVGPEN RDTTMEAFRN GESNVLITTN VLARGVDVDN VGLVINYDVP IDKDGNPDHE TYLHRIGRTG RFGRKGTAIN LISDEKSIGI LAAIEKFYSP AKEMIKQVEA DPETLADHIQ I
|
| |