Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40362 |
Symbol | |
ID | 7198279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 211951 |
End bp | 213636 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184322 |
Protein GI | 219128233 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0281204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACTT ATTTCGTTGC CTTTGCGTTC ATCATCTGGA CTTTGGCGTT ATGGCGACAC GTCTCGCGTC GGGCGGCATT CATCAGCGAC CCAGACCGGC GTCTCCCGTC CCCTCTCTGG CACGTCCCCG TTGCTTCGTC CCACGGGGAC CACAACGACC CAATGACTAC ACCCACCAAC GTCAGTCGTA CGGTCGAGTC GTTGCTCTCG TTGCGCGTCG GAGACTTGCC TGGCTACACC GGATGGGCCC GGCCCGTCGG AACACTGTCA CCGTACTTTC GACAAGTATC CGACGCCCAC GGCCGTCCCA CTCGAACCAT CGTTACGGTG GGACGCGTCT GGACCGTACG AGTGGCGTGT ACGGGACACG TACGGTGTTT CCGTCCCCAA CGGGCCATTC TGGACGTACG TGCGTACGGG CCCAGTGTCG TGGCCGGAAT GGTCGTGCCG TCCCGTGTCC GCCGGGCCTA CGACCCATCC AACACCACCG CCAGTCACGA CACGATTCCG GGATATTACG ATGTGCAAAT TGTCTTTCCC GATGCCGGAG TCTACCACGT CGAAGTAGTC CTGGCCTTTT CCAACGCACC GGCGTGGAAC GAATTTCCAC TGGCTGGACC CGAACCCGAC TACGAAGGCT ACCTTCTACC AGATTTCCCC TTAACCGTGG TTGTCGATCC CGTAGAACTA GTCAGCGAAG ATCCTACGAC GCACGCGAGT CAAGACAATC GACCAGTATG TACAAACGCA GACTTGCTCG AAACGTCGCC AACCAGTGCA ATAATCAAAG GTCGTTGGAG AGTGTCCGAC AAGGTTCAGG ACCGACGCTT GGTCGAAGAT GCGTACGAGA TTGATCACAG TCCATCCAAC ATTAGCCGAA CAGGATATGA GCAGGGCGTC AATTCGCTGG GTATTACTAT GGCCTTTGAA TACCAAAAGT GTAAACTCGC GACTGTTCAC CAAAGTAAGC GGATCTTGAT GGTGCCGAAA ACGAAGGATT GGTACATTCT GTTCGTTGGT GACTCCAACA TGCGGGCACA GCATTTGACC TTTCAATCTT GGCACGGTCG CGATCGGAAT CAATATCCCC AAAGTGGCTA CATTTCGACT GCCAAGGGTC TCGCAAAGCA GCTACCCCAA ATTAGAGAAA AACTGAGTGC TTTGAAGAAG CACGCCACTA ATCGGACCGA GTTTTACGTC CTGTTCAATG CCGGCTTGCA CGACATTGCG CGCCTCTGTA GTCGAAAGTG GTCCCATGAG AGATTTAACA ACGGTGACAC CCGACCTTGC GTGGAACAGT ATCGGCAGCA TCTAGGCGAG CTGATCAATG GCATCAAGGA CCTCTCACCC AAGCTTGCCG TTCTGCAAAC AACCATTGCT GGTTGGCCCA AATGGGGCAA TTTTGGTTTT GCGTGGCCGC CATCTCAAGG ACAACCATTG CCCTTCCACT CGTCCACATG CCAATCCTTT AATGAGATTG CCTGGGAAGA GGCAACGAAA GCGGATATCT CTGTGATGGA TGCGTATTGG TTGACCGTGC CTCGGCCAGA TCATCGACAA GTCGATAACG AAAATGCGAT TGGAAAGCAC ATGGTACACG TTGGACCAGA GATACACAGC GTTATGCAGC GCAAGTGGAT TTCCTTGATT GAGATTGCCT TGGGAAACGA CAACCCAGTC ATGTGA
|
Protein sequence | MSTYFVAFAF IIWTLALWRH VSRRAAFISD PDRRLPSPLW HVPVASSHGD HNDPMTTPTN VSRTVESLLS LRVGDLPGYT GWARPVGTLS PYFRQVSDAH GRPTRTIVTV GRVWTVRVAC TGHVRCFRPQ RAILDVRAYG PSVVAGMVVP SRVRRAYDPS NTTASHDTIP GYYDVQIVFP DAGVYHVEVV LAFSNAPAWN EFPLAGPEPD YEGYLLPDFP LTVVVDPVEL VSEDPTTHAS QDNRPVCTNA DLLETSPTSA IIKGRWRVSD KVQDRRLVED AYEIDHSPSN ISRTGYEQGV NSLGITMAFE YQKCKLATVH QSKRILMVPK TKDWYILFVG DSNMRAQHLT FQSWHGRDRN QYPQSGYIST AKGLAKQLPQ IREKLSALKK HATNRTEFYV LFNAGLHDIA RLCSRKWSHE RFNNGDTRPC VEQYRQHLGE LINGIKDLSP KLAVLQTTIA GWPKWGNFGF AWPPSQGQPL PFHSSTCQSF NEIAWEEATK ADISVMDAYW LTVPRPDHRQ VDNENAIGKH MVHVGPEIHS VMQRKWISLI EIALGNDNPV M
|
| |