Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42478 |
Symbol | |
ID | 7196668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 176450 |
End bp | 178993 |
Gene Length | 2544 bp |
Protein Length | 809 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176536 |
Protein GI | 219109563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.300834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTTTGGAG ACCAGAACAC TTTCTGTCAG TATCGACACC GTAATCTCCC TCGAGCGTGT CCACTGGAGC AGTACATGGG GCGACAATAG ATTGTGACTT GCTGTTCGTA GGAAATGTAC AGCAGTAGGG GTTCTACGGC GCCAATCGCC CCAGTCCAGC CCAAAATTCT CGGTGAAATT CTGCTACCCG ATTCAAAAGA CTACAAATTG TACTGCTGGA AAGTACGGAA TGGGTCTTCC GTACGAAAAG GAGAGACAGT GGCACTCGCA ATCCACAGAG ACGGCAGCAA CGGCTTGGGC TCGTCTGAAG CGGGCGCAGT ATCATCGTCT TTGCCAACAT CCCAAACCCC ATCTACTGCT ACACAGCCAG CCTCCTCCAA CAGATCGCAA TATAAGCGAC CGACGAAACG AAGAAAACAC GTTGTACAAT CACCTGATGG CGATGCCGAA ACCATAATGG ACGAGAAGCC TGCAGCGGAT ACAAAAAAGG CTTCGAATGC CACCGCCAAC AACGTTGCTC CTGCCACACC GTTGACGCCT GCCCCTACTT CCGACTCAAT TTCGGCGTCC GAGAAGATAC CGATTCTGGC GACAGCGGAT GGTTTGGTTC GCATGGGAAG ATTAGAGGGA ATTCATCAGC AACATGATGT TCTTATCGGT TTCATCGAGG AGTGTACCCA TCCAACCGTT ATAGACGGTC TTTGTGCCGT GTGTGGCAAG TCTGTAGACA AAACAAAGAC ACCGGGCGAA GCATCGCACT CGGAGCAATC GCCTCAGTCA GACATGTCAC GGGTCACCGT TTCGGGACAT ACCGTCACTG TGTCGCGCGC GGAAGGTCAA CGAATGGCAC AACAAGATGC GGAGCGGCTA CAAAAGCGAA AGAAGCTGTC GCTCGTTCTC GATCTCGATC ATACTCTGGT ACACGCTACA AACGATACTC GTGCCCAACA ATTTTGCAAA TCTCGGGACG ATGTGCGAAC TCTTATTCTA CCAATGTTAC GTCCCAATGG GGAACCGCGG CAGCCGCAAC ATCCCGAATG GACGCAACAT TTTGTCAAGA TGAGACCGCA CGTGGAAGTA TTCTTGAACG AAGCCCAGGA CCAATACGAA ATTGGTGTAT ATACTGCCGG AACAAGAGAC TACGCCGAAC AAATTTGCAT TTTGCTGTGC CGGCACTTAC TGGGGGTTTC GCGGGACCAG CCAGAAATGG ATATGTTGCG ATACCGGATT GTTAAAATGG AACAGGCACT TTTCCAGTCG ACATCATGCG CGGACCCAGA GGAGCACGAA CCTACTCGCG CATCCGAATC TCTTGAAGCA CCACTACCAA GTGTGGATCG TTCGCACGAG AATGGTGGGA CGAATACGCT ATCCGACGAG GCAAAACACG GCGTGGACCC AGAGGAGTAC GACCGTGCAC AGGCATCCGA GCCCCTTGAA GCACCATTAT CAAGTGTGGA TCTTTCGCAC AATAATGGTT TGACGAATAC GACGTTACCC GACGAGGCAG AACACGCTGG ATCTAAACGC AAACGAGTAA CGTTTGGGGC TAGTCCAGAT GAAGCCAAAT CAGATGGCCC ATCGGCAATG AATCTTGAAA AGCTCAAGAC AGAGCTACGA GAAGCGCAAG CCCTGGAGGA CAAGGTACTG GAACTGCGGC AGCGTTTGTT TGGGAGCCGA ATAGTGTCTC GAACCGACGT TAGAGACTTG GGTCAGAATG TAAAAAGTTT GAAACGAATA TTTCCTTGCG GTGGAATCAT GGCGGTGGTA ATGGATGATC GGGAAGATGT ATGGGCCAAT GCAGCAGACA TTTTGACCGT TCGAAAGGGC GAACCACCCG ACAATCTTTT GCTTGTACGA CCTTATCATT GGAGTTCCTT TTTGGGGTTT GCGGACGTGA ACAATGCTTC CGGTGCTGAC CTGTCCGGAG AAAGTGAGGC GGGCGACGTC GAGACGGACG AGCAATTGTT GTGGTCACTC GATATTTTGC AGCGGGTTCA CCGTCGCTTC TACGAATCGG ATGGCAGCTT CCTTGGTGCT CTTACCCAGA CGGTCCCCGA TATCGTCAAG CAATTGCGAG CGGAAACACT GCATGGCGCG CATCTGGTAT TTTCGGGAAT GGTTCCGTTA CACCGACAGC AACAGCAACT AGAGTCCGGC GACAAAGTAG TTCCCCGACC GACCGTGATT CGCTATGCCG AAACGCTCGG GGCCAAGGTA TGGTCCAAAG TTACACCTGT GCTCACCCAC GTCGTGGCGG CGAAAGATGG AACTGACAAA ATTTTGGCAG CGCGAAAGCT TCCGGGATGC AGGATTGTCA AGCCAGGGTG GTTGATGGAG TGCGTGTGGA GTTTGACGAG GCGAGACGAA GGCCGGTATT TGCTGGGTGA TGCATCTCCG CGATTCTCGG AACTCCGCAC ACCACTGTCC GAGTATTCGA CCGCTAAGGA AAATTCGTCT AGTGAACTTG ACGATGATTC GGAAGATGAC GACTTGGCCG CCCAATTTGA AAGCGAATTA ATGGAAGAAG AGGAATACGT ATAA
|
Protein sequence | MYSSRGSTAP IAPVQPKILG EILLPDSKDY KLYCWKVRNG SSVRKGETVA LAIHRDGSNG LGSSEAGAVS SSLPTSQTPS TATQPASSNR SQYKRPTKRR KHVVQSPDGD AETIMDEKPA ADTKKASNAT ANNVAPATPL TPAPTSDSIS ASEKIPILAT ADGLVRMGRL EGIHQQHDVL IGFIEECTHP TVIDGLCAVC GKSVDKTKTP GEASHSEQSP QSDMSRVTVS GHTVTVSRAE GQRMAQQDAE RLQKRKKLSL VLDLDHTLVH ATNDTRAQQF CKSRDDVRTL ILPMLRPNGE PRQPQHPEWT QHFVKMRPHV EVFLNEAQDQ YEIGVYTAGT RDYAEQICIL LCRHLLGVSR DQPEMDMLRY RIVKMEQALF QSTSCADPEE HEPTRASESL EAPLPSVDRS HENGGTNTLS DEAKHGVDPE EYDRAQASEP LEAPLSSVDL SHNNGLTNTT LPDEAEHAGS KRKRVTFGAS PDEAKSDGPS AMNLEKLKTE LREAQALEDK VLELRQRLFG SRIVSRTDVR DLGQNVKSLK RIFPCGGIMA VVMDDREDVW ANAADILTVR KGEPPDNLLL VRPYHWSSFL GFADVNNASG ADLSGESEAG DVETDEQLLW SLDILQRVHR RFYESDGSFL GALTQTVPDI VKQLRAETLH GAHLVFSGMV PLHRQQQQLE SGDKVVPRPT VIRYAETLGA KVWSKVTPVL THVVAAKDGT DKILAARKLP GCRIVKPGWL MECVWSLTRR DEGRYLLGDA SPRFSELRTP LSEYSTAKEN SSSELDDDSE DDDLAAQFES ELMEEEEYV
|
| |