Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45711 |
Symbol | |
ID | 7200483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 1015611 |
End bp | 1018021 |
Gene Length | 2411 bp |
Protein Length | 775 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179952 |
Protein GI | 219118351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00177654 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCACCACTT CGGGAGATTT TTTTTTGCCG GCCGAAATCC ACCGCTTTAC AGAGCTAGTG CAGGGCGACC TCCCACTACA AGTATGAATT GCGTTCATGA ACTTAAGTTT TCGTGGCGCG TATTGCGGCT GCGTCGCCGA CAACCACTAT CCACAGCAGT GATGGCTACC GCAAGGGATC GCTGCCGAAT GCTTTCGGCG ATTTCAGATT GTCCATTTTC TTTACAACCT TGTCGAAAAC TCACGACCAC TGCTGCGAAA GTTGTCACTG AGAGCAAACG CGAGAGAAAA AATTATCTCC CTGCAGGACC AGAGGACTCA ACAATTAAGT TTGCTATGTC GCTGCCTAGC GGAACGATGA ATAACGATAT AGCTGGGTTT TGTCGAAAAG CAATCAATAT TCTTACTAAG AGAGGTACAC AACAATGCAT GATTATCGCA GAACAGATTT TGGTAAAGTT AATCGAGGAA AGGAAAGAAG TCAAAGTCAC GTTGTTCAAT CGTGTAATCC GCGGTTGGGC TAGAACCAAC ACAGACGATG CACTTTTCAA AGCAAGGTCT CTACTTGAGC GAATGTGGGA GCTGCAAGAA ACGCAGCCAA ATTTGTACCC CCCACCGAAT AACGGCACCT ACAGCAGCGT GCTGTATGTT TGTGGTCTGA GCTCTCACCC CGATACTCGA ACAGTAGCAC AACGACTCGT GCAAGAGCTG GAGGAAAGAC TACAGCCTTC TTTTCCTAGT TCCGCTATAT ATAGTCAACT CATTCGAGTG CACTCGAATC AAGCCAGCAC ACAGTATGGG GCTGCCATGG CGGCAGAAGA TACGCTTATC CACCTTTCAA GTCTGTCCAC CAAAGGCCGA CCGCATCCGA CGACAACGTC ATTTAACCGA GTTCTCAAGG CCTGGAGCAC GAGTCAAGAG GGAAGAGGAG CCGAACGGGC TCTTGAAATT TTGGAAATGA TGATCCAGCT GGAAGCTAAT CATGGCGTAG CGCACCCGGA CAAATACAGC TTTGGCACTG TGGCAGCAGC CTTTGCACGA CGAGGGCAAC CCCAGAGAGC GGAAGCAATT TTTAGAAAAG CCGTCGCACA CTTTCGAGCA AAAAACATCT CTCGTGCTGA AGATCGCGTA GATTTAACTT CTTGCCTCAA CGCCGCACTA TCGGCGTGGG CAAAAAGTGG TTGTGACGAG GCAGCAACAA TAGCGGAAAC GCTACTTCGC GACGGATACT CTTTGACTGA TATGAGCAAC ACCGCCCAGG CAGCGATCGC GTCGGGAGAC TCATTTTCGG TTGTTATTCG ACCGGACTTA GCAACACACA TGTCCTACAT GGAAGCTCAA GTCCGTTGTG GAAGAATGCT ACAAGCGCAT GCTCATGTTG TCGCTATGGT CGAAGCAGCT ATTGGCCAAG CTGGGCCGCC TCCAACAACA GACACATTCA ATTTTGTTAT TCATGGCTGG CTCCGCCTGT CTCAAGGGAG CGGGGGGAAA TATGTCTTAT CCCTTTTGAA TTCGATGATA AATCTCTCGG AGCAACACGG ATTTCCATGT GCTCCAAATT CGGGCACCTT CAACATGTGC ATTGGCTTGC TTTGTAAGGA AAACTCGCTC AAAGAGGCTT TCGACGTCTT GATAGATGGT GAGAACCGGC TTACTGTAGA CATGTTTCCA TACCAGGTTC TCATCAATGC TTTGGTAAAA ACGAAAAAGT ACGAAGACAC TGTGACCGCT TCTACCTTAT TACATCGTAT TGAGGCGGGG GTTAAGAAAG GAACTTTTCA ATGGGATCCG AAATCTGTTG GGTTGTACAC CGCTGTAATT TCAGCAATGG CGAACTTCAG ATCCAAGGAG TCCGCTGATT TAGCATTGCA GACACTTGGA CACCAGAAGA ATTTGGGTGT TCAACCGACA AGTCGAGCGT ATACTGCTGT TATTTTTGCT TTCGCTTCCC TTCGGAACGT AAAATCTGGC AGGGTTGCCT TCGATTTGTT TCGAGAAATG CAAGTTTTAG ACTCCGATCC CTCAAACAAG CTGAAGCTCG ATAGACTTGT CTTTGCAGCA ATACTGACGT CTCTCAGAAA TGCGAGAACC AAGGAGTCTG CGGAAAATGC TTGCGTGGTA TTGTCCTACA TGTTAGATTT GCACATTCAT GGTCGTCCCG ATATAGAGCC GGATGAAAGG TGCTACGATG CCTGTCTCAA TGCCCTTCTC AATAGTCGTG ATGCACATTG CGTTGAGCGA GCTGCTCGTC TTATTAAGAC AATTGTTACG AGACACCACG ACGGTGCCCT GTCGCATCTG CCATCAGCTG CAATAATCAA AAGTGTGTCA AAAGCTTGTT CGCGCTTGCA GACAGCTACA ATGCATGGGT TTGCCTCAGA ATTGTCTGCC TTGATAAAAC AGAAAGAGTA A
|
Protein sequence | MNCVHELKFS WRVLRLRRRQ PLSTAVMATA RDRCRMLSAI SDCPFSLQPC RKLTTTAAKV VTESKRERKN YLPAGPEDST IKFAMSLPSG TMNNDIAGFC RKAINILTKR GTQQCMIIAE QILVKLIEER KEVKVTLFNR VIRGWARTNT DDALFKARSL LERMWELQET QPNLYPPPNN GTYSSVLYVC GLSSHPDTRT VAQRLVQELE ERLQPSFPSS AIYSQLIRVH SNQASTQYGA AMAAEDTLIH LSSLSTKGRP HPTTTSFNRV LKAWSTSQEG RGAERALEIL EMMIQLEANH GVAHPDKYSF GTVAAAFARR GQPQRAEAIF RKAVAHFRAK NISRAEDRVD LTSCLNAALS AWAKSGCDEA ATIAETLLRD GYSLTDMSNT AQAAIASGDS FSVVIRPDLA THMSYMEAQV RCGRMLQAHA HVVAMVEAAI GQAGPPPTTD TFNFVIHGWL RLSQGSGGKY VLSLLNSMIN LSEQHGFPCA PNSGTFNMCI GLLCKENSLK EAFDVLIDGE NRLTVDMFPY QVLINALVKT KKYEDTVTAS TLLHRIEAGV KKGTFQWDPK SVGLYTAVIS AMANFRSKES ADLALQTLGH QKNLGVQPTS RAYTAVIFAF ASLRNVKSGR VAFDLFREMQ VLDSDPSNKL KLDRLVFAAI LTSLRNARTK ESAENACVVL SYMLDLHIHG RPDIEPDERC YDACLNALLN SRDAHCVERA ARLIKTIVTR HHDGALSHLP SAAIIKSVSK ACSRLQTATM HGFASELSAL IKQKE
|
| |