Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41259 |
Symbol | |
ID | 7199068 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 320116 |
End bp | 323333 |
Gene Length | 3218 bp |
Protein Length | 915 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185258 |
Protein GI | 219130198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.68146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCTG GGAAGCCTGC ACTGAAATTG GACCGACCAC CAGCACTTCA AAGCAAAGCA ACTTTTTACT TATTTTCTCG GCGTACTGGC AATTTTAGCT TCGACCAACT TACCAATTCA TCGCAGGCCA AGCGAAACGG ACGGCTCCAG AGTAACAAGC CATCGACCCT GCGAGGATTC CCTGACAGGT TTGACCTTAC TTTTGGCACT CTCTGCCTAT ACCGTCAATC TAGTCGAAAC TTGACGGTCC TCTCGAATGT AATAGCGATA GTTGCCGAGA GGAACTCTTG GGCAAAATCA AAATCTGTTG ATTTTTCGCG TCAGTCGGCT GGTTACCGTT CCCCGTCTGA TCTCGACCGA AGCTCCACGC TCGATACGTC AAAAACCGGT ACGAACCACA AAGTATCGTT AGTTTCGTTA ACTACCAAAC AACGTCACGA ATAACCATGA TCGGCAAGCT TTACTGGTAC TTCCTCTCTG TCTGTCTGTA TCATTATCAT TCACAGTCAA AAAGTTGACA GTGAATCTGA TTCTCAGCAT ACGAAACTGC TGTGCAGCCA TTATATCCTT TCCATGTCTA TTTTGCAAAT TCGTTCTGTA ATACCAGTGA CGTTATCGTT GTTCTCTTCT TTACGGAACG ATCTGAATTT GCGATAGAGT GTTGTTTTAC GGAACGATCT GAATTTGCGA TAGAGTGTTG TCGACAGTTG TGATGAGGAA GAAGAAAACC AATGATAAGG CAATCTTGGG TGACTGCGAG GCCAATACGG AAGAAATAAA TCAGCTTTTC GTTGGACTGT ACCGTGTTTG CGGCTTTTCC AATGGTGAAT CTCTGGAATC TCGTGAAAGC TCACGAAAGC AATTCGTCGA GTCTATGCAG GTCTTGTGTT CGGCGCTTGG ACAACTCCAT GACGAGCGAA AATCGGGTGA ACTCGGAATA CAGAGCATCT TGCCCGCTCT CGAGCAAGCC GCGTCTCGAG TACGGCTTGT CGCCCACGAT TTCATGGCTT ACTTGACACG AAGAGTTTGC GAGTTGGATT TTCCAATGAA AAAAATCCTG TCATCGCTTC GTGACAAGCC GGCTTTTACT ACTTTGCTTA TCCTGTCAAG GTCCTGGAGA ACGTGTCTCA ACCATCAAGT GCAACAGGAA AAGACTGTGG CATTGAAAGC GGTCTCATTA CTTGATCAAT TGTATTCGTC TTCCAGACTG AAGCATCCTA CAAACGACTT GCTAGCGTCC TTACAAAATT TAGTTAACAT TTTGACACGC ATGGACTTCA TCGCGCTTCC ACACGCTAGG ATTGGGCTTC AGTTGCTGGG TTCTCCCAAT AACATACAAA GAGTTCACTC GTCAGATTTA TCGCTGCTTG CTGTGGACTA CGACATGTCC CGGTCGACAG AGTATAACAA GGTGCCGGCG CATTGCACCG TCGACGTACC CATAACATGG CAGGGGATCG AAAATTTTAT AGAAAGGGAC GCTGGAAATT TCCCTCCTTT GACAAGCTTT CTACTGGTGG GTCCGGAAGG GAGCGGAAAA ACCCATATCT GCGATGTGTT AGAAAAGAGC TGCGTACGCT CGTCAATAGC AGGTAAGTAA GGATGGTTAA AATGGAACGA ATACCAATCT ATTTCTGCGC TAACATTTCG GTCAACGATC CGTAGTCCTC CGTCCTCGGC TTCCTCTGGA CATATTAGGT CAATCAGTTG GCGAGATGGA AGATGTCCTC GTCGCTCTCG TTGATTCAGC AAAAAGCGGG AGACAATCAT GTTTTGCGCT CATTCTGGAT GACGCTGATT TTCTCATTGC CACTGGCGAA AGTGGGATTG GCGAGGGCGG AGAGCGATTC TCAGGGCGTC ATCATATACA GTCGAGATCC CAATCTACAT TTTTCGCTCT GTTAGATAGT TTTCGCAGTG ATGCAATATC TTGTAGCCGA CTAATCCTGA TCTGCACGTC CAAAATGGAT CAGGATTGGA CCGCGGGCCG ATTTGACCGC AAATACCACA TCTTGCCACC AAATGAACAC GAAAGGAGAC TCTTTATCTG CTCAAATCTT GGTCTTCATA CACCAGTGAA ATGTAGCTTA ACGATACTTT TGGAGGATAT GGTAGAAGGG ACAGTAGGAC GAACATACTC GGAAATTGCA CTCTACTGCA GGCAAGCTGC GATTGATCAT GCTTCATCTG AAACGGTCGC CGAAGCGGAG ACCCTTCTGC ATTTCTTGAA ACGTCGTCTC CAGTCAATCA CTCCTGAGTC CTTGCGTAGT GGCGTGCTGG ATGAATTTGT GGATATGCGA GTTTGGACAG CCCGTGATTT GGGTAGTATG GAGACACTGG ACGATTCTGA GTCCTCCTAC CATCTCCCTT TGTTCGGTTC AAGTGCGGAG CAGGCATGGA AGGATCTTCA ATCAACTGTA ATCATTCCTT TATGCCGAGC AAGAGAGCTG GAAGACTTGA GGAATCCTTG CGGGTTTTTT TCTCCGCGAA TATTTGTTGG CGGCATGCTG TTGGCAGGTC TTCCAGGAAC AGGCAAGAGT TCGTTAGCTT TTCACACCGC AAAGATCGCC GCCCGACTGC TCCCGACTGT CAAATTTTTG GAAGTGAGCT GCACGTCTCT TATTCATAAA GAAGTCGGTG GATCGGAGCG TGCCCTTCAC CACTTGTTGG TATGCGCTCG CAAGGCTGCG CCCTGCATTC TACTGATGGA CAGTATCGAA ACAATTGCGG CTGTTCGTGG AAATGATGCA ACGACGGAAG GCACGATGGA TCGCTTGCTT TCAACACTTC TAGTCGAGCT AGACGGCGTG CAGGAACATG GGCAATCATC TGTTTCCTCA CCTGCAGGTA TTGCTGTCAT TGGCATAACG CACAATTCGG ATTGGATTGA CCCTGCATTG CTGCGGCCTG GGCGTCTAGA CAAGATTGCT ACTCTGGATT TACCCGACTA TCAAATTCGA TATGGCATTG CAGCTAGGGA CCTGAAAAGC GGGATAGCGG TTCCTGCTAA TCTTAACCTC CTGAATGTAA TTGCTGCAAA AACGCATGGG ATGAGTGGGG CGAGCGTCGC TGCCGTTTGC AGTGACTTAA AATTGGCATT TGCTCTGGGT AGCAACGTGT GTCAATCTGC GCTTGCGGAA ATAATACGTT CGCGACGGTA AGGATCGGTG CACTATGTAC TCTCTTTTGT AGAGCTTAGC CTGTATGTAC ATACTTAA
|
Protein sequence | MSSGKPALKL DRPPALQSKA TFYLFSRRTG NFSFDQLTNS SQAKRNGRLQ SNKPSTLRGF PDRFDLTFGT LCLYRQSSRN LTVLSNVIAI VAERNSWAKS KSVDFSRQSA GYRSPSDLDR SSTLDTSKTG TNHKVSVLST VVMRKKKTND KAILGDCEAN TEEINQLFVG LYRVCGFSNG ESLESRESSR KQFVESMQVL CSALGQLHDE RKSGELGIQS ILPALEQAAS RVRLVAHDFM AYLTRRVCEL DFPMKKILSS LRDKPAFTTL LILSRSWRTC LNHQVQQEKT VALKAVSLLD QLYSSSRLKH PTNDLLASLQ NLVNILTRMD FIALPHARIG LQLLGSPNNI QRVHSSDLSL LAVDYDMSRS TEYNKVPAHC TVDVPITWQG IENFIERDAG NFPPLTSFLL VGPEGSGKTH ICDVLEKSCV RSSIAVLRPR LPLDILGQSV GEMEDVLVAL VDSAKSGRQS CFALILDDAD FLIATGESGI GEGGERFSGR HHIQSRSQST FFALLDSFRS DAISCSRLIL ICTSKMDQDW TAGRFDRKYH ILPPNEHERR LFICSNLGLH TPVKCSLTIL LEDMVEGTVG RTYSEIALYC RQAAIDHASS ETVAEAETLL HFLKRRLQSI TPESLRSGVL DEFVDMRVWT ARDLGSMETL DDSESSYHLP LFGSSAEQAW KDLQSTVIIP LCRARELEDL RNPCGFFSPR IFVGGMLLAG LPGTGKSSLA FHTAKIAARL LPTVKFLEVS CTSLIHKEVG GSERALHHLL VCARKAAPCI LLMDSIETIA AVRGNDATTE GTMDRLLSTL LVELDGVQEH GQSSVSSPAG IAVIGITHNS DWIDPALLRP GRLDKIATLD LPDYQIRYGI AARDLKSGIA VPANLNLLNV IAAKTHGMSG ASVAAVCKLS LYVHT
|
| |