Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50489 |
Symbol | |
ID | 7199325 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 196943 |
End bp | 199860 |
Gene Length | 2918 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185398 |
Protein GI | 219130492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.558278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCGCACCTA CAAGTTCGCG ATTTACAGTT AACATATAAA GAAATGAGGT AGTTTTGTCA TCTACCACTT CAAGCCAGAC TCTTTTGGAT TGTACAATCC AGTTAGTTAG CTAGATGCAT TGAAAAACAA ACGAGACAAA CCAGAGTTAC TCATAGCAGA CCAAGAGATC GCCCGGGACG AAATACTGCT CTGCCTTTGG CCCAACCGGG GTCTACAAGA CCCGCACGCA TAGATATCCC GACATTTCCC TTTCCAGCAT CAACGGCAGG CCCGGCATCA TGACCGACTG GATATACCTT GACTTTTCCC GCACCACGTG GAACCATAGA TTTTGCATTC ATGGAGCCGG GCTCTACAGG GATTTGATAG ACAACAGACG GTAGCGCATT ACTACCCTTT ATGAGCGGGG GTAGCAAGAT TTCAAACTCG ACGGTAAAAA GCCGAGAACG CACCGCTTCA TCGTACGATG CTTCCAAAAA TTCTGATGTT TTCAGTTGGT GAGCGGCTGC CGATCCTTGA CGCCAGAACG CTCTAAGTTT CTTGCCGTCA CTCATCTTTC CGATTGCCCG CATCCCGTAT TTCCCTTTCT TTTTGTGAAG TTTAAGTTGA ACCGGGTTAG CAAGATATTT CTGTATGTGA CGCTGTAGCT TGCGGTCGAC TCCTTCCAAT ACTGGGGGCT CAAGTTCCCA CTCGACATCC TGCATCAAAA ACATTTCACG AGTCATGCCT GAGAGGGCTT CATTGGTATC CAGCGCCATT TCACCACGAA AGTCTGTATC CGATTGACCC GAAGATCGGA GAAAGCGCTG AGCGCTGGCA AAGGGATAGA TAGCGAGCAG CAAAAAACAA CAGAGAATAA CGGGCTTCAT CGTTAGTCTT AGGCTTAACC TTGTAAATCT GAGAATGTCG TTGTTTGGAT TTCTAGAAGT GCTGATGGAT TTCAGGTGAA ACACTATCCA TGTGATCGAA GGCGATCTCA CAGTCGAGAT GGCGGAAATT GGGAATGGTC GCGGAGAGTT CCATAGTAAT TTGTCGGCAA GAAATCTTAC ATCGATGAAA CTGTAAGCTT CTTCCACGCG GAGTTGCCGC CCTGGACGAC AAAGGTTCGG GATTCAACAA ATCCTATTAT GGGAGTCCGG CTCCATAAAA GCAATCACTT GTCCGGGTTC TCACGTCAAA TGTTCGCGCC GTACCTCGGC TTCCTTCGAT CAACCCTGCA CAAACTCCTC TTCATTGCCG TTGAAATAAA ATTGGAACGA CAGCTTTGGA GGGAGATTGG AGTTGTCGTC GATGGAGGAT TTTGCATTTT CTTAAACAAA CCCTTTTGAT ATCACAATGG CCATCGCAAA GAATCCCCAC CGGAGCCCTG AAATGTTCCG CATCCATCGC TATTCTTTGT GGTCGTTCGA GGTGGGTGCA TCTGTACAAC CACTTTCCAA GGGTAAACGA AAGAAGAGCC GCTTCATGAT TTCCCCTACA CCTACGCATA GCAACTTCTG GAACTCCCAT GACTTCACAT TGGATAAAAA GATTGGACAA GGATGTTTTG GGAAGATTTA TCGGGCCAAG TACCATCGAC CTATAGAATT GGCTCCCTCC AGGGATAGAA GCCAAAGTTC TGTCCACATA AACGCGAAAA AGTGCTCCTT TGTTGCTATC AAACAATTCT CAAAAATAAA GCTCATGGAA TCGAAAGACC GCGGAAGCCG TTCCCATGAG CTACTTGAAC GGGAAATTGG CATTCACAGC CAGTAAGTTT GCGAGTGGAA CGGCAACCCC AAGGAAACAT TTTCCTCACG TTTTTTTGAC AAATGGCAAT AGATTGCAGC ACAAGCATAT TTTGTCATTT TGGGGATACT TTGATAGCTT GTCGCATGTG TCTTTGGTTT TAGAGTACGC ACCGTATGGT GACCTTCTGA ACTATACCAC TCGGAATTTT CCATATTCCA GAGAACTCCG TCTCAAAGCC TCTAGCCATT TTGTCCGGCA AATTGCTTGC GCACTGGACC ACCTCCAGGC ATGTCAAATC GCCCACCGAG ACATTAAGCC CGAAAATATT GTCGTGGTTT CGCCCCGGCA AGTGAAATTG TGTGATTTCG GATGGGCTGT TTCTTTCCAA AAAGCTGGCT ACCAAACGAC ACTGTGCGGT ACGTCCGAAT ACGTTCCGCC TGAAATGCTA GCCTGCAACT GTAAATACCA AGCAGCATAC GTCGACTCAT GGGCTCTTGG AGTATTGACG TATGAACTCG TTGAAGGCGA GTCACCCTTT GTCCTAGACG CTTCTAAATG CAAAACGAAC CTTCCAAGGC AACAAATCCA TGGAAATGTT ACAACCGAAA TGGTTTTCGA TAAGATTCGA AACTTTCCTG GATTTTTTCC GCGGCATGGT CCATCCCAGC TCACCAGTGT GGAAATGCGT ACCTTTGCGG ATTTCGTGAC AGGCTTGATG CAAATCAACC CTGAAAGTCG CTGGTGCCCA GTTGATGCGC TGGAGCATTC TTTTTTATCA CTATCTTCAT TCCTCGTGGA GGGCCGCGAG CCTCCGAATA AAGAACACTC TCGACACAGC AGCTTCTCAA AGCCTGCGTC TTATGTCCAT TTTATTAACA ATGCGTCGGT AGGTTGACTC TCATTGTCAC TTACAGTTCT ATATAATGAA TTGTGAGGTA CGAGTAGCAT CTATTATATT CATCTTTGTT TTGAATCTTC TCTCGTGCGT GAACGATGAA TTTTCGGATT CGAAATCCAT GTGGTTGGAT GATTGAAGTG AGATGAAGCA TGAGGAAGAA CGGAGAACCT CATCAACACC TCTCCTTACA TTAATTATAG AGCGGGAAAC GTCGACACAT GCCATGCTGC CCTCGACGTG ACACTCATTG ACGCAAAAGC AAAGAATCTG TCATGCATTA ACGATTGA
|
Protein sequence | MAIAKNPHRS PEMFRIHRYS LWSFEVGASV QPLSKGKRKK SRFMISPTPT HSNFWNSHDF TLDKKIGQGC FGKIYRAKYH RPIELAPSRD RSQSSVHINA KKCSFVAIKQ FSKIKLMESK DRGSRSHELL EREIGIHSQL QHKHILSFWG YFDSLSHVSL VLEYAPYGDL LNYTTRNFPY SRELRLKASS HFVRQIACAL DHLQACQIAH RDIKPENIVV VSPRQVKLCD FGWAVSFQKA GYQTTLCAYV DSWALGVLTY ELVEGESPFV LDASKCKTNL PRQQIHGNVT TEMVFDKIRN FPGFFPRHGP SQLTSVEMRT FADFVTGLMQ INPESRWCPV DALEHSFLSL SSFLVEGREP PNKEHSRHSS FSKPASYVHF INNASVG
|
| |