Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_15212 |
Symbol | |
ID | 7194858 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 536183 |
End bp | 539362 |
Gene Length | 3180 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183119 |
Protein GI | 219125714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.178674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTTC TTTCGTGTGA AGGGCACAAC GGAGCTGGGA AGTCGACTTT AAGCAACCTG ATCTCGTGTG AATTTCGTCC AACTAGCGGA GACGTAAAGG TCTTTGGTCA TTCCGTGACG AATGAAACAT GTGCTGTACG CAACCTCGTC GGAATATGCC GTCAAGACGA TTACCTCTAC CCAAATCTTA CAGCCAAGGA ACATCTTGAA CTTTACGCTG GACTACGGGG CGTGCCTCTG AAGAACATTC CCTCGGTTGT GCAGGAATGG CTTGAAAGTG TCGATCTACA ATCAGTTCAA GATCACTATA GCGCAAGCTA TTCCGGTGGC ATGAAACGCC GATTGTCTCT TGCGCTGGCA ACGATTGGTG GCCGTCCTCT AATTATATTG GATGAGCCAA CCACTGGAAT GGATCCTGTC AGTCGCCGCT TCGTCTGGCG CCACATTGAT TCTGTTAAAG AAGGTCGAGT CATCCTGTTG ACTACACACG CTATGGAAGA AGCCGACCTG TTGGCAGACA CAGTTGCTAT TATGCGAAAG GGAAAATTTG CTGCGGTCGG CACCCCACTA GAACTCAAGG CAGAGCATGG ATCGGCTCTC CAGTTTTCTG TCCTTGTCGA GCCGGAGCTC GTCAGCACAA CAGAGTCTTC GATTCGTGAT CGCTTTGCTC AGTACAAAGA ATTTTTCACG CTTACGGCAG GTAGCGCAGG AAACCTGTCT GCAAACATTC GACGGGTCAG CAAGACCGAT GGAGAAGAAG GCGTGGATGT TGATACGCTA ACCGAGTTTG TGGCGTGGTT GGAAAGTGAT GAGTCTGGTG TCTCCGAGTA CGGCTTTTCA AATAGCTGCC TGGAGGAAGT TTTCCTAGCG GTCACTCACG GGTTGACGGA ACAGAATCAG TCCGTAGTGG AAGAAAACGA GATACAAAGC AGTACTCAAG TCGCGCAGAA TAACAGCGGA ACTTTTCCAA GTACTTTCCC CGGCGATACG ATGCCTCAGG GGTTAGAATC TCAAAGGATG GCAGAAGAAG TGGTCATCAG AAGCATCAAT GATGTCTCAT CCGCGACACC CAGGGTGAAT GTATTGGGCC AGATTGAGGC TTTACTCATT CATTCTTTTG GAAGAAGCTG GACAGGTCAG GCTTCAAGAG GAAATTGGAT TTTTGTAGCT GTCATGACAA TCATTTCGAT TGTATCGGGT CTGGCTATTG GTAGAGCACA AGAACCTCTT TACCTTTTGC CTGTTCCAGT GATGATTCTG TCTCTCATGC TAGTTGTCTG TTTGACACCT TTGTATTCAG ATCGGGCCCT TGGTTTGTGG CACCTCGTGC GCACGCAAAG CTTAACACCC CAAGCATTCA TACTGAGCAC AGCAGTTCAT TCTTTCGCGG TCATGTTCTT CTATGGTCTG ATTGTCCTTA CCGCTTTGTT CGCAACACCA CTATTTCGGG AACCCCATAT TTGCACACCG GACGATTGGA ATTGCGGATT GCCGGGCTTC GGTTCGCGTC GCGAGATCTT CGAAACGCCT GTGTGGAATT TTGACGGTGG AGTCTACATG GAGCAATCAG TGCAACTGTC TGCAATTCGA ATTCCGGCTG GATACGGTTG GTTATTTGGC TCGGTTGCTC TCTTCGCCAT ATCATTTCCC GGAGCAGTCT TCTCAAGCGC ATACCTCCCA GGAAACCGGT TGGCTCTCGT TACAGTAACA TTCGTCATCC TTGGTCTCAG CGCGATACCA TTTGTATTTC TCCTGAAAAC ATACGACACC AATAGTGAAG GTGTTAGTAA TTGTGTACTT GAGTTGGACC CAACAAACCA GTGTGAAAAC ACATTCACCG TGGATGAAAT TGGGACGGCT TTTCTCAATT GTGTGGGTCG TAGCTTAGCT TTGGAGCAGA CCTTTTGCGC TCCATTGCCG ACGTCTATGA TACCTCAGCT CGGGCTTTTT CAAGCACTCA GTATGAGTAT GATTGGTAAA GTCAGATTCA TTTCGGATCC ACCTGCATAC TTGGACGAAG TTTTCCTTCC TAGTATTGGA GGCGATGCAA GCTGTGACGG AGACACCTGC CTTTTTCCAC TGGCCTATGA ATTTTATGTC AAAACCATGC TATACATGCT TTTGGGGTCA ATTTTACTGA CTATTCTTGG GCTGGCCATG GTTTTCACAT TCGCGTTCCC AGTCAAAATT GTTTTGCGAT TAAAAGAATT CCTAGTACTG GCGTCATCTT GCTCTTTGAC GAAGCAATCG CAAATAACGA TAGTAGATGA TACCGAAGAA AATGAGGAAG TTTCGAAGGA ACGCGCTGTT GTTCGTGGAA TGATGAAGAA CTTCCTTCTT GACACCAACG AGAAGATGTG TGTCAGCAGC AACAGCATAG AGCACGACCA AGTTTCGCCT GTACTGATGT ACAAGCTCAG CAAAGTCTAC CCATCGCTTG GCCGCTTGCC TCCCAAAGTG GGGTTGAAGG AACTTGATAA GTGGGGTTGA AGGAACTTGA TCTTCATGTT CCAAAAGGAC AAGTTATTGG ACTACTGTAA GTGTCATCTT TGTTTTTTCG TCGCTAAAAC AAAACAAACA AAAATAACAC GATTGTTCTG TCTCTCATGT AGGGGCAAAA ATGGTGCTGG GAAAACAACG GCGCTGAAAA TTCTTTCAAC AGCCCATGAC GCGACTGACG GAGTCGCGTT GGTCGCTGGT TATAACGTCA ATGCAGAGCA ACGCCGAGTT TTCGAACATT TAGGGAACTG CCCACAGTTT GATGTTGTGT GGGATCGTTA CACAGTCGAG CACCATCTGG TATTCTTCGC GCGACTCAAG GGATTGCCAC GGAAGGAAGT GAGAGACATC GCCATGAGGG TTGCAAATGC AGTCGGATTA GGTGCTCCGG AAGTTTTTCA TCGCCATGTT GGACAACTGA GCGGAGGCAT GCGTCGACGT CTGTCGATTG CCATTTCGCT CGTGGGTGCT CCCGATGTAT TGCTTCTAGA TGAACCGTCT ACTGGTCTCG ATCCGTCGAC GCGCAATTCC ATATGGGGGC TTATACATTC GTTTGCGACC CCCGAGCGCT CTATAATTAT TACAACACAT ATGATGATCG AGGCCGATAC GTTATGCAAT CGGATCGCGA TAATGAAGAA AGGGAAATTG GCGGTTGTTG GCACACAGCA AACTTTGAAA
|
Protein sequence | MFFLSCEGHN GAGKSTLSNL ISCEFRPTSG DVKVFGHSVT NETCAVRNLV GICRQDDYLY PNLTAKEHLE LYAGLRGVPL KNIPSVVQEW LESVDLQSVQ DHYSASYSGG MKRRLSLALA TIGGRPLIIL DEPTTGMDPV SRRFVWRHID SVKEGRVILL TTHAMEEADL LADTVAIMRK GKFAAVGTPL ELKAEHGSAL QFSVLVEPEL VSTTESSIRD RFAQYKEFFT LTAGSAGNLS ANIRRVSKTD GEEGVDVDTL TEFVACFACT DELDLHVPKG QVIGLLGKNG AGKTTALKIL STAHDATDGV ALVAGYNVNA EQRRVFEHLG NCPQFDVVWD RYTVEHHLVF FARLKGLPRK EVRDIAMRVA NAVGLGAPEV FHRHVGQLSG GMRRRLSIAI SLVGAPDVLL LDEPSTGLDP STRNSIWGLI HSFATPERSI IITTHMMIEA DTLCNRIAIM KKGKLAVVGT QQTLK
|
| |