Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20335 |
Symbol | |
ID | 7201027 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 772746 |
End bp | 775037 |
Gene Length | 2292 bp |
Protein Length | 665 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180115 |
Protein GI | 219118695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACAATGGC CGTCGCTTGT GTCAAGGCAA GAGCAAGTGT ATATGAATCT TTCTACGTCC CATTGACTAC TGATCTATTT ACTAAGCCCA TAATAAGTTT CTTTGGAAAT GAAGCTCTTT TCTGAAACAG CCAAAAAGAA ACGACACCGC TCCGATTCCA TTGATTCATT AGCGAATTTC CGGCAGAATC TCCCTGTTTA CGCATATCGC ACGGAGCTAC TTCGAGCAAT ACGGAGTCCC AAAATTGTCC TTGTTACTGC TTCAACTGGT AGCGGCAAGT CGACCCAGAT ACCGGCCTAC TTGATCGACA GCGAGCACGT CATGGCCGTC ACACAGCCTC GACGAGTTGC TGCCACTACC CTCGCCCAAC GCGTGGCGCA CGAACACCAA GGCATTGTGG GACACCAAAT TGGCTATCGC GTAAGATTTG ACGATTGCAC TCGCAGCGAT ACCCAACTTA CCTACGTGAC GGATGGTATG CTCCTGCGAG AAGCCATGGT GGATCCACTC CTACGAAAAT ATTCGATCAT ATTTCTCGAC GAAGCGCACG AGCGATCCTT ACAAACTGAT ATTTTGCTCG GTGTCGTGCA GCGTGCGCGC AAATCACGAC AGCAGTCCTA CAGTAGGCGG CCATTGCAAA TCGTTGTCAT GTCGGCAACA CTACAAGTCC AAACTTTTGA AGATTTTTTC GGTAAAGAAA ATGTTGTCCG TATTGAAATT CCTGGGCGGC AATTTCCAGT GCAAATGCTG TATACTTCGG TACCCGCCGA AGATTATATG GAAGCTACTT TGGCGACGAT ATTGCAAATC CACACCCATG AGGAAGCTGG AGATGTGCTA GTCTTTCTAC CTGGACAGGA AGAAATCGAG GATTTGGCGA CGTTGCTAAG GTTGCAGCTA CAGGAAGAGG AAGCCTCCAA ATGGACCGGG GATCATGTTG TCCCTTTCGC GCATCAGCAC ACCAGGGGCA ATACCGTATC GCAACTGCTC GCGAATGGAG TTCTCATATG CCTCTTGTAT GCAGCCTTAC CTCCGGAAGC ACAGCTTGCG GCTTTTGCCG AAAAGCCAGA AGGGTGTCGT CGCAAAATAA TTCTCGCTAC CAACATTGCC GAAACGTCCG TTACATTGCC GGCGATTCGC TACGTTGTAG ACACCGGTAA ACACAAACTC CGACAAATCC TAGCCACTGG TATGGACAGT TTGACAGTCG AATCTGTCAG TCAAGCGCAA GCGGCCCAAC GCGCTGGACG AGCAGGCCGT ATTGGACCTG GACTCTGTTT TCGACTATAC ACTGAAGACG CTTTCGAGCG CTTGGATCCC GACAGTCTGC CTGAAATCCT TCGAGTCGGT TTGGCGCAAG TCATACTGCA ACTCAAGGGT ATGGGAGTGC AAGATCCCAC TACGTTTGAT TTCGTCACTC CGCCCGACAC ATCCAGCTTG GTTCGTGCCG CGAAACTGTT GTACGCCTTA GGGGCAGTCA ACGATGCCAT GGAGCTGACG GACTACGGCA AAAAACTAGC CAAGTTACCG CTAGATCCTG TGTTCGGCCA CCTGCTTCTC AAGAGTGCCG AGTACTCGTG CACAAGTGAG ATGTTGACGG CAGTGGCCGT CCTCTCTGCC GAGAACGTCT TCTATCGACC GACAACCGGA GAAGTTATCG CCAAAGCGGC CGCTGCCCAC CGTCGGTTTG CGAGCCACGA AGGGGATCTC CCTACCTTTC TCAATGTGTA CCAAGCCTGG GAACGAGAGG CGTCGTACGT ACCTCCCACT TCTGGTGGTC GCCGAGCCCA AAAAAAGCTT CTGCATACGA ACGGACAGCA TTCAAGAGTG TTGCATGGGG AATGGTGCCA GCGAAATTTT GTTTCTGGAC GTTCCTTGGG ACGAGCTTAT CACGTACGAC AGCAACTTAG GTCTACGTGT TTGAGGCCAG CGGAAAAGAA TGGGCTAGGG ATGGACGTGA ACGTCACTTG TGGAAAGGAT CGAGAAAGTT TTCTAAAGTG TGCGGCGGCT GGACTTTTTT TGCAAGTGGC CTCCCGCACC AAGGCAGAAA CGGAGATTGA TAGTCGGGGT CGATCGGGAA CTGTCGTCTC AACGCGAGGA CGGTATCGAA CGAAAATAGG TAACGAAACA GTTTCCATCC ATCCGACGTC TACCATGTTT GGAAGACATC CTGCACCAGC TTGTGTAGTG TTTACGGAAT TAGTCACGAC GAAAAAGACG TACATTCGCG GAGTGACACA AATCCGGGAA GAATGGCTAC ACGAGGTTGC TCCTGTCTTC TATCCAAAGT AA
|
Protein sequence | MKLFSETAKK KRHRSDSIDS LANFRQNLPV YAYRTELLRA IRSPKIVLVT ASTGSGKSTQ IPAYLIDSEH VMAVTQPRRV AATTLAQRVA HEHQGIVGHQ IGYRVRFDDC TRSDTQLTYV TDGMLLREAM VDPLLRKYSI IFLDEAHERS LQTDILLGVV QRARKSRQQS YSRRPLQIVV MSATLQVQTF EDFFGKENVV RIEIPGRQFP VQMLYTSVPA EDYMEATLAT ILQIHTHEEA GDVLVFLPGQ EEIEDLATLL RGNTVSQLLA NGVLICLLYA ALPPEAQLAA FAEKPEGCRR KIILATNIAE TSVTLPAIRY VVDTGKHKLR QILATGMDSL TVESVSQAQA AQRAGRAGRI GPGLCFRLYT EDAFERLDPD SLPEILRVGL AQVILQLKGM GVQDPTTFDF VTPPDTSSLV RAAKLLYALG AVNDAMELTD YGKKLAKLPL DPVFGHLLLK SAEYSCTSEM LTAVAVLSAE NVFYRPTTGE VIAKAAAAHR RFASHEGDLP TFLNVYQAWE REASYHSRVL HGEWCQRNFV SGRSLGRAYH VRQQLRSTCL RPAEKNGLGM DVNVTCGKDR ESFLKCAAAG LFLQVASRTK AETEIDSRGN ETVSIHPTST MFGRHPAPAC VVFTELVTTK KTYIRGVTQI REEWLHEVAP VFYPK
|
| |