Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18579 |
Symbol | |
ID | 7204395 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 617949 |
End bp | 619672 |
Gene Length | 1724 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186383 |
Protein GI | 219113599 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.445158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGAGGAGC AACGTCGAAA CTGTTACATT GTATTTGGAG TTGCCGAAAA TAGTGGTGCG AAGAAACGCT GAAATATGGC GGACGCAAAA TCTCTTTTTA CTCTGTTGGA AAGTGCAGCA GGATATGCTT TGTTCGAGGT AGTCGCATTT GAAGAGATTG GCGGCTTGTT GGAAGACTCT ATGGACACCG TCACCGATTT GCAACGCTTT AGCCGCGCAG TGAAACTTAA GGCGTTCAGT CCATTTGAAA GCGCCGCCGA AGCCTTGGAG AACGCAAATG CTGTTTCGGA ACACGCCATG ACAGGAACAC TTCACAATTT TTTAGAGATG AACCTCCCTA AGGTCAAAAA ATCCAGCAAG ACGGCTTCAT ACGCACTTGG AGTCATGGAT CCGGCCTTGG CCACGGCTAT TTCTGAGGGT CTTGGTGTTT CATGCCGTTC TGATGATACG ATTCGAGAAA TTGCCAGAGG CTGTCGAGCG CATTTGGATA CTTTTGTCAA AGGTCTCGAA GGCGGCGCAG CTGAAAAAGC CCAGTTGGGG CTTGGCCATT CCTATTCGCG TAGCAAGATC AAGTTTAATC CAGCTCGTTC CGATAATATG ATCATTCAGT CGATCGCTTT GTTGGATCAG CTGGATAAGG ATGTCAACAC CTTTGCGATG CGCATTCGTG AATGGTATTC CTGGCATTTT CCAGAGCTGA AAGACATTGT GAAGGATAAC ATTATGTTTG CCAGAGCAGC AGCGTTCATT CAAGACAAGA ACTCCCTTTT TACAAACAGC GCTTCTGATT CCGGAGAAAG TTCCGCAGAA AGCAATGGAA AGCTGGAAGG TCTAATAGAA ATTGTTGGGG ATGAGGACCT TGCCAAACAG GTCATCGCTT CCGCGCGTAC GAGTATGGGA ATGGACTGTA GTCCGGTCGA TATGATCAAC ATTGTTAACT TTACGACTCG CATGGTAAAG TTAGCGGAAT TCCGGAAACA ATTGGGCATG TACCTAACGG AAAAGATGTC CATTGTCGCG CCGAATTTGT CGGCACTAAT TGGAGACACT GTTGCAGCTA GACTTATCAG CAAGGTTTGT CCATTCTGAA TTATATTTTT GGTCTCTTTT TTACCGATCA TGTGCTCAAT TGTTTGATTT CTACCACACA GGCTGGTTCG CTGACAAACC TCGCCAAGGC TCCTGCGAGT ACAGTGCAGA TTCTCGGAGC TGAGAAGGCT CTCTTCCGGG CCCTCAAAAC CAAGGGTAAT ACACCCAAGT ACGGCTTAAT TTATCATTCT ACTTTCATAG GACGCGCCGA CGCAAAAAAC AAGGGCCGCA TATCTCGCTA TTTGGCTAAT AAATGTTCGA TCGCTACACG GATCGATTCT TTCTCGGACG AGCCTAGTCG CTTATATGGA GAAAAGCTTC GAGACCAAGT CGAGGAGCGG CTGAAGTTCT ACGAGACGGG CCAAGCACCA AGACGCAATT TGGATGTTAT GGAGGAAGTG AGCCGGGAGC TCAGAGCCGC CAAGGGAGAA GAAGACGACA ATGTTGAGAT GGAGGATATC TCGAAGAAGG ACAAGAAGAA AAAAGCGAAG AAAGAAAAGA AAAGCCGGAA GTCCAGTGAT GCTATTGAAG TTGATGAAGA ACTCAAACGG TCAGCGAAAA AGGCGAAAAA GGAAGCAGCT TCATCGGAAA AGAAGAAAAA GAAAAAGAAA TCTAAAAGCG GAGATTCTGA TTAG
|
Protein sequence | MADAKSLFTL LESAAGYALF EVVAFEEIGG LLEDSMDTVT DLQRFSRAVK LKAFSPFESA AEALENANAV SEHAMTGTLH NFLEMNLPKV KKSSKTASYA LGVMDPALAT AISEGLGVSC RSDDTIREIA RGCRAHLDTF VKGLEGGAAE KAQLGLGHSY SRSKIKFNPA RSDNMIIQSI ALLDQLDKDV NTFAMRIREW YSWHFPELKD IVKDNIMFAR AAAFIQDKNS LFTNSASDSG EKIVGDEDLA KQVIASARTS MGMDCSPVDM INIVNFTTRM VKLAEFRKQL GMYLTEKMSI VAPNLSALIG DTVAARLISK AGSLTNLAKA PASTVQILGA EKALFRALKT KGNTPKYGLI YHSTFIGRAD AKNKGRISRY LANKCSIATR IDSFSDEPSR LYGEKLRDQV EERLKFYETG QAPRRNLDVM EEVSRELRAA KGEEDDNVEM EDISKKDKKK KAKKEKKSRK SSDAIEVDEE LKRSAKKAKK EAASSEKKKK KKKSKSGDSD
|
| |