Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38903 |
Symbol | |
ID | 7203683 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 491520 |
End bp | 493253 |
Gene Length | 1734 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182849 |
Protein GI | 219125148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.216492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCCTG CCACCCGGCA AATGAAAAGC GAGGCTGTCT ATGCACACAT CTTGGATAAC ATTCTCTTGT TGTCCCAAGA ACACCCTATC CGTCTTAGCT TCCAACAGCA GGGATATGAA ACAGCCATCG ACATTCTCTC TATCTTCGAG AACGAACTCG ATGCCCTCGG TTACAGGTCT CCCACGCCTG TCGACGGTGT AGACAACCCA CGGATCCCAC TGCTCATGGC GCATCGACAA ATCTTGCGTC ATTTTCTACG TTGGCAAGCA TCCTTTGAAC GGCAAAAGGG GAGCCCTATG AAGCCTTCGG AACTCATTGC GTTGAACAAC GAAGACTTTG TTCAGTATTG AGGATCGGCA CTTGGCCAGG TATCGACAAC CAGTAGTCCC TCAACCTTGG CCCCCACCGC AACGAGCATC ACCCCTAAAG TGCGATCTGC TGCTGACGAC TTCAAGCGTG GCGTTAAACG TGACAAAACG CATTATCCCG TACTCAAGGA TGACAAGTAC TGGGATAACT TCTACCGTTC GTTTGTGGTC ACTGCAGTCT CGCATAATGT CGAAAAGGTC CTTGACCCAA CGTATGTTCC TACAGAGCCC TCTGACAAAG CACTTTTCGA AGAGCAAAAG AAGTTTGTGT ACTCTGCACT AGAACACACA CTGCAGACAG CGGCGGACTT TACTGCGGCC ATTGGAAACC TGTACACCAG GAAAGTGCGT GAATTGGAAG AAGCAAAGGA TGAGGAGGAT GTTAAAGTCA AGCCACCGGC TCCGTTCTCG AAGGAAACGA AGTGGATTCC GTTCTTCAAG TTGTTGGTAA ACTACTTGAG CTCTGTGACG GGAGTTAACA AAGTGCCGTT GGATTATGTC GTTCGGAAAG ACGACGACGT TGCTGCACCG GATACCGAGT TCAAGACGGA GCACGAGAAG TTGGTGCTGT CGACTCCCCA TACGGGGACG GCGTTCAACA AGGACAACGG GAAAGTTTGG ATCCAGGTGA AACAGTTGAC TGTGAACGGT CCAGCTTGGA CTTACGTTGC GCCTTTCGAG AAGAAACGCG ACGGTCGTGG AGCGGTCAAG GCTTTGAATA GTCACTATGA AGGTGATGCG GTGATGTCCA AGTCCAAGGC GGCTGCATTT GATGTGCTTG AGCACACCAC CTACACTGGA GAACGTCGTA ACTTTGGTAT GGAACGGTAC ACGAACGCCT TGTCGACGGC ATTCTAGACC CTGGACAAGT ACGGAGAGAC CTTGACGGAG TCAAGAAAGG TGGATGTCTT CTTGCGCAAT AATCACTGCA CCAATCCCAA GATGCTCTCA GGAATTGCGG TAATTCAGGG AGACGCGGAT TGGATGTCCA ATTTTGCCAA GGCGGCCGAC TATTTGGCCT TGTTTACTAA CACCAATACC TCTCAAAAGA CAGGTTGTTC GATCTCAAGT GCTCAGCAGA CTAGTAACAA CAAGAAGAAG CCGGCTATTC GAGCGGGCAA CTATACTCCA AATGAATGGC ATCAGCTCTC GGACAAAGAA AAGGACAAAG TTAGAGCCAA GCGAGCGGCC GCCAAGTCCT CTCGCGATAA AAATAAGCGC TCGGCAGCAG CAATCACTCG TTCGAGCGAG AAACCTGACA AGGGGAGCGC GGATGGTGCA ACCAATGCAG GTGATCAGTT TGCTCTCTCA ACCAAGAAGA AGAAAAGGAA GACTGTTGGT TTTGAAGGCG AAACGAGCGA TTGA
|
Protein sequence | MVPATRQMKS EAVYAHILDN ILLLSQEHPI RLSFQQQGYE TAIDILSIFE NELDALGYRS PTPVDGVDNP RIPLLMAHRQ ILRHFLRWQA SFERQKGSPM KPSELIALNN EDFVHPSTLA PTATSITPKV RSAADDFKRG VKRDKTHYPV LKDDKYWDNF YRSFVVTAVS HNVEKVLDPT YVPTEPSDKA LFEEQKKFVY SALEHTLQTA ADFTAAIGNL YTRKVRELEE AKDEEDVKVK PPAPFSKETK WIPFFKLLVN YLSSVTGVNK VPLDYVVRKD DDVAAPDTEF KTEHEKLVLS TPHTGTAFNK DNGKVWIQVK QLTVNGPAWT YVAPFEKKRD GRGAVKALNS HYEGDAVMSK SKAAAFDTLD KYGETLTESR KVDVFLRNNH CTNPKMLSGI AVIQGDADWM SNFAKAADYL ALFTNTNTSQ KTGCSISSAQ QTSNNKKKPA IRAGNYTPNE WHQLSDKEKD KVRAKRAAAK SSRDKNKRSA AAITRSSEKP DKGSADGATN AGDQFALSTK KKKRKTVGFE GETSD
|
| |