Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43645 |
Symbol | |
ID | 7197358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1088103 |
End bp | 1090936 |
Gene Length | 2834 bp |
Protein Length | 862 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177749 |
Protein GI | 219111995 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0711576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGGGGATG ATAATGGAAT GGAGCGGAAA GCGAAAATCA GCCCTTACAG TTTGTTGCTT TCAAACGTGG AAAGCCACTT ATTTGGAAAG AGCACTCACA AAAGGCGCGG CCTTTTCGGC AACGACCCCA AAATGGATTA TTTTCGCAGC GGCTGAGAGG CAATACCACT GTCGCCGAAA GGTTTGCTAC CACTACGCAC GGGATTCAAG TTGCATACAA ACGATCTTCA GAGTACAACC GCTCTATGCA CAACCATTTT CTTGCGGAAG CTGACAAATC AAATCGTAAG GAGGCCTCTG CAAGGCTACT GTCGGCCACA AAGCCTTTTT CTAGGTTACA ATGGAGAGAG GCAATTAGGA CATTGGAATG GTGGACAAAT GAAAATGGCG GTAGCGAGTT ATCCATCTGC ATTTTTGCGA AGCTTTGCGA ACAAGAATCC AAAGGTGAAA ATGAACTGGA CGAGACTTTG GCGAAGAAGG TGGCACAATC ATGGGCAAGG GATTTTGTGG CAGGGGTATC GACGCTCAGT GTGGCAGAGT TTTCTAGGCA GTTAAAAATT TGCCAGGAAG CGATTCCATC GTTGAGAATG GATGTTTCAG CTATCATTCA TAGTGCAAAC GCTAAAAGAC AAACTCAAGA GCGCTCGCAA GGTGAAAAGT TGGCAATACC GAATATTTCT TTACTTGAGG AGATGGAGAG CAAAATCGAG AAATCACAGA TCCAAGACGA TCCGTCGGTG TGCCCAGACG TGAGAACAAT AAATGCTGCC ATATCGGCGT CACTATCATT CGACGAGCCG GAACTTGCAG AACGTGTTTT GACTCGCATG TGGAATATTT ATGACTGCAG ACAATGGCAA AAAATTCGCC CCAACCACAT CACATATCAC AATATAATGG TGAGCTGGGT AAAAACATGC CGGTCGACTG AAAGAGCGGT GAGAGTGGAA CGTCTCTTGC AGCAATTAAA GAAGCGACAT AGTGAACATC ATTATGACAA GGGTCTAACA CCACAAAGGC AGCACTACAT TCTGGCTTTG TCGGCATACT CTAGAGACGG TGATCTTAAG CGGGCACAGG CCCTATTTGA CTACATGCTG AAATCCTTTA CATTGACGCG TGAAATCTCT TTAAAACCGA CTGGCAAGGT CTGTGCTGTG TATTTAAGCG CTTTATCGCG CAAACCAGAC GTGACTGCTC CTGTTCTTGC GGCGGCGGTG GTCCACAACC TCTTTGAGCG GTTTCATGAT AACATGGACC CTGATTGGCT GCCCACGCCT CAAATGTACA CTTCTTACAT GAAAATCTGG ATTGATAGTG GTCGACCTGA TGGTGCACAG CAAGCACAAA AAATTCTTGA TGACATGTCA ACTAAGCCAG GGAATCTCTC CCCGAACATT ATGCATTACG GTGCTGTTAT TTCAGGGTGG TGCAAAGCCG ATGAACCCGA CCGGGCGGAA TCGATTGTTC GATTTCTTTG TGAGAAAACC AAGGTCAAAC CAGATATCAA ATGTTTTCTC GAAGTTGTGA AAGCTTGGTC TTCCCGAAAT ACATCGGAAG CGAATAAAAG GGCTGAGAAG CTGCTGGTGC TTATGAAAGA AATTGAAGGT ACATCAGGTA CAGAGCGGTC GACAGCAACG CTCGATATCA TGCGAATTTT GTCGCGGAGC ATAAAAGGCG GTGACGCTCA GCGATCAGAA GATATCATTC ACCGAATGCA ATCACAATAC TATCTAGGAC ACTCGTCTTT GAAAGCAACC ACTGAACACT ACAACGCTGC ATTAGGAGCC TGGTCACGAT CGAACGAATT TGAAAGTCCT GATCGAGCAG AGGCTCTCTT GCTAGAAATG CTGGAAAGAT TTGACCACGG CGACAATGAC CTTTCTCCGA CAGAGCAATC GTTCACATCG GTAATCACAT GCTGGGGGAG GAGTAATCGA GAAGAAGCCG GTCTTCGAGC ACAGGCCGTT TTCGATGGTA TGCAAGACAG GAGGGATCAG CAGCATCTAA GCTCACTTTG TGTAACCGCA GCATCATACT CGGCGCTCAT TATGGCGTGG GCTCGAGCTG GTGAACCTAA TCGCGCTGAA ATTGTGTTGC AGGAAATGCA CAATGACTAT TTAAACGGAA ACAAAGCAGC TGAGCCAAAT CAAATCGTAT TCAATGCAAC CATAAATGCA TGGGGGAAGT CACAACGAAA GGGACGAGAA AAACGCGTCG ATCAAATCAT ACGGCTTATG CAGCAGCTGG GTGACCTTAC TGAAGTACGG CCCGATATCG TAACGTTTTC TACCGCCATA GCCTCATGTA TGAGAAGTGA ACTTTCAAAC GCTCATGAGC TGGCAGAGAG CTACCTTGAC GAAGCCAAAA GGCTGTACGC TGAGGGTGAC AGCGGTTGCA AGCCAAACTC AATGTGTTAT GGCGCTGTGA TTCAGGCAAT CGTACGTGGG GGCAAAAGCG ACGCTCCTTT GAGAGCCGAA AAATATCTCA ATGAGATGAT TTCCCAAAGC GAGTTGGATG CAAAGCAAGT TCAAATGGCA TTTGCCGCAA CAATTAGTAG TTGGGCGCAT GCTTCGTATC AGTCAGAAAA CGTTCTGAGG GCAGAAGCGC TACTTGACAC AATGACCGCT TTTGGGAGCA AGCGCGGACA GCAGTTTCTT CCGGGCCTAC GAGTTTACAC AGAAGTGCTT ATAGCGATTT CGAAAAGCGA ACCTCGCATA AGGGCCAAAA AAGCAACAGC TCTATCTCAA TCAATGCAAG CGAATAGCAT TTTTCCTGAT GAAATCTGCG AAAGAATATT TAACGCTTGT TTCATCCGAA AGAGAAGACG ATGA
|
Protein sequence | MHNHFLAEAD KSNRKEASAR LLSATKPFSR LQWREAIRTL EWWTNENGGS ELSICIFAKL CEQESKGENE LDETLAKKVA QSWARDFVAG VSTLSVAEFS RQLKICQEAI PSLRMDVSAI IHSANAKRQT QERSQGEKLA IPNISLLEEM ESKIEKSQIQ DDPSVCPDVR TINAAISASL SFDEPELAER VLTRMWNIYD CRQWQKIRPN HITYHNIMVS WVKTCRSTER AVRVERLLQQ LKKRHSEHHY DKGLTPQRQH YILALSAYSR DGDLKRAQAL FDYMLKSFTL TREISLKPTG KVCAVYLSAL SRKPDVTAPV LAAAVVHNLF ERFHDNMDPD WLPTPQMYTS YMKIWIDSGR PDGAQQAQKI LDDMSTKPGN LSPNIMHYGA VISGWCKADE PDRAESIVRF LCEKTKVKPD IKCFLEVVKA WSSRNTSEAN KRAEKLLVLM KEIEGTSGTE RSTATLDIMR ILSRSIKGGD AQRSEDIIHR MQSQYYLGHS SLKATTEHYN AALGAWSRSN EFESPDRAEA LLLEMLERFD HGDNDLSPTE QSFTSVITCW GRSNREEAGL RAQAVFDGMQ DRRDQQHLSS LCVTAASYSA LIMAWARAGE PNRAEIVLQE MHNDYLNGNK AAEPNQIVFN ATINAWGKSQ RKGREKRVDQ IIRLMQQLGD LTEVRPDIVT FSTAIASCMR SELSNAHELA ESYLDEAKRL YAEGDSGCKP NSMCYGAVIQ AIVRGGKSDA PLRAEKYLNE MISQSELDAK QVQMAFAATI SSWAHASYQS ENVLRAEALL DTMTAFGSKR GQQFLPGLRV YTEVLIAISK SEPRIRAKKA TALSQSMQAN SIFPDEICER IFNACFIRKR RR
|
| |