Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49009 |
Symbol | |
ID | 7195401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 293366 |
End bp | 296485 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183718 |
Protein GI | 219126969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGACC TCATTGTTTT AACCTCAAGG CTCTCTTACA AAATCCGCAC ATACCCTAGC AATCACACTC TCTGCTTGTG GCAACGCTGT TATCACGAAA CGAGTTCTGC CGATGACAGC ATCGACACCC AAGCCGAGAC TTGCAGCGTA GCCTCCAAAA TGAGCTCGAG GAACAGTCAG TTGAAGGGTC ATACTCTCCA ATTCGAAGAA GAACAATGTT TATCTTCTGC AGAGGTCAAA GTTAGTGACA AAAAATTCGA GCCTGGGCTG ACGTTGTCGG CGTTGCCCTT TTTACAGTCG TCGGAATCTT CCGGAGGAAG ATCGCTCTGG AAGTCTCACA AAGTCGACGA CTCAACACCG GTAACGGTTT GGCGCCAGTT GACCCTAAGG CTATTGCAAT CCACAGAGGA GCTGACACGC CACGAATGTT ATCTGATGGA AGAGTGCCTA CAATGGTGGA CCCGAAGACG CGTCAAATTC GATGTCGTCG ACCCGGAAGC GAGTGAAGTT GTTTGGAAGC TCTGGTACAA ACTCTTGACG GAATCGAACG GCAAGCCGTC GTCTTTGTTG CTCAATTCGG TATTGGATCA TTGGAGGTTG TCGATAAAAC AAAACATCAA AGTGCCCTAC TGGCCTGACG GAGTTATCTC GCATGTTCAG TCGCTAGCGC CGGAGCTAGT GGATGTCAAG AGTTTCGCTC TGATCCTTGG AGCGATGGCA TATTGGTATG ACGTGGACCC CTGGAAGGCA CAAGCCTTAC TTCAAGAGCT ACCTTCACAT GTAAAACCCA ACGCAATTGT TTGGAATTCT GCATTAACAG TTTGGGCCAA GGCGGATGCG TTACGATGGC CCGACGCTGC CCTTCGGGCG GAACAACTTC TAGAACGGAT GAAACAACAT CCCGACATTC AGCCCAACGA GGTGAGCTTC ACTTGTGTAT TGGAAGCTCT TGCCAACAGC CCTTCCGCAA AGGCTCCAGA AAAAGCTGAA AGAGTCTTTC AGGATATGGA GGATGCAGGA TTCCTGTCAC CCATCGCTTG TTTGCAAGTC ATGCAAGTTT GGGCGAAATC GGACAGCCAC CACGGAGCTG ACAAAGCCTA TGCTCTTCTT CACGAGATGG TGCAATTGTA CCTCAAAGAC CAATCAACCG TTAAAAAACC CATGAAGCAT TGTTTCTCAG TAGTCATGGA GGGATTTGCG CGTCGTGGCA AACCAGAGAA AGTCGAACGA ATTCTGTTGG AGCTCCAGGA TTTATACCAA CGCTTCCATG ATGATGACTT TGTGCCAACG GCTGCGACCT TCAATTCAGT CCTCTCTGCG TATGCGAGAT GTGCTCTGCC TGATAGGGCT GCGCGGGCCG AGCAATTGCT GATGAGTCTA CGTGAAATGG CCGAGTGGAG CGGTAATACT GCCTGCTTAC CCGACACTAC ATCGGTTAAT ACAGTTGTGC ATGCATGGGC AGAGAGCAAC GATAAGGGCG CAGTCGAAGG CGCTGAAGCC TTACTGAATG CGATGCAACT GTGGGAAGGA GTGCAGCCTG ATGCATACAC CTACACTGCT GTCATAAAGG CTTGGGCTCG CTCCGACAGG AATGGTGCCG CAATGCGATG CGAATCCTTC CTGAACGCAA TGTGGGCAGC ATTCGAAAAG GGGAATCACC AAGTCAAGCC AAATGATGTG ACATATGCTA CAGTGATCTA CGCCTGGTCC AGAAACAAAG GAAAAGAAGC TCCTTACCGA GCCGAGGCAC TTTTCCGAGA AATGATGGAA CGGCATAAGA ACGGCGATTC AACGTTGAAA CCCAGGGAGT CTTCTTATGT GTCTCTAATG ACAACTTGGA ATCGCAGCAA CCTGCGAGAA GCGCCCTCTC GAGTCCAATT TTATTTTAAT CAAATGCGAG GGAGTCACCT CGCAGGCGAC AAAAGTTTGC AACCGTCCGC GAAATTGTAC AATGCTGCTC TTTTCGCCAT GAAACGGGCA GGAGATGGGG CGGGGGCCGA AAACATTTTG GAGGTTATGT ATGCAGATTT CGAGAGGGGC AACAACAAGG CCCAGCCAAA TACGCATGTT TTCAACACGA TCATATCTGC TTGGGCAAAC ACTAGAACGC ATATCGCGCC AGAAAGAGCA GAAGCTATTG TTTTACGGAT GCTGGAGCTT CATTCAGACA AAGGATGGGA TTGTAAGCCA AATGCCATAA CATACACCTG TATTCTAGAC TGCTGGGCAA AATCAAGTCG TAGCGACGCT CCTGACCGAG CAGAGGAGAT CCTCAGACAC ATGCAACACT TAAGCGATAA AGGTGACGAA AACGTTAAGC CAACCACCTA CGCTTGGTCG ACTGTCCTTA CAGCGTGGTC GCGGTCGACT TCTCTGGATG CCCCTGTCCG AGCTCAGCAA TTATTTGACG AGATGTTACG AAAATTTGAA GCAGGCGACA AATCTCTTCG GCCGAGTGGA CCAGCATACG CTAGTGTTTT ATCCACTTGG TCGCGCAGCA ACCGACACGA CGCGCCTCAA ATATCATCCG AAATCTTAAA ACTCATGAAA GAACGCCACC TGGCGGATAC GTTAAATGAG AAGCCAAACC GGTACCACTA CTCAGCTGTC ATCAGCGCCT TTGCATCAAA GGGAGATGTT CAGAATGCCG AGGCATTGTT CGAAGAAATG AAACTTTTAA AGGACGTAGA GCCACATGAT GGTTGCTATA ATGGCCTCAT CAAGGCGTAC GGTCGTTCGT CATTACCCGA TGCAGCGGAA CGTGCCGAGT CTTTGCTTCG CACTATGGAG AAAGAATCAG CCGTTGGAGC CGACTGCAGT CCTACCATGA TAACCTACAC ATCGATTTTG GATATTTGGC AAAGAAGCCA AAGACCGGAT GCAGTCGACA GAGCTGAATC CCTCCTGAAG GAAATGCTGA AACTCGCTGA ACAGGGACGG GACAAGCTAA GCCCCAATGC GTCTACCTTT CTTGCTTTCT TACGAGTAAT TTCCAAAAGT TGTGCTACTG ACAAGGCCGC TCGTGCTGAG GAAGTCTTGT CGTTGATGAA AGCTTTTAAA TGCGCGCCAA CGGACGCTGT ATTTCGAGAA TTGAATAAAT GTCGAGCTGA TGCCTGCTAG
|
Protein sequence | MKDLIVLTSR LSYKIRTYPS NHTLCLWQRC YHETSSADDS IDTQAETCSV ASKMSSRNSQ LKGHTLQFEE EQCLSSAEVK VSDKKFEPGL TLSALPFLQS SESSGGRSLW KSHKVDDSTP VTVWRQLTLR LLQSTEELTR HECYLMEECL QWWTRRRVKF DVVDPEASEV VWKLWYKLLT ESNGKPSSLL LNSVLDHWRL SIKQNIKVPY WPDGVISHVQ SLAPELVDVK SFALILGAMA YWYDVDPWKA QALLQELPSH VKPNAIVWNS ALTVWAKADA LRWPDAALRA EQLLERMKQH PDIQPNEVSF TCVLEALANS PSAKAPEKAE RVFQDMEDAG FLSPIACLQV MQVWAKSDSH HGADKAYALL HEMVQLYLKD QSTVKKPMKH CFSVVMEGFA RRGKPEKVER ILLELQDLYQ RFHDDDFVPT AATFNSVLSA YARCALPDRA ARAEQLLMSL REMAEWSGNT ACLPDTTSVN TVVHAWAESN DKGAVEGAEA LLNAMQLWEG VQPDAYTYTA VIKAWARSDR NGAAMRCESF LNAMWAAFEK GNHQVKPNDV TYATVIYAWS RNKGKEAPYR AEALFREMME RHKNGDSTLK PRESSYVSLM TTWNRSNLRE APSRVQFYFN QMRGSHLAGD KSLQPSAKLY NAALFAMKRA GDGAGAENIL EVMYADFERG NNKAQPNTHV FNTIISAWAN TRTHIAPERA EAIVLRMLEL HSDKGWDCKP NAITYTCILD CWAKSSRSDA PDRAEEILRH MQHLSDKGDE NVKPTTYAWS TVLTAWSRST SLDAPVRAQQ LFDEMLRKFE AGDKSLRPSG PAYASVLSTW SRSNRHDAPQ ISSEILKLMK ERHLADTLNE KPNRYHYSAV ISAFASKGDV QNAEALFEEM KLLKDVEPHD GCYNGLIKAY GRSSLPDAAE RAESLLRTME KESAVGADCS PTMITYTSIL DIWQRSQRPD AVDRAESLLK EMLKLAEQGR DKLSPNASTF LAFLRVISKS CATDKAARAE EVLSLMKAFK CAPTDAVFRE LNKCRADAC
|
| |